0% found this document useful (0 votes)

31 views251 pages

1mth202 Soln

Uploaded by

Bakr Aladrisy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views251 pages

1mth202 Soln

Uploaded by

Bakr Aladrisy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 251

Lecture Notes on Discrete Mathematics

July 21, 2018

AF
DR
2

DR
AF
T
Contents

1 Basic Set Theory 5

1.1 Basic Set Theory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
1.1.1 Union and Intersection of Sets . . . . . . . . . . . . . . . . . . . . . . . . 8
1.1.2 Set Difference, Set Complement and the Power Set . . . . . . . . . . . . . 9
1.2 Relations and Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
1.2.1 Composition of Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
1.2.2 Equivalence Relation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
1.3 Advanced topics in Set Theory and Relations∗ . . . . . . . . . . . . . . . . . . . 24
1.3.1 Families of Sets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
1.3.2 More on Relations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
T

2 Peano Axioms and Countability 27

2.1 Peano Axioms and the set of Natural Numbers . . . . . . . . . . . . . . . . . . . 27

2.1.1 Addition, Multiplication and its properties . . . . . . . . . . . . . . . . . 28

2.1.2 Well Ordering in N . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
2.1.3 Applications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35
2.2 Finite and Infinite Sets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
2.3 Countable and Uncountable sets . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
2.3.1 Cantor’s Lemma . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44
2.3.2 Creating Bijections . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45
2.3.3 Schröder-Bernstein Theorem . . . . . . . . . . . . . . . . . . . . . . . . . 47
2.4 Integers and Modular Arithmetic . . . . . . . . . . . . . . . . . . . . . . . . . . . 51
2.5 Construction of Integers and Rationals∗ . . . . . . . . . . . . . . . . . . . . . . . 60
2.5.1 Construction of Integers . . . . . . . . . . . . . . . . . . . . . . . . . . . . 60
2.5.2 Construction of Rational Numbers . . . . . . . . . . . . . . . . . . . . . . 64

3 Partial Orders, Lattices and Boolean Algebra 67

3.1 Partial Orders . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67
3.2 Lattices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81
3.3 Boolean Algebras . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89

4 Basic Counting 97
4.1 Permutations and Combinations . . . . . . . . . . . . . . . . . . . . . . . . . . . 98
4.1.1 Multinomial theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 106

3
4 CONTENTS

4.2 Circular Permutations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 109

4.3 Solutions in Non-negative Integers . . . . . . . . . . . . . . . . . . . . . . . . . . 114
4.4 Set Partitions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 120
4.5 Lattice Paths and Catalan Numbers . . . . . . . . . . . . . . . . . . . . . . . . . 125
4.6 Some Generalizations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 129

5 Advanced Counting Principles 133

5.1 Pigeonhole Principle . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 133
5.2 Principle of Inclusion and Exclusion . . . . . . . . . . . . . . . . . . . . . . . . . 140
5.3 Generating Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 146
5.4 Recurrence Relation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 158
5.5 Generating Function from Recurrence Relation . . . . . . . . . . . . . . . . . . . 161

6 Introduction to Logic 175

6.1 Propositional Logic . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 175
6.2 Predicate Logic∗ . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 190

7 Graphs 197
7.1 Basic Concepts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 197
7.2 Connectedness . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 204
7.3 Isomorphism in Graphs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 207
T

7.4 Trees . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 210

7.5 Connectivity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 218

7.6 Eulerian Graphs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 220

7.7 Hamiltonian Graphs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 223
7.8 Bipartite Graphs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 228
7.9 Matching in Graphs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 230
7.10 Ramsey Numbers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 233
7.11 Degree Sequence . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 235
7.12 Planar Graphs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 236
7.13 Vertex Coloring . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 240
7.14 Representing graphs with Matrices . . . . . . . . . . . . . . . . . . . . . . . . . . 241
7.14.1 More Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 243

Index 246
Chapter 1

Basic Set Theory

We will use the following notation throughout the book.

1. The empty set, denoted ∅, is the set that has no element.

2. N := {1, 2, . . .}, the set of Natural numbers;

3. W := {0, 1, 2, . . .}, the set of whole numbers

4. Z := {. . . , −2, −1, 0, 1, 2, . . .}, the set of Integers;

5. Q := { pq : p, q ∈ Z, q 6= 0}, the set of Rational numbers;

T
AF

6. R := the set of Real numbers; and

7. C := the set of Complex numbers.

For the sake of convenience, we have assumed that the integer 0, is also a natural number. This
chapter will be devoted to understanding set theory, relations, functions and the principle of
mathematical induction. We start with basic set theory.

1.1 Basic Set Theory

Mathematicians over the last two centuries have been used to the idea of considering a collection
of objects/numbers as a single entity. These entities are what are typically called sets. The
technique of using the concept of a set to answer questions is hardly new. It has been in use
since ancient times. However, the rigorous treatment that the set received happened only in
the 19th century due to the german mathematician Georg Cantor. He was the first person who
was responsible in ensuring that the set had a home in mathematics. Cantor developed the
concept of the set during his study of the trigonometric series, which is now known as the limit
point or the derived set operator. He developed the transfinite numbers of which the ordinals
and cardinals are two types. His new and pathbreaking ideas were not well received by his
contemporaries. Further, from his definition of a set, a number of contradictions and paradoxes
arose. One of the most famous paradoxes is the Russell’s Paradox, due to Bertrand Russell
in 1918. This paradox amongst others, opened the stage for the development of axiomatic set

5
6 CHAPTER 1. BASIC SET THEORY

theory. The interested reader may refer to Katz [8]. In this book, we will consider the intuitive
or naive view point of sets.
The notion of a set is taken as a primitive and so we will not try to define it explicitly. On
the contrary, we will give it an informal description and then go on to establish the properties
of a set.
A set can be described intuitively as a collection of distinct objects. The objects are called the
elements or members of the set. Here, we will be able to say when an object/element belongs
to a set or not.
The objects can be just about anything from real physical things to abstract mathematical
objects. The principal, distinguishable and an important feature of a set is that the objects are
“distinct” or “uniquely identifiable.”
Any object of the collection comprising a set is referred as an element of the set. So, if S is a
set and x is an element of S, we denote it by x ∈ S. If x is not an element of S, we denote it by
x 6∈ S.
A set is typically denoted by curly braces, { }.

Example 1.1.1. 1. X = {apple, tomato, orange}. Hence, orange ∈ X, but potato 6∈ X.

2. X = {a1 , a2 , . . . , a10 }. Then, a100 6∈ X.

3. Observe that the sets {1, 2, 3}, {3, 1, 2} and { digits in the number 12321} are the same as
the order in which the elements appear doesn’t matter.
T
AF

We now address the idea of distinctness of elements of a set, which comes with its own sub-
DR

tleties.

Example 1.1.2. 1. Consider a collection of identical red balls in a basket. Is it a set?

Ans: This is a set because in principle, the balls in the basket are uniquely identifiable. For
example, we can paint a different number on them.

2. Consider the list of digits 1, 2, 1, 4, 2. Is it a set?

Ans: No, it is not a set as there is no way to distinguish the first 1 from the next. Same holds
for the number 2.

3. Let X = {1, 2, 3, 4, 5, 6, 7, 8, 9, 10}. Then X is the set of first 10 natural numbers. Or

equivalently, X is the set of integers between 0 and 11.

Definition 1.1.3. [Empty Set] The set S that contains no element is called the empty set
or the null set denoted by { } or ∅.

An object x is an element or a member of a set S, written x ∈ S, if x satisfies the rule that

defines the membership for S. With this notation, one has three main ways for specifying a set.
They are:

1. Listing all its elements (list notation), e.g., X = {2, 4, 6, 8, 10}. Then X is the set of even
integers between 0 and 12.

2. Stating a property with notation (predicate notation), e.g.,

1.1. BASIC SET THEORY 7

(a) X = {x : x is a prime number}. This is read as “X is the set of all x such that x
is a prime number”. Here x is a variable and stands for any object that meets the
criteria after the colon.
(b) The set X = {2, 4, 6, 8, 10} in the predicate notation can be written as
i. X = {x : 0 < x ≤ 10, x is an even integer }, or
ii. X = {x : 1 < x < 11, x is an even integer }, or
iii. x = {x : 2 ≤ x ≤ 10, x is an even integer } etc.

(c) X = {x : x is a student in IITK and x is older than 30}.

Note that the above expressions are certain rules that help in defining the elements of the
set X. In general, one writes X = {x : p(x)} or X = {x | p(x)} to denote the set of all
elements x (variable) such that property p(x) holds. In the above note that “colon” is
sometimes replaced by “—”.

3. Defining a set of rules which generate its members (recursive notation), e.g., let X = {x :
x is an even integer greater than 3}. Then, X can also be written as

(a) 4 ∈ X.
(b) whenever x ∈ X then x + 2 ∈ X.
(c) no other element different from those above belongs to X.
T

Thus, in recursive rule, the first rule is the basis of recursion, the second rule gives a
AF

method to generate new element(s) from the elements already determined and the third
DR

rule binds or restricts the defined set to the elements generated by the first two rules. The
third rule should always be there. But, in practice it is left implicit. At this stage, one
should make it explicit.

Definition 1.1.4. [Subset and Equality] Let X and Y be two sets.

1. Let Z be a set such that whenever x ∈ Z, x ∈ X as well, then Z is said to be a subset of

the set X, denoted Z ⊆ X.

2. If X ⊆ Y and Y ⊆ X, then X and Y are said to be equal, denoted X = Y .

Example 1.1.5. 1. Let X be a set. Then X ⊆ X. Thus, ∅ ⊆ ∅ and hence the empty set is
a subset of every set.

2. We know that N ⊆ W ⊆ Z ⊆ Q ⊆ R ⊆ C.

3. Note that ∅ 6∈ ∅.

4. Let X = {a, b, c}. Then a ∈ X but {a} ⊆ X. Also, {{a}} 6⊆ X.

5. If S ⊆ T and S 6= T then S is called a proper subset of T . That is, there exists an

element a ∈ T such that a 6∈ S.

In the next two subsections, we mention set operations that help us in generating new sets
from existing sets.
8 CHAPTER 1. BASIC SET THEORY

1.1.1 Union and Intersection of Sets

Definition 1.1.6. [Set Union and Intersection] Let X and Y be two sets.
1. The union of X and Y , denoted by X ∪ Y , is the set whose elements are the elements of
X as well as the elements of Y . Specifically, X ∪ Y = {x | x ∈ X or x ∈ Y }.
2. The intersection of X and Y , denoted by X ∩Y , is the set that contains only the common
elements of X and Y . Specifically, X ∩ Y = {x | x ∈ X and x ∈ Y }. The set X and Y
are said to be disjoint if X ∩ Y = ∅.
Example 1.1.7. 1. Let A = {1, 2, 4, 18} and B = {x : x is an integer, 0 < x ≤ 5}. Then,

A ∪ B = {1, 2, 3, 4, 5, 18} and A ∩ B = {1, 2, 4}.

2. Let S = {x ∈ R : 0 ≤ x ≤ 1} and T = {x ∈ R : .5 ≤ x < 7}. Then,

S ∪ T = {x ∈ R : 0 ≤ x < 7} and S ∩ T = {x ∈ R : .5 ≤ x ≤ 1}.

3. Let A = {{b, c}, {{b}, {c}}, b} and B = {a, b, c}. Then

A ∩ B = {b} and A ∪ B = {a, b, c, {b, c}, {{b}, {c}} }.

We now state a few properties related to union and intersection of sets. The proof of only
T
the first distributive law is presented. The readers are supposed to provide proofs of the other
AF

results.
DR

Lemma 1.1.8. Let R, S and T be three sets. Then,

1. Obvious properties:
(a) S ∪T = T ∪S and S ∩T = T ∩S (union and intersection are commutative operations).
(b) R ∪ (S ∪ T ) = (R ∪ S) ∪ T and R ∩ (S ∩ T ) = (R ∩ S) ∩ T (union and intersection
are associative operations).
(c) S ⊆ S ∪ T, T ⊆ S ∪ T .
(d) S ∩ T ⊆ S, S ∩ T ⊆ T .
(e) S ∪ ∅ = S, S ∩ ∅ = ∅.
(f ) S ∪ S = S ∩ S = S.

2. Distributive laws (combines union and intersection):

(a) R ∪ (S ∩ T ) = (R ∪ S) ∩ (R ∪ T ) (union distributes over intersection).
(b) and R ∩ (S ∪ T ) = (R ∩ S) ∪ (R ∪ T ) (intersection distributes over union).

Proof. Let x ∈ R ∪ (S ∩ T ). Then, x ∈ R or x ∈ S ∩ T . If x ∈ R then clearly, x ∈ R ∪ S

and x ∈ R ∪ T . Thus, x ∈ (R ∪ S) ∩ (R ∪ T ). If x 6∈ R but x ∈ S ∩ T , then x ∈ S and
x ∈ T . Hence, x ∈ R ∪ S and x ∈ R ∪ T . Thus, x ∈ (R ∪ S) ∩ (R ∪ T ). Hence, we see that
R ∪ (S ∩ T ) ⊆ (R ∪ S) ∩ (R ∪ T ).
Now, let y ∈ (R ∪ S) ∩ (R ∪ T ). Then, y ∈ R ∪ S and y ∈ R ∪ T . Now, if y ∈ R ∪ S then either
y ∈ R or y ∈ S or both.
1.1. BASIC SET THEORY 9

If y ∈ R then clearly y ∈ R ∪ (S ∩ T ). If y 6∈ R then the conditions y ∈ R ∪ S and y ∈ R ∪ T

imply that y ∈ S and y ∈ T . Thus, y ∈ S ∩ T and hence y ∈ R ∪ (S ∩ T ). This shows that
(R ∪ S) ∩ (R ∪ T ) ⊆ R ∪ (S ∩ T ) and hence we get a complete proof of the first distributive law.

Exercise 1.1.9. 1. Complete the proof of Lemma 1.1.8.

2. Proof the following statements:
(a) S ∪ (S ∩ T ) = S ∩ (S ∪ T ) = S.
(b) S ⊆ T if and only if S ∪ T = T .
(c) If R ⊆ T and S ⊆ T then R ∪ S ⊆ T .
(d) If R ⊆ S and R ⊆ T then R ⊆ S ∩ T .
(e) If S ⊆ T then R ∪ S ⊆ R ∪ T and R ∩ S ⊆ R ∩ T .
(f ) If S ∪ T 6= ∅ then either S 6= ∅ or T 6= ∅.
(g) If S ∩ T 6= ∅ then both S 6= ∅ and T 6= ∅.
(h) S = T if and only if S ∪ T = S ∩ T .

1.1.2 Set Difference, Set Complement and the Power Set

Definition 1.1.10. [Set Difference, Symmetric Difference] Let A and B be two sets.
1. The set difference of X and Y , denoted by X \Y , is defined by X \Y = {x ∈ X : x 6∈ Y }.
T

2. The symmetric difference of X and Y , denoted by X∆Y , is defined by X∆Y =

(X \ Y ) ∪ (Y \ X).
DR

Example 1.1.11. 1. Let A = {1, 2, 4, 18} and B = {x : x is an integer, 0 < x ≤ 5}. Then,

A \ B = {18}, B \ A = {3, 5} and A∆B = {3, 5, 18}.

2. Let S = {x ∈ R : 0 ≤ x ≤ 1} and T = {x ∈ R : .5 ≤ x < 7}. Then,

S \ T = {x ∈ R : 0 ≤ x < .5} and T \ S = {x ∈ R : 1 < x < 7}.

3. Let A = {{b, c}, {{b}, {c}}, b} and B = {a, b, c}. Then

A \ B = {{b, c}, {{b}, {c}}}, B \ A = {a, c} and A∆B = {a, c, {b, c}, {{b}, {c}}}.

In many set theory problems, all sets are defined to be subsets of some reference set, referred
to as the universal set, denoted mostly by U . We now define the complement of a set.

Definition 1.1.12. [Set complement] Let U be the universal set and X ⊆ U . Then, the
complement of X, denoted by X 0 , is defined as X 0 = {x ∈ U : x 6∈ X}.

We now state a few properties that directly follow from the definition and hence the proofs
are omitted.

Lemma 1.1.13. Let U be the universal set and S, T ⊆ U . Then,

1. U 0 = ∅ and ∅0 = U .
10 CHAPTER 1. BASIC SET THEORY

2. S ∪ S 0 = U and S ∩ S 0 = ∅.
3. S ∪ U = U and S ∩ U = S.
4. (S 0 )0 = S.
5. S ⊆ S 0 if and only if S = ∅.
6. S ⊆ T if and only if T 0 ⊆ S 0 .
7. S = T 0 if and only if S ∩ T = ∅andS ∪ T = U .
8. S \ T = S ∩ T 0 and T \ S = T ∩ S 0 .
9. S∆T = (S ∪ T ) \ (S ∩ T ).
10. De-Morgan’s Laws:
(a) (S ∪ T )0 = S 0 ∩ T 0 .
(b) (S ∩ T )0 = S 0 ∪ T 0 .

The De-Morgan’s laws help us to convert arbitrary set expressions into those that involve
only complements and unions or only complements and intersections.

Definition 1.1.14. [Power Set] Let X be a subset of a set Ω. Then the set that contains all
subsets of X is called the power set of X and is denoted by P(X) or 2X .

Example 1.1.15. 1. Let X = ∅. Then P(∅) = {∅, X} = {∅}.

T
AF

2. Let X = {∅}. Then P(X) = {∅, X} = {∅, {∅}}.

3. Let X = {a, b, c}. Then P(X) = {∅, {a}, {b}, {c}, {a, b}, {a, c}, {b, c}, {a, b, c}}.

4. Let X = {{b, c}, {{b}, {c}}}. Then P(X) = {∅, {{b, c}}, {{{b}, {c}}}, {{b, c}, {{b}, {c}}} }.

1.2 Relations and Functions

We start with the definition of the cartesian product of two sets and use it to define relations.
Note that this is another method to construct new sets from given set(s).

Definition 1.2.1. [Cartesian Product] Let X and Y be two sets. Then their cartesian
product, denoted by X × Y , is defined as X × Y = {(a, b) : a ∈ X, b ∈ Y }. Thus,

(a1 , b1 ) = (a2 , b2 ) if and only if a1 = a2 and b1 = b2 .

Example 1.2.2. 1. Let A = {a, b, c} and B = {1, 2, 3, 4}. Then

A × A = {(a, a), (a, b), (a, c), (b, a), (b, b), (b, c), (c, a), (c, b), (c, c)}.
A × B = {(a, 1), (a, 2), (a, 3), (a, 4), (b, 1), (b, 2), (b, 3), (b, 4), (c, 1), (c, 2), (c, 3), (c, 4)}.

2. The Euclidean plane, denoted by R2 = R × R = {(x, y) : x, y ∈ R}.

3. By convention, ∅ × B = A × ∅ = ∅. In fact, A × B = ∅ if and only if A = ∅ or B = ∅.

1.2. RELATIONS AND FUNCTIONS 11

4. One can use the product construction several times, e.g., if X, Y and Z are sets then

X × Y × Z = {(x, y, z) : x ∈ X, y ∈ Y, z ∈ Z} = (X × Y ) × Z = X × (Y × Z).

Exercise 1.2.3. Let A, B, C and D be non-empty sets. Then, prove the following statements:
1. A × (B ∪ C) = (A × B) ∪ (A × C).
2. A × (B ∩ C) = (A × B) ∩ (A × C).
3. (A × B) ∩ (C × D) = (A ∩ C) × (B ∩ D).
4. (A × B) ∪ (C × D) ⊆ (A ∪ C) × (B ∪ D). Give an example to show that the converse need
not be true.
Ans: Let A = {1}, B = {2}, C = {3} and D = {4}. Then, A × B = {(1, 2)} and
C × D = {(3, 4)}. So, (A × B) ∪ (C × D) = {(1, 2), (3, 4)} Whereas (A ∪ C) × (B ∪ D) =
{(1, 2), (1, 4), (3, 2), (3, 4)}.

Definition 1.2.4. [Relation] Let X and Y be two non-empty sets. A relation R from X to
Y is a subset of X × Y . We write xRy to mean (x, y) ∈ R ⊆ X × Y . Thus, for any two sets
X and Y , the sets ∅ and X × Y are always relations from X to Y . A relation from X to X is
called a relation on X.

Example 1.2.5. 1. Let X be any non-empty set and consider the set P(X). Then one can
define a relation R on P(X) by R = {(S, T ) ∈ P(X) × P(X) : S ⊆ T }.
T
AF

2. Let A = {a, b, c, d}. Then, some of the relations R on A are:

(a) R = A × A.
(b) R = {(a, a), (b, b), (c, c), (d, d), (a, b), (a, c), (b, c)}.
(c) R = {(a, a), (b, b), (c, c)}.
(d) R = {(a, a), (a, b), (b, a), (b, b), (c, d)}.
(e) R = {(a, a), (a, b), (b, a), (a, c), (c, a), (c, c), (b, b)}.
(f) R = {(a, b), (b, c), (a, c), (d, d)}.

To draw pictures for relations on a set X, we first put a node for each element x ∈ X and
label it x. For each (x, y) ∈ R, we draw a directed line from x to y. If (x, x) ∈ R then a
loop is drawn at x. The figures for some of the relations is given in Figure 1.1.

3. Let A = {1, 2, 3}, B = {a, b, c} and let R = {(1, a), (1, b), (2, c)}. Figure 1.2 represents the
relation R.1

4. Let A = Z, the set of integers. Then

R = {(x, y) : x, y ∈ Z and y = x + 5m, for some m ∈ Z}

is a relation on Z. If we try to draw a picture for this relation then there is no arrow
among any two elements of {1, 2, 3, 4, 5}.
1
We use pictures to help our understanding and they are not parts of proof.
12 CHAPTER 1. BASIC SET THEORY

c d c d c d

a b a b a b

A×A Example 2.b Example 2.c

Figure 1.1: Graphic representation of some of the relations in Example 2

1 a

2 b

3 c

Figure 1.2: Graphic representation of the relation in Example 3

T
AF

5. Let A = Z, the set of integers. For a fixed positive integer n, let

R = {(x, y) : x, y ∈ Z and y = x + nm, for some m ∈ Z}.

Then, R is a relation on Z. A picture for this relation has no arrow among any two
elements of {1, 2, 3, . . . , n}.

Definition 1.2.6. [Inverse Relation] Let X and Y be two non-empty sets and let R be a
relation in X × Y . Then, the inverse relation, denoted by R−1 , is a subset of Y × X and is
defined by R−1 = {(b, a) ∈ Y × X : (a, b) ∈ R}. So, for all a ∈ X and b ∈ Y

aRb if and only if bR−1 a.

Example 1.2.7. 1. If R = {1, a), (1, b), (2, c)} then R−1 = {(a, 1), (b, 1), (c, 2)}.

2. Let R = {(a, b), (b, c), (a, c)} be a relation on A = {a, b, c} then R−1 = {(b, a), (c, b), (c, a)}.
1
Definition 1.2.8. [Partial Function, Pre-image, Image] Let X and Y be two non-empty
sets and and let f be a relation in X × Y .

1. Then, f is called a partial function from X to Y , denoted by f : X → Y , if for every

a ∈ X and b, b0 ∈ Y the condition (a, b), (a, b0 ) ∈ f implies that b = b0 . In such a case, one
writes f (a) = b, i.e., f (a) = b if there exists a unique b ∈ Y such that (a, b) ∈ f . Note
that it may happen that for a particular choice of a ∈ X, (a, b) 6∈ f , for any b ∈ Y . In this
case, one says that f (a) is undefined.
1.2. RELATIONS AND FUNCTIONS 13

2. Let f : X → Y be a partial function and let f (x) = y. Then, x is called a pre-image of

y and y is called an image of x. Also, for any set Z, one also defines

f (Z) := {b : f (x) = b, for some x ∈ Z}.

Thus, note that f (Z) = ∅ if Z ∩ X = ∅.

Example 1.2.9. Let A = {a, b, c, d} and B = {1, 2, 3, 4} and X = {3, 4, b, c}.

1. If R1 = {(a, 1), (b, 1), (c, 2)} is a relation in A × B then
(a) R1 is a partial function.
(b) R1 (a) = 1, R1 (b) = 1, R1 (c) = 2. Also, R1 (d) is undefined. Thus, R1 ({d}) = ∅.
(c) R1 (X) = {1, 2}.
(d) R1−1 ({1}) = {a, b} and R1−1 (2) = c as R1−1 = {(1, a), (1, b), (2, c)}. For x ∈ X, R1−1 (x)
is not defined and hence R1−1 (X) = ∅.

2. If R2 = {(a, 1), (b, 4), (c, 2), (d, 3)} is a relation in A × B then
(a) R2 is a partial function.
(b) R2 (a) = 1, R2 (b) = 4, R2 (c) = 2 and R2 (d) = 3.
(c) R2 (X) = {2, 4}.
(d) R2−1 (1) = a, R2−1 (2) = c, R2−1 (3) = d and R2−1 (4) = b. Also, R2−1 (X) = {b, d}.
T
AF

Definition 1.2.10. [Domain, Range, Function] Let X and Y be two non-empty sets and let
f : X → Y be a partial function.
DR

1. Then, the domain 1 of f , denoted by dom f := {a : (a, b) ∈ f } is the set of all pre-images
of f .
2. Then, the range of f , denoted by rng f := {b : (a, b) ∈ f } is the collection of images of f .
3. If dom f = X then the partial function f is called a total function on X, or a function
from X to Y .

Convention:
Let p(x) be a polynomial in x with integer coefficients. Then, by writing ‘f : Z → Z is
a function defined by f (x) = p(x)’, we mean the function f = {(a, p(a)) : a ∈ Z}. For
example, the function f (x) = x2 stands for the set {(a, a2 ) : a ∈ Z}.

Example 1.2.11. 1. For A = {a, b, c, d} and B = {1, 3, 5}, let f = {(a, 5), (b, 1), (d, 5)} be a
relation in A × B. Then, f is a partial function with dom f = {a, b, d} and rng f = {1, 5}.
Further, we can define a function g : {a, b, d} → {1, 5} by g(a) = 5, g(b) = 1 and g(d) = 5.
Also, using g, one obtains the relation g −1 = {(1, b), (5, a), (5, d)}.
2. Note that the following relations f : Z → Z are indeed functions.

(a) f = {(x, 1) | x is even} ∪ {(x, 5) | x is odd}.

1
The domain set is the set from which we define our relations but dom f is the domain of the particular partial
function f . They are different.
14 CHAPTER 1. BASIC SET THEORY

(b) f = {(x, −1) | x ∈ Z}.

(c) f = {(x, x (mod 10)) | x ∈ Z}, where x (mod 10) gives the remainder when 10
divides x.
(d) f = {(x, 1) | x < 0} ∪ {(0, 0)} ∪ {(x, −1) | x > 0}.

Remark 1.2.12. 1. If X = ∅, then by convention, one assumes that there is a function,

called the empty function, from X to Y .

2. If Y = ∅, then it can be easily observed that there is no function from X to Y .

3. Individual relations and functions are also sets. Therefore, one can have equality between
relations and functions, i.e., they are equal if and only if they contain the same set of
pairs. For example, let A = {−1, 0, 1}. Then, the functions f, g, h : A → A defined by
f (x) = x, g(x) = x|x| and h(x) = x3 are equal as the three functions correspond to the
relation R = {(−1, −1), (0, 0), (1, 1)} on A.

4. Some books use the word ‘map’ in place of ‘function’. So, both the words are used inter-
changeably throughout the book.

5. Throughout the book, whenever the phrase ‘let f : X → Y be a function’ is used, it will be
assumed that both X and Y are nonempty sets.
T

The following is an immediate consequence of the definition.

Proposition 1.2.13. Let f be a non-empty relation in A × B and S be any set. Then,

1. f (S) 6= ∅ if and only if dom(f ) ∩ S 6= ∅.

2. f −1 (S) 6= ∅ if and only if rng(f ) ∩ S 6= ∅.

Proof. We will prove only one way implication. The other way is left for the reader.
Part 1: Since f (S) 6= ∅, one can find a ∈ S ∩ A and b ∈ B such that (a, b) ∈ f . This, in turn,
implies that a ∈ dom(f ) ∩ S (a ∈ S).
Part 2: Since rng(f ) ∩ S 6= ∅, one can find b ∈ rng(f ) ∩ S and a ∈ A such that (a, b) ∈ f . This,
in turn, implies that a ∈ f −1 (b) ⊆ f −1 (S).

Some important functions are now defined.

Definition 1.2.14. [Identity and Zero functions] Let X be a non-empty set.

1. Then the relation Id := {(x, x) : x ∈ X} is called the identity relation on X.
2. Then the function f : X → X defined by f (x) = x, for all x ∈ X, is called the identity
function and is denoted by Id.
3. Then the function f : X → R with f (x) = 0, for all x ∈ X, is called the zero function
and is denoted by 0.
Exercise 1.2.15. 1. Do the following relations represent functions? If yes, why?

(a) Let f : Z → Z be defined by

1.2. RELATIONS AND FUNCTIONS 15

i. f = {(x, 1) | 2 divides x} ∪ {(x, 5) | 3 divides x}.

ii. f = {(x, 1) | x ∈ S} ∪ {(x, −1) | x ∈ S 0 }, where S = {n2 : n ∈ Z} and S 0 = Z \ S.
iii. f = {(x, x3 ) | x ∈ Z}.
√
(b) Let f : R+ → R be defined by f = {(x, ± x) | x ∈ R+ }.
√
(c) Let f : R → R be defined by f = {(x, x) | x ∈ R}.
√
(d) Let f : R → C be defined by f = {(x, x) | x ∈ R}.
(e) Let f : R∗ → R be defined by f = {(x, loge |x|) | x ∈ R∗ }.
(f ) Let f : R → R be defined by f = {(x, tan x) | x ∈ R}.

2. Let f : X → Y be a function. Then f −1 is a relation in Y × X and the following results

hold for f −1 .
(a) f −1 (A ∪ B) = f −1 (A) ∪ f −1 (B), for each A, B ⊆ Y .
(b) f −1 (A ∩ B) = f −1 (A) ∩ f −1 (B), for each A, B ⊆ Y .
(c) f −1 (∅) = ∅.
(d) f −1 (Y ) = X.
0
(e) f −1 (B 0 ) = f −1 (B) , for each B ⊆ Y , where B 0 is the complement of B in Y and
0
f −1 (B) is the complement of f −1 (B) in X.
Ans: Note that x ∈ f −1 (B 0 ) ⇔ f (x) ∈ B 0 = Y \ B, i.e., x ∈ f −1 (B 0 ) ⇔ f (x) ∈ Y
T

but f (x) 6∈ B. So, x ∈ f −1 (B 0 ) ⇔ x ∈ f −1 (Y ) = X but x 6∈ f −1 (B). Or equivalently,

0
x ∈ f −1 (B 0 ) ⇔ x ∈ X \ f −1 (B) = f −1 (B) .
DR

Definition 1.2.16. [One-one/Injection] A function f : X → Y is called one-one (also called

an injection), if f (x) 6= f (y) is true for each pair x 6= y in X. Equivalently, f is one-one if
x = y is true for each pair x, y ∈ X for which f (x) = f (y).
Example 1.2.17. 1. Let A be a non-empty set. Then the identity map, Id, is one-one.
2. Let ∅ =
6 A ( B. Then f (x) = x is a one-one map from A to B.
3. The function f : Z → Z defined by f (x) = x2 is not one-one as f (−1) = f (1) = 1.
4. The function f : {1, 2, 3} → {a, b, c, d} defined by f (1) = c, f (2) = b and f (3) = a, is
one-one. It can be checked that there are 24 one-one functions f : {1, 2, 3} → {a, b, c, d}.
5. There is no one-one function from the set {1, 2, 3} to its proper subset {1, 2}.
Ans: Suppose there is a function f : {1, 2, 3} → {1, 2} which is one-one. Then, by the
definition of one-one, f (1), f (2) and f (3) are three distinct elements in {1, 2}, which has
exactly two distinct elements. Thus, a contradiction.
6. There are one-one functions from the set N of natural numbers to its proper subset
{2, 3, . . .}. One of them is given by f (1) = 4, f (2) = 3, f (3) = 2 and f (n) = n + 1,
for all n ≥ 4.

Definition 1.2.18. [Restriction function] Let f : X → Y be a function and A ⊆ X, A 6= ∅.

Then, by fA , we deonte the function fA = {(x, y) : (x, y) ∈ f, x ∈ A}, called the restriction
of f to A.
16 CHAPTER 1. BASIC SET THEORY

Example 1.2.19. Define f : R → R as f (x) = 1, if x is irrational and f (x) = 0, if x is rational.

Then, fQ : Q → R is the constant 0 function.

Proposition 1.2.20. Let f : X → Y be a one-one function and Z be a nonempty subset of X.

Then, fZ is also one-one.

Proof. Let if possible, fZ (x) = fZ (y), for some x, y ∈ Z. Then, by definition of fZ , we have
f (x) = f (y). As f is one-one, we get x = y. Thus, fZ is one-one.

Definition 1.2.21. [Onto/Surjection] A function f : X → Y is called onto (also called a

surjection), if f −1 (b) 6= ∅, for each b ∈ Y . Equivalently, f : X → Y is onto if ‘each b ∈ Y has
some pre-image in X’.

Example 1.2.22. 1. Let A be a non-empty set. Then the identity map, Id, is onto.

2. Let ∅ =
6 A ( B. Then f (x) = x is a not onto as A ( B.

3. There are 6 onto functions from {1, 2, 3} to {1, 2}. For example, f (1) = 1, f (2) = 2, and
f (3) = 2 is one such function.
(
y, if y ∈ A,
4. Let ∅ =6 A ( B. Choose a ∈ A. Then g(y) = is an onto map from
a, if y ∈ B \ A.
B to A.
T
AF

5. There is no onto function from the set {1, 2} to its proper superset {1, 2, 3}.
DR

6. There are onto functions from the set {2, 3, . . .} to its proper superset N, the set of natural
numbers. One of them is given by f (n) = n − 1, for all n ≥ 2.

Definition 1.2.23. [Bijection/One-One Correspondence, Equivalent Set] Let X and Y be

two sets. A function f : X → Y is said to be a bijection if f is one-one as well as onto. The
sets X and y are said to be equivalent if there exists a bijection f : X → Y .

Example 1.2.24. 1. The function f : {1, 2, 3} → {a, b, c} defined by f (1) = c, f (2) = b and
f (3) = a, is a bijection. Thus, the set {a, b, c} is equivalent to {1, 2, 3}.

2. Let A be a non-empty set. Then the identity map, Id, is a bijection. Thus, the set A is
equivalent to itself.

3. The set N is equivalent to {2, 3, . . .}. Indeed the function f : N → {2, 3, . . .} defined by
f (1) = 3, f (2) = 2 and f (n) = n + 1, for all n ≥ 3 is a bijection.

1. Define f : N → Z by f = { x, −x x+1

Exercise 1.2.25. 2 | x is even} ∪ { x, 2 | x is odd}.
Is f one-one? Is it onto?

2. Define f : N → Z and g : Z → Z by f = {(x, 2x) | x ∈ N} and g = { x, x2 | x is even} ∪

{(x, 0) | x is odd}. Are f and g one-one? Are they onto?

3. Let A be the class of subsets of {1, 2, . . . , 9} of size 5 and B be the class of 5 digit numbers
with strictly increasing digits. For a ∈ A, define f (a) the number obtained by arranging
the elements of a in increasing order. Is f one-one and onto?
1.2. RELATIONS AND FUNCTIONS 17

1.2.1 Composition of Functions

Definition 1.2.26. [Composition of relations] Let f and g be two relations such that rng f ⊆
dom g. Then, the composition of f and g, denoted by g ◦ f , is defined as
n o
g ◦ f = (x, z) : (x, y) ∈ f and (y, z) ∈ g for some y ∈ rng(f ) ⊆ dom(g) .

It is a relation. In case, both f and g are functions then (g ◦ f )(x) = g (f (x)) as (x, z) ∈ g ◦ f
implies that there exists y such that y = f (x) and z = g(y). Similarly, one defines f ◦ g if
rng g ⊆ dom f .

Example 1.2.27. Take f = {(β, a), (3, b), (3, c)} and g = {(a, 3), (b, β), (c, β)}. Then, g ◦ f =
{(3, β), (β, 3)} and f ◦ g = {(a, b), (a, c), (b, a), (c, a)}.

The proof of the next result is omitted as it directly follows from definition.

Proposition 1.2.28. [Algebra of composition of functions] Let f : A → B, g : B → C and

h : C → D be functions.
1. Then, (h ◦ g) ◦ f : A → D and h ◦ (g ◦ f ) : A → D are functions. Moreover, (h ◦ g) ◦ f =
h ◦ (g ◦ f ) (associativity holds).
2. If f and g are injections then g ◦ f : A → C is an injection.
T

3. If f and g are surjections then g ◦ f : A → C is a surjection.

4. If f and g are bijections then g ◦ f : A → C is a bijection.

5. [Extension] If dom f ∩ dom h = ∅ and rng f ∩ rng h = ∅ then the function f ∪ h from A ∪ C
to B ∪ D defined by f ∪ h = {(a, f (a)) : a ∈ A} ∪ {(c, h(c)) : c ∈ C} is a bijection.
6. Let A and B be sets with at least two elements each and let f : A → B be a bijection.
Then, the number of bijections from A to B is at least 2.

Theorem 1.2.29. [Properties of identity function] Let A and B be two nonempty sets and
Id : A → A be the identity function. Then, for any two functions f : A → B and g : B → A

1. the map f ◦ Id = f .

2. the map Id ◦ g = g.

Proof. Part 1: By definition, (f ◦ Id)(a) = f (Id(a)) = f (a), for all a ∈ A. Hence, f ◦ Id = f .

Part 2: The readers are advised to supply the proof.

We now give a very important bijection principle.

Theorem 1.2.30. [bijection principle] Let f : A → B and g : B → A be functions such that

g ◦ f (a) = a, for each a ∈ A. Then
1. f is one-one and
2. g is onto.
18 CHAPTER 1. BASIC SET THEORY

Proof. Let g ◦f (a) = a, for each a ∈ A. To prove the first part, let us assume that f (a1 ) = f (a2 ),
for some a1 , a2 ∈ A. Then using the given condition

a1 = g ◦ f (a1 ) = g (f (a1 )) = g (f (a2 )) = g ◦ f (a2 ) = a2 .

Thus, f is one-one and this completes the proof of the first.

For the second part, let a ∈ A. As g ◦ f (a) = a, we see that for b = f (a), one has g(b) =
g(f (a)) = g ◦ f (a) = a. Thus, we have found b ∈ B such that g(b) = a. Hence, g is onto and
this completes the required proof.

1. Let f, g : N → N be defined by f = {(x, 2x) | x ∈ N} and g = { x, x2 |

Exercise 1.2.31.
x is even} ∪ {(x, 0) | x is odd}. Then, verify that g ◦ f is the identity map on N, whereas
f ◦ g maps even numbers to itself and maps odd numbers to 0.
2. Let f : X → Y be a function. Then, prove that f −1 : Y → X is a function if and only if
f is a bijection.
Ans: Let f −1 be a function. Then, for each y ∈ Y , there is a unique x ∈ X such that
f −1 (y) = x. Thus, by definition (y, x) ∈ f −1 and hence (x, y) ∈ f . Or equivalently, f (x) = y.
So, f is onto.
To prove f is one-one, let us assume that f (x1 ) = f (x2 ), for some x1 , x2 ∈ X. Need to
show x1 = x2 . Since f −1 is a function, the image of each element of Y is unique. As
T

f (x1 ) = f (x2 ), the image of f (x1 ) and f (x2 ) under f −1 is the same element of X. But, by
AF

definition f −1 (f (x1 )) = x1 and f −1 (f (x2 )) = x2 and hence x1 = x2 . This completes the

proof of f is one-one.
Now, let us assume that f is a bijection. We need to prove that f −1 is a function. So, we need
to show that dom f −1 = Y and for each y ∈ Y there is a unique x ∈ X such that f −1 (y) = x.
Since f is onto, rng f = Y and hence dom f −1 = rng f = Y . As f is one-one, by definition
f (x1 ) 6= f (x2 ) whenever x1 6= x2 . Hence, the image of any two distinct element of Y under
f −1 will be distinct and hence for each y ∈ Y there is a unique x ∈ X such that f −1 (y) = x.
Thus, f −1 is a function.
3. Define f : N × N → N by f (m, n) = 2m−1 (2n − 1). Is f a bijection?
4. Let f : X → Y be a bijection and A ⊆ X. Is f (A0 ) = (f (A))0 ?
5. Let f : X → Y and g : Y → X be two functions such that
(a) (f ◦ g)(y) = y holds, for each y ∈ Y .
(b) (g ◦ f )(x) = x holds, for each x ∈ X.

Show that f is a bijection and g = f −1 . Can we conclude the same without assuming the
second condition?
Ans: If (x, y) ∈ f , then y = f (x), so g(y) = g(f (x)) = x, that is (y, x) ∈ g. Similarly
(x, y) ∈⇒ (y, x) ∈ f . So, g = f −1 .
To show that f is one-one, note that f (x) = f (y) ⇒ g(f (x)) = g(f (y)) ⇒ x = y.
To show that f is onto, let y ∈ Y . Then, there is g(y) ∈ X such that f (g(y)) = y.
1.2. RELATIONS AND FUNCTIONS 19

We cannot conclude the same without (ii). Take f : X = J2 → J1 = Y defined as f (1) =

1 = f (2). Take g : J1 → J2 defined by g(1) = 1. Then, f (g(y)) = y holds, for each y ∈ Y ,
whereas f is not a bijection.

1.2.2 Equivalence Relation

Now that we have seen quite a few examples of relations, let us look at some of the properties
that are of interest in mathematics.

Definition 1.2.32. [Relations on Set] Let R be a relation on a non-empty set A. Then R

is said to be

1. reflexive if (a, a) ∈ R, for all a ∈ A.

2. symmetric if (b, a) ∈ R whenever (a, b) ∈ R.

3. anti-symmetric if, for all a, b ∈ A with (a, b), (b, a) ∈ R implies a = b in A.

4. transitive if, for all a, b, c ∈ A with (a, b), (b, c) ∈ R implies (a, c) ∈ R.

Exercise 1.2.33. For relations defined in Example 1.2.5, determine which of them are

1. reflexive.
T

Ans: 1, 2.2a, 2.2b, 4, 5.

2. symmetric.
DR

Ans: 2.2a, 2.2c, 2.2e, 4, 5.

3. anti-symmetric.
Ans: 1.

4. transitive.
Ans: 1, 2.2a, 2.2b, 2.2c, 2.2d, 2.2f, 4, 5.

We are now ready to define a relation that appears quite frequently in mathematics. Before
R
doing so, let us either use the symbol ∼ or ∼ for relation. That is, if a, b ∈ A then we represent
R
(a, b) ∈ R by either a ∼ b or a ∼ b.

Definition 1.2.34. [Equivalence Relation, Equivalence Class] Let ∼ be a relation on a

non-empty set A. Then ∼ is said to form an equivalence relation if ∼ is reflexive, symmetric
and transitive. The equivalence class containing a ∈ A, denoted [a], is defined as [a] := {b ∈
A : b ∼ a}.
Example 1.2.35. 1. Consider the relations on A that appear in Example 1.2.5. Then,

(a) Example 1.2.5.1 is not an equivalence relation (the relation is not symmetric).
(b) Example 1.2.5.2.2a is an equivalence relation with [a] = {a, b, c, d} as the only equiv-
alence class.
20 CHAPTER 1. BASIC SET THEORY

(c) Other relations in Example 1.2.5.2 are not equivalence relation.

(d) Example 1.2.5.4 is an equivalence relation with the equivalence classes as
i. [0] = {. . . , −15, −10, −5, 0, 5, 10, . . .}.
ii. [1] = {. . . , −14, −9, −4, 1, 6, 11, . . .}.
iii. [2] = {. . . , −13, −8, −3, 2, 7, 12, . . .}.
iv. [3] = {. . . , −12, −7, −2, 3, 8, 13, . . .}.
v. [4] = {. . . , −11, −6, −1, 4, 9, 14, . . .}.
(e) Example 1.2.5.5 is an equivalence relation with the equivalence classes as
i. [0] = {. . . , −3n, −2n, −n, 0, n, 2n, . . .}.
ii. [1] = {. . . , −3n + 1, −2n + 1, −n + 1, 1, n + 1, 2n + 1, . . .}.
iii. [2] = {. . . , −3n + 2, −2n + 2, −n + 2, 2, n + 2, 2n + 2, . . .}.
iv. [n − 2] = {. . . , −2n − 2, −n − 2, −2, n − 2, 2n − 2, 3n − 2, . . .}.
v. [n − 1] = {. . . , −2n − 1, −n − 1, −1, n − 1, 2n − 1, 3n − 1, . . .}.

2. Let R = {(a, a), (b, b), (c, c)} be a relation on A = {a, b, c}. Then, R forms an equivalence
relation with three equivalence classes, namely [a] = {a}, [b] = {b} and [c] = {c}.
3. Let R = {(a, a), (b, b), (c, c), (a, c), (c, a)} be a relation on A = {a, b, c}. Then, R forms an
equivalence relation with two equivalence classes, namely [a] = [c] = {a, c} and [b] = {b}.

Proposition 1.2.36. [Equivalence relation divides a set into disjoint classes] Let ∼ be an
equivalence relation on X.
T
AF

1. Then any two equivalence classes are either disjoint or identical.

S
2. Further, X = [a].
a∈X

Thus, an equivalence relation ∼ on X divides X into disjoint equivalence classes.

Proof. If the equivalence classes [a] and [b] are disjoint, then there is nothing to prove. So, let
us assume that there are two equivalence classes, say [a] and [b], that intersect. Hence, there
exists c ∈ X such that c ∈ [a] ∩ [b]. That is, c ∼ a and c ∼ b.
As ∼ is symmetric, a ∼ c as well. Now, ∼ is transitive, with a ∼ c and c ∼ b and so a ∼ b.
Hence, if x ∼ a, then the above argument implies that x ∼ b. Thus, [a] ⊆ [b]. A similar argument
implies that [b] ⊆ [a] as symmetry with c ∼ b implies b ∼ c and the transitivity with b ∼ c and
c ∼ a implies b ∼ a. Thus, whenever two equivalence classes intersect, they are indeed equal.
For the second part, note that for each x ∈ X, [x], the equivalence class containing x is well
S
defined. Thus, if we take the union over all x ∈ X, we get X = [x].
x∈X

Exercise 1.2.37. Determine the equivalence relation among the relations given below. Further,
for each equivalence relation, determine its equivalence classes.
1. R = {(a, b) ∈ Z2 | a ≤ b} on Z?
Ans: No. R is not symmetric.
2. R = {(a, b) ∈ Z∗ × Z∗ | a divides b}, where Z∗ = Z \ {0} on Z∗ ?
Ans: No. R is not symmetric.
1.2. RELATIONS AND FUNCTIONS 21

3. For x = (x1 , x2 ), y = (y1 , y2 ) ∈ R2 and R∗ = R \ {0}, let

(a) R = {(x, y) ∈ R2 × R2 | |x|2 = x21 + x22 = y12 + y22 = |y|2 }.

Ans: Yes. For t ≥ 0, the equivalence classes are [(t, 0)] = {x : x21 + x22 = t2 } (all
concentric circles with center (0, 0)).
(b) R = {(x, y) ∈ R2 × R2 | x = αy for some α ∈ R∗ }.
Ans: Yes. The equivalence classes are [(0, 0)] = {(0, 0)}, [(0, 1)] = {(0, a) : a ∈ R∗ }
and for t ∈ R, [(1, t)] = {(x, tx) : x ∈ R∗ }.
(c) R = {(x, y) ∈ R2 × R2 | 4x21 + 9x22 = 4y12 + 9y22 }.
Ans: Yes. For t ≥ 0, the equivalence classes 2 2 2
√ are [(t, 0)] = {x : 4x1 + 9x2 = 4t } (all
5
ellipses with center (0, 0) and eccentricity ).
3
(d) R = {(x, y) ∈ R2 × R2 | x − y = α(1, 1) for some α ∈ R∗ }.
Ans: No. R is not reflexive.
(e) Fix c ∈ R. Now, define R = {(x, y) ∈ R2 × R2 | y2 − x2 = c(y1 − x1 )}.
Ans: Yes. For t ∈ R, the equivalence classes are [(t, 0)] = {(x, c(x − t)) : x ∈ R} (all
lines with slope c).
(f ) R = {(x, y) ∈ R2 × R2 | |x1 | + |x2 | = α(|y1 | + |y2 |)}, for some positive real number α.
Ans: Yes. There are only two equivalence classes, namely [(0, 0)] = {(0, 0)} and
T
AF

[(1, 0)] = R2 \ {(0, 0)}.

(g) R = {(x, y) ∈ R2 × R2 | x1 x2 = y1 y2 }.
DR

Ans: Yes. The equivalence classes are [(0, 0)] = X-axis ∪ Y -axis and for each t ∈
t
R∗ , [(1, t)] = {(x, ) : x ∈ R∗ } (all rectangular hyperbola with X-axis and Y -axis as
x
asymptotes).

4. For x = (x1 , x2 ), y = (y1 , y2 ) ∈ R2 , let S = {x ∈ R2 | x21 + x22 = 1}. Then is the relation
given below an equivalence relation on S?
(a) R = {(x, y) ∈ S × S | x1 = y1 , x2 = −y2 }.
Ans: No. R is neither reflexive nor transitive.
(b) R = {(x, y) ∈ S × S | x = −y}.
Ans: No. R is neither reflexive nor transitive.

Definition 1.2.38. [Partition of a set] Let X be a non-empty set. Then a partition of X is

a collection of disjoint, non-empty subsets of X whose union is X.

Example 1.2.39. Let X = {a, b, c, d, e}.

1. If R is an equivalence relation on X with

R = {(a, a), (b, b), (c, c), (d, d), (e, e), (a, b), (b, a), (c, e), (e, c)}

then its equivalence classes are [a] = [b] = {a, b}, [c] = [e] = {c, e} and [d] = {d}.
22 CHAPTER 1. BASIC SET THEORY

2. Let {a}, {b, c, d}, {e} be a partition of X. Then verify that

R = {(a, a), (b, b), (c, c), (d, d), (e, e), (b, c), (c, d), (b, d), (c, b), (d, c), (d, b)}

is an equivalence relation with [a] = {a}, [b] = {b, c, d} and [e] = {e}.
The next proposition follows directly follows from Proposition 1.2.36 and hence the proof is
omitted. It answers the question that “if a partition of a non-empty set X is given then does
there exists an equivalence relation on X such that the disjoint equivalence classes are exactly
the elements of the partition?”
Proposition 1.2.40. [Constructing equivalence relation from equivalence classes] Let f be
an equivalence relation on X 6= ∅ whose disjoint equivalence classes are [a] : a ∈ A}, for some
index set A. Then,
! !
[ [ [
f= {(x, x)} {(x, y) : x, y ∈ [a], x 6= y} .
x∈X a∈A
Exercise 1.2.41. 1. Let X and Y be two nonempty sets and f : X → Y be a relation. Let
IdX and IdY be the identity relations on X and Y , respectively. Then,
(a) is it necessary that f −1 ◦ f ⊆ IdX ?
Ans: No. Take X = Y = {1, 2} and f = {(1, 1), (2, 1)}.
(b) is it necessary that f −1 ◦ f ⊇ IdX ?
Ans: No. Take X = Y = {1, 2} and f = {(1, 1)}.
T

(c) is it necessary that f ◦ f −1 ⊆ IdY ?

Ans: No. Take X = Y = {1, 2} and f = {(1, 1), (1, 2)}.

(d) is it necessary that f ◦ f −1 ⊇ IdY ?

Ans: No. Take X = Y = {1, 2} and f = {(1, 1)}.

2. Suppose now that f is a function. Then,

(a) is it necessary that f ◦ f −1 ⊆ IdY ?
Ans: Yes. Let (y1 , y2 ) ∈ f ◦ f −1 . So, there exists z ∈ X such that (y1 , z) ∈ f −1
and (z, y2 ) ∈ f . Thus, (z, y1 ), (z, y2 ) ∈ f . As f is a function, y1 = y2 . Hence,
(y1 , y2 ) ∈ IdY .
(b) is it necessary that IdX ⊆ f −1 ◦ f ?
Ans: Yes. Let x ∈ X. As f is a function, there exists y ∈ Y such that (x, y) ∈ f .
Then, by definition (y, x) ∈ f −1 . Hence, (x, x) ∈ f −1 ◦ f . Thus, IdX ⊆ f −1 ◦ f .

3. Take A 6= ∅. Is A × A an equivalence relation on A? If yes, what are the equivalence

classes?
Ans: Yes. It has only one equivalence class, namely [a] = {x : x ∈ A}.
4. On a nonempty set A, what is the smallest equivalence relation (in the sense that every
other equivalence relation will contain this equivalence relation; recall that a relation is a
set)?
Ans: Each equivalence class can contain only one element, i.e., the equivalence relation is
S
given by {(x, x)}.
x∈X
1.2. RELATIONS AND FUNCTIONS 23

Exercise 1.2.42. [Optional]

1. Let X = {1, 2, 3, 4, 5} and let f be a relation on X. By checking whether f is reflexive or

not, whether f is symmetric or not and whether f is transitive or not, we see that there
are 8 types of relations on X. Give one example for each type.

2. Let A = B = {1, 2, 3}. Then, what is the number of

(a) relations from A to B?

Ans: As each relation is a subset of A × B and A × B has 9 elements, there are 29
distinct relations from A to B.
(b) relations f from {1, 2, 3} to {a, b, c} such that dom f = {1, 3}?
Ans: There are 7 = 23 −1 nonempty relations from {1} to {a, b, c}. There are (23 )2 = 64
relations from {1, 3} to {a, b, c}. Out of them, one is empty, 7 have dom f = {1} and 7
have dom f = {3}. So, the answer is 49.
(c) relations f from {1, 2, 3} to itself such that f = f −1 ?
Ans: The condition f = f −1 implies that f is symmetric. So, any subset of

{(1, 1), (1, 2), (1, 3), (2, 2), (2, 3), (3, 3)}

defines a unique symmetric relation (just add in the opposite points). So, the answer is
T

26 .
AF

(d) single valued relations from {1, 2, 3} to itself ? How many of them are functions?
DR

Ans: 3 with dom f = {1}, 32 with dom f = {1, 2} and 33 with dom f = {1, 2, 3}. In
total, 3 31 + 32 32 + 33 33 . Among them the number of functions is 33 .

(e) equivalence relations on {1, 2, 3, 4, 5}.

Ans: The number of equivalence relations with 5 equivalence classes is 1.
The number of equivalence relations with 4 equivalence classes is 52 .

(5)(3)
The number of equivalence relations with 3 equivalence classes is 53 + 2 2 2 .
The number of equivalence relations with 2 equivalence classes is 54 + 53 .

The number of equivalence relations with 1 equivalence class is 1.

Total is 52.

3. Let f, g be two non-equivalence relations on R. Then, is it possible to have f ◦ g as an

equivalence relation? Give reasons for your answer.

4. Let f, g be two equivalence relations on R. Then, prove/disprove the following statements.

(a) f ◦ g is necessarily an equivalence relation.

(b) f ∩ g is necessarily an equivalence relation.
(c) f ∪ g is necessarily an equivalence relation.
(d) f ∪ g 0 is necessarily an equivalence relation.
24 CHAPTER 1. BASIC SET THEORY

1.3 Advanced topics in Set Theory and Relations∗

1.3.1 Families of Sets
Definition 1.3.1. [Family of sets] Let A be a set. For each x ∈ A, take a new set Ax . Then,
the collection

{Ax }x∈A := Ax | x ∈ A

is a family of sets indexed by elements of A (index set). Unless otherwise mentioned, we

assume that the index set for a class of sets is nonempty.

Definition 1.3.2. [Union / Intersection of families of sets] Let {Bα }α∈S be a nonempty
class of sets. We define their
1. union as ∪ Bα = {x | x ∈ Bα , for some α}, and
α∈S
2. intersection as ∩ Bα = {x | x ∈ Bα , for all α}.
α∈S

[Convention] Union of an empty class is ∅. The intersection of an empty class of subsets of a

set X is X 1 .
Example 1.3.3. 1. Take A = {1, 2, 3}, A1 = {1, 2}, A2 = {2, 3} and A3 = {4, 5}. Then,
n o
{Aα | α ∈ A} = {A1 , A2 , A3 } = {1, 2}, {2, 3}, {4, 5} .
T

Thus, ∪ Aα = {1, 2, 3, 4, 5} and ∩ Aα = ∅.

α∈A α∈A
2. Take A = N and An = {n, n + 1, . . .}. Then, the family
DR

n o
{Aα | α ∈ A} = {A1 , A2 , . . .} = {1, 2, . . .}, {2, 3, . . .}, . . . .

Thus, ∪ Aα = N and ∩ Aα = ∅.
α∈A α∈A
T 1 2
3. Verify that [− n , n ] = {0}.
n∈N

We now give a set of important rules some of whose proofs are left for the reader.

Theorem 1.3.4. [Algebra of union and intersection] Let {Aα }α∈L be a nonempty class of
subsets of X and B be any set. Then, the following statements are true.

1. B ∩ ∪ Aα = ∪ (B ∩ Aα ).
α∈L α∈L

2. B ∪ ∩ Aα = ∩ (B ∪ Aα ).
α∈L α∈L

Ans:

x∈B∪ ∩ Aα ⇔ x ∈ B or x ∈ ∩ Aα ⇔ x ∈ B or x ∈ Aα , for all α ∈ L
α∈L α∈L

⇔ x ∈ B ∪ Aα , for all α ∈ L ⇔ x ∈ ∩ (B ∪ Aα ).
α∈L
1
The way we see this convention is as follows: First we agree that the intersection of an empty class of subsets
is a subset of X. Now, let x ∈ X such that x 6∈ ∩ Bα . This implies that there exists an α ∈ S such that x 6∈ Bα .
α∈S
Since S is empty, such an α does not exist.
1.3. ADVANCED TOPICS IN SET THEORY AND RELATIONS∗ 25

0
3. ∪ Aα = ∩ A0α .
α∈L α∈L
Ans:
0
x∈ ∪ Aα ⇔ x ∈ X and x 6∈ ∪ Aα ⇔ x ∈ X and x 6∈ Aα , for all α ∈ L
α∈L α∈L

⇔x∈ A0α , for all α ∈ L ⇔ x ∈ ∩ A0α .

α∈L

0
4. ∩ Aα = ∪ A0α .
α∈L α∈L

Proof. We give the proofs for Part 1 and 4. For Part 1, we see that

x ∈ B ∩ ∪ Aα ⇔ x ∈ B and x ∈ ∪ Aα ⇔ x ∈ B and x ∈ Aα , for some α ∈ L
α∈L α∈L

⇔ x ∈ B ∩ Aα , for some α ∈ L ⇔ x ∈ ∪ (B ∩ Aα ).
α∈L

For Part 4, we have

0
x ∈ ∩ Aα ⇔ x 6∈ ∩ Aα ⇔ x 6∈ Aα , for some α ∈ L ⇔ x ∈ A0α , for some α ∈ L
α∈L α∈L

⇔ x ∈ ∪ A0α .
α∈L

Proceed in similar lines to complete the proofs of the other parts.

Exercise 1.3.5. 1. Consider Ax }x∈R , where Ax = [x, x + 1]. What is ∪ Ax and ∩ Ax ?
T

x∈R x∈R
AF

Ans: R and ∅, respectively.

2. For x ∈ [0, 1] write Zx := {zx | z ∈ Z} and Ax = R \ Zx. What is ∪ Ax and ∩ Ax ?

x∈R x∈R
Ans: R \ {0} and ∅, respectively.
3. Write the closed interval [1, 2] = ∩ In , where In are open intervals.
n∈N
4. Write R as a union of infinite number of pairwise disjoint infinite sets.
5. Write the set {1, 2, 3, 4} as the intersection of infinite number of infinite sets.
6. Suppose that A∆B = B. Is A = ∅?
7. Prove Theorem 1.3.4.

1.3.2 More on Relations

Proposition 1.3.6. [Properties of union and intersection under a relation] Let f : X → Y
be a relation and {Aα }α∈L ⊆ P(X). Then, the following statements hold.

1. f ∪ Aα = ∪ f (Aα ).
α∈L α∈L

2. f ∩ Aα ⊆ ∩ f (Aα ). Give an example where the inclusion is strict.
α∈L α∈L

Proof. Part 1:

y∈f ∪ Aα ⇔ (x, y) ∈ f, for some x ∈ ∪ Aα ⇔ (x, y) ∈ f with x ∈ Aα , for some α ∈ L
α∈L α∈L

⇔ y ∈ f (Aα ), for some α ∈ L ⇔ y ∈ ∪ f (Aα ).

α∈L
26 CHAPTER 1. BASIC SET THEORY

For Part 2, we assume that ∩ Aα 6= ∅. Then,

α∈L

y∈f ∩ Aα ⇔ (x, y) ∈ f, for some x ∈ ∩ Aα ⇔ (x, y) ∈ f with x ∈ Aα , for all α ∈ L
α∈L α∈L

⇒ y ∈ f (Aα ), for all α ∈ L ⇔ y ∈ ∩ f (Aα ).

α∈L

Thus, the required result follows.

Remark 1.3.7. It is important to note the following in the proof of the above theorem:
‘y ∈ f (Aα ), for all α ∈ L’ implies that ‘for each α ∈ L, we can find some xα ∈ Aα such that
(xα , y) ∈ f ’. That is, the xα ’s need not be the same. This gives you an idea to construct a
counterexample.
Define f : {1, 2, 3, 4} → {a, b} by f = {(1, a), (2, a), (2, b), (3, b), (4, b)}. Take A1 = {1, 3} and
A2 = {1, 2, 4} and verify that the inclusion in Part 2 of Theorem 1.3.6 is strict. Also, find the
xi ’s for b.

Exercise 1.3.8. [Important]

1. Let f : X → Y be a single valued relation, A ⊆ X, B ⊆ Y and {Bβ }β∈I be a nonempty
family of subsets of Y . Then, show that
(a) f −1 ∩ Bβ = ∩ f −1 (Bβ ).

β∈I β∈I
T

−1 ∪ Bβ = ∪ f −1 (Bβ ).

(b) f
AF

β∈I β∈I
(c) f −1 (B 0 ) = dom f \ f −1 (B).
DR

(d) f f −1 (B) ∩ A = B ∩ f (A). Note that this equality fails if f is not single valued.

Ans: If (x, y), (x, z) ∈ f , for some y 6= z, take A = {x}, B = {z}.

2. Let f : X → Y be one-one and {Aα }α∈L be a nonempty family of subsets of X. Is

f ∩ Aα = ∩ f (Aα )?
α∈L α∈L
3. Show that each set can be written as a union of finite sets.
4. Give an example of an equivalence relation on N for which there are 7 equivalence classes,
out of which exactly 5 are infinite.
Ans: Let O denote the odd natural numbers. Let Ai = 2i O, i = 0, 1, 2, 3. Put A4 = {16},
A5 = {32} and A6 = N \ ∪5i=0 Ai . Define the relation R on N as xRy if x and y are both in
the same Ai . That is, R = ∪6i=0 (Ai × Ai ). This is an equivalence relation.
5. Show that union of finitely many finite sets is a finite set.
Chapter 2

Peano Axioms and Countability

2.1 Peano Axioms and the set of Natural Numbers

In this section, We are now ready to state the Peano axioms.When these axioms were proposed
by Peano and the rest, their goal was to provide the fewest axioms, that would generate the
natural numbers that we are familiar with. The intuition here is to first exert the existence of
at lest one natural number and define a successor function to determine the rest.

P1. 1 ∈ N, i.e., 1 is a natural number. (One can also consider 0 ∈ N).

At this point, we are guaranteed the existence of exactly one natural number. We now
T
AF

use the successor function to generate other natural numbers. So, we define a function S
whose domain is N.
DR

P2. If x ∈ N then S(x) ∈ N, i.e., the successor of a natural number is also a natural number.
Here, S(x) is referred to as the successor of x. Intuitively one can think of S(x) as x + 1.
However, at this stage we have no formal idea as to what ‘+’ is. Further, we are very far
away from establishing N, the way we know it. So far, we can say that S(1) = 1. In this
case, all the previous conditions are satisfied. Of course, we want to avoid this!!! So, in
some sense, we want to ensure that 1 is not the successor of any natural number.

P3. For any x ∈ N, S(x) 6= 1, i.e., the pre-image of 1 under S is empty. Thus, at this stage N
contains at least two natural numbers 1, S(1). If we stop here, we cannot construct N, the
way we know it. For example, if N = {1, S(1)} with S(x) 6= 1, for all x ∈ N, forces us to
have S(S(1)) = S(1). But we want N, the set of natural numbers, and hence we certainly
require that S is injective.

P4. For every x, y ∈ N, the condition S(x) = S(y) implies that x = y.

Remark 2.1.1. [Consequences of P4] As a first step, it eliminates the possibility that
N = {1, S(1)} as S(1) 6= 1 from Axiom P3. Thus, S(1) 6∈ {1, S(1)}. So, denote S(1) =
2. A repetition of above argument will imply that S(2) 6∈ {1, 2}. So, denote S(2) = 3.
Similarly, denote S(3) = 4, S(4) = 5, . . .. Continuing this pattern, we get {1, 2, 3, . . .} ⊆ N.
Hence, these axioms so far have pushed our formal definition of N to include all the usual
elements (natural numbers).

27
28 CHAPTER 2. PEANO AXIOMS AND COUNTABILITY

The question arises, what disallows us from having {1, 2, 3, . . .} ∪ {a, b} = N, for certain
two symbols a and b. Note that it is possible to define S on {1, 2, 3, . . .} as above and also
to say that S(a) = b and S(b) = a. This clearly satisfies all the axioms defined above.

So, we need another axiom to exclude versions where N is ‘too large’. Now, taking inspi-
ration from induction, we define the following.

Definition 2.1.2. [Inductive set] A set X is said to be inductive if

1. either 1 ∈ X or 0 ∈ X or both,
2. x ∈ X implies that S(x) ∈ X.

The name “inductive” comes from 1 ∈ X (base step) and the second condition being
the inductive step. Based on the above definition, the last Peano axiom is the Axion of
Induction.

P5. If X is an inductive set then N ⊆ X.

The previous axioms ensured that {1, 2, . . .} ⊆ N. Also {1, 2, . . .} is an inductive set and
hence the last axiom implies that N ⊆ {1, 2, . . .}. Thus, N = {1, 2, . . .}.

Now that we have axiomatically established the set of natural numbers, can we also establish
the arithmetic in N, the most important property for which natural numbers are known? The
T
AF

arithmetic in N that touches every aspect of our lives is clearly addition and multiplication. So,
let us carefully define addition and multiplication using the Peano axioms and the successor
DR

function.
Using only the Peano axioms, we first prove a small result and then use it to define addition
‘+0 of two natural numbers.

Lemma 2.1.3. If n ∈ N and n 6= 1, then there exists m ∈ N such that S(m) = n.

Proof. Let X = {x ∈ N : x = 1 or x = S(y) for some y ∈ N}. By definition 1 ∈ X. Also, for

each n ∈ X, by definition there exists y ∈ N such that n = S(y). Further, y ∈ N implies that
S(y) ∈ N and hence S(S(y)) = S(n) ∈ X. Thus, for each n ∈ X, S(n) ∈ X and hence by the
axiom of induction X = N.

2.1.1 Addition, Multiplication and its properties

Now, we use the recursion rule to define addition ‘+’.

Definition 2.1.4. [Addition] We use the following two assignments to define addition.
1. For each n ∈ N, assign n + 1 = S(n).
2. For each m, n ∈ N, assign n + S(m) = S(n + m).
Remark 2.1.5. 1. We have introduced ‘+’ by certain assignments which require justification.
Note that ‘assign’ actually translates into function.
2.1. PEANO AXIOMS AND THE SET OF NATURAL NUMBERS 29

2. By Lemma 2.1.3, we know that any natural number x 6= 1 is of the form S(y), for some
natural number y and hence we have defined addition for all natural numbers.

On similar lines, we define multiplication ‘·’ and again Lemma 2.1.3 will assure us that we
have defined multiplication for each natural number.

Definition 2.1.6. [Multiplication] We use the following two assignments to define multiplica-
tion.
1. For each n ∈ N, assign n · 1 = n.
2. For each m, n ∈ N, assign n · S(m) = n · m + n.

To get a feeling why the above definitions on N satisfies our existing concept of natural numbers,
we shall use only the above axioms to prove some of the familiar properties.
1. [Associativity of addition] For every n, m, k ∈ N, n + (m + k) = (n + m) + k.
Proof. Let X = {k ∈ N : for all m, n ∈ N, n + (m + k) = (n + m) + k}. To show that
X = N.
By definition, 1 ∈ X as for each n, m ∈ N, n+(m+1) = n+S(m) = S(n+m) = (n+m)+1.
Now, let z ∈ X and let us show that S(z) ∈ X. Since z ∈ X

n + (m + z) = (n + m) + z, for all n, m ∈ N. (2.1)

Thus, by definition and Equation (2.1), we see that

n+(m+S(z)) = n+S(m+z) = S(n+(m+z)) = S((n+m)+z) = (n+m)+S(z), for all n, m ∈ N.

Hence, S(z) ∈ X and thus by Axiom P5, X = N.

2. [Commutativity of addition] For every x, y ∈ N, x + y = y + x.

Proof. Let X = {k ∈ N : for all n ∈ N, n + k = k + n}. To show that X = N.
We first show that 1 ∈ X. To do so, we define Y = {n ∈ N : n + 1 = 1 + n, for all n ∈ N}
and prove that Y = N. This in turn will imply that 1 ∈ X.
Firstly, 1 + 1 = 1 + 1 and hence 1 ∈ Y . Now, let y ∈ Y . To show S(y) ∈ Y . But, y ∈ Y
implies that 1 + y = y + 1 and hence

1 + S(y) = S(1 + y) = S(y + 1) = S(S(y)) = S(y) + 1.

Thus, S(y) ∈ Y and hence by Axiom P5, Y = N. Therefore, we finally conclude that
1 ∈ X.
Now, let z ∈ X. To show S(z) ∈ X. But, z ∈ X implies that n + z = z + n, for all
n ∈ mN . Thus, using 1 ∈ X, n + z = z + n, for all n ∈ mN and associativity, one has

n + S(z) = n + (z + 1) = (n + z) + 1 = (z + n) + 1 = 1 + (z + n) = (1 + z) + n = S(z) + n,

for all n ∈ N. Hence, S(z) ∈ X and thus by Axiom P5, X = N.

30 CHAPTER 2. PEANO AXIOMS AND COUNTABILITY

3. [Distributive Law] For every n, m, k ∈ N, n · (m + k) = n · m + n · k.

Proof. Let X = {k ∈ N : for all m, n ∈ N, n · (m + k) = n · m + n · k}. To show that
X = N.
1 ∈ X as for each n, m ∈ N,

n · (m + 1) = n · S(m) = n · m + n = n · m + n · 1.

Now, let z ∈ X and let us show that S(z) ∈ X. Since z ∈ X

n · (m + z) = n · m + n · z, for all n, m ∈ N. (2.2)

Thus, by definition and Equation (2.2), we see that

n·(m+S(z)) = n·S(m+z) = n·(m+z)+n = (n·m+n·z)+n = n·m+(n·z+n) = n·m+n·S(z),

for all n, m ∈ N. Hence, S(z) ∈ X and thus by Axiom P5, X = N.

Exercise 2.1.7. The readers are now required to prove the following using only the above
properties:
1. [Uniqueness of addition] For every m, n, k ∈ N, whenever m = n then m + k = n + k.
Ans: Define X = {k ∈ N : whenever m = n then m + k = n + k}. If m = n then
S(m) = S(n) as S is a function. Hence, by
T

m + 1 = S(m) = S(n) = n + 1
AF

and thus 1 ∈ X. Now, let k ∈ X. Then, m + k = n + k and hence S(m + k) = S(n + k)

as S is a function. So, m + S(k) = S(m + k) = S(n + k) = n + S(k) and hence S(k) ∈ X.

Thus, by Axiom P5, X = N.

2. [Cancellation Law] For every x, y ∈ N, if x + z = y + z for some z ∈ N then x = y.

Ans: Let X = {k ∈ N : if m, n ∈ N satisfy n + k = m + k then n = m}. To show that
X = N. Clearly, 1 ∈ X as m + 1 = n + 1 implies that S(m) = S(n). But, S is injective
(Axiom P4) and hence m = n.
Now, let z ∈ X. To show, S(z) ∈ X. So, let us assume that m + S(z) = n + S(z). Thus, by
definition, S(m + z) = S(n + z) and since S is injective m + z = n + z. As z ∈ X, we get
m = n and hence S(z) ∈ X. Thus, by Axiom P5, X = N.

3. [Associative Law for multiplication] For every x, y, z ∈ N, x · (y · z) = (x · y) · z.

Ans: Let X = {k ∈ N : for all m, n ∈ N, m · (n · k) = (m · n) · k}. To show that X = N.
Clearly, 1 ∈ X as by definition, m · (n · 1) = m · n = (m · n) · 1.
Now, let z ∈ X. To show, S(z) ∈ X. As z ∈ X implies that m · (n · z) = (m · n) · z, one has

m · (n · S(z)) = m · (n · z + n) = m · (n · z) + m · n = (m · n) · z + m · n = (m · n) · S(z).

Thus, by Axiom P5, X = N.

2.1. PEANO AXIOMS AND THE SET OF NATURAL NUMBERS 31

4. [Multiplication by 1] For each n ∈ N, 1 · n = n.

Ans: Let X = {n ∈ N : 1 · n = n}. By definition, 1 ∈ X as 1 · 1 = 1. Then, for any z ∈ X
(implies 1 · z = z),

1 · S(z) = 1 · (z + 1) = 1 · z + 1 · 1 = z + 1 = S(z).

Thus, S(z) ∈ X and hence by Axiom P5, X = N.

5. [Second Distributive Law] For every n, m, k ∈ N, (m + n) · k = m · k + n · k.

Ans: Let X = {k ∈ N : for all m, n ∈ N, (m + n) · k = m · k + n · k}. To show that X = N.

1 ∈ X as for each n, m ∈ N,

(m + n) · 1 = m + n = m · 1 + n · 1.

Then, for z ∈ X and n, m ∈ N,

(m + n) · S(z) = (m + n) · (z + 1) = (m + n) · z + (m + n) · 1
= m·z+n·z+m·1+n·1 (as z ∈ X)
= m · z + m · 1 + n · z + n · 1 = m · (z + 1) + n · (z + 1)
= m · S(z) + n · S(z).
T
AF

Hence, S(z) ∈ X and thus by Axiom P5, X = N.

6. [Commutativity of multiplication] For each m, n ∈ N, n · m = m · n.

Ans: Let X = {n ∈ N : for all m ∈ N, m · n = n · m}. From the previous item, 1 ∈ X as
m · 1 = m = 1 · m, for all m ∈ N. Then, for any z ∈ X (implies m · z = z · m, for all m ∈ N),

m · S(z) = m · (z + 1) = m · z + m · 1 = z · m + 1 · m = (z + 1) · m = S(z) · m.

Thus, S(z) ∈ X and hence by Axiom P5, X = N.

7. [Uniqueness of multiplication] For every m, n, k ∈ N, whenever m = n then m · k = n · k.

Ans: Define X = {k ∈ N : whenever m = n then m · k = n · k}. If m = n then

m·1=m=n=n·1

and thus 1 ∈ X. Now, let k ∈ X. Then, m · k = n · k and hence by uniqueness of addition

m · k + m = n · k + m = n · k + n and therefore,

m · S(k) = m · (k + 1) = m · k + m · 1 = m · k + m = n · k + n
= n · k + n · 1 = n · (k + 1) = n · S(k)

and hence S(k) ∈ X. Thus, by Axiom P5, X = N.

32 CHAPTER 2. PEANO AXIOMS AND COUNTABILITY

8. [Multiplicative Cancellation] For every x, y ∈ N, if x · z = y · z for some z ∈ N then

x = y.
Ans: Let X = {n ∈ N : if m, k ∈ N satisfy n · k = m · k then n = m}. To show that
X = N. Let n = 1 and assume that 1 · k = m · k. To show, m = 1.
If m = 1 then 1 ∈ X. Else, m 6= 1. Hence, by Lemma 2.1.3, there exists y ∈ N such that
m = S(y). Hence,

1 · k = m · k = S(y) · k = y · k + k = y · k + 1 · k

and 1 + 1 · k = 1 + y · k + 1 · k. Thus, by additive cancellation

1 = 1 + y · k = y · k + 1 = S(y · k),

a contradiction to S(`) 6= 1, for all ` ∈ N. Hence, m = 1.

Now, let z ∈ X. So, for any m, k ∈ N if z · k = m · k then z = m. To show, S(z) ∈ X, i.e.,
need to show that if for any m, k ∈ N, S(z) · k = m · k then S(z) = m.
If m = 1 then 1 · k = S(z) · k with 1 ∈ X implies S(z) = 1. A contradiction to S(`) 6= 1, for
all ` ∈ N. Thus, m 6= 1 and hence by Lemma 2.1.3, there exists y ∈ N such that m = S(y).
Therefore, S(z) · k = m · k = S(y) · k and hence z · k + k = y · k + k. Thus, by additive
cancellation, z · k = y · k and since z ∈ X, we get z = y. So, S is a function implies that
T

m = S(y) = S(z) and hence S(z) ∈ X. Thus, by Axiom P5, X = N.

AF
DR

2.1.2 Well Ordering in N

In this subsection, we introduce the ordering on N. So, for any m, n ∈ N, we need to define
what n < m means?

Definition 2.1.8. [Ordering in N] Let m, n ∈ N. Then, we say n < m (in word, n is less than
m) if there exists a k ∈ N such that m = n + k. Further, n ≤ m if either n < m or n = m.

Lemma 2.1.9. [Transitivity] Let x, y, z ∈ N such that x < y and y < z. Then x < z.

Proof. Since x < y, there exists k ∈ N such that y = x + k. Similarly, y < z gives the existence
of ` ∈ N such that z = y + `. Hence, z = y + ` = (x + k) + ` = x + (k + `) = x + t, where
t = k + ` ∈ N as k, ` ∈ N. Thus, by definition x < z.

Exercise 2.1.10. Let x, y, z ∈ N. Then prove that

1. whenever x ≤ y and y < z then x < z.
2. whenever x < y and y ≤ z then x < z.
3. whenever x ≤ y and y ≤ z then x ≤ z.
4. whenever x < y then x + z < y + z and x · z < y · z.

Lemma 2.1.11. For all m, n ∈ N, m 6= m + n.

2.1. PEANO AXIOMS AND THE SET OF NATURAL NUMBERS 33

Proof. Let X = {m ∈ N : m 6= m + 1}. Clearly, 1 ∈ X as 1 6= 1 + 1 = S(1) (Axiom P3). Now,

let n ∈ X. On the contrary, assume that S(n) 6∈ X. Then, S(n) = S(n) + 1 = S(S(n)). As S
is injective (Axiom P4), we get n = S(n) = n + 1, a contradiction to n ∈ X. So, S(n) ∈ X and
hence by Axiom P5, X = N. Thus, n 6= n + 1 for all n ∈ N.
Now, define X = {k ∈ N : for all m ∈ N, m 6= m + k}. Then, by the previous paragraph,
1 ∈ X. So, assume k ∈ X and try to show that S(k) ∈ X. Or equivalently, need to show that

m 6= m + S(k) = S(m + k), for all m ∈ N.

So, let us define Y = {m ∈ N : m 6= S(m + k)}. Clearly, 1 ∈ Y as by Axiom P3, 1 6= S(`), for
any ` ∈ N. So, let m ∈ Y . To show, S(m) ∈ Y .
On the contrary, assume that S(m) 6∈ Y . So, by definition of Y , S(m) = S(S(m) + k). As S
is injective (Axiom P4), the previous step gives m = S(m) + k = m + 1 + k = m + (1 + k) =
m + (k + 1) = (m + k) + 1 = S(m + k), a contradiction to m ∈ Y . Thus, by Axiom P5, Y = N.

Lemma 2.1.12. [Well ordering in N] For all m, n ∈ N, exactly one of the following is true:
1. n < m,
2. n = m,
3. n > m.
T

Proof. As a first step, we show that if one of the above holds then the other two cannot hold.
AF

So, let us assume that n < m. Then, by definition, there exists k ∈ N such that m = n + k.
DR

Then, by Lemma 2.1.11 n 6= n + k = m and hence n 6= m. If m < n, then n = m + `, for some

` ∈ N. Thus,
n = m + ` = (n + k) + ` = n + (k + `), for some k + ` ∈ N,

a contradiction.
The readers should prove the other parts of the first step. Now, to complete the proof, let us
fix n ∈ N and define X = {m ∈ N : either m < n or m = n or n < m}. We now show that
1 ∈ X.
If n = 1 then 1 = 1 and hence 1 ∈ X. If n 6= 1 then there exists y ∈ N such that n = S(y) =
y + 1 = 1 + y and hence by the definition of order, 1 < n. Thus, 1 ∈ X. Let us now assume that
m ∈ X and prove that S(m) ∈ X. As m ∈ X then either m < n or m = n or n < m. We will
consider all three cases and in each case show that S(m) ∈ X.
If m < n then n = m + k, for some k ∈ N. Further, if k = 1 then n = m + 1 and S(m) = n.
Thus, S(m) ∈ X. If k 6= 1 then there exists ` ∈ N such that S(`) = k. Then,

n = m + k = m + S(`) = m + (` + 1) = m + (1 + `) = (m + 1) + ` = S(m) + `

and hence S(m) < n. Thus, S(m) ∈ X.

If m = n then S(m) = m + 1 = n + 1 and hence n < S(m). Thus S(m) ∈ X.
If n < m then m = n + `, for some ` ∈ N. Thus, S(m) = S(n + `) = (n + `) + 1 = n + (` + 1)
and hence n < S(m). Therefore, S(m) ∈ X and the proof of each case is complete. Thus, by
Axiom P5, X = N.
34 CHAPTER 2. PEANO AXIOMS AND COUNTABILITY

We are now in a position the state two important principles, namely the Well ordering principle
and the principle of mathematical induction.

Theorem 2.1.13. [Well ordering principle in N (or N ∪ {0})] Every non-empty subset X of
N has a least element.

Proof. We first prove that for each n ∈ N, the statement “every non-empty subset of {1, 2, . . . , n}
has a least element”. To prove this let

A = {n ∈ N : every non-empty subset of {1, 2, . . . , n} has a least element}.

Clearly 1 ∈ A as 1 itself is the least element of {1}, the only non-empty subset of {1}. Let
n ∈ A. To show, S(n) = n + 1 ∈ A.
So, let X be a non-empty subset of {1, 2, . . . , n + 1}. If X = {n + 1} then it has n + 1 as
its least element. If X 6= {n + 1} then B = {1, 2, . . . , n} ∩ X is non-empty and is a non-empty
subset of {1, 2, . . . , n}. As n ∈ A, B has a least element, say k. Then, by the definition of B, k
is also the least element of X. Thus, A is an inductive set and by Axiom P5, A = N.

What is also interesting about the Well ordering principle is that it is logically equivalent to
the principle of mathematical induction, which is stated next. One can obtain a direct proof of
the principle of mathematical induction by defining an inductive set and then using Axiom P5.
Here, we use the Well ordering principle to prove the principle of mathematical induction.
T
AF

Theorem 2.1.14. [Principle of mathematical induction (PMI)] Let P (n) be a statement

(proposition) dependent on a natural number n ∈ N. Assume that

1. base step: P (1) is true,
2. induction step: for each n ∈ N, the statement P (n) is true implies P (n + 1) is true.

Then, P (n) is true for all n ∈ N.

Proof. Let X ⊆ N such that 1 ∈ X and if k ∈ X then S(k) = k + 1 ∈ X. To show that X = N.

If N \ X = ∅ then we are done. So, let us assume that N \ X 6= ∅. Then, N \ X is a non-empty
subset of N and hence by the Well ordering principle, let k 6= 1 (1 ∈ X) be the least element of
N \ X. Then, by Lemma 2.1.3, there exists y ∈ N such that k = S(y) = y + 1. Thus, y < k.
Since k is the least element of N \ X, y 6∈ N \ X. So, y ∈ X and hence by the definition of the
set X, k = S(y) ∈ X, a contradiction as k ∈ N \ X. Therefore, N \ X = ∅, i.e., X = N.

We now prove that the principle of mathematical induction implies the Well ordering principle.
Proof: Let P (n) be the statement “Any subset of natural numbers containing an element k,
with k ≤ n, has a least element”.
Define X = {n ∈ N : P (n) is true}. Clearly, 1 ∈ X as P (1) is trivially true. So, let us assume
that y ∈ X and show that S(y) = y + 1 ∈ X.
As y ∈ X, the statement “if there is a subset E of N containing an element t, with t ≤ y, then E
contains a least element” is true. Now, let Y ⊆ N with Y containing an element t ≤ S(y) = y +1.
2.1. PEANO AXIOMS AND THE SET OF NATURAL NUMBERS 35

If Y has no element which is less than y + 1 then y + 1 is the least element of Y and hence
y + 1 ∈ X.
If Y has an element t < y + 1, then B = Y ∩ {1, 2, . . . , y} is non-empty and it contains the
element t ≤ y. Thus, B is a subset of N containing an element t, with t ≤ y, and hence B
contains a least element. Therefore, by definition of B, the least element of B is also the least
element of Y and hence Y contains a least element. Thus, y + 1 ∈ X.
Thus, by the principle of mathematical induction, X = N. Now, let T be any non-empty
subset of N. Since T is non-empty, there exists an m ∈ N such that m ∈ T . Thus, T is a subset
of N containing an element t, with t = m ≤ m. As P (m) is true, the set T has a least element
and thus, one has the Well ordering principle.

Exercise 2.1.15. Prove that for all m, n ∈ N, S(m) + n = S(m + n).

2.1.3 Applications
Let us now go back to the definition of addition: n + 1 = S(n), n + S(m) = S(n + m), for all
n, m ∈ N. The word ‘assign’ means that we actually have a function that does the assignment.
We will now prove a theorem, commonly known as the recursive theorem, that will help us in
actually defining the addition function as an application.
T

Theorem 2.1.16. [Recursive Theorem] Let α be a fixed natural number and let f : N → N
AF

be a function. Then, there exists a unique function g : N → N such that

g(1) = α and g(S(x)) = f (g(x)), for all x ∈ N.

Proof. [Existence of g] Since we want a function g : N → N, we are essentially looking for

a subset of N × N. By g(1) = α, we mean (1, α) ∈ g. Further, g(S(x)) = f (g(x)) means if
y = g(x), or equivalently, if (x, y) ∈ g then (S(x), f (y)) ∈ g. Using this understanding, let us
construct g. So, let

X = {A ⊆ N × N : (1, α) ∈ A and (x, y) ∈ A implies that (S(x), f (y)) ∈ A}.

Clearly, A 6= ∅ as N × N ∈ A. So, define

\
g= A.
A∈X

Then, (1, α) ∈ g as (1, α) ∈ A, for all A ∈ X. Now, let (x, y) ∈ g. Then, (x, y) ∈ A, for all
A ∈ X. Hence, by definition of A, (S(x), f (y)) ∈ A, for all A ∈ X. Thus, whenever (x, y) ∈ g,
we see that (S(x), f (y)) ∈ g. Therefore, g ∈ X and by definition (intersection of all A ∈ X), g
is the smallest element of g.
We now claim that g : N → N is a function. So, we show that dom(g) = N and each element
of the domain has exactly one image under g.
Let Y = {n ∈ N : there existsz ∈ N for which (n, z) ∈ g}.As (1, α) ∈ g, we get 1 ∈ Y . So,
let n ∈ Y . To show S(n) ∈ Y . As n ∈ X, there exists z ∈ N such that (n, z) ∈ g. Hence, by
36 CHAPTER 2. PEANO AXIOMS AND COUNTABILITY

definition of g, (S(n), f (z)) ∈ g, i.e., S(n) ∈ Y and therefore by Axiom P5, Y = N. In other
words, dom(g) = N.
As a next step, we prove that for each element of the domain, there is exactly one image under
g. So, define
Z = {n ∈ N : whenever(n, y) ∈ g and (n, z) ∈ g then y = z}.

1 ∈ Z as 1 6∈ Z implies that there exist z1 , z2 ∈ N, z1 6= z2 such that (1, z1 ), (1, z2 ) ∈ g. Then,

the relation h = g \ {(1, z2 )} ( g and h ∈ X. This contradicts the minimality of g. Hence,
(1, z1 ), (1, z2 ) ∈ g implies z1 = z2 .
So, now let us assume that n ∈ Z. We need to show that S(n) ∈ Z. So, let if possible
S(n) 6∈ Z. As, n ∈ Z, there exists a unique m ∈ N such that (n, m) ∈ g. Hence, by definition,
(S(n), f (m)) ∈ g. But, we have assumed that S(n) ∈ Z. Therefore, there exists z ∈ N, z 6= f (m)
such that (S(n), z) ∈ g. But in this case, we again have h = g \ {(S(n), z)} ( g with h ∈ X.
This contradicts the minimality of g. Hence, f (m) = z. Thus, S(n) ∈ Y and thus by Axiom P5,
Z = N.
As a final step in this proof, we show that g is unique. So, let g1 , g2 be two functions such that
g1 (1) = g2 (1) = α, g1 (S(k)) = f (g1 (k)) and g2 (S(k)) = f (g2 (k)). Define V = {n ∈ N : g1 (n) =
g2 (n)}. Then, 1 ∈ V . Also, n ∈ V implies that g1 (n) = g2 (n) and hence g1 (S(n)) = f (g1 (n)) =
f (g2 (n)) = g2 (S(n)). Thus, S(n) ∈ V and thus by Axiom P5, V = N. This completes the proof
of the recursive theorem.
T
AF

Example 2.1.17. As an application of the recursion theorem, we re-define addition and multi-
DR

plication of natural numbers. Note that the uniqueness of the function g helps us in the sense
that we can either either guess the function and then verify it or inductively define the function
g.
1. Let f : N → N be defined by f (x) = S(x), for all x ∈ N. Now, fix m ∈ N. Then, by the
recursion theorem, there exists a unique function g : N → N such that

g(1) = m and g(S(n)) = f (g(n)), for all n ∈ N.

Thus, g(n + 1) = g(S(n)) = S(g(n)) = g(n) + 1, for all n ∈ N. So, let us verify that the
unique function g satisfies g(n + 1) = m + n, for all n ∈ N.
Clearly, g(1) = m and by definition of g, m + S(n) = g(S(n) + 1) = g(S(S(n))) =
S(g(S(n))) = S(g(n + 1)) = S(m + n). Thus, we get the required addition function.
2. Fix m ∈ N and define f : N → N by f (n) = n + m, for all n ∈ N. Then, by the recursion
theorem, there exists a unique g : N → N such that g(1) = m and g(S(n)) = f (g(n)), for
all n ∈ N. Thus, let us verify that the unique function g satisfies g(n) = m · n, for all
n ∈ N.
Clearly, g(1) = m = m · 1 and

g(S(n)) = f (g(n)) = g(n) + m = m · n + m · 1 = m · (n + 1) = m · S(n).

Thus, we get the required addition function.

2.1. PEANO AXIOMS AND THE SET OF NATURAL NUMBERS 37

3. Fix m ∈ N and define f : N → N by f (n) = m · n, for all n ∈ N. Then, by the recursion

theorem, there exists a unique g : N → N such that g(1) = m and g(S(n)) = f (g(n)), for
all n ∈ N. Thus, let us verify that the unique function g satisfies g(n) = mn , for all n ∈ N.
Clearly, g(1) = m = m1 and

g(S(n)) = f (g(n)) = m · g(n) = m · mn = m( n + 1) = mS(n) .

Thus, we get the required addition function.

By now, the readers should have got a glimpse of the work required to axiomatically construct
N, the set of natural numbers. Similarly, the construction of integers from natural numbers and
the construction of rational numbers from integers require quite a lot of work. These construc-
tions are very helpful in understanding advanced algebra. But, we will skip their constructions
for the time being and try to understand the numbers using the well-ordering principle and the
principle of mathematical induction.

Theorem 2.1.18. [Archimedean property for positive integers] Let x, y ∈ N. Then, there
exists n ∈ N such that nx ≥ y.

Proof. On the contrary assume that such an n ∈ N does not exist. That is, nx < y for every
n ∈ N. Now, consider the set S = {y −nx | n ∈ N∪{0}}. Then y ∈ S and hence S is a nonempty
T

subset of N0 . Therefore, by the well-ordering principle (Theorem 2.1.13), S contains its least
AF

element, say y − mx. Then, by assumption the integer y − (m + 1)x ≥ 0, y − (m + 1)x ∈ S, and
DR

y − (m + 1)x < y − mx. A contradiction to the minimality of y − mx. Thus, our assumption is
invalid and hence the required result follows.

Theorem 2.1.19. [Another form of PMI] Let S ⊆ Z be a set which satisfies

1. k0 ∈ S and

2. k + 1 ∈ S whenever {k0 , k0 + 1, . . . , k} ⊆ S.

Then {k0 , k0 + 1, . . .} ⊆ S.

Proof. Consider T = {x − (k0 − 1) | x ∈ S, x ≥ k0 }. Then 1 ∈ T as k0 ∈ S and 1 = k0 − (k0 − 1).

Now, let {1, 2, . . . , k} ⊆ T . Then, {k0 , k0 + 1, . . . , k0 + k − 1} ⊆ S. Hence by the hypothesis,
(k0 + k − 1) + 1 = k0 + k ∈ S. Therefore, by definition of T , we have k + 1 ∈ T and hence using
the strong form of PMI, T = N. Thus, the required result follows.

The next result gives the equivalence of the weak form of PMI with the strong form of PMI.

Theorem 2.1.20. [Equivalence of PMI in weak form and PMI in strong form] Fix a natural
number k0 and let P (n) be a statement about a natural number n. Suppose that P means the
statement ‘P (n) is true for each n ∈ N, n ≥ k0 ’. Then ‘P can be proved using the weak form of
PMI’ if and only if ‘P can be proved using the strong form of PMI’.
38 CHAPTER 2. PEANO AXIOMS AND COUNTABILITY

Proof. Let us assume that the statement P has been proved using the weak form of PMI. Hence,
P (k0 ) is true. Further, whenever P (n) is true, we are able to establish that P (n + 1) is true.
Therefore, we can establish that P (n + 1) is true if P (k0 ), . . . , P (n) are true. Hence, P can be
proved using the strong form of PMI.
So, now let us assume that the statement P has been proved using the strong form of PMI.
Now, define Q(n) to mean ‘P (`) holds for ` = k0 , k0 + 1, . . . , n’. Notice that Q(k0 ) is true.
Suppose that Q(n) is true (this means that P (`) is true for ` = k0 , k0 + 1, . . . , n). By hypothesis,
we know that P has been proved using the strong form of PMI. That is, P (n+1) is true whenever
P (`) is true for ` = k0 , k0 + 1, . . . , n. This, in turn, means that Q(n + 1) is true. Hence, by the
weak form of PMI, Q(n) is true for all n ≥ k0 . Thus, we are able to prove P using the weak
form of PMI.

Example 2.1.21. [Wrong use of PMI: Can you find the error?] The following is an incorrect
proof of ‘if a set of n balls contains a green ball then all the balls in the set are green’. Find the
error.

Proof. The statement holds trivially for n = 1. Assume that the statement is true for n ≤ k.
Take a collection Bk+1 of k + 1 balls that contains at least one green ball. From Bk+1 , pick a
collection Bk of k balls that contains at least one green ball. Then by the induction hypothesis,
each ball in Bk is green. Now, remove one ball from Bk and put the ball which was left out in
T

the beginning. Call it Bk0 . Again by induction hypothesis, each ball in Bk0 is green. Thus, each
AF

ball in Bk+1 is green. Hence by PMI, our proof is complete.

Exercise 2.1.22. [Optional]

n xn+1 − 1
1. Let x ∈ R with x 6= 1. Then prove that 1 + x + x2 + · · · + xn = xk =
P
.
k=0 x−1
2. Let a, a + d, a + 2d, . . . , a + (n − 1)d be the first n terms of an arithmetic progression. Then,
n−1
X n
S= (a + id) = a + (a + d) + · · · + (a + (n − 1)d) = (2a + (n − 1)d) .
2
i=0

3. Let a, ar, ar2 , . . . , arn−1 be the first n terms of a geometric progression, with r 6= 1. Then,
n−1 rn − 1
S = a + ar + · · · + arn−1 =
P i
ar = a .
i=0 r−1
4. Prove that

(a) 6 divides n3 − n, for all n ∈ N.

(b) 7 divides n7 − n, for all n ∈ N.
(c) 3 divides 22n − 1, for all n ∈ N.
(d) 9 divides 22n − 3n − 1, for all n ∈ N.
(e) 10 divides n9 − n, for all n ∈ N.
(f ) 12 divides 22n+2 − 3n4 + 3n2 − 4, for all n ∈ N.
2.1. PEANO AXIOMS AND THE SET OF NATURAL NUMBERS 39

2
n(n + 1)
(g) 13 + 23 + · · · + n3 = .
2
5. Determine a formula for 1 · 2 + 2 · 3 + 3 · 4 + · · · + (n − 1) · n and prove it.
6. Determine a formula for 1 · 2 · 3 + 2 · 3 · 4 + 3 · 4 · 5 + · · · + (n − 1) · n · (n + 1) and prove it.
7. Determine a formula for 1 · 3 · 5 + 2 · 4 · 6 + · · · + n · (n + 2) · (n + 4) and prove it.
8. [Informative] For all n ≥ 32, there exist nonnegative integers x and y such that n =
5x + 9y. [Hint: Prove it first for the starting 5 numbers.]
9. [Informative] Prove that, for all n ≥ 40, there exist nonnegative integers x and y such
that n = 5x + 11y.
10. For every positive integer n ≥ 5 prove that 2n > n2 > 2n + 1.
11. [Informative] Prove that for µ > 0,
p
1 p2 (p + 1)2 p(p + 1)(2p + 1)

Y p(p + 1)
(1 + lµ) ≥ 1 + µ+ − µ2 .
2 2 4 6
l=1

k k−1 k−1 k k
µ2 µ2
(l3 − l2 ) ≥ 1 + µ (l3 − l2 ),
Q P P P P
Ans: (1 + lµ) ≥ (1 + kµ) 1 + µ l+ 2 l+ 2
l=1 l=1 l=1 l=1 l=1
ignoring µ3 term.
12. [Informative] By an L-shaped piece, we mean a piece of the type shown in the picture.
T
AF

Consider a 2n × 2n square with one unit square cut. See the picture given below.
DR

L-shaped piece 4 × 4 square with a unit square cut

Show that a 2n × 2n square with one unit square cut, can be covered with L-shaped pieces.
13. [Informative] Verify that (k+1)5 −k 5 = 5k 4 +10k 3 +10k 2 +5k+1. Now, put k = 1, 2, . . . , n
n n n n n
and add to get (n + 1)5 − 1 = 5 k 4 + 10 k 3 + 10 k2 + 5
P P P P P
k+ 1. Now, use
k=1 k=1 k=1 k=1 k=1
n n n n n
k3 , k2 , k4 .
P P P P P
the formula’s for k, and 1 to get a expression for
k=1 k=1 k=1 k=1 k=1
14. [Informative: A general result than AM-GM]
(a) Let a1 , . . . , a9 be nonnegative real numbers such that the sum a1 +· · ·+a9 = 5. Assume
that a1 6= a2 . Consider a1 +a 2 a1 +a2
2 , 2 , a3 , . . . , a9 and argue that
a + a 2
1 2
a1 · · · a9 ≤ a3 · · · a9 .
2

(b) Let a1 , . . . , an be any nonnegative real numbers such that the sum a1 + · · · + an = r0 .
Argue that the highest value of a1 · · · an is obtained when a1 = · · · = an = r0 /n.
(c) Let a1 , . . . , an be fixed nonnegative real numbers such that the sum a1 + · · · + an = r0 .
Conclude from the previous item that (r0 /n)n ≥ a1 · · · an , the AM-GM inequality.
40 CHAPTER 2. PEANO AXIOMS AND COUNTABILITY

2.2 Finite and Infinite Sets

We now discuss the size of sets. A useful way to compare two sets is through their size. In
particular, we will be concerned about those sets whose size exceeds the size of the set N, the
set of natural numbers.
To start with, for a fixed positive integers n, let us write {1, 2, . . . , n} = {1, 2, . . . , n}. We are
now ready to prove a few results which will be quite useful in this section.

Lemma 2.2.1. [One-One order preserving map from a one-one map] Fix a positive integer
n and let f : {1, 2, . . . , n} → N be a one-one function. Let rng f = {f (x) : x ∈ {1, 2, . . . , n}}, the
image of f in N. Then, there exists a function g : {1, 2, . . . , n} → rng f such that g is one-one
and g preserves order, i.e., x < y implies that g(x) < g(y), for all x, y ∈ {1, 2, . . . , n}.

Proof. We use induction to prove this result. The result is clearly true for n = 1 as g : [1] →
{f (1)} given by g(1) = f (1) is a one-one and order preserving map. So, let the result be true
for n = k and suppose we have been given a one-one map f : [k + 1] → N. We need to construct
the function g which is one-one and preserves order.
As rng f is a non-empty subset of N, by the well-ordering principle, rng f contains a least
element, say α ∈ N such that f (x) = α, for some x ∈ [k + 1]. Now, define h : {1, 2, . . . , k} →
rng f \ {α} by (
f (y) if y < x
h(y) = .
f (y + 1) if y ≥ x
T
AF

Then, h is one-one as f is one-one and by definition, h is onto. But, by induction step, there
exists a map g1 : {1, 2, . . . , k} → rng h = rng f \{α} such that g1 is one-one and order preserving.
DR

Thus, the required map g : [k + 1] → rng f is given by

(
α if y = 1
g(y) = .
g1 (y − 1) if y ≥ 2

Verify that g is indeed one-one and order preserving and hence the required result follows.

As an application of Lemma 2.2.1, we prove the following result.

Lemma 2.2.2. [Injection] Let f : [m] → {1, 2, . . . , n} be a one-one function for some m, n ∈ N.
Then m ≤ n.

Proof. As {1, 2, . . . , n} ⊆ N, by Lemma 2.2.1 there exists a function g : [m] → rng f ⊆

{1, 2, . . . , n} which is one-one and order preserving. We claim that g(x) ≥ x, for all x ∈ [m].
Suppose the claim is false. Then, the set S = {` ∈ [m] : g(`) < `} is non-empty subset of N.
Hence by the well ordering principle, S contains a least element, say k ∈ [m] such that g(k) < k.
Clearly k 6= 1 as g(1) ≥ 1. So, we see that g(k − 1) ≥ k − 1 and g(k) < k. But, g is order
preserving and hence
g(k) > g(k − 1) ≥ k − 1,
a contradiction to g(k) < k. Thus, the claim is true.
As g is order preserving and g(x) ∈ {1, 2, . . . , n}, one has n ≥ g(m) ≥ m. Thus, n ≥ m and
hence the required result follows.
2.2. FINITE AND INFINITE SETS 41

As an immediate corollary one has the following result. The proof is left for the reader.

Lemma 2.2.3. [Bijection] Let f : [m] → {1, 2, . . . , n} be a bijection for some m, n ∈ N. Then
m = n.

Ans: If f : [m] → {1, 2, . . . , n} is one-one and onto then the function f −1 : {1, 2, . . . , n} → [m]
is also one-one. f is one-one implies m ≤ n and f −1 is one-one implies m ≥ n. Hence, m = n
The following remark helps us to define cardinality of a finite set.

Remark 2.2.4. [Cardinality of a finite set] Let X be a finite set and suppose there exist
m, n ∈ N and bijections f : [m] → X and g : {1, 2, . . . , n} → X. As g is a bijection, g −1 : X →
{1, 2, . . . , n} is also a bijection and hence the map g −1 ◦ f : [m] → {1, 2, . . . , n} is a bijection.
Thus, by Lemma 2.2.3 m = n. Thus we see that if X is a finite set then the number m for which
there is a bijection f : [m] → X is unique. This number m is called the cardinality of X and
is generally denoted by |X|. Hence, for any positive integer m, |[m]| = m.

We now assemble a few important facts on cardinality.

Fact 2.2.5. 1. Let X and Y be two disjoint sets and let f : X → {1, 2, . . . , n} and g : Y →
[m] be two bijections. Then, the function h : X ∪ Y → [m + n] defined by
(
f (x) if x ∈ X
h(x) =
g(x) + n if x ∈ Y
is a bijection.
T
AF

2. Fix n ≥ 2 and let f : X → {1, 2, . . . , n} be a bijection such that for a fixed element a ∈ X,
one has f (a) = k. Then g : X \ {a} → {1, 2, . . . , n − 1}, defined by
DR

(
f (x) if f (x) ≤ k − 1
g(x) =
f (x) − 1 if f (x) ≥ k + 1
is a bijection.
3. For any positive integer n and k, there is no bijection from {1, 2, . . . , n} to [n + k].
Proof. Use Lemma 2.2.3.

4. Any subset of {1, 2, . . . , n} is finite.

Proof. We use PMI to prove this. It is true for n = 1. Let the result be true for
{1, 2, . . . , n − 1},. Now, let S ⊆ {1, 2, . . . , n}. If n 6∈ S, then S ⊆ {1, 2, . . . , n − 1}, and
hence using PMI the result follows. If n ∈ S, let T = S \ {n}. Then by PMI, T is finite,
and hence by Fact 2.2.5.1, S is finite as S is disjoint union of T and {n}.

5. Any subset of a finite set is finite.

Proof. Let |S| = n, for some n ∈ N. Then, there is a bijection f : S → {1, 2, . . . , n}. Let
T ⊆ S. If T is empty then there is nothing to prove. Else, consider the map fT : T → f (T ).
This map is a bijection. By Fact 2.2.5.4, f (T ) ⊆ {1, 2, . . . , n} is finite. Hence, Lemma 2.2.3
gives T is finite.
42 CHAPTER 2. PEANO AXIOMS AND COUNTABILITY

6. The set N is not finite.

Proof. Assume that the set N is finite and |N| = n, for some natural number n. But,
{1, 2, . . . , n, n + 1} ⊆ N and therefore the identity map Id : {1, 2, . . . , n, n + 1} → N is
one-one. Thus, by Lemma 2.2.2, n + 1 = |{1, 2, . . . , n, n + 1}| ≤ n, a contradiction.

Exercise 2.2.6. 1. Let X and Y be two disjoint sets with |X| = m and |Y | = n. Then
|X ∪ Y | = m + n.
2. Let X be a nonempty finite set and let Y ⊆ X. Then |Y | ≤ |X|. In particular, if Y ( X
then |Y | < |X|.
3. Let X be a finite nonempty set and α be a fixed symbol. Now, consider the set Y = {(α, a) |
a ∈ X}. Then |X| = |Y |.
Ans: Verify that f : X → Y defined by f (a) = (α, a), for all a ∈ X, is a bijection.
4. Let X be a nonempty finite set. Then, for any set Y , |X| = |X \ Y | + |X ∩ Y |.
5. Let X and Y be two finite sets then |X ∪ Y | = |X| + |Y | − |X ∩ Y |.
Proof. We know X ∪ Y = (X \ Y ) ∪ (X ∩ Y ) ∪ (Y \ X). As the sets X \ Y , X ∩ Y , and
Y \ X are finite and pairwise disjoint, the result follows from Exercise 2.2.6.1.
T

To proceed with the next definition, recall that the sets X and Y are said to be equivalent if
AF

there exists a bijection between X and Y .

Definition 2.2.7. [Finite/Countably finite and Infinite/Countably infinite sets]

1. A set X is said to be finite or countably finite if either X is empty or X is equivalent
to [m], for some m ∈ N. A set which is not finite is called an infinite set.
2. A set which is either finite or is equivalent to N is called a countable set. In particular,
a set which is equivalent to N is called a countably infinite set.

We now give a few useful criteria to determine whether a set is finite or infinite.
Fact 2.2.8. 1. Let X be an infinite set and Y be a finite set. Then X \ Y is also infinite. In
particular, if a ∈ X, then X \ {a} is also infinite.
2. A set X is infinite if and only if there is a one-one function f : N → X.
Proof. Let X be infinite. So X 6= ∅. Let a1 ∈ X. Put f (1) = a1 and X1 = X \ {a1 }.
By Fact 2.2.8.1, X1 is infinite. Assume that we have defined f (1), . . . , f (k) and obtained
Xk = Xk−1 \{ak }. As Xk−1 was infinite, by Fact 2.2.8.1, Xk is also infinite. Hence Xk 6= ∅.
Let ak+1 ∈ Xk . Define f (k + 1) = ak+1 and Xk+1 = Xk \ {ak+1 }. By applying induction,
f gets defined on N. Notice that by construction ak+1 ∈/ {a1 , . . . , ak }. Hence f is one-one.
Conversely, let f : N → X be one-one. Then f : N → f (N) is a bijection. Thus, N is
equivalent to f (N). So, X contains f (N), a countably infinite set. Thus, using Fact 2.2.5.5,
X is infinite as well.
2.3. COUNTABLE AND UNCOUNTABLE SETS 43

3. A set is infinite if and only if it is equivalent to a proper subset of itself.

Proof. Let S be an infinite set. Then, by Fact 2.2.8.2, there is a one-one function f : N → S.
Now define a map g : S → S \ {f (1)} by
(
x, if x 6∈ f (N)
g(x) = .
f (k + 1), if x = f (k)

Then, g is indeed a bijection. Thus, S is equivalent to its proper subset S \ {f (1)}.

Conversely, let T be a proper subset of a set S such that S and T are equivalent. Suppose S
is finite. Then, by Fact 2.2.5.5, T is finite with |T | < |S|. But, by Remark 2.2.4 |S| = |T |,
a contradiction.

Exercise 2.2.9. 1. Let X be a infinite set and let Y ⊇ X. Then Y is also infinite.
Ans: If Y is finite then by Fact 2.2.5.5, X is finite. Contradicting infiniteness of X.
2. Define f : N → Z by (
−x
2 if x is even
f (x) = x−1
.
2 if x is odd
Prove that f gives an equivalence between N and Z. Thus, Z is countably infinite set.

2.3 Countable and Uncountable sets

T
AF

In the previous section we learnt that N is a countably infinite set. We now show that the set
DR

N × N is also countably infinite.

Lemma 2.3.1. The set N × N is countably infinite.

Proof. Verify that N × N is equivalent to the set A = {(x, y) ∈ N × N : y ≤ x} by using the map
g : N × N → A defined by g(x, y) = (x + y − 1, y). Further, use he map f : A → N defined by
x(x − 1)
f (x, y) = + y to show that A is equivalent to N.
2

We now present another proof using the even and odd numbers. Note that this idea can be
suitably generalized to replace 2 by any prime number.
Alternate. Define a map h : N × N → N by h(x, y) = 2x−1 (2y − 1). Then, h is one-one as
h(x, y) = h(m, n) if and only if 2x−1 (2y − 1) = 2m−1 (2n − 1). Now, if x = m then 2y − 1 = 2n − 1
and hence y = n. Therefore (x, y) = (m, n). If x > m then 2x−m (2y−1) = 2n−1, a contradiction
as the left hand side is an even number whereas the right hand side is an odd number.
h is onto as every x ∈ N can be uniquely written as x = 2r−1 (2n − 1), for some r, n ≥ 1.

nm o
Exercise 2.3.2. Let Q+ = : m, n ∈ N, gcd(m, n) = 1 and Q− = {−x : x ∈ Q+ }.
n
1. Then prove that Q+ is countably infinite.
2. Thus conclude that Q− is countably infinite as well.
3. Therefore, prove that Q is countably infinite.
44 CHAPTER 2. PEANO AXIOMS AND COUNTABILITY

2.3.1 Cantor’s Lemma

To proceed further, we present Cantor’s experiment. To do so, recall that for any set X, P(X)
denotes the power set of X, i.e., P(X) is the set containing all subsets of X.

Cantor’s Experiment for the student: Why does it happen?

Take a plain paper.
1. On the left draw an oval (of vertical length) and write the elements of {1, 2, 3, 4}
inside it, one below the other. On the right draw a similar but large oval and write
the elements of P({1, 2, 3, 4}) inside it, one below the other.
2. Now draw a directed line from 1 (on the left) to any element on the right. Repeat
this for 2, 3 and 4. We have drawn a function. Call it f .
3. Notice that f (1), f (2), f (3) and f (4) are sets. Find out the set X = {i : i ∈
/ f (i)}.
Locate this set on the right.
4. It is guaranteed that you do not have a directed line touching X. Why?

Lemma 2.3.3. [Cantor] Let S be a set and f : S → P(S) be a function. Then, there exists
A ∈ P(S) which does not have a pre-image. That is, there is no surjection from S to P(S).

Proof. On the contrary assume that there exists f : S → P(S) such that f is a surjection. Now,
consider the set A = {x : x ∈ / f (x)} ∈ P(S). As f is a surjection, there exists s ∈ S with
T

/ f (x)}. We now show that s neither belongs to A nor to A0 .

f (s) = A. So, A = f (s) = {x : x ∈

If s ∈ A, then by definition of A, s ∈
/ f (s) = A. Similarly, if s ∈
/ A means that s ∈ f (s) = A.
DR

Thus, s ∈ 0
/ A ∪ A = S, a contradiction.

Remark 2.3.4. [Uncountable set] Cantor’s Lemma states that one cannot have a bijection
between a set and its power set. So, the sets N and P(N) cannot be equivalent. Thus, the set
P(N) is infinite but cannot be countably infinite. The sets that are not countable are called
uncountable sets.
Definition 2.3.5. 1. [Enumeration] Let A be a countably infinite set. Then, by definition,
there is a bijection f : N → A. So, we can list all the elements of A as f (1), f (2), . . .. This
list is called an enumeration of the elements of A.
2. [Sequence] An infinite sequence of a non-empty set X is a function f : N → X and is
represented by {fi }i∈N = {f1 , f2 , . . .}.
Example 2.3.6. 1. Let S be the set of all 0-1-sequences, i.e., S is the collection of all
functions x : N → {0, 1}. Or equivalently,

S = x : x = {x1 , x2 , . . .} where for each i ∈ N, xi ∈ {0, 1} .

Define f : S → P(N) as

f (x) = f {x1 , x2 , . . .} = {n : xn = 1}.

Then f is a bijection. Hence, S is uncountable by Cantor’s lemma.

2.3. COUNTABLE AND UNCOUNTABLE SETS 45

2. Let T = {x ∈ (0, 1) | x has a decimal expansion containing the digits 0 and 1 only}. Then
T is uncountable.
Proof. One proof follows by the previous idea.
Alternate. [Cantor’s diagonalization] If T is countably infinite, let x1 , x2 , · · · be an
enumeration of T . Let xn = .xn1 xn2 · · · , where xni ∈ {0, 1}. Put ynn = 1, if xnn = 0
and ynn = 0, otherwise. Consider the number y = .y11 y22 · · · ∈ T . Notice that for each n,
y 6= xn . That is, y ∈ T but it is not in the enumeration list. This is a contradiction.

2.3.2 Creating Bijections

Experiment 1:
Make a horizontal list of the elements of N using ‘· · · ’ only once. Now, horizontally list
the elements of Z just below the list of N using ‘· · · ’ once. Draw vertical lines to supply a
bijection from N to Z. Can you supply another by changing the second list a little bit?

Experiment 2:
Suppose that you have an open interval (a, b). Its center is c = a+b2 and the distance of the
l b−a
center from one end is 2 = 2 . View this as a line segment on the real line. Stretch (a, b)
uniformly without disturbing the center and make its length equal to L.
Where is c now (in R)? Where is c − 2l ? Where is c + 2l ? Where is c − α × 2l , for a fixed
T

α ∈ (−1, 1)?
AF

Now, use the above idea to find a bijection from (a, b) to (s, t)? [Hint: Fix the center first.]
DR

Exercise 2.3.7. 1. Supply two bijections from (1, ∞) to (5, ∞), one by ‘scaling’ and the
other by ‘translating’.
2. Take reciprocal to supply a bijection from (0, 1) to (1, ∞). You can also use the exponential
function to get this.
3. Supply a bijection from (−1, 1) to (−∞, ∞).
4. Supply a bijection from (0, 1) to R.
5. Supply a bijection from (0, 1) × (0, 1) to R × R.

Train-Seat argument to find a bijection

Let f : P = (0, 1) → T = (3, 5) be a bijection. Imagine elements of P as PERSONS and
elements of T as seats in a TRAIN. So, f assign a seat to each person and the train is full.
1. Now suppose a new person 0 is arriving. He wants a seat. To manage it, let us un-seat
two persons 21 , 13 . So, two seats f ( 12 ), f ( 13 ) are vacant. But we have 3 persons to take
those seats. Giving each person a seat is not possible.
2. Suppose that we un-seat 21 , 31 , · · · , 30
1
? Can we manage it?
3. Suppose that we un-seat 12 , 31 , · · · ? Can we manage it now?
4. What do we do if we had two new persons arriving? Fifty new persons arriving? A
set {a1 , a2 , · · · } of new persons arriving?
46 CHAPTER 2. PEANO AXIOMS AND COUNTABILITY

The readers are required to prove the next theorem.

Theorem 2.3.8. Let A be a set containing the set {a1 , a2 , . . . , } and let f : A → B be a bijection.
Then, prove that, for any collection

1. {c1 , . . . , ck } of elements that are outside A, the function


 f (x)
 if x ∈ A \ {a1 , a2 , . . .}
h(x) = f (ai+k ) if x = ai , i ∈ N

f (ai ) if x = ci , i = 1, 2, . . . , k.


is a bijection from A ∪ {c1 , . . . , ck } to B.

2. {c1 , c2 , . . .} of elements that are outside A, the function


 f (x)
 if x ∈ A \ {a1 , a2 , . . .}
h(x) = f (a2n−1 ) if x = an , n ∈ N

f (a2n ) if x = cn , n ∈ N


is a bijection from A ∪ {c1 , c2 , . . .} to B.

T
AF

Exercise 2.3.9. Use Theorem 2.3.8 to give bijections from A to B, where

1. A = [0, 1) and B = (0, 1).

2. A = (0, 1) ∪ {1, 2, 3, 4} and B = (0, 1).

3. (0, 1) ∪ N to (0, 1).

4. A = [0, 1] and B = [0, 1] \ { 11 , 13 , 51 , · · · }.

5. A = R and B = R \ N.

6. A = (0, 1) and B = R \ N.

7. A = [0, 1] and B = R \ N.

8. A = (0, 1) and B = (1, 2) ∪ (3, 4).

9. A = R \ Z and B = R \ N.
 √
 x√ if x ∈ R \ Z, x √
 ∈
/N 2
Ans: h(x) = n 2 if x = (2n + 1) 2 is a bijection from R \ Z to R \ N.
 √
n if x = 2n 2


 √
 x√ if x ∈ R \ Z, x ∈
 /N 2
√
Alternatively, h(x) = n 2 if x = (2n − 1) 2 is a bijection from R \ N to R \ Z.
 √
−n if x = 2(n + 1) 2

2.3. COUNTABLE AND UNCOUNTABLE SETS 47

2.3.3 Schröder-Bernstein Theorem

Creating bijections from injections
Let X = Y = N. Take injections f : X → Y and g : Y → X defined as f (x) = x + 2 and
g(x) = x + 1. In the picture, we have X on the left and Y on the right. If (x, y) ∈ f , we
draw a solid line joining x and y. If (y, x) ∈ g, we draw a dotted line joining y and x.

1 1
2 2
3 3
4 4
5 5
6 6
7 7

..
..

Figure 2.1: Graphic representation of functions f and g

We want to create a bijection h from X to Y by erasing some of these lines.

T
AF

1. Thus, h(1) must be 3. So, the dotted line (3, 4) cannot be used for h.
2. So, h(4) must be 6. So, the dotted line (6, 7) cannot be used for h.
DR

3. So, h(7) must be 9. Continue two more steps to realize what is happening.
(
f (x), if x = 3n − 2, n ∈ N
Thus, the bijection h : X → Y is given by h(x) = −1
g (x), otherwise.

Exercise 2.3.10. Take X = Y = N. Supply bijections using the given injections f : X → Y

and g : Y → X.
1. f (x) = x + 1 and g(x) = x + 2.
2. f (x) = x + 1 and g(x) = x + 3.
3. f (x) = x + 1 and g(x) = 2x.

Theorem 2.3.11. [Schröder-Bernstein: Creating a bijection] Let A and B be two non-empty

sets and let f : A → B and g : B → A be injections. Then, there exists a bijection from A to B.

Proof. If g is onto, we have nothing to prove. So, assume that g is not onto. Put O = A \ g(B),
φ = g ◦ f and E = O ∪ φ(O) ∪ φ2 (O) ∪ · · · . Use φ0 (O) to denote O. Notice that
∞ ∞
g f (E) = φ(E) = φ ∪ φn (O) = ∪ φn (O) = E \ O,
n=0 n=1

as g does not map to O. Hence, g maps f (E) to E \ O bijectively. Recall that O is the set of
points in A that are not mapped by g, O ⊆ E and g has already mapped f (E) onto E \ O.
48 CHAPTER 2. PEANO AXIOMS AND COUNTABILITY

Hence, g must map f (E)0 to E 0 bijectively. So, the function

(
g −1 (x) if x ∈ E 0 ,
h(x) =
f (x) if x ∈ E,

is a bijection from A to B.
Alternate. If g is onto, we have nothing to prove. So, assume that g is not onto. Put
O = A \ g(B), φ = g ◦ f and E = O ∪ φ(O) ∪ φ2 (O) ∪ · · · . Use φ0 (O) to denote O. Notice that
∞ ∞
φ(E) = g f (E) = φ(E) = φ ∪ φn (O) = ∪ φn (O) = E \ O,
n=0 n=1

as g does not map to O. Observe that φ : E → E \ O is a bijection. Define h : A → A \ O as

(
x, if x ∈ A \ E,
h(x) =
φ(x), if x ∈ E.

Then, note that h is a bijection and hence h−1 ◦ g is a bijection from B to A.

Alternate. Let F = {T ⊆ A | g (f (T )0 ) ⊆ T 0 }.

g
g(f (T )c) f (T )c
T
AF

T f f (T )
DR

Figure 2.2: Depiction of Schröder-Bernstein Theorem

Note that ∅ ∈ F . Put U = ∪ T . Then, U ∈ F , as

T ∈F

0 0
g f (U )0 = g f = g ∩ f (T )0 = ∩ g f (T )0 ⊆ ∩ T 0 = U 0 .

∪ T =g ∪ f (T )
T ∈F T ∈F T ∈F T ∈F T ∈F

Thus, U is the maximal element of F . We claim that U 0 ⊆ g (f (U )0 ). To see this, take

x ∈ U 0 \ g (f (U )0 ) and put V = U ∪{x}. Then, f (U ) ⊆ f (V ) and so f (V )0 ⊆ f (U )0 . Thus

g f (V )0 ⊆ g f (U )0 ⊆ U 0 ∩{x}0 = V 0 ,

a contradiction to the maximality of U in F . So, g f (U )0 = U 0 . Now, define h : A → B as

(
f (x) if x ∈ U,
h(x) = −1
g (x) else.

It is easy to see that h is a bijection.

The next two results are applications of Schröder-Bernstein theorem.

Lemma 2.3.12. [Infinite iff countably infinite subset]

2.3. COUNTABLE AND UNCOUNTABLE SETS 49

1. Let X be an infinite subset of N. Then, X is countably infinite.

2. Let X = {a1 , a2 , . . .} be countably infinite and Y ⊆ X. Then Y is countable.
3. A set X is infinite if and only if X has a countably infinite subset.

Proof. Part 1. Since X is infinite, by Fact 2.2.8.2 there is a one-one function f : N → X. Also,
X ⊆ N and hence Id : X → N is a one-one function. Hence, by Schröder-Bernstein theorem,
there exists a bijection from X to N and the required result follows.
Part 2. Since X is countably infinite , by definition, there exists a bijection f : X → N. If
Y is finite then by definition, it is countable. So, assume that Y is infinite. As f is one-one,
f (Y ) is an infinite subset of N and hence by the first part, f (Y ) is countably infinite. So, let
g : f (Y ) → N be a bijection. Then the map g ◦ f gives a bijection from Y to N and hence the
required result follows.
Part 3. Since X is infinite, by Fact 2.2.8.2 there is a one-one function f : N → X. Thus, f (N)
is a countably infinite subset of X. Conversely, assume that X is finite. Then, by Fact 2.2.5.5,
every subset of X is finite, a contradiction to the assumption that X has a countably infinite
subset. Thus, the required result follows.

As a corollary, we have the following result.

Corollary 2.3.13. Let X be uncountable and X ⊆ Y . Then Y is uncountable.

Proof. If Y is countable, then by Lemma 2.3.12, X must be countable, a contradiction.

AF
DR

Theorem 2.3.14. [Power set and uncountability] If S is infinite, then P(S) is uncountable.

Proof. As S is infinite, by Fact 2.2.8.2, there is a one-one map, say f : N → S. Now, define a
map g : P(N) → P(S) as g(A) = f (A), for all A ∈ P(N). Then, g is clearly one-one and hence

g P(N) is uncountable (as P(N) is uncountable). Hence P(S), being a superset of g P(N) , is
uncountable, by Corollary 2.3.13.

Theorem 2.3.15. [Countable union of countable sets] Countable union of countable sets
(union of a countable class of countable sets) is countable.

Proof. Let {Ai }i∈N be a countable class of countable sets and put X = ∪ Ai . If X is finite then we
i
are done. So, let X be infinite. Hence, by Fact 2.2.8.2, there is a one-one map f : N → X. Define
g : X → N as g(x) = 2i 3k , if i is the smallest positive integer for which x ∈ Ai and x appears
at the k-th position in the enumeration of Ai . Then g is one-one. Now, by Schröder-Bernstein
theorem A is equivalent to N.

Theorem 2.3.16. [Powet set of N equivalent to R] The set P(N) is equivalent to [0, 1).
Furthermore, P(N) is equivalent to R.

Proof. We already know a one-one map f : P(N) → [0, 1) (see Examples 2.3.6.1 and 2.3.6.2).
Let r ∈ (0, 1). Consider the nonterminating binary representation of r. Denote by Fr the set of
50 CHAPTER 2. PEANO AXIOMS AND COUNTABILITY

positions of 1 in this representation. Now, define g : [0, 1) → P(N) by g(r) = Fr , if r 6= 0 and

g(0) = ∅. Then g is one-one. Now, by Schröder-Bernstein theorem P(N) is equivalent to [0, 1).
The next statement follows as [0, 1) is equivalent to (0, 1) (see Exercise 2.3.17.5) and (0, 1) is
equivalent to R.

Exercise 2.3.17. 1. Give a one-one function from N to Q. Define f from Q to N as


r s
 2 3
 if x = rs , gcd(r, s) = 1, r > 0, s > 0,
f (x) = 5r 3s if x = −rs , gcd(r, s) = 1, r > 0, s > 0,

1 if x = 0


Argue that f is one-one. Apply Schröder-Bernstein theorem to prove that Q is equivalent

to N.
2. Give a one-one map from (0, 1) → (0, 1) × (0, 1). For each x ∈ (0, 1), let .x1 x2 · · · be the
nonterminating decimal representations1 of x. For x = .x1 x2 x3 · · · , y = .y1 y2 y3 · · · , define
f (x, y) = .x1 y1 x2 y2 x3 y3 · · · . Argue that f is an injection from (0, 1) × (0, 1) to (0, 1).
Hence, show that (0, 1) is equivalent to (0, 1) × (0, 1). Hence, show that R × R is equivalent
to R.
3. Fix k ∈ N. Supply a one-one map from N to Nk , the k-fold cartesian product of N. Now,
use k distinct primes to supply a one-one map from Nk to N. Hence, conclude that Nk is
equivalent to N.
T
AF

4. Supply a bijection from (0, 1) to (1, 2) ∪ (3, 4) ∪ (5, 6) ∪ (7, 8) ∪ · · · .

1
Ans: (0, 1) = ∪ [ n+1 , n1 ). We already know that [ n+1
1
, n1 ) is equivalent to any open interval.
n∈N
5. Show using Schröder-Bernstein that (0, 1) is equivalent to (0, 1].
6. Let X be a set such that f : N → X is an onto function. Then, either X is a finite set or
X is countably infinite.
7. Let X = {a1 , a2 , . . .} be a countably infinite set and let Y ⊆ X. Then, Y is countable.
8. [Cardinal numbers in brief ]
(a) Cardinal numbers are symbols which are associated with sets such that equivalent
sets get the same symbol. By A we denote the cardinal number associated with A.
i. If there is an injection f : A → B, then we write A ≤ B. By A ≥ B, we mean
that B ≤ A.
ii. If there is a bijection f : A → B, then we write A = B.
iii. We write {1, 2, . . . , n} as n and ∅ as 0. Thus, for a finite set A, we have A = |A|.
iv. We use ℵ0 to denote N. If x = A is a cardinal number by 2x we mean P(A).
(b) Facts about cardinal numbers:
i. If x, y, z are cardinal numbers such that x ≤ y and y ≤ z, then x ≤ z. In other
words it says, if there is a one-one map from A to B and a one-one map from B
to C, then there is a one-one map from A to C.
ii. Let x be any cardinal number. Then x 2x . This is Cantor’s lemma.
1
Recall that every real number has a unique nonterminating decimal representation.
2.4. INTEGERS AND MODULAR ARITHMETIC 51

ℵ
iii. The cardinal numbers we know till now are 0, 1, 2, 3, . . . , ℵ0 = N, 2ℵ0 = R, 22 0 , . . ..
ℵ
iv. The cardinal numbers ℵ0 = N, 2ℵ0 = R, 22 0 , . . . are called the infinite cardinal
numbers.
v. The ‘generalized continuum hypothesis’ says that there is no cardinal number
between an infinite cardinal number x and 2x .

9. Let A be the set of all infinite sequences formed using 0, 1 and B be the set of all infinite
sequences formed using 0, 1, 2. Which one has larger cardinality and why?
Ans: For (x) = x1 , x2 , · · · ∈ A, let us define f (x) = .x1 x2 · · · (binary). Then f : A → [0, 1]
is a surjection and hence A ≥ [0, 1]. For (y) = y1 , y2 , · · · ∈ B, let us define g(y) = .y1 y2 · · ·
(decimal). Then g : B → [0, 1] is one-one. So, B ≤ [0, 1] and hence B ≤ A. Also,
IdA : A → B is an injection. Thus, A ≤ B. Hence, they have the same cardinality.
10. Write R as a union of pairwise disjoint sets of size 5.
Ans: Note that R = (−∞, 2] ∪ (2, 3] ∪ (3, 6] ∪ (6, 7) ∪ [7, ∞) and these five sets have the
same cardinality. Let f, g, h and t be bijections from (−∞, 2] to (2, 3], (3, 6], (6, 7), [7, ∞),
respectively. Then R = ∪ {r, f (r), g(r), h(r), t(r)}.
r∈(−∞,2)

11. Let S be a countable set of points on the unit circle in R2 . Consider the line segments
Ls with one end at the origin and the other end at a point s ∈ S. Fix these lines. We
are allowed to rotate the circle anticlockwise (the lines do not move). Let T be another
T

countable set of points on the unit circle. Can we rotate the circle by an angle θ so that
AF

no line Ls touches any of the points of T ?

Ans: Let θij be the angle of rotation required so that point pi touches line lj . The set of all
θij is countable and the set [0, 2π) is uncountable. Thus, yes it is indeed possible.
12. A complex number is algebraic if it is a root of a polynomial equation with integer coef-
ficients. All other numbers are transcendental. Show that the set of algebraic numbers
is countable.
Ans: For each point a = (a0 , a1 , a2 , . . . , ak ) ∈ Zk × (Z \ {0}), let Sk be the roots of the
∞
polynomial equation a0 + a1 x + · · · + ak xk = 0. Take Ak =
S S
Sk and A = Ak .
a∈Zk ×(Z\{0}) k=1
Then A is the set of all algebraic numbers. The set A is countable as each Ak is countable
and the union is over a countable set.
13. Give a bijection from R to R \ Q.
Ans: Recall that Q can be enumerated. First get a bijection from R \ Q to itself. Now, use
train-seat argument to adjust Q.

2.4 Integers and Modular Arithmetic

In this section, we study some properties of integers. We start with the ‘division algorithm’.
Lemma 2.4.1. [Division algorithm] Let a and b be two integers with b > 0. Then there exist
unique integers q, r such that a = qb + r, where 0 ≤ r < b. The integer q is called the quotient
and r, the remainder.
52 CHAPTER 2. PEANO AXIOMS AND COUNTABILITY

Proof. Existence: Take S = {a + bx | x ∈ Z} ∩ N0 . Then a + |a|b ∈ S. Hence, S is a nonempty

subset of N0 . Therefore, by Well-Ordering Principle, S contains its minimum, say s0 . So,
s0 = a + bx0 , for some x0 ∈ Z. Notice that s0 ≥ 0. We claim that s0 < b.
If s0 ≥ b then s0 − b ≥ 0 and hence s0 − b = a + b(x0 − 1) ∈ S, a contradiction to s0 being the
minimum element of S. Put q = −x0 and r = s0 . Thus, we have obtained q and r such that
a = qb + r with 0 ≤ r < b.
Uniqueness: Assume that there exist integers q1 , q2 , r1 and r2 satisfying a = q1 b+r1 , 0 ≤ r1 < b,
a = q2 b+r2 , and 0 ≤ r2 < b. Without loss of generality, we assume r1 ≤ r2 . Then, 0 ≤ r2 −r1 < b.
Notice that r2 − r1 = (q1 − q2 )b. So, 0 ≤ (q1 − q2 )b < b. But the only integer multiple of b which
lies in [0, b) is 0. Hence, q1 − q2 = 0. Thus, r1 = r2 as well. This completes the proof.

Definition 2.4.2. [Divisibility]

1. [Divisor] Let a, b ∈ Z with b 6= 0. If a = bc, for some c ∈ Z then b is said to divide (be a
divisor of) a and is denoted b | a.
Discussion: If a is a nonzero integer then the set of positive divisors of a is always nonempty
(as 1 | a) and finite (as a positive divisor of a is less than or equal to |a|).

2. [Greatest common divisor/Highest common factor] Let a and b be two nonzero integers.
Then the set S of their common positive divisors is nonempty and finite. Thus, S contains
T

its greatest element. This element is called the greatest common divisor of a and b and
AF

is denoted gcd(a, b). In some books, the gcd is also called the highest common factor.
DR

3. [Relatively prime/Co-prime integers] An integer a is said to be relatively prime to an

integer b if gcd(a, b) = 1. Or, two integers a and b are said to be co-prime if gcd(a, b) = 1.

The next remark follows directly from the definition and the division algorithm.

Remark 2.4.3. Let a, b ∈ Z \ {0} and d = gcd(a, b). Then, for any positive common divisor c
of a and b, one has c | d.

The next result is often stated as ‘the gcd(a, b) is a linear combination of a and b’.

Theorem 2.4.4. [Bézout’s identity] Let a and b be two nonzero integers. Then, there exist
integers x0 , y0 such that d = ax0 + by0 , where d = gcd(a, b).

Proof. Consider the set S = {ax + by | x, y ∈ Z} ∩ N. Then, either a ∈ S or −a ∈ S. Thus, S is

a nonempty subset of N. Hence, by Well-ordering principle, S contains its least element, say d.
As d ∈ S, we have d = ax0 + by0 , for some x0 , y0 ∈ Z. We claim that d = gcd(a, b).
Note that d is positive. Let c be any positive common divisor of a and b. Then c | ax0 +by0 = d
as x0 , y0 ∈ Z. We now show that d | a and d | b.
By division algorithm, there exist integers q and r such that a = dq + r, with 0 ≤ r < d. Thus,
we need to show that r = 0.
On the contrary, assume that r > 0. Then

r = a − dq = a − q(ax0 + by0 ) = a(1 − qx0 ) + b(−qy0 ) ∈ {ax + by | x, y ∈ Z}.

2.4. INTEGERS AND MODULAR ARITHMETIC 53

Hence, r is a positive integer in S which is strictly less than d. This contradicts the fact that d
is the least element of S. Thus, r = 0 and hence d|a. Similarly, d|b.

The division algorithm gives us an idea to algorithmically compute the greatest common divisor
of two integers, commonly known as the Euclid’s algorithm.

Discussion 2.4.5. 1. Let a, b ∈ Z\{0}. By division algorithm, a = |b|q +r, for some integers
q, r ∈ Z with 0 ≤ r < |b|. Then,

gcd(a, b) = gcd(a, |b|) = gcd(|b|, r).

To show the second equality, note that r = a − |b|q and hence gcd(a, |b|) | r. Thus,
gcd(a, |b|) | gcd(|b|, r). Similarly, gcd(|b|, r) | gcd(a, |b|) as a = |b|q + r.

2. We can now apply the above idea repeatedly to find the greatest common divisor of two
given nonzero integers. This is called the Euclid’s algorithm. For example, to find
gcd(155, −275), we proceed as follows

−275 = (−2) · 155 + 35 (so, gcd(−275, 155) = gcd(155, 35))

155 = 4 · 35 + 15 (so, gcd(155, 35) = gcd(35, 15))
35 = 2 · 15 + 5 (so, gcd(35, 15) = gcd(15, 5))
15 = 3 · 5 (so, gcd(15, 5) = 5).
T
AF

To write 5 = gcd(155, −275) in the form 155x0 + (−275)y0 , notice that

5 = 35−2·15 = 35−2(155−4·35) = 9·35−2·155 = 9(−275+2·155)−2·155 = 9·(−275)+16·155.

Also, note that 275 = 5·55 and 155 = 5·31 and thus, 5 = (9+31x)·(−275)+(16+55x)·155,
for all x ∈ Z. Therefore, we see that there are infinite number of choices for the pair
(x, y) ∈ Z2 , for which d = ax + by.

3. [Euclid’s algorithm] In general, given two nonzero integers a and b, the algorithm proceeds
as follows:

a = bq0 + r0 with 0 ≤ r0 < b, b = r0 q1 + r1 with 0 ≤ r1 < r0 ,

r0 = r1 q2 + r2 with 0 ≤ r2 < r1 , r1 = r2 q3 + r3 with 0 ≤ r3 < r2 ,
.. .
. = ..
r`−1 = r` q`+1 + r`+1 with 0 ≤ r`+1 < r` , r` = r`+1 q`+2 .

The process will take at most b − 1 steps as 0 ≤ r0 < b. Also, note that gcd(a, b) = r`+1
and r`+1 can be recursively obtained, using backtracking. That is,

r`+1 = r`−1 − r` q`+1 = r`−1 − q`+1 (r`−2 − r`−1 q` ) = r`−1 (1 + q`+1 q` ) − q`+1 r`−2 = · · · .

Exercise 2.4.6. 1. Let a, b ∈ N with gcd(a, b) = d. Then gcd( ad , db ) = 1.

2. Prove that the system 15x + 12y = b has a solution for x, y ∈ Z if and only if 3 divides b.
54 CHAPTER 2. PEANO AXIOMS AND COUNTABILITY

3. [Linear Diophantine Equation] Let a, b, c ∈ Z \ {0}. Then the linear system ax + by = c,

in the unknowns x, y ∈ Z has a solution if and only if gcd(a, b) divides c. Furthermore,
determine all pairs (x, y) ∈ Z × Z such that ax + by is indeed c.

4. Prove that gcd(a, bc) = 1 if and only if gcd(a, b) = 1 and gcd(a, c) = 1, for any three
nonzero integers a, b and c.

5. Euclid’s algorithm can sometimes be applied to check whether two numbers which are func-
tions of an unknown integer n, are relatively prime or not? For example, we can use the
algorithm to prove that gcd(2n + 3, 5n + 7) = 1 for every n ∈ Z.

6. [Informative] Suppose a milkman has only 3 cans of sizes 7, 9 and 16 liters. What is the
minimum number of operations required to deliver 1 liter of milk to a customer? Explain.

To proceed further, we need the following definitions.

Definition 2.4.7. [Prime/Composite numbers]

1. [Unity] The positive integer 1 is called the unity (or the unit element) of Z.

2. [Prime] A positive integer p is said to be a prime, if p has exactly two positive divisors,
namely, 1 and p.
T

3. [Composite] A positive integer r is called composite if r 6= 1 and is not a prime.

AF
DR

We are now ready to prove an important result that helps us in proving the fundamental
theorem of arithmetic.

Lemma 2.4.8. [Euclid’s lemma] Let p be a prime and let a, b ∈ Z. If p | ab then either p | a
or p | b.

Proof. If p | a, we are done. So, assume that p - a. As p is a prime, gcd(p, a) = 1. Thus, we can
find integers x, y such that 1 = ax + py. As p | ab, we have

p | abx + pby = b(ax + py) = b · 1 = b.

Thus, if p|ab then either p|a or p|b.

One also has the following result.

Corollary 2.4.9. Let n be an integer such that n | ab and gcd(n, a) = 1. Then n | b.

Proof. As gcd(n, a) = 1, there exists x0 , y0 ∈ Z such that nx0 + ay0 = 1. Hence, b = aby0 + nbx0 .
As n divides ab, n divides aby0 + n(bx0 ) = b. Thus, the required result follows.

Now, we are ready to prove the fundamental theorem of arithmetic that states that ‘every
positive integer greater than 1 is either a prime or is a product of primes. This product is
unique, except for the order in which the prime factors appear’.
2.4. INTEGERS AND MODULAR ARITHMETIC 55

Theorem 2.4.10. [Fundamental theorem of arithmetic] Let n ∈ N with n ≥ 2. Then

there exist prime numbers p1 > p2 > · · · > pk and positive integers s1 , s2 , . . . , sk such that
n = ps11 ps22 · · · pskk , for some k ≥ 1. Moreover, if n also equals q1t1 q2t2 · · · q`t` , for distinct primes
q1 > q2 > · · · > q` and positive integers t1 , t2 , . . . , t` then k = ` and for each i, 1 ≤ i ≤ k, pi = qi
and si = ti .

Proof. We prove the result using the strong form of the principle of mathematical induction.
The result is clearly true for n = 2. So, let the result be true for all m, 2 ≤ m ≤ n − 1. If n is a
prime, then we have nothing to prove. Else, n has a prime divisor p. Then apply induction on
n
p to get the required result.

Theorem 2.4.11. [Euclid: Infinitude of primes] The number of primes is infinite.

Proof. On the contrary assume that the number of primes is finite, say p1 = 2, p2 = 3, . . . , pk .
Now, consider the positive integer N = p1 p2 · · · pk + 1. Then, we see that none of the primes
p1 , p2 , . . . , pk divides N which contradicts Theorem 2.4.10. Thus, the result follows.

Proposition 2.4.12. [Primality testing] Let n ∈ N with n ≥ 2. Suppose that for any prime
√
p ≤ n, p does not divide n then, n is prime.
√ √
Proof. Suppose n = xy, for 2 ≤ x, y < n. Then, either x ≤ n or y ≤ n. Without loss of
√
T

generality, assume x ≤ n. If x is a prime, we are done. Else, take a prime divisor of x to get
AF

a contradiction.
DR

Exercise 2.4.13. [Informative] Prove that there are infinitely many primes of the form 4n−1.

Definition 2.4.14. [Least common multiple] Let a, b ∈ Z. Then the least common multiple
of a and b, denoted lcm(a, b), is the smallest positive integer that is a multiple of both a and b.

Theorem 2.4.15. Let a, b ∈ N. Then, gcd(a, b) · lcm(a, b) = ab. Thus, lcm(a, b) = ab if and
only if gcd(a, b) = 1.

Proof. Let d = gcd(a, b). Then d = as + bt, for some s, t ∈ Z, a = a1 d, b = b2 d, for some
a1 , b1 ∈ N. We need to show that lcm(a, b) = a1 b1 d = ab1 = a1 b, which is clearly a multiple of
both a and b. Let c ∈ N be any common multiple of a and b. To show, a1 b1 d divides c. Note
that
c cd c(as + bt) c c
= = = s+ t∈Z
a1 b1 d (a1 d) · (b1 d) ab b a
as ac , cb ∈ Z and s, t ∈ Z. Thus, a1 b1 d = lcm(a, b) divides c and hence lcm(a, b) is indeed the
smallest. Thus, the required result follows.

Definition 2.4.16. [Modular Arithmetic] Fix a positive integer n. Then, ‘an integer a is said
to be congruent to an integer b modulo n’, denoted a ≡ b (mod n), if n divides a − b.
Example 2.4.17. 1. It can be easily verified that any two even (odd) integers are equivalent
modulo 2 as 2 | 2(l − m) = 2l − 2m (2 | 2(l − m) = ((2l + 1) − (2m + 1))).
56 CHAPTER 2. PEANO AXIOMS AND COUNTABILITY

2. The numbers ±10 and 22 are equivalent modulo 4 as 4 | 12 = 22 − 10 and 4 | 32 =

22 − (−10).
3. Let n be a fixed positive integer and let S = {0, 1, 2, . . . , n − 1}.
(a) Then, by division algorithm, for any a ∈ Z there exists a unique b ∈ S such that
a ≡ b (mod n). The number b is called the residue of a modulo n.
n−1
S
(b) Thus, the set of integers, Z = {a + kn : k ∈ Z}, i.e., every integer is congruent to
a=0
an element of S. The set S is taken as the standard representative for the set of
residue classes modulo n.

Theorem 2.4.18. Let n be a positive integer. Then, the following results hold.
1. Let a ≡ b (mod n) and b ≡ c (mod n), for some a, b, c ∈ Z. Then, a ≡ c (mod n).
2. Let a ≡ b (mod n), for some a, b ∈ Z. Then, a ± c ≡ b ± c (mod n) and ac ≡ bc (mod n),
for all c ∈ Z.
3. Let a ≡ b (mod n) and c ≡ d (mod n), for some a, b, c, d ∈ Z. Then, a±c ≡ b±d (mod n)
and ac ≡ bd (mod n). In particular, am ≡ bm (mod n), for all m ∈ N.
4. Let ac ≡ bc (mod n), for some non-zero a, b, c ∈ Z. Then, a ≡ b (mod n), whenever
n
gcd(c, n) = 1. In general, a ≡ b (mod ).
gcd(c, n)
Proof. We will only prove two parts. The readers should supply the proof of other parts.
T

Part 3: Note that ac − bd ≡ ac − bc + bc − bd ≡ c(a − b) + b(c − d). Thus, n | ac − bd, whenever

n | a − b and n | c − d.
DR

In particular, taking c = a and d = b and repeatedly applying the above result, one has
a ≡ bm (mod n), for all m ∈ N.
m

Part 4: Let gcd(c, n) = d. Then, there exist non-zero c1 , n1 ∈ Z and c = c1 d, n = n1 d. Thus,

n | ac − bc means that n1 d | c1 d(a − b). This, in turn implies that n1 | c1 (a − b). Hence, by
n
Corollary 2.4.9, we get = n1 | a − b.
gcd(c, n)

Before coming to the next result, we look at the following examples.

Example 2.4.19. 1. Note that 3 · 9 + 13 · (−2) ≡ 1 (mod 13). So, the system 9x ≡ 4
(mod 13) has the solution

x ≡ x · 1 ≡ x · (3 · 9 + 13 · (−2)) ≡ 3 · 9x ≡ 3 · 4 ≡ 12 (mod 13).

2. Verify that 9 · (−5) + 23 · (2) = 1. Hence, the system 9x ≡ 1 (mod 23) has the solution

x ≡ x · 1 ≡ x (9 · (−5) + 23 · (2)) ≡ (−5) · (9x) ≡ −5 ≡ 18 (mod 23).

3. The system 3x ≡ 15 (mod 30) has solutions x = 5, 15, 25, whereas the system 7x = 15 has
only the solution x = 15. Also, verify that the system 3x ≡ 5 (mod 30) has no solution.

Theorem 2.4.20. [Linear Congruence] Let n be a positive integer and let a and b be non-zero
integers. Then, the system ax ≡ b (mod n) has at least one solution if and only if gcd(a, n) | b.
Moreover, if d = gcd(a, n) then ax ≡ b (mod n) has exactly d solutions in {0, 1, 2, . . . , n − 1}.
2.4. INTEGERS AND MODULAR ARITHMETIC 57

Proof. Let x0 be a solution of ax ≡ b (mod n). Then, by definition, ax0 − b = nq, for some
q ∈ Z. Thus, b = ax0 − nq. But, gcd(a, n) | a, n and hence gcd(a, n) | ax0 − nq = b.
Suppose d = gcd(a, n) | b. Then, b = b1 d, for some b1 ∈ Z. Also, by Euclidean algorithm,
there exists x0 , y0 ∈ Z such that ax0 + ny0 = d. Hence,

a(x0 b1 ) ≡ b1 (ax0 ) ≡ b1 (ax0 + ny0 ) ≡ b1 d ≡ b (mod n).

This completes the proof of the first part.

To proceed further, assume that x1 , x2 are two solutions. Then, ax1 ≡ ax2 (mod n) and
n n
hence, by Theorem 2.4.18.4, x1 ≡ x2 (mod ). Thus, we can find x2 ∈ {0, 1, . . . , } such that
d d
n
x = x2 + k is a solution of ax ≡ b (mod n), for 0 ≤ k ≤ d − 1. Verify that these x’s are distinct
d
and lie between 0 and n − 1. Hence, the required result follows.

Exercise 2.4.21. 1. Prove Theorem 2.4.18.

2. Determine the solutions of the system 3x ≡ 5 (mod 65).
3. Determine the solutions of the system 5x ≡ 95 (mod 100).
4. Prove that the system 3x ≡ 4 (mod 28) is equivalent to the system x ≡ 20 (mod 28).
5. Prove that the pair of systems 3x ≡ 4 (mod 28) and 4x ≡ 2 (mod 27) is equivalent to
the pair x ≡ 20 (mod 28) and x ≡ 14 (mod 27). Hence, prove that the above system is
T
AF

equivalent to solving either 20 + 28k ≡ 14 (mod 27) or 14 + 27k ≡ 20 (mod 28) for the
unknown quantity k. Thus, verify that k = 21 is the solution for the first case and k = 22
DR

for the other. Hence x = 20 + 28 · 21 = 608 = 14 + 22 · 27 is a solution of the above pair.

p!
6. Let p be a prime. Then, prove that p | kp =

, for 1 ≤ k ≤ p − 1.
k!(p − k)!
7. [Informative] Let p be a prime. Then, the set
(a) Zp = {0, 1, 2, . . . , p − 1} has the following properties:
i. for every a, b ∈ Zp , a + b (mod p) ∈ Zp .
ii. for every a, b ∈ Zp , a + b = b + a (mod p).
iii. for every a, b, c ∈ Zp , a + (b + c) ≡ (a + b) + c (mod p).
iv. for every a ∈ Zp , a + 0 ≡ a (mod p).
v. for every a ∈ Zp , a + (p − a) ≡ 0 (mod p).
(b) Z∗p = {1, 2, . . . , p − 1} has the following properties:
i. for every a, b ∈ Zp , a · b (mod p) ∈ Z∗p .
ii. for every a, b ∈ Z∗p , a · b = b · a (mod p).
iii. for every a, b, c ∈ Z∗p , a · (b · c) ≡ (a · b) · c (mod p).
iv. for every a ∈ Z∗p , a · 1 ≡ a (mod p).
v. for every a ∈ Z∗p , a · b ≡ 1 (mod p). To see this, note that gcd(a, p) = 1. Hence,
by Euclid’s algorithm, there exists x, y ∈ Z such that ax + py = 1. Define b ≡ x
(mod p). Then,

a·b≡a·x≡a·x+p·y ≡1 (mod p).

58 CHAPTER 2. PEANO AXIOMS AND COUNTABILITY

In algebra, any set, say F, in which ‘addition’ and ‘multiplication’ can be defined
in such a way that the above properties are satisfied then F is called a field. So,
Zp = {0, 1, 2, . . . , p − 1} is an example of a field. In general, the well known examples
of fields are:
i. Q, the set of rational numbers.
ii. R, the set of real numbers.
iii. C, the set of complex numbers.
(c) From now on let p be an odd prime.
i. Then, the equation x2 ≡ 1 (mod p). Since, p is a prime, the only solutions in Zp
are x = 1, p − 1.
ii. Then, for a ∈ {2, 3, . . . , p − 2}, the number b ∈ Z∗p that satisfies a · b ≡ 1 (mod p)
also satisfies b ∈ {2, 3, . . . , p − 2} and b 6= a.
iii. Thus, for 1 ≤ i ≤ p−12 , we have pairs {ai , bi } that are pairwise disjoint and satisfy
p−1
S2
ai · bi ≡ 1 (mod p). Moreover, {ai , bi } = {2, 3, . . . , p − 2}.
i=1
iv. Hence, 2 · 3 · · · · · (p − 2) ≡ 1 (mod p).
v. We thus have the following famous theorem called the Wilson’s Theorem: Let
p be a prime. Then (p − 1)! ≡ −1 (mod p). Proof. Note that from the previous
step, we have
T
(p − 1)! ≡ 1 · (p − 1) · 2 · 3 · · · · · (p − 2) ≡ −1 · 1 ≡ −1 (mod p).
AF
DR

vi. (Primality Testing) Let n be a positive integer. Then, (n − 1)! ≡ −1 (mod n) if

and only if n is a prime.

Theorem 2.4.22. [Chinese remainder theorem] Fix a positive integer m and let n1 , n, . . . , nm
be pairwise co-prime positive integers. Then, the linear system

x ≡ a1 (mod n1 )
x ≡ a2 (mod n2 )
..
.
x ≡ am (mod nm )

has a unique solution modulo N = n1 n2 · · · nm .

M
Proof. For 1 ≤ k ≤ m, define Mk = . Then, gcd(Mk , nk ) = 1 and hence there exist integers
nk
xk , yk such that Mk xk + nk yk = 1 for 1 ≤ k ≤ m. Then

Mk x k ≡ 1 (mod nk ) and Mk xk ≡ 0 (mod n` ) for ` 6= k.

m
P
Define x0 = Mk xk ak . Then, it can be easily verified that x0 satisfies the required congruence
k=1
relations.
2.4. INTEGERS AND MODULAR ARITHMETIC 59

Example 2.4.23. Let us come back to Exercise 2.4.21.5. In this case, M = 28 · 27 = 756, M1 =
27 and M2 = 28. Therefore, x1 = −1 and x2 = 1. Thus,

x0 = 27 · −1 · 20 + 28 · 1 · 14 ≡ −540 + 392 ≡ −148 ≡ 608 (mod 756).

Exercise 2.4.24. 1. Find the smallest positive integer which when divided by 4 leaves a
remained 1 and when divided by 9 leaves a remainder 2.
2. Find the smallest positive integer which when divided by 8 leaves a remained 4 and when
divided by 15 leaves a remainder 10.
3. Does there exist a positive integer n such that

n≡4 (mod 14), n ≡ 6 (mod 18)?

Give reasons for your answer. What if we replace 6 or 4 with an odd number?
4. [Informative] Let n be a positive integer. Then, the set
(a) Zn = {0, 1, 2, . . . , n − 1} has the following properties:
i. for every a, b ∈ Zn , a + b (mod n) ∈ Zn .
ii. for every a, b ∈ Zn , a + b = b + a (mod n).
iii. for every a, b, c ∈ Zn , a + (b + c) ≡ (a + b) + c (mod n).
iv. for every a ∈ Zn , a + 0 ≡ a (mod n).
v. for every a ∈ Zn , a + (p − a) ≡ 0 (mod n).
T

vi. for every a, b ∈ Zn , a · b (mod n) ∈ Zn .

vii. for every a, b ∈ Zn , a · b = b · a (mod n).

viii. for every a, b, c ∈ Zn , a · (b · c) ≡ (a · b) · c (mod n).

ix. for every a ∈ Zn , a · 1 ≡ a (mod n).
In algebra, any set, say R, in which ‘addition’ and ‘multiplication’ can be defined in
such a way that the above properties are satisfied then R is called a commutative
ring with unity. So, Zn = {0, 1, 2, . . . , n − 1} is an example of a commutative ring
with unity. In general, the well known examples of commutative ring with unity are:
i. Z, the set of integers.
ii. Q, the set of rational numbers.
iii. R, the set of real numbers.
iv. C, the set of complex numbers.
(b) Now, let m and n be two co-prime positive integers. Then, by the above, the sets
Zm , Zn , and Zmn are commutative rings with unity. In the following, we show that
there is a one-to-one correspondence (ring isomorphism) between Zm × Zn and Zmn .
To do so, define

f : Zmn → Zm × Zn by f (x) = (x (mod m), x (mod n)) for all x ∈ Zmn .

Then, defining ‘addition’ and ‘multiplication’ in Zm × Zn component-wise and using

Theorem 2.4.18, we have the following:
i. f (x + y) = f (x) + f (y), for all x, y ∈ Zmn .
ii. f (x · y) = f (x) · f (y), for all x, y ∈ Zmn .
60 CHAPTER 2. PEANO AXIOMS AND COUNTABILITY

iii. for every (a, b) ∈ Zm × Zn , by CRT, there exists a unique x ∈ Zmn such that

x ≡ a (mod m) and x ≡ b (mod n).

iv. also, | Zm × Zn |=| Zmn |= mn..

Hence, we have obtained the required one-one correspondence, commonly known as
the ring isomorphism. That is, the two rings Zm × Zn and Zmn are isomorphic.

2.5 Construction of Integers and Rationals∗

This section contains two subsections. In the first subsection, we construct integers from natural
numbers and prove a few properties, such as addition, multiplication and subtraction. The
second subsection generalizes the ideas in the first subsection to construct rationals and then
study a few properties of rationals.

2.5.1 Construction of Integers

To start with let X = N × N. We define a relation ‘∼’ on X by

(a, b) ∼ (c, d) if a + d = b + c for all a, b, c, d ∈ N.

T
AF

Then, verify that ∼ is indeed an equivalence relation on X. Let Z denote the collection of
all equivalence classes under this relation. So, if [x], [y] ∈ Z then [x] is an equivalence class
DR

containing x = (x1 , x2 ), for some x1 , x2 ∈ N and [y] is an equivalence class containing y =

(y1 , y2 ), for some y1 , y2 ∈ N. Now, using the successor function S defined in Axiom P2, observe
that Z consists of all equivalence classes of the form

1. [(1, 1)] = {(n, n) : for all n ∈ N},

2. for a fixed element m ∈ N, [(1, S(m))] = {(n, m + n) : for all n ∈ N}, and

3. for a fixed element m ∈ N, [(S(m), 1)] = {(m + n, n) : for all n ∈ N}.

Definition 2.5.1. [Addition in Z] Let [x] = [(x1 , x2 )], [y] = [(y1 , y2 )] ∈ Z, for some x1 , x2 , y1 , y2 ∈
N. Then, one defines addition in Z, denoted by ⊕, as

[x] ⊕ [y] = (x1 , x2 ) ⊕ (y1 , y2 ) = [(x1 + y1 , x2 + y2 )]. (2.3)

Note that basically we have defined a map ⊕ : Z × Z → Z which takes two non-empty sets, say
[(x1 , x2 )] and [(y1 , y2 )] and gives a set [(x1 + y1 , x2 + y2 )]“namely the addition of the two” as the
image. Thus, we need to verify that the addition of two different representatives of the domain,
give rise to the same set on the range. This process of defining a map using representatives and
then verifying that the image is independent of the representatives chosen is characterized by
saying that “the map is well-defined”. So, let us now prove that ⊕ is well-defined.

Lemma 2.5.2. The map ⊕ defined in Equation (2.3) is well-defined.

2.5. CONSTRUCTION OF INTEGERS AND RATIONALS∗ 61

Proof. Let [(u1 , u2 )] = [(v1 , v2 )] and [(x1 , x2 )] = [(y1 , y2 )] be two equivalence classes in Z. Then,
by definition

[(u1 , u2 )] ⊕ [(x1 , x2 )] = [(u1 + x1 , u2 + x2 )] and [(v1 , v2 )] ⊕ [(y1 , y2 )] = [(v1 + y1 , v2 + y2 )].

For well-definedness, we need to show that [(u1 +x1 , u2 +x2 )] = [(v1 +y1 , v2 +y2 )]. Or equivalently,
we need to show that u1 + x1 + v2 + y2 = u2 + x2 + v1 + y1 .
But, the equality of the equivalence classes [(u1 , u2 )] = [(v1 , v2 )] and [(x1 , x2 )] = [(y1 , y2 )]
implies
u1 + v2 = u2 + v1 and x1 + y2 = x2 + y1 .

Thus, adding the two and using the commutativity of addition in N, we get u1 + x1 + v2 + y2 =
u2 + x2 + v1 + y1 . Thus, the required result follows.

In this particular case, one can also check the following statements to verify the well-definedness
of ⊕, i.e., one needs to show that for all `, m, n, r ∈ N the following statements hold.
1. [(1, 1)] ⊕ [(n, n)] = [(m, m)] + [(n, n)].
2. [(1, S(m))] ⊕ [(1, S(`))] = [(r, m + r)] ⊕ [(n, ` + n)].
3. [(S(m), 1)] ⊕ [(S(`), 1)] = [(m + r, r)] ⊕ [(` + n, n)].
4. [(1, 1)] ⊕ [(1, S(m))] = [(n, n)] ⊕ [(r, m + r)] = [(1, 1)] ⊕ [(r, m + r)] = [(r, r)] ⊕ [(1, S(m))].
T

5. [(1, 1)] ⊕ [(S(m), 1)] = [(n, n)] ⊕ [(m + r, m)] = [(1, 1)] ⊕ [(m + r, m)] = [(r, r)] ⊕ [(S(m), 1)]?.
AF

6. [(1, S(m))] ⊕ [(S(`), 1)] = [(r, m + r)] ⊕ [(` + n, n)] and so on.
DR

We give the argument for the fourth statement. The readers are supposed to provide arguments
for other statements.
1. [(1, 1)] ⊕ [(1, S(m))] = [(2, S(m) + 1)] = [(n + r, m + n + r)] as using commutativity and
associativity of addition of natural numbers, one has

2 + m + n + r = m + 2 + n + r = (m + 1) + 1 + n + r = S(m) + 1 + n + r.

Hence, [(1, 1)] ⊕ [(1, S(m))] = [(n + r, m + n + r)] = [(n, n)] ⊕ [(r, m + r)].
2. [(1, 1)] ⊕ [(r, m + r)] = [(r + 1, m + r + 1)] = [(r + 1, r + S(m))] = [(r, r)] ⊕ [(1, S(m))].

On similar lines, we now define multiplication among elements of Z.

Definition 2.5.3. [Multiplication of Integers] Let [x] = [(x1 , x2 )], [y] = [(y1 , y2 )] ∈ Z, for
some x1 , x2 , y1 , y2 ∈ N. Then, one defines multiplication in Z, denoted by , as

[x] [y] = [(x1 , x2 )] [(y1 , y2 )] = [(x1 y1 + x2 y2 , x1 y2 + x2 y1 )]. (2.4)

Since we are talking about multiplication between two sets using their representatives, we need
to verify that the multiplication is indeed well-defined. So, the readers are required to prove
“well-definedness” of multiplication. The readers can now prove all the properties of addition
and multiplication in Z by using the corresponding properties of natural numbers.
62 CHAPTER 2. PEANO AXIOMS AND COUNTABILITY

Exercise 2.5.4. Let [x], [y], [z] ∈ Z and let us denote [0] = [(1, 1)]. Then, prove that
1. [Associativity of addition] ([x] + [y]) + [z] = [x] + ([y] + [z]).
2. [Commutativity of addition] [x] + [y] = [y] + [x].
3. [Existence of the zero element] [x] + [0] = [x].
4. [Cancellation property holds] If [x] + [y] = [x] + [z] then [y] = [z]. This implies that the
zero element is unique.
5. [Existence of additive inverse] for every [x] = [(x1 , x2 )], the equivalence class [(x2 , x1 )],
denoted by −[x], satisfies [x] ⊕ (−[x]) = [0]. Now, use the cancellation property in Z to
show that the additive inverse is unique.
6. [Distributive laws] ([x] + [y]) [z] = [x] [z] ⊕ [y] [z].
7. [Associativity of multiplication] ([x] [y]) [z] = [x] ([y] [z]).
8. [Commutativity of multiplication] [x] [y] = [y] [x].
9. [Existence of the identity element] [x] [1] = [x], where [1] = [(S(1), 1)].
10. [Cancellation property holds] If [x] [y] = [x] [z] with [x] 6= [0] then [y] = [z]. This
implies that the identity element is unique.
11. [x] [0] = [0].

As a last property, we show that a copy of N naturally seats inside Z.

T
AF

Lemma 2.5.5. Consider the map f : N → Z defined by f (n) = [(S(n), 1)], for all n ∈ N. Then,
DR

for all a, b ∈ N
1. f is one-one,
2. f (a + b) = f (a) ⊕ f (b), and
3. f (a · b) = f (a) f (b).

Proof. Part 1.: Suppose f (a) = f (b) for some a, b ∈ N. Then, by definition, [(S(a), 1)] =
[(S(b), 1)], or equivalently, S(a) + 1 = S(b) + 1. Now, use cancellation in N to get S(a) = S(b).
Thus, a = b as S is an injective map.
Part 2.: By definition, f (a + b) = [(S(a + b), 1)] and

f (a) ⊕ f (b) = [(S(a), 1)] ⊕ [(S(b), 1)] = [(S(a) + S(b), 1 + 1)] = [(S(a) + b + 1, 1 + 1)]
= [(S(a + b) + 1, 1 + 1)] = [(S(a + b), 1)] = f (a + b).

Part 3.: By definition, f (a · b) = [(S(a · b), 1)] and

f (a) f (b) = [(S(a), 1)] [(S(b), 1)] = [(S(a) · S(b) + 1 · 1, S(a) · 1 + 1 · S(b))]
= [(S(a) · S(b) + 1, S(a) + S(b))] = [(S(a · b), 1)] = f (a b)

as S(a)·S(b)+1+1 = S(a)·b+S(a)·1+1+1 = a·b+1·b+S(a)+1+1 = S(a·b)+S(b)+S(a).

Thus, we have indeed shown that N is seating inside Z as f (N) and the addition and multipli-
cation operations are satisfied by f (the map f commutes with the addition operation and the
2.5. CONSTRUCTION OF INTEGERS AND RATIONALS∗ 63

multiplication operation). So, from now on, the symbols + and · will be used for addition and
multiplication in integers. Further, as n ∈ N is identified with f (n) = [(S(n), 1)], we would like
to associate the symbol ‘−’ as n = S(n) − 1 and −n = 1 − S(n). We proceed to do this in the
next few paragraphs.

Definition 2.5.6. [Order in Z] Let [x] = [(x1 , x2 )], [y] = [(y1 , y2 )] ∈ Z, for some x1 , x2 , y1 , y2 ∈
N. Then, the order in Z is defined by saying that [x] < [y] if x1 + y2 < y1 + x2 . Further,
[x] ≤ [y] if either [x] = [y] or [x] < [y].

We again need to check for well-definedness. So, let [(u1 , u2 )] = [(v1 , v2 )] and [(x1 , x2 )] =
[(y1 , y2 )] be two equivalence classes in Z with [(u1 , u2 )] < [(x1 , x2 )]. We need to show that
[(v1 , v2 )] < [y1 , y2 )]. Or equivalently, v1 +y2 < y1 +v2 whenever u1 +v2 = v1 +u2 , x1 +y2 = y1 +x2
and u1 + x2 < x1 + u2 . Thus,

v1 + y2 + x1 + u2 = v1 + u2 + x1 + y2 = u1 + v2 + y1 + x2 = y1 + v2 + u1 + x2
< y1 + v2 + x1 + u2 .

Hence, by the order property in N (see Exercise 2.1.10), v1 + y2 < y1 + v2 . Thus, the above
definition is well-defined. At this stage, one would like to verify that the function f defined in
Lemma 2.5.5 preserves the order as well.

Lemma 2.5.7. Consider the map f : N → Z defined by f (n) = [(S(n), 1)], for all n ∈ N. Then,
for all a, b ∈ N, a < b if and only if f (a) < f (b).
T
AF

Proof. Using Exercise 2.1.10 a < b if and only if a + 1 + 1 < b + 1 + 1, or equivalently, a < b if and
only if S(a) + 1 < S(b) + 1. Thus, a < b if and only if f (a) = [(S(a), 1)] < [(S(b), 1)] = f (b).
DR

Definition 2.5.8. [Positive elements in Z] Let [x] = [(x1 , x2 )] ∈ Z. Then, [x] is said to be
positive if [0] < [x] and is said to be non-negative if [0] ≤ [x]. In general, we write [x] > [0]
to mean [x] is positive and [x] ≥ [0] for [x] being non-negative.

Lemma 2.5.9. Let [x] = [(x1 , x2 )] ∈ Z. Then, [x] > [0] if and only if x1 > x2 .

Proof. By definition, [(x1 , x2 )] > [0] = [(1, 1)] if and only if x1 + 1 > x2 + 1. Or equivalently,
using Exercise 2.1.10, one obtains [(x1 , x2 )] > [(1, 1)] if and only if x1 > x2 .

Exercise 2.5.10. 1. Prove the following results for any [x] ∈ Z.

(a) [x] > 0 if and only if [x] = [(S(n), 1)] = f (n), for some n ∈ N.
(b) [x] > 0 if and only if −[x] < 0.

2. [y] > [z], for some [y], [z] ∈ Z if and only if [y] + [x] > [z] + [x].
3. If [y] > [z], for some [y], [z] ∈ Z then [y] · [x] > [z] · [x], whenever [x] > 0.

Thus, Z = N ∪ {0} ∪ (−N) and hence from now on, in place of using equivalence class to
represent the elements of Z, we will just use natural numbers, their negatives and the zero
element to represent Z, the set of integers. Thus, whenever we define functions or operations on
Z then we don’t have to worry about well-definedness. Let us now discuss the “absolute value
function”, namely the modulus function.
64 CHAPTER 2. PEANO AXIOMS AND COUNTABILITY

Definition 2.5.11. A function g : Z → N ∪ {0} is called as absolute/modulus function if

1. g(n) = n if n ≥ 0,
2. g(n) = −n, if n < 0.

This function is denoted by | · |. Thus, |m| = m, if m ≥ 0 and −m, if m < 0. Further, by

Exercise 2.5.10.1, observe that |m| ≥ 0 for all m ∈ Z.

For a better understanding of this function, we prove the following two results.

Lemma 2.5.12. For any x ∈ Z, −|x| ≤ x ≤ |x|. Further, if x ≥ 0 and −x ≤ y ≤ x for some
y ∈ Z then |y| ≤ x.

Proof. Let x ≥ 0. Then, by definition |x| = x and hence x ≤ |x|. As |x| = x, the other inequality
−|x| ≤ x reduces to −x ≤ x. Or equivalently, we need to show that 0 = x + (−x) ≤ x + x = 2x,
which is indeed true. If x < 0 then we see that |x| > 0 > x and hence x ≤ |x|. Note that the
condition −|x| ≤ x is equivalent to the condition |x| + x ≥ 0 (use Exercise 2.5.10.2) which is
indeed true as by definition x + |x| = x + (−x) = 0.
For the second part, we again consider two cases, namely, y ≥ 0 and y < 0. If y ≥ 0 then
|y| = y and hence the condition y ≤ x implies |y| ≤ x. In case y < 0 implies |y| = −y. Further,
using Exercise 2.5.10.2, the condition −x ≤ y is equivalent to the condition 0 ≤ y + x which in
turn is equivalent to −y ≤ x. Hence |y| = −y ≤ x. Thus, the required result follows.
T
AF

As a direct application of Lemma 2.5.12, one obtains the triangle inequality.

Lemma 2.5.13. [Triangle inequality in Z] Let x, y ∈ Z. Then |x + y| ≤ |x| + |y|.

Proof. Using Lemma 2.5.12, one has −|x| ≤ x ≤ |x| and −|y| ≤ y ≤ |y|. Hence,

−|x| + (−|y|) ≤ x + y ≤ |x| + |y|.

Now, use the associativity and commutativity of addition to get

0 = −|x| + (−|y|) + |x| + |y| = −(|x| + |y|) + (|x| + |y|)

and hence the uniqueness of the additive inverse implies −|x| + (−|y|) = −(|x| + |y|). Thus, the
required result follows from the second part of Lemma 2.5.12.

This finishes most of the results on the basic operations related with integers.

2.5.2 Construction of Rational Numbers

In this subsection, we will describe the construction of rational numbers and prove a few prop-
erties, such as addition, multiplication, subtraction and division by non-zero element.
So, let us start with denoting Z∗ = Z \ {0} and defining an equivalence relation on X = Z × Z∗
and then doing everything afresh as was done for the set of integers. Define a relation ‘∼’ on X
by
(a, b) ∼ (c, d) if a · d = b · c for all a, c ∈ Z, b, d ∈ Z∗ .
2.5. CONSTRUCTION OF INTEGERS AND RATIONALS∗ 65

Then, verify that ∼ is indeed an equivalence relation on X. Let Q denote the collection of all
equivalence classes under this relation. This set is called the “set of rational numbers”. In this
set, we define addition and multiplication, using the addition and multiplication in Z, as follows:

1. [Addition in Q] Let [x] = [(x1 , x2 )], [y] = [(y1 , y2 )] ∈ Q. Then, one defines addition in
Q, denoted by ⊕, as

[x] ⊕ [y] = (x1 , x2 ) ⊕ (y1 , y2 ) = [(x1 · y2 + x2 · y1 , x2 · y2 )].

2. [Multiplication in Q] Let [x] = [(x1 , x2 )], [y] = [(y1 , y2 )] ∈ Q. Then, one defines multi-
plication in Q, denoted by , as

[x] [y] = (x1 , x2 ) (y1 , y2 ) = [(x1 · y1 , x2 · y2 )].

The readers are advised to verify the well-definedness of the above operations in Q. Further,
if we define the map f : Z → Q by f (a) = [(a, 1)] then it can be easily verified that the
map f is one-one and it preserves addition and multiplication. Thus, Z is seating inside Q as
f (Z). So, again one replaces the symbols ‘⊕’ and ‘ ’ by ‘+’ and ‘·’. Sometimes, even ‘·’ is not
used for multiplication. We also note that the element 0 ∈ Z corresponds to [(0, 1)] = [(0, x)],
for all x ∈ Z∗ . Hence, an element [(x1 , x2 )] ∈ Q with [(x1 , x2 )] 6= 0 implies that x1 6= 0.
Thus, verify that for each [(x1 , x2 )] ∈ Q with x1 6= 0, the element [(x2 , x1 )] ∈ Q satisfies
T

[(x1 , x2 )] · [(x2 , x1 )] = 1. As the next operation, one defines division in Q as follows.

Definition 2.5.14. [Division in Q] Let [x] = [(x1 , x2 )], [y] = [(y1 , y2 )] ∈ Q with y1 6= 0. Then,
DR

one defines division in Q, denoted by /, as

[x]/[y] = [(x1 , x2 )]/[(y1 , y2 )] = [(x1 y2 , x2 y1 )].

Note that x2 y1 ∈ Z∗ as x2 , y1 6= 0.

The readers are advised to verify the well-definedness of division defined above. Before pro-
ceeding further with other important properties of rational numbers, the readers should verify
all the properties related with addition, subtract, multiplication and division by a non-zero el-
ement. The next result, even though it doesn’t seem important, helps us to define order in
rational numbers.

Lemma 2.5.15. [Representation of an element of Q] Let [x] ∈ Q. Then [x] = [(y1 , y2 )], for
some y1 , y2 ∈ Z such that y2 > 0.

Proof. Let [x] = [(x1 , x2 )], for some x1 , x2 ∈ Z. If x2 > 0, we are done. Else, using Ex-
ercise 2.5.10.1, we know that −x2 > 0. Then, by the definition of equivalence class [x] =
[(x1 , x2 )] = [(−x1 , −x2 )]. Hence, the required result follows.

So, now we proceed with the definition of order in Q.

Definition 2.5.16. [Order in Q] Let [x] = [(x1 , x2 )], [y] = [(y1 , y2 )] ∈ Q, for some x1 , x2 , y1 , y2 ∈
Z with x2 , y2 > 0. Then the order in Q is defined by [x] > [y] if x1 y2 > x2 y1 .
66 CHAPTER 2. PEANO AXIOMS AND COUNTABILITY

We again need to verify the well-definedness of order in Q. Also, as before, [x] ≥ [y] means
either [x] = [y] or [x] > [y]. As a final result of this section, we prove the following result.

Lemma 2.5.17. [Existence of rational between two rational] Let [x], [y] ∈ Q with [x] < [y].
Then, there exists a rational number [z] such that [x] < [z] < [y].

Proof. Let [x] = [(x1 , x2 )] and [y] = [(y1 , y2 )], for some x1 , x2 , y1 , y2 ∈ Z with x2 , y2 > 0. Since
[x] < [y], x1 y2 < x2 y1 , one has 2x1 y2 < x1 y2 + x2 y1 < 2x2 y1 . Further, 2x2 y2 > 0 and hence
let us take [z] = [(x1 y2 + x2 y1 , 2x2 y2 )]. Then, it can be easily verified that [x] < [z] < [y] as
x2 , y2 ∈ Z and the cancellation property with respect to multiplication holds in Z.

T
AF
DR
Chapter 3

Partial Orders, Lattices and Boolean

Algebra

3.1 Partial Orders

Let X be a non-empty set and let f be a relation on X. Then, recall from Definition 1.2.32 that
f is anti-symmetric if (x, y) ∈ f and x 6= y implies (y, x) ∈
/ f . That is, both (x, y) and (y, x)
cannot be in f , whenever x and y are distinct.

Definition 3.1.1. [Partial order] Let X be a non-empty set. A relation f on X is called a

T
AF

partial order if f is reflexive, transitive and anti-symmetric. Further, two elements, namely
a, b ∈ X, are said to be comparable if either (a, b) ∈ f or (b, a) ∈ f .
DR

Example 3.1.2. 1. Let X = {1, 2, 3, 4, 5}.

(a) The identity relation Id is reflexive, transitive and anti-symmetric. So, it is a partial
order. But, none of the elements of X are comparable.
(b) The relation Id ∪ {(1, 2)} is also a partial order. Here 1 and 2 are comparable.
(c) The relation Id ∪ {(1, 2), (2, 1)} is reflexive, transitive. But it is not anti-symmetric,
as (1, 2) and (2, 1) are both in the given relation.
(d) The relation Id ∪ {(1, 2), (3, 4)} is also a partial order. Here, 1, 2 are comparable and
3, 4 are comparable.

2. Let X = N. Then f = {(a, b) : a divides b} is a partial order.

3. Let X be a nonempty collection of sets. Then f = {(A, B) | A, B ∈ X, A ⊆ B} is a partial
order on X.
4. On R the set f = {(a, b) : a − b ≤ 0} is a partial order. It is called the usual partial
order on R. List 5 elements of f . Usual partial order on a subset of R is defined similarly.

Exercise 3.1.3. Give a partial order on {1, 2, 3, 4, 5} with the

1. maximum number of elements in it.
Ans: {(1, 1), (2, 2), (3, 3), (4, 4), (5, 5), (1, 2), (1, 3), (1, 4), (1, 5), (2, 3), (2, 4), (2, 5),
(3, 4), (3, 5), (4, 5)}

67
68 CHAPTER 3. PARTIAL ORDERS, LATTICES AND BOOLEAN ALGEBRA

2. minimum number of elements in it.

Ans: {(1, 1), (2, 2), (3, 3), (4, 4), (5, 5)

Definition 3.1.4. Let X be a non-empty set.

1. [Partially ordered set (poset)] The tuple (X, f ) is called a partially ordered set (in
short, poset) if f is a partial order on X. It is common to use ≤ instead of f . We say
x ≤ y to mean that (x, y) ∈ f or x and y are related or x and y are comparable. We say
x < y to mean that x ≤ y and x 6= y.
2. [Linear/Total/Complete order] A partial order f on X is called a linear/complete/total
order if either (x, y) ∈ f or (y, x) ∈ f , for each pair x, y ∈ X, i.e., each pair of elements
of X are comparable.
3. [Linearly ordered set] The poset (X, f ) is said to be a linearly ordered set if f is a
linear order on X. You may imagine the elements of a linearly ordered set as points on a
line.
4. [Chain and its height] A linearly ordered subset of a poset is called a chain. The
maximum size of a chain is called the height of a poset.
5. [Anti-chain and its width] Let (X, f ) be a poset and A ⊆ X. Suppose that no two
elements in A are comparable. Then A is called an anti-chain. The maximum size of an
anti-chain is called the width of the poset.
T

6. [Strictly ordered set] Then (X, f ) be a (strictly) ordered set if f is anti-symmetric and
AF

transitive.
1. The poset in Example 3.1.2.1a has height 1 (resp. chain is {1}) and
DR

Example 3.1.5.
width 5 (respectively, anti-chain is {1, 2, 3, 4, 5}).
2. The poset in Example 3.1.2.1b has height 2 (resp. chain is {1, 2}) and width 4 (resp.
anti-chain is {2, 3, 4, 5} or {1, 3, 4, 5}).
3. The poset in Example 3.1.2.1d has height 2 (resp. chain is {1, 2} or {3, 4}) and width 3
(resp. anti-chain is {1, 3, 5}). Find other anti-chains?
4. The set N with the usual order is a linearly ordered set.
5. If (X, f ) is a nonempty linearly ordered set, then the height of X is X and the width of
X is 1.
6. The set N with a ≤ b if a divides b, is not linearly ordered. However, the set {1, 2, 4, 8, 16}
is a chain. This is just a completely ordered subset of the poset. There are larger chains,
for example, {2k | k = 0, 1, 2, . . .}. It has height N and width N .
7. The poset (P({1, 2, 3, 4, 5}), ⊆) is not linearly ordered. However, {∅, {1, 2}, {1, 2, 3, 4, 5}} is
a chain in it. So, is {∅, {2}, {2, 3}, {2, 3, 4}, {2, 3, 4, 5}, {1, 2, 3, 4, 5}}. Its height is 6. What
is its width?

Definition 3.1.6. [Lexicographic/Dictionary ordering] Let (Σ, ≤) be a nonempty finite

linearly ordered set (like the English alphabets with a ≤ b ≤ c ≤ · · · ≤ z) and Σ∗ be the
collection of all words formed using the elements of Σ. For a ≡ a1 a2 · · · an , b ≡ b1 b2 · · · bm ∈ Σ∗ ,
for some m, n ∈ N, define a ≤ b if
3.1. PARTIAL ORDERS 69

(a) a1 < b1 or

(b) ai = bi for i = 1, . . . , k for some k < min{m, n} and ak+1 < bk+1 or

(c) ai = bi for i = 1, . . . , n = min{m, n}.

Then (Σ∗ , ≤) is a linearly ordered set. This ordering is called the lexicographic or dictionary
ordering. Sometimes Σ is called the ‘alphabet set’ and Σ∗ is called the ‘dictionary’.

Exercise 3.1.7. Let D1 be the dictionary of words made from a, b, c and D2 be the dictionary
of words made from a, b, d. Are these two sets equivalent?

Discussion 3.1.8. [Directed graph representation of a finite poset] Often we represent

a nonempty finite poset (X, ≤) by a picture. The process is described below.
(a) Put a dot/node for each element of X and label it.
(b) If a ≤ b, then join the dot/node for a and the dot/node for b by an arrow (a directed
line).
(c) Put a loop at the dot/node of a, for each a ∈ X.

1. A directed graph representation of A = {1, 2, 3, 9, 18} with the ‘divides’ relation (a ≤ b if

a | b) is given below.

18
T
AF
DR

2 3

1
Definition 3.1.9. [Hasse diagram] The Hasse diagram of a nonempty finite poset (X, ≤) is
a picture drawn in the following way.
1. Each element of X is represented by a point and is labeled with the element.

2. If a ≤ b then the point representing a must appear at a lower height than the point
representing b and further the two points are joined by a line.

3. If a ≤ b and b ≤ c then the line between a and c is removed.

Later, we shall show that for every nonempty finite poset (X, ≤), a Hasse diagram can be
drawn.

Example 3.1.10. Hasse diagram for A = {1, 2, 3, 9, 18} with the relation as ‘division’.
70 CHAPTER 3. PARTIAL ORDERS, LATTICES AND BOOLEAN ALGEBRA

2 3

Exercise 3.1.11. Draw the Hasse diagram for {1, 2, 3} × {1, 2, 3, 4} under lexicographic order.

Proposition 3.1.12. Let F be a nonempty family of single valued relations such that either
f ⊆ g or g ⊆ f , that is, F is linearly ordered. Let h = ∪ f . Then the following are true.
f ∈F

1. h is single valued.

2. dom(h) = ∪ dom(f ).
f ∈F

3. rng(h) = ∪ rng(f ).
f ∈F

4. If every element of F is one-one (from its domain to its range) then h is also one-one.

Proof. We shall only prove the first two items.

1. Let x ∈ dom(h) and (x, y), (x, z) ∈ h. Then there are f, g ∈ F, such that (x, y) ∈ f and
AF

(x, z) ∈ g. As F is a chain, either f ⊆ g or g ⊆ f , say f ⊆ g. Then, g is not single valued,

a contradiction.
DR

2. Note that x ∈ dom(h) means (x, y) ∈ h for some y. This means (x, y) ∈ f for some f .
That is, x ∈ dom(f ), for a function f . This means x ∈ ∪ dom(f ).
f ∈F

Definition 3.1.13. 1. [Bounds] Let (X, f ) be a poset and A ⊆ X. We say x ∈ X is an

upper bound of A if for each z ∈ A, (z, x) ∈ f . In words, it means ‘each element of A is
≤ x’. The term lower bound is defined analogously.

2. [Maximal] An element x ∈ A is maximal element in A, if ‘whenever there exists a z ∈ A

with (x, z) ∈ f then x = z. In other words, it means ‘no element in A is strictly larger
than x’. The term minimal is defined analogously.

3. [Maximum] An element x ∈ A is called the maximum of A, if x is an upper bound

of A. In other words, it means ‘an upper bound of A which is contained in A’. Such an
element, when it exists, is unique. The term minimum is defined analogously.

4. [Least upper bound] An element x ∈ X is called the least upper bound (lub) of A if
x is an upper bound of A and for each upper bound y of A, we have (x, y) ∈ f . In other
words ‘x is the minimum/least of the set of all upper bounds of A. The term greatest
lower bound (glb) is defined analogously.

Example 3.1.14. Consider the two posets described by the following picture.
3.1. PARTIAL ORDERS 71
d

b c b c

a a

X Y
Figure 3.1: Posets X and Y

1. Consider the poset X = {a, b, c} in Figure 3.1. If A = X then

(a) the maximal elements of A are b and c,
(b) the only minimal element of A is a,
(c) a is the lower bound of A in X,
(d) A has no upper bound in X,
(e) A has no maximum element,
(f) a is the minimum element of A,
(g) no element of X is the lub of A and
(h) a is the glb of A in X.

2. Consider the posets in Figure 3.1. Then, the following table illustrates different definitions.
T

Note that X = {a, b, c} and Y = {a, b, c, d}.

A = {b, c} ⊆ X A = {a, c} ⊆ X A = {b, c} ⊆ Y

Maximal element(s) of A b, c c b, c
Minimal element(s) of A b, c a b, c
Lower bound(s) of A in X/Y a a a
Upper bound(s) of A in X/Y doesn’t exist c d
Maximum element of A doesn’t exist c doesn’t exist
Minimum element of A doesn’t exist a doesn’t exist
lub of A in X/Y doesn’t exist c d
glb of A in X/Y a a a

Exercise 3.1.15. Determine the maximal elements, minimal elements, lower bounds, upper
bounds, maximum, minimum, lub and glb of A in the following posets (X, f ).
1. Take X = Z with usual order and A = Z.
Ans: We have no maximal element, no minimal element, no lower bounds and no upper
bounds.
2. Take X = N, f = {(i, i) : i ∈ N} and A = {4, 5, 6, 7}.
Ans: We have no upper bounds, no lower bounds. Each of 4, 5, 6, 7 are maximal elements
(also minimal elements) of A. No maximum and no minimum.

Discussion 3.1.16. [Bounds of empty set] Let (X, f ) be a nonempty poset. Then each x ∈ X
is an upper bound for ∅ as well as a lower bound for ∅. So, an lub for ∅ may or may not exist.
1
72 CHAPTER 3. PARTIAL ORDERS, LATTICES AND BOOLEAN ALGEBRA

For example, if X = {1, 2, 3} and f is the usual order, then lub ∅ = 1. Whereas, if X = Z and f
is the usual order, then an lub for ∅ does not exist. Similar statements hold for glb.
Definition 3.1.17. [Well order] A linear order f on X is said to be a well order if each
nonempty subset A of X has a minimal element (in A). We call (X, f ) a well ordered set to
mean that f is a well order on X. Note that ‘a minimal element’, if it exists, is ‘a minimum’ in
this case.
Example 3.1.18.
1. The set Z with usual ordering is not well ordered, as {−1, −2, . . . , } is a nonempty subset
with no minimal element.
2. The ordering 0 ≤ 1 ≤ −1 ≤ 2 ≤ −2 ≤ 3 ≤ −3 ≤ · · · describes a well order on Z.
3. The set N with the usual ordering is well ordered.
4. The set R with the usual ordering is not well ordered as the set (0, 1) doesn’t have its
minimal element in (0, 1).
Exercise 3.1.19. Consider the dictionary order on N2 . Show that this is a well order.
Ans: For ∅ = 6 K ⊆ N2 , let m be the smallest element of K1 = {x | (x, y) ∈ K}. Let n be the
smallest element of K2 = {y | (m, y) ∈ K}. Consider (m, n).
Definition 3.1.20. [Initial segment] Let (W, ≤) be well ordered and a ∈ W . The initial
segment of a is defined as I(a) := {x | x ∈ W, x < a}.
T
AF

Example 3.1.21. Take N with the usual order. Then I(5) = {1, 2, 3, 4} and I(1) = ∅.
DR

Theorem 3.1.22. [Principle of transfinite induction] Let (W, ≤) be a nonempty well ordered
set. Let A ⊆ W which satisfies ‘whenever I(w) ⊆ A then w ∈ A’. Then A = W .
Proof. If A 6= W , then Ac =
6 ∅. As W is well ordered, let s be the minimal element of Ac . So,
any element x < s is in A. That is, I(s) ⊆ A. By the hypothesis s ∈ A, a contradiction.

Fact 3.1.23. The principle of transfinite induction is the principle of mathematical induction
when W = N.
Proof. To see this, let p(n) be a statement which needs to be proved by mathematical induction.
Put A = {n ∈ N | p(n) is true}. Assume that we have been able to show that ‘I(n) ⊆ A ⇒ n ∈
A’. It means, we have shown that 1 ∈ A, as ∅ = I(1) ⊆ A. Also we have shown that for n ≥ 2,
if {p(1), . . . , p(n − 1)} are true then p(n) is true as well, as I(n) = {1, 2, . . . , n − 1}.

Definition 3.1.24. [Product of sets] Recall that the product A1 × A2 = {(x1 , x2 ) | xi ∈ Ai }

may be written as

f (1), f (2) f : {1, 2} → A1 ∪ A2 is a function with f (1) ∈ A1 , f (2) ∈ A2 .

Moreover, if A1 and A2 are finite sets then |A1 × A2 | = |A1 | · |A2 |. In general, we define the
product of the sets in {Aα }α∈L , L 6= ∅, as
Y
Aα = f | f : L → ∪ Aα is a function with f (α) ∈ Aα , for each α ∈ L .
α∈L
α∈L
3.1. PARTIAL ORDERS 73

Q
Example 3.1.25. 1. Take L = N and An = {0, 1}. Then Aα is the class of functions
α∈L
f : L → {0, 1}. That is, it is the class of all 0-1-sequences.
2. By definition, product of a class of sets among which one of them is ∅ is empty.

What about product of a class of sets in which no set is empty? Is it nonempty? This could
not be proved using the standard set theory. In fact, it is now proved that this question cannot
be answered using the standard set theory. So, a new axiom, called the axiom of choice, was
introduced.

Axiom 3.1.26. [Axiom of Choice] The product of a nonempty class of nonempty sets is
nonempty.

Proposition 3.1.27. [Injection-Surjection] Let A and B be nonempty sets. Then, there is a

surjection g : A → B if and only if there is an injection f : B → A.

Proof. Let g : A → B be onto. We shall find an injection from B to A. To start with, notice that
for each b ∈ B, the set g −1 (b) 6= ∅. Then, by axiom of choice
Q −1 Q −1
g (b) 6= ∅. Let f ∈ g (b).
b∈B b∈B
Then, by Definition 3.1.24, f : B → A is a function. As g is a function, g −1 (b)’s are disjoint and
hence f is one-one.
Conversely, let f : B → A be one-one. Fix an element b ∈ B. Define g : A → B as
T
(
f −1 (x), if x ∈ f (B),
AF

g(x) =
b, if x ∈ A \ f (B).
DR

Observe that g is onto.

Definition 3.1.28. [Family of finite character] A class F of sets is called a family of finite
character if it satisfies: ‘A ∈ F if and only if each finite subset of A is also in F ’.
Example 3.1.29. 1. { } is a family of finite character.
2. Power sets are families of finite character.
3. {∅, {1}, {2}} is a family of finite character.
4. If A ∩ B = ∅, then P(A) ∪ P(B) is a family of finite character.
5. The set {∅} ∪ {{a} | a 6= 0, a ∈ R} is a family of finite character. This is the class of
linearly independent sets in R.
6. Let V be a non trivial vector space and F be the class of linearly independent subsets of
V. Then F is a family of finite character.
Q
Exercise 3.1.30. 1. Let L = A1 = A2 = A3 = {1, 2, 3}. Is the set Aα equal to the class
α∈L
of functions f : {1, 2, 3} → {1, 2, 3}? Give reasons for your answer.
An has 6 elements. Give another.1
Q
2. Give sets An , n ∈ N such that
n∈N
1
When we ask for more than one example, we encourage the reader to get examples of different types, if
possible.
74 CHAPTER 3. PARTIAL ORDERS, LATTICES AND BOOLEAN ALGEBRA

Some equivalent axioms of axiom of choice

[Axiom of choice] Cartesian product of a nonempty collection of nonempty sets is
nonempty.
[Zorn’s lemma] A partially ordered set in which every chain has an upper bound, has
a maximal element.
[Zermelo’s well ordering principle] Every set can be well ordered.
[Hausdorff ’s maximality principle] Every nonempty partially ordered set contains a
maximal chain.
[Tukey’s lemma] Every nonempty family of finite character has a maximal element.

Exercise 3.1.31. 1. Does there exist a poset with exactly 5 maximal chains of size (number
of elements in it) 2, 3, 4, 5, 6, respectively and 2 maximal elements? If yes, draw the Hasse
diagram. If no, argue it.

Ans: Yes.

T
AF
DR

2. Let (X, f ) be a nonempty poset and ∅ 6= Y ⊆ X. Define fY = {(a, b) ∈ f | a, b ∈ Y }.

Show that fY is a partial order on Y . This is the induced partial order on Y .

3. Apply induction to show that a nonempty finite poset has a maximal element and a minimal
element.

Discussion 3.1.32. [Drawing the Hasse diagram of a finite poset (X, f )] Let x1 , . . . , xk be
the minimal elements of X. Draw k points on the same horizontal line and label them x1 , . . . , xk .
Now consider Y = X \ {x1 , . . . , xk } and fY . By induction, the picture of (Y, fY ) can be drawn.
Put it above those k dots. Let y1 , . . . , ym be the minimal elements of Y . Now, draw the lines
(xi , yj ) if (xi , yj ) ∈ f . This is the Hasse diagram of (X, f ).

Discussion 3.1.33. [Existence of Hamel basis] Let V be a vector space with at least two
elements. Recall that the collection F of linearly independent subsets of V is a family of finite
character. Recall that a basis or a Hamel basis is a maximal linearly independent subset of V.
As V has at least 2 elements, it has a nonzero element, say a. Then {a} ∈ F. Hence, F 6= ∅.
Thus, by Tukey’s lemma, the set F has a maximal element. This maximal set is the required
basis. Hence, we have proved that every vector space with at least 2 elements has a Hamel basis.
3.1. PARTIAL ORDERS 75

Exercise 3.1.34. 1. Let n ∈ N. Define Pn = {k ∈ N | k divides n}. Define a relation ≤n

on Pn as ≤n = {(a, b) | a divides b}. Show that (Pn , ≤n ) is a poset, for each n ∈ N. Give
a necessary and sufficient condition on n so that (Pn , ≤n ) is a completely ordered set.
Ans: If p|n, q|n are distinct primes, then neither p|q nor q|p. So Pn is a chain implies n is a
power of a prime.
n o n o
2. Take X = (1, 1), (1, 2), (1, 3), . . . ∪ (2, 1)(3, 1), (4, 1), . . . . The ordering defined is
n o [ n o
f= ∪ (1, m), (1, n) ∪ (m, 1), (n, 1) .
m, n ∈ N m, n ∈ N
m≤n m≤n

Does X have any maximal or minimal elements? Is X linearly ordered? Is it true that
every nonempty set has a minimal element? Is it true that every nonempty set has a
minimum? What type of nonempty sets always have a minimum?
Ans: Notice that with this ordering X has a minimal element (1, 1) which is also a lower
bound for X. Hence, every subset of X has at least one lower bound, namely (1, 1). The set X
is not a linearly ordered set. Every nonempty subset A of X has at least one minimal element.
Three types of nonempty sets have the minimum: set which contain (1, 1); sets which contain
points only of the form (1, m); and sets which contain points only of the form (m, 1).
3. Prove or disprove:
T
(a) There are at least 5 functions f : R → R which are partial orders.
AF

Ans: No, Id is the only one.

(b) Let S be the set of sequences (xn ), with xn ∈ {0, 1, . . . , 9}, for each n ∈ N, such that
‘if xk < xk+1 , then xk+1 = xk+2 = · · · }‘. Then S is countable.
Ans: Yes. If xk < xk+1 does not happen for some k, then (xn ) is a decreasing sequence
made with 0, 1, . . . , 9. Hence, it has to be eventually constant. This class is a subclass
of the class of eventually constant sequences made with 0, 1, . . . , 9. The later class is
countable.
(c) Take N with usual order. Then the dictionary order on N2 is a well order.
Ans: Yes.
(d) Let S be the set of all non-increasing sequences made with natural numbers. Then S
is countable.
Ans: Yes. The set Sn of all such sequences made with elements from {1, 2, . . . , n} is
countable. Then S = ∪ Sn .
n∈N
(e) Let S be the set of all nondecreasing sequences made with natural numbers. Then S
is countable.
Ans: No. Corresponding to a sequence (dn ) of 0 and 1, we can create a unique nonde-
creasing sequence (xn ) as xn = 1 + d1 + · · · + dn . Thus, the number of such sequences
is at least the number of 0-1-sequences. The later set is uncountable.
(f ) Take N with usual order and N2 with the dictionary order. Then any nonempty subset
of N2 which is bounded above has a lub.
Ans: Yes. The top-right element of the set is the maximum, hence it is the lub.
76 CHAPTER 3. PARTIAL ORDERS, LATTICES AND BOOLEAN ALGEBRA

(g) Every nonempty countable linearly ordered set is well ordered with respect to the same
ordering.
Ans: No. Consider Z with the usual order.
(h) Every nonempty countable chain which is bounded below, in a partially ordered set,
is well ordered with respect to the same ordering.
Ans:No. In R the
set of positive rational numbers under usual order is bounded below.
1
But, : k ∈ N does not contain a minimal element.
2k
(i) The set Q can be well ordered.
Ans: Yes. If {x1 , x2 , x3 , · · · } is a countable set, define an order as x1 ≤ x2 ≤ x3 ≤ · · · .
This is a well order.
(j) For a fixed n ∈ N, let An and Bn be non-empty sets and let Rn be a one-one relation
from An to Bn . Then, ∩ Rn is a one-one relation.
n
Ans: Yes.
(k) Let S be the set of words with length at most 8 using letters from {3, A, a, b, C, c}. We
want to define a lexicographic order on S to make it a dictionary. There are more
than 500 ways to do that.
Ans: Yes. There are 6! ways, as any linear order gives a separate lexicographic order.
T

(l) An infinite poset in which each nonempty finite set has a minimum, must be linearly
AF

ordered.
DR

Ans: Yes. Consider all two element sets.

(m) A nonempty finite poset in which each nonempty finite set has a minimum, must be
well ordered.
Ans: Yes. It is linearly ordered and a finite chain is well ordered.
(n) An infinite poset in which each nonempty finite set has a minimum, must be well
ordered.
Ans: No. Consider R.

4. Let S = {(x, y) : x2 + y 2 = 1, x ≥ 0}. It is a relation from R to R. Draw a picture of the

inverse of this relation.
√
3
Ans:
√
S contains points like ( 21 , ± 2 ) and (0, ±1). So, the inverse contains points like
(± 23 , 21 ) and (±1, 0). The picture is

5. Construct the Hasse diagram for the ⊆ relation on P({a, b, c}).

Ans:
3.1. PARTIAL ORDERS 77

{a, b, c}

{a, b} {a, c} {b, c}

{a} {b} {c}

6. Draw the Hasse diagram for the partial order describing the ‘divides’ relations on the set
{2, 3, 4, 5, 6, 7, 8}.
7. Draw the Hasse diagram of {1, 2, 3, 6, 9, 18} with ‘divides’ relation.
Ans:

9 6

3 2
T
AF

1
DR

(a) What is its height? What is its width.

Ans: Height is 4 and width is 2.
(b) Let A = {2, 3, 6}. What are the maximal elements, minimal elements, maximum,
minimum, lower bounds, upper bounds, glb and lub of A.
Ans: There is only one maximal element of A. It is 6.
Minimal elements of A are 2, 3.
Maximum of A is 6.
Minimum of A does not exist.
There is only one lower bound of A. It is 1.
Upper bounds of A are 6, 18.
The greatest lower bound of A is 1.
The least upper bound of A is 6.

Exercise 3.1.35. ∗

1. Show that the following three definitions are equivalent.

(a) A set X is finite if either X = ∅ or X = {1, 2, . . . , n}, for some n ∈ N.
(b) [Tarski] A set X is finite if and only if every nonempty family of subsets of X has
a minimal element.
78 CHAPTER 3. PARTIAL ORDERS, LATTICES AND BOOLEAN ALGEBRA

(c) [Dedekind] A set is infinite if it is equivalent to a proper subset of itself. A set is

finite if it is not infinite.

Ans: We already know the equivalence of items 1a and 1c.

1a ⇒ 1b. Let X 6= ∅ be finite by 1a. Let ∅ =6 F ⊆ P(X). Select B ∈ F with the smallest
n
cardinality (as F has at most 2 elements). Then B is a minimal element of F .

1b ⇒ 1c. Let X be infinite by 1c. Then X = B, for some B ( X. Let f : X → B, be

a bijection and x ∈ X \ B. Put f 1 (x) = f (x) and f n (x) = f (f n−1 (x)) for n ≥ 2. Note
that i 6= j ⇒ f i (x) 6= f j (x). Put Bk = {f k (x), f k+1 (x), · · · }. Then F = {B1 , B2 , · · · } is a
nonempty class of subsets of X which does not contain a minimal element (no Bm is minimal
as Bm+1 ( Bm ). So, X is infinite by 1b.

2. Let (X, f ) be a nonempty poset. Show that there exists a linear order g on X such that
f ⊆ g.

Ans: Let F be the class of all partial orders h on X such that f ⊆ h. This is nonempty,
as f ∈ F. Note that (F, ⊆) is a nonempty poset. By Hausdorff maximality principle, it has a
maximal chain, say C. Put g = ∪ h.
h∈C
It is easy to verify that g is a partial order. Suppose that g is not a linear order on X. Then
∃ x, y ∈ X such that neither (x, y) nor (y, x) is in g. (Evidently, x 6= y.)
T

Now define Lx = {z : (z, x) ∈ g} and My = {z : (y, z) ∈ g}. Note that if z ∈ Lx ∩ My ,

then (y, x) ∈ g by transitivity. Hence, Lx ∩ My = ∅. Note that x ∈ Lx and y ∈ My .

Put g1 = g ∪ (Lx × My ). This is a partial order. To see that, observe that reflexivity of g1 is
trivial.

Antisymmetry: Let (a, b), (b, a) ∈ g1 . Both of them cannot be in Lx × My , as Lx ∩ My = ∅.

Assume that (a, b) ∈ Lx ×My and (b, a) ∈ g. This means, (a, x) ∈ g, (y, b) ∈ g and (b, a) ∈ g.
But then (y, x) ∈ g, a contradiction. So, both of them are in g and so a = b.

Transitivity: Let (a, b), (b, c) ∈ g1 . Clearly, both of them are not in Lx × My . If both of them
are in g, we have nothing to prove. So, let (a, b) ∈ Lx × My and (b, c) ∈ g. This means,
(a, x) ∈ g, (y, b) ∈ g and (b, c) ∈ g. From the last two, c ∈ My . So, (a, c) ∈ Lx × My ⊆ g1 .
Similar statement holds, if (b, c) ∈ Lx × My and (a, b) ∈ g.

Notice that g1 ∈
/ C and C ∪ {g1 } is a larger chain than C, a contradiction.

3. Let G be a non-Abelian group and H be an Abelian subgroup of G. Show that there is a

maximal Abelian subgroup J of G such that H ⊆ J.

Ans: Let F be the class of Abelian subgroups of G which contain H. Notice that H ∈ F.
So, by Hausdorff’s maximality principle there is a maximal chain C of elements of F. Notice
that H ∈ C, otherwise we could extend C. Put J = ∪ A. It is easy to check that J is an
A∈C
Abelian subgroup of G. If J0 is any Abelian subgroup that contains J properly, then J0 ∈/C
and J0 ∈ F. Thus, C ∪ {J0 } is a larger chain than C, which contradicts the maximality of C.

4. Let F be a family of finite character and B be a chain in F . Show that ∪ A ∈ F .

A∈B
3.1. PARTIAL ORDERS 79

Ans: Let {p1 , . . . , pk } be a finite subset of X := ∪ A ∈ F . So, there are sets P1 , . . . , Pk ∈

A∈B
B such that pi ∈ Pi . Since B is a chain, one of the Pi ’s contains the others, say Pk . So,
{p1 , . . . , pk } is a finite subset of Pk . As F is a family of finite character and Pk ∈ F , it follows
that {p1 , . . . , pk } ∈ F . Thus, each finite subset of X is in F . As F is a family of finite
character, X ∈ F .
5. Let A 6= ∅ and F be a field. Let FA := {f : f is a function from A to F}. Let Γ := {f ∈
FA : {a ∈ A : f (a) 6= 0} is finite}. Show that Γ is a vector space over F with respect to
point-wise addition of functions and point-wise scalar multiplication. Also show that every
vector space V is isomorphic to Γ for some suitable choice of A.
Ans: As FA is a vector space, showing that Γ is a subspace is easy. Let B be the Hamel basis
of V. Let χb : A → F be the characteristic function of {b}. Now
   

 
 
 

X X
Γ= f : f = α b χb ≡ v : v = αb b = V,

 b∈B,B⊆A

   b∈B,B⊆A


B f inite B f inite

where αb ∈ F and the last equality follows as each element x ∈ V can be expressed, in a unique
way, as a linear combination of elements of B.
6. Let X be a vector space and A be a nonempty linearly independent subset of X. Let S ⊆ X
satisfy span(S) = X. Show that ∃ a Hamel basis B such that A ⊆ B ⊆ S.
T

Ans: Let F = {B : B is linearly independent, A ⊆ A ⊆ S}. Notice that F 6= ∅. It is partially

ordered w.r.t ⊆. By Hausdorff maximality principle, we have a maximal chain, say C. Consider
DR

the set A0 = ∪ B. It is easy to argue that A0 is linearly independent. If span(A0 ) ( X,

B∈C
then select x0 ∈ X \ span(A0 ). Observe that A0 ∪ {x0 } is linearly independent and it is not
in C. So, C ∪ {A0 ∪ {x0 }} is a larger chain than C, a contradiction.
7. Let (L, ≤) be a nonempty linearly ordered set. Prove that ∃ W ⊆ L such that ≤ well orders
W and such that for each x ∈ L, there is a y ∈ W satisfying x ≤ y. For example, for
L = R, we can take W = N.
Ans: Take a point l ∈ L. Then, {l} is well ordered by ≤. Let X be class of subsets of L
satisfying that ‘each set in L is well ordered by ≤ with l as its minimum’. Notice that {l} ∈ X.
On X, we define a partial order f as (A, B) ∈ f if A ⊆ B and elements of B \ A are upper
bounds of A.
Then, (X, f ) is a nonempty poset and by Hausdorff maximality principle we have a maximal
chain in X, say C. Clearly, this chain starts with {l}. Put W = ∪ A. Then it is clear that
A∈C
W ⊆ L.
To show that W is well ordered, let B ⊆ W be a nonempty set. Let b ∈ B. Then there is a
set Cb ∈ C such that b ∈ Cb . Recall that Cb was well ordered. Consider the initial segment
I(b) of b in Cb . Note that (I(b) ∪ {b}) ∩ B is a nonempty subset of Cb , hence has a minimum
in it, say, w. Notice that w ∈ B.
We claim that w is the minimum of B. Assume, if possible that, ∃ y ∈ B such that y < w.
As w ≤ b, we see that y < b. If y ∈ Cb , then y ∈ I(b) and hence y ∈ (I(b) ∪ {b}) ∩ B, which
80 CHAPTER 3. PARTIAL ORDERS, LATTICES AND BOOLEAN ALGEBRA

implies that w ≤ y. So, y ∈ / Cb . In that case, y can only belong to a set in C that comes
after Cb (which is a proper superset of Cb ). But then y is an upper bound of Cb , contradicting
y < b.
Thus, W is well ordered and hence in X. Now suppose there is a p ∈ L which is a strict upper
bound of W . Then W0 := W ∪ {p} is well ordered and C ∪ {W0 } is a larger chain than C,
contradicting the maximality of C.
8. Show that R is not a finite dimensional vector space over Q. Hint: Assume that R as
a vector space over Q has dimension k. Argue that R is isomorphic to Qk and so it is
countable, a contradiction.
9. Let A be a nonempty set. Then there is an element a which is not in A.
Ans: We know that P(A) A. Hence, there exists a ∈ P(A) \ A. Otherwise, P(A) ⊆ A
which means that P(A) ≤ A.
10. Let A be a nonempty set. Then there exists B such that A ∩ B = ∅ and A = B.
Ans: Consider the class F = {X | X ∩ A = ∅, X ≤ A}. This is a nonempty set as ∅ ∈ F. On
F, take the subset partial order. Then, by Hausdorff maximality principle we have a maximal
chain in F, say L. Put W = ∪ X.
X∈L
Then it is clear that W ∈ ¸(F ). Let if possible x ∈ W ∩ A. Then, there exists an element, say
X0 in L such that x ∈ X0 . A contradiction as X0 ∩ A = ∅. Now, use Proposition 3.1.12 to
T

conclude that W ≤ A.
AF

We now show that W = A. Assume that W A. Then take an element x which is not in
DR

A ∪ W . Then W ∪ {x} is in F. So, L ∪{W ∪ {x}} is a chain that is a super-chain of L. A

contradiction to the maximality of L.
11. Let A and B be two nonempty sets. Show that there is a set C such that C ∩ A = ∅ and
C = B.
Ans: First obtain a set D which is equivalent to A ∩ B and is disjoint from A (use previous
exercise). Now, put C = (B \ A) ∪ D.
12. Let A and B be nonempty sets. Put a = A and b = B. Then show that either a ≤ b or
b ≤ a.
Ans: Let F be the class of all one-one functions f for which dom f ⊆ A and rng f ⊆ B.
Since A and B are nonempty sets, we have F is nonempty.
Consider the poset (F, ⊆). By Hausdorff maximality principle, we have a maximal chain C.
Put h = ∪ f .
f ∈C
It is easy to see that h is one-one, dom h = ∪ dom f and rng h = ∪ rng f .
f ∈C f ∈C

If dom h ( A and rng h ( B, then take x ∈ A\dom h and y ∈ B\rng h. Then h0 = h ∪{(x, y)}
is a one-one function in F and h0 ∈
/ C. Thus, C ∪ {ho } is a larger chain, a contradiction to
the maximality of C.
So, either dom h = A, in which case we have a ≤ b; or rng h = B, in which case we have
b ≤ a.
3.2. LATTICES 81

13. Let a = A and b = B, where A ∩ B = ∅. Then we define a + b as A ∪ B and ab as A × B.

(a) Let a be an infinite cardinal number. Show that a + a = a and aa = a.
(b) Let a, b, c be cardinal numbers. Show that a ≤ b ⇒ {a + c ≤ b + c, ac ≤ bc}.

Ans: 1. We shall show that A ≡ (A × {0}) ∪ (A × {1}) = A × {0, 1}. Let C ⊆ A be a

countably infinite set. We already know that C ≡ C × {0, 1}.
Let F = {S ⊆ A : S ≡ (S × {0, 1})}. Then (F, ⊆) is a nonempty poset. Apply Hausdorff
maximality principle to get a maximal chain C. Put h = ∪ f . It is easy to see that h is a
f ∈C
bijection from dom f to rng f .
If A \ dom h is finite then dom h = a and we are done.
If A \ dom h contains a countable infinite set, say C0 , then let f0 be a bijection from C0 →
C0 × {0, 1} and consider h0 = h ∪ f0 .
Then h0 ∈ F and h0 ∈
/ C. Observe that C ∪ {h0 } is a larger chain than C, a contradiction.
The proof of the other part is similar.
2. This part of the proof is routine.
14. Suppose that u ≤ v are two infinite cardinal numbers. Then show that u + v = v and
uv = v.
Ans: Note that v ≤ v + u follows by definition (identity mapping is one-one). Now as u ≤ v,
T

we have u + v ≤ v + v = v.
AF

Similarly, v ≤ vu follows by definition. As u ≤ v, we have uv ≤ vv = v.

3.2 Lattices
Discussion 3.2.1. In a poset, is it necessary that two elements x, y should have a common
upper bound?
Ans: No. Take {1, 2, . . . , 6} with ‘divides’ partial order. The elements 5 and 3 have no
common upper bound.
In a poset, if a pair {x, y} has at least one upper bound, is it necessary that {x, y} should have
a lub?
Ans: No. Consider the third poset described by it’s Hasse diagram in Figure 3.2. Then, the
pair {a, b} has c, d as upper bounds, but there is no lub of {a, b}.

1 1
c d
a c
a c a b c

a b
0 0 b
A distributive lattice A non-distributive lattice Both are non-lattices

Figure 3.2: Hasse diagrams

82 CHAPTER 3. PARTIAL ORDERS, LATTICES AND BOOLEAN ALGEBRA

Definition 3.2.2. [Lattice]

1. A poset (L, ≤) is called a lattice if each pair x, y ∈ L has a lub denoted ‘x ∨ y’ and a glb
denoted ‘x ∧ y’.
2. A lattice is called a distributive lattice if it satisfies the following two properties.
)
x ∨ (y ∧ z) = (x ∨ y) ∧ (x ∨ z).
distributive laws
x ∧ (y ∨ z) = (x ∧ y) ∨ (x ∧ z).
Example 3.2.3. 1. Let L = {0, 1} ⊆ Z and define a ∨ b = max{a, b} and a ∧ b = min{a, b}.
Then, L is a chain as well as a distributive lattice.
2. The set N with usual order and ∨ := max and ∧ := min is a distributive lattice. We
consider two cases to verify that a ∨ (b ∧ c) = (a ∨ b) ∧ (a ∨ c). The second distributive
identity is left as an exercise for the reader.
(a) Case 1: a ≥ min{b, c}. Then, either a ≥ b or a ≥ c, say a ≥ b. Hence,
a ∨ (b ∧ c) = max{a, min{b, c}}
= a = min{max{a, b} = a, max{a, c} ≥ a} = (a ∨ b) ∧ (a ∨ c).

(b) Case 2: a < min{b, c}. Then, a < b and a < c. Hence,
a ∨ (b ∧ c) = max{a, min{b, c}}
= min{b, c} = min{max{a, b} = b, max{a, c} = c} = (a ∨ b) ∧ (a ∨ c).

3. Prove that the first figure in Figure 3.2 is a distributive lattice.

T
AF

4. Prove that the second figure in Figure 3.2 is a lattice but not a distributive lattice.
DR

5. Let S = {a, b, c}. On P(S), we define A ∨ B = A ∪ B and A ∧ B = A ∩ B. Then, it can

be easily verified that P(S) is a lattice.
6. Fix a positive integer n and let D(n) denote the poset obtained using the ‘divides’ partial
order with ∨ := lcm and ∧ := gcd. Then, prove that D(n) is a distributive lattice. For
example, for n = 12, 30 and 36, the corresponding lattices are shown below.
Ans: We check one distributive identity below.
I. Let p be a prime such that pk | lcm{a, gcd{b, c}}. Then, either pk |a or pk |b, c. In that case,
pk | lcm{a, b} and pk | lcm{a, c}. So, pk | gcd{lcm{a, b}, lcm{a, c}}.
II. Conversely, suppose pk | gcd{lcm{a, b}, lcm{a, c}}. Then, pk | lcm{a, b}, lcm{a, c}. Then,
either pk |a or pk |b, c. So, pk | lcm{a, gcd{b, c}}.

12 30
12 18

4 6 6 10 15 4 6 9

2 3 2 3 5 2 3

1 1 1
3.2. LATTICES 83

Exercise 3.2.4. 1. Fix a prime p and a positive integer n. Draw the Hasse diagram of
n
D(p ). Does this correspond to a chain? Give reasons for your answer.
2. Let n be a positive integer. Then, prove that D(n) is a chain if and only if n = pm , for
some prime p and a positive integer m.
3. Let (X, f ) be a nonempty chain with ∨ := lub and ∧ := glb. Is it a distributive lattice?
Ans: In a chain each pair of elements are comparable. Thus, {x, y} has a lub, namely
max{x, y} and a glb, namely min{x, y}. Thus, a chain is a lattice.
Suppose that x ∨ y ≤ z. In that case x, y ≤ z and so (x ∨ y) ∧ z = x ∨ y = (x ∧ z) ∨ (y ∧ z).
Suppose that z < x∨y and assume that x ≤ y. In that case (x∨y)∧z = y and (x∧z)∨(y∧z) ≤
z ∨ (y ∧ z) = z ∨ y = y.
Proof of the other distributive equality is similar.

Proposition 3.2.5. [properties of a lattice] Let (L, ≤) be a lattice. Then, the following
statements are true.

(a) The operations ∨ and ∧ are idempotent, i.e., ‘lub{a, a} = a and glb{a, a} = a’.
(b) ∨ commutative (so is ∧).
(c) ∨ is associative (so is ∧).
T

(d) a ∧ (a ∨ b) = a = a ∨ (a ∧ b) [absorption] , i.e., ‘ glb{a, lub{a, b}} = a = lub{a, glb{a, b}}0 .

(e) a ≤ b ⇔ a ∨ b = b ⇔ a ∧ b = a.
DR

(f ) b ≤ c ⇒ {a ∨ b ≤ a ∨ c, a ∧ b ≤ a ∧ c} [isotonicity] .
(f1) {a ≤ b, c ≤ d} ⇒ {a ∨ c ≤ b ∨ d, a ∧ c ≤ b ∧ d}.
(g) a ∨ (b ∧ c) ≤ (a ∨ b) ∧ (a ∨ c), a ∧ (b ∨ c) ≥ (a ∧ b) ∨ (a ∧ c) [distributive inequalities] .
(h) a ≤ c ⇔ a ∨ (b ∧ c) ≤ (a ∨ b) ∧ c [modular inequality] .

Proof. We prove only a few parts. The rest are left for the reader.
(c) Let d = a ∨ (b ∨ c). Then, d is the lub of {a, b ∨ c}. Thus, d is an upper bound of both
{a, b} and {a, c}. So, d ≥ a ∨ b and d ≥ a ∨ c. Therefore, d ≥ a ∨ b and d ≥ c and hence
d an upper bound of {a ∨ b, c}. So, d is greater or equals to the lub of {a ∨ b, c}, i.e.,
d ≥ (a ∨ b) ∨ c. Thus, the first part of the result follows.
(e) Let a ≤ b. As b is an upper bound of {a, b}, we have a ∨ b = lub{a, b} ≤ b. Also, a ∨ b is an
upper bound of {a, b} and hence a ∨ b ≥ b. So, we get a ∨ b = b. Conversely, let a ∨ b = b.
As a ∨ b is an upper bound of {a, b}, we have a ≤ a ∨ b = b. Thus, the first part of the
result follows.
(f) Let b ≤ c. Note that a ∨ c ≥ a and a ∨ c ≥ c ≥ b. So, a ∨ c is an upper bound for {a, b}.
Thus, a ∨ c ≥ lub{a, b} = a ∨ b and hence the prove of the first part is over.
(f1) Using isotonicity, we have a ∨ c ≤ b ∨ c ≤ b ∨ d. Similarly, using isotonicity again, we have
a ∧ c ≤ b ∧ c ≤ b ∧ d.
84 CHAPTER 3. PARTIAL ORDERS, LATTICES AND BOOLEAN ALGEBRA

(g) Note that a ≤ a ∨ b and a ≤ a ∨ c. Thus, a = a ∧ a ≤ (a ∨ b) ∧ (a ∨ c). As b ≤ a ∨ b and

c ≤ a ∨ c, we get b ∧ c ≤ (a ∨ b) ∧ (a ∨ c). Now using (f1), we obtain the required result,
i.e., a ∨ (b ∧ c) ≤ (a ∨ b) ∧ (a ∨ c).

(h) Let a ≤ c. Then, a ∨ c = c and hence by the ‘distributive inequality’, we have a ∨ (b ∧

c) ≤ (a ∨ b) ∧ (a ∨ c) = (a ∨ b) ∧ c. Conversely, let a ∨ (b ∧ c) ≤ (a ∨ b) ∧ c. Then,
a ≤ a ∨ (b ∧ c) ≤ (a ∨ b) ∧ c ≤ c and the required result follows.

Practice 3.2.6. Show that in a lattice one distributive equality implies the other.

Ans: Suppose that x ∧ (y ∨ z) = (x ∧ y) ∨ (x ∧ z). Then

(x ∨ y) ∧ (x ∨ z) = [(x ∨ y) ∧ x] ∨ [(x ∨ y) ∧ z] (hypothesis)

= x ∨ (x ∧ z) ∨ (y ∧ z) (absorption, hypothesis)
= x ∨ (y ∧ z) (absorption).

Definition 3.2.7. If (Li , ≤i ), i = 1, 2 are lattices with ∨ := lub and ∧ := glb. Then, (L1 ×L2 , ≤)
is a poset with a = (a1 , a2 ) ≤ (b1 , b2 ) = b if a1 ≤1 b1 and a2 ≤2 b2 , that is, if b dominates a
T

entry-wise. In this case, we see that a ∨ b = (a1 ∨1 b1 , a2 ∨2 b2 ) and a ∧ b = (a1 ∧1 b1 , a2 ∧2 b2 ).

Thus (L1 × L2 , ≤) is a lattice, called the direct product of (Li , ≤i ), for i = 1, 2.

Example 3.2.8. 1. Consider L = {0, 1} with usual order. The set of all binary strings Ln
of length n is a poset with the order (a1 , . . . , an ) ≤ (b1 , . . . , bn ) if ai ≤ bi , ∀i. This is the
n-fold direct product of L. It is called the lattice of n-tuples of 0 and 1.

2. Consider the lattices {1, 2, 3} and {1, 2, 3, 4} with usual orders. Hasse diagram of the direct
product {1, 2, 3} × {1, 2, 3, 4} is given below.

(3, 4)

(1, 4)
(3, 1)

(1, 1)

Practice 3.2.9. Consider N with the usual order. The lattice order defined on N2 as a direct
product is different from the lexicographic order on N2 . Draw pictures for all (a, b) ≤ (5, 6) in
both the orders to see the argument.

Proposition 3.2.10. The direct product of two distributive lattices is a distributive lattice.
3.2. LATTICES 85

Proof. The direct product of two lattices is a lattice by definition. Note that

[(a1 , b1 ) ∨ (a2 , b2 )] ∧ (a3 , b3 ) = (a1 ∨ a2 , b1 ∨ b2 ) ∧ (a3 , b3 )

= (a1 ∨ a2 ) ∧ a3 , (b1 ∨ b2 ) ∧ b3

= (a1 ∧ a3 ) ∨ (a2 ∧ a3 ), (b1 ∧ b3 ) ∨ (b2 ∧ b3 )

= (a1 ∧ a3 ), (b1 ∧ b3 ) ∨ (a2 ∧ a3 ), (b2 ∧ b3 )

= (a1 , b1 ) ∧ (a3 , b3 ) ∨ (a2 , b2 ) ∧ (a3 , b3 )

Definition 3.2.11. Let (Li , ≤i ), i = 1, 2 be two lattices. A function f : L1 → L2 satisfying

f (a ∨1 b) = f (a) ∨2 f (b) and f (a ∧1 b) = f (a) ∧2 f (b) is called a lattice homomorphism.
Furthermore, if f is a bijection, then it is called a lattice isomorphism.
Example 3.2.12. 1. Let D be the set of all words in our English dictionary with ‘dictionary
ordering’. Then, prove that D is a lattice. Now, consider the set S of all words in D
which are of length at most six or first-part-words of length six. Note that S is a lattice
again. Define f : D → S as f (d) = d if d has length at most six, otherwise f (d) is the
first-part-word of length 6 of d. Then, f is a homomorphism. It is not an isomorphism as
f (stupid) = f (stupidity).
2. Consider the lattice N with usual order. Let S = {0, 1, 2} with usual order. Let f : N → S
be a homomorphism. If f (m) = 0 and f (n) = 1, then m ≤ n, or else, we have
T
AF

0 = f (m) = f (m ∨ n) 6= f (m) ∨ f (n) = 0 ∨ 1 = 1.

Thus, the map f must have one of the following forms. Draw pictures to understand this.

(a) f −1 (0) = N.
(b) f −1 (0) = {1, 2, . . . , k} and f −1 (1) = {k + 1, . . .}, for some k ∈ N.
(c) f −1 (0) = {1, 2, . . . , k}, f −1 (1) = {1, 2, . . . , r}\{1, 2, . . . , k} and f −1 (2) = N\{1, 2, . . . , r},
for some k, r ∈ N with k < r.

Definition 3.2.13. [Complete lattice] A lattice (L, ≤) is complete if ∨A (lub of A) and ∧A

(glb of A) exist in L, for each nonempty subset A of L.
Example 3.2.14. 1. Verify that the lattices in Figure 3.3 are complete.

(1, 1, 1) 30 {a, b, c}

(1, 0, 1)
(1, 1, 0) (0, 1, 1) 6 10 15 {a, b} {a, c} {b, c}

(1, 0, 0) (0, 0, 1) 2 3 5 {a} {b} {c}

(0, 1, 0)

(0, 0, 0) 1 ∅

Figure 3.3: Complete lattices

86 CHAPTER 3. PARTIAL ORDERS, LATTICES AND BOOLEAN ALGEBRA

2. Verify that every finite lattice is complete.

3. [Bounded lattice] Every complete lattice has a least element 0 and a greatest element 1.
Any lattice with these two elements is called a bounded lattice.

4. The set [0, 5] with usual order is a bounded and complete lattice. So, is the set [0, 1)∪[2, 3].

5. The set (0, 5] is a lattice which is neither bounded nor complete.

6. The set [0, 1) ∪ (2, 3] is a bounded lattice, though not complete.

7. The set R with usual order is a lattice. It is not complete in the lattice ‘sense’. It is
‘conditionally complete’, that is, for every bounded nonempty subset glb and lub exist.
Can you think of a reason which implies the importance of the condition ‘non-emptiness’ ?

Ans: Each real number is an upper bound for ∅ and so we do not have a lub of ∅. Similarly,
we do not have a glb of ∅.

8. Fix n ∈ N and let p1 , p2 , . . . , pn be n distinct primes. Prove that the lattice D(N ), for
N = p1 p2 · · · pn is isomorphic to the lattice Ln (the lattice of n-tuples of 0 and 1) and to
the lattice P(S), where S = {1, 2, . . . , n}. The Hasse diagram for n = 3 is shown above.
T
AF

Definition 3.2.15. [Lattice Complement] Let (L, ≤) be a bounded lattice. Then, a com-
DR

plement of b ∈ L is an element (if it exists) c ∈ L such that b ∨ c = 1 and b ∧ c = 0. The

lattice is called complemented if every element has at least one complement. We shall use ¬b
to denote b, a complement of b.

Example 3.2.16. 1. The interval [0, 1] with usual ordering is a distributive lattice but is
not complemented.

2. Verify the captions of the two figures given below. Also, compute ¬0, ¬a, ¬b, ¬c, and ¬1.

1
f

a b c

0 0
Complemented but NOT distributive Distributive but NOT complemented

Discussion 3.2.17. [The comparison table] Let (L, ≤) be a lattice and let a, b, c ∈ L. Then,
the following table lists the properties that hold (make sense) in the specified type of lattices.
3.2. LATTICES 87

Properties Lattice type

∨, ∧ are idempotent any lattice
∨, ∧ are commutative any lattice
∨, ∧ are associative any lattice
[absorption] a ∧ (a ∨ b) = a = a ∨ (a ∧ b) any lattice
a≤b⇔a∧b=a⇔a∨b=b any lattice
[isotonicity] b ≤ c ⇒ {a ∨ b ≤ a ∨ c, a ∧ b ≤ a ∧ c} any lattice
a ∨ (b ∧ c) ≤ (a ∨ b) ∧ (a ∨ c)
[distributive inequalities] any lattice
a ∧ (b ∨ c) ≥ (a ∧ b) ∨ (a ∧ c)
[modular inequality] a ≤ c ⇔ a ∨ (b ∧ c) ≤ (a ∨ b) ∧ c any lattice
0 is unique; 1 is unique bounded lattice !!
if a is a complement of b, then b is also a complement of a bounded lattice !!
¬0 is unique and it is 1; ¬1 is unique and it is 0 bounded lattice !!
an element a has a unique complement distributive complemented lattice !!
n o
a ∨ c = b ∨ c, a ∨ ¬c = b ∨ ¬c ⇒ a = b
[cancelation] n o distributive complemented lattice
a ∧ c = b ∧ c, a ∧ ¬c = b ∧ ¬c ⇒ a = b
¬(a ∨ b) = ¬a ∧ ¬b
[DeMorgan] distributive complemented lattice
¬(a ∧ b) = ¬a ∨ ¬b
a ∨ ¬b = 1 ⇔ a ∨ b = a
distributive complemented lattice
T

a ∧ ¬b = 0 ⇔ a ∧ b = a
AF
DR

Proof. We will only prove the properties that appear in the last three rows. The other properties
are left as an exercise for the reader. To prove the cancelation property, note that

b = b ∨ 0 = b ∨ (c ∧ ¬c) = (b ∨ c) ∧ (b ∨ ¬c) = (a ∨ c) ∧ (a ∨ ¬c) = a ∨ (c∧ =

6 c) = a ∨ 0 = a

and

b = b ∧ 1 = b ∧ (c ∨ ¬c) = (b ∧ c) ∨ (b ∧ ¬c) = (a ∧ c) ∨ (a ∧ ¬c) = a ∧ (c ∨ ¬c) = a ∧ 1 = a.

To prove the DeMorgan’s property, note that

(a ∨ b) ∨ (¬a ∧ ¬b) = (a ∨ b ∨ ¬a) ∧ (a ∨ b ∨ ¬b) = 1 ∧ 1 = 1,

and
(a ∨ b) ∧ (¬a ∧ ¬b) = (a ∧ ¬a ∧ ¬b) ∨ (b ∧ ¬a ∧ ¬b) = 0 ∨ 0 = 0.

Hence, by Definition 3.2.15, we get ¬(a ∨ b) = ¬a ∧ ¬b. Similarly, note that (a ∧ b) ∨ (¬a ∨ ¬b) =
(a∨¬a∨¬b)∧(b∨¬a∨¬b) = 1∧1 = 1 and (a∧b)∧(¬a∨¬b) = (a∧b∧¬a)∨(a∧b∧¬b) = 0∧0 = 0.
Thus, by Definition 3.2.15, we again get ¬(a ∧ b) = (¬a ∨ ¬b). To prove the next assertion, note
that if a ∨ ¬b = 1, then

a = a ∨ (b ∧ ¬b) = (a ∨ b) ∧ (a ∨ ¬b) = (a ∨ b) ∧ 1 = a ∨ b.
88 CHAPTER 3. PARTIAL ORDERS, LATTICES AND BOOLEAN ALGEBRA

Conversely, if a = a ∨ b, then a ∨ ¬b = (a ∨ b) ∨ ¬b = 1. On similar lines, one completes the

proof of the second part and is left as an exercise for the reader.

Exercise 3.2.18. 1. Prove that every linearly ordered set is distributive.

2. Draw the Hasse diagrams of {1, 2, 3} × {1, 2, 3, 4} with dictionary order and the lattice
order ((m, n) ≤ (p, q) if m ≤ p and n ≤ q).

3. Give a partial order on N to make it a bounded lattice. You may draw Hasse diagram
representing it.

4. Does there exist a partial order on N for which each nonempty subset has finitely many (at
least one) upper bounds and finitely many (at least one) lower bounds?

Ans: No. Considering the set N we see that there is a unique minimum, say, k. Then, {k}
has infinitely many upper bounds.

5. Consider the lattice N2 with lexicographic order. Is it isomorphic to the direct product of
(N, ≤) with itself, where ≤ is the usual order?

6. Show that {0, 1, 2, . . .} is a complete lattice under divisibility relation (allow (0, 0) in the
relation). Characterize those sets A for which ∨A = 0.
T

7. Is the lattice {1, 2} × {1, 2} × {1, 2} × {1, 2} isomorphic to {1, 2, 3, 4} × {1, 2, 3, 4}?
AF

8. Prove/Disprove: If L is a lattice which is not complete, then L ≥ N .

Ans: Yes. As every finite lattice is complete.

9. Draw the Hasse diagram of a finite complemented lattice which is not distributive.

10. How many lattice homomorphisms are there from {1, 2} to {1, 2, . . . , 9}?

Ans: We are looking for an nondecreasing sequence of length two. It can be obtained by
arranging two bars and 9 − 1 = 8 balls (one plus the number of balls to the left of first bar is
10!
the first element of the sequence). So, the answer is 8!2! = 45.

11. Draw as many Hasse diagrams of non-isomorphic lattices of size 6 as you can.

Ans: Note that it must have a 1 (top) and a 0 (bottom). Considering the poset obtained by
deleting these two elements, we have many cases.

• Height 1:
Case: Height 1

• Height 2:

Case: Height 2
Case: Height 2

Case:
3.3. Height ALGEBRAS
BOOLEAN 2 89

Case: Height 3

• Height 3: Height 3
Case:

Case: Height 3

Case: Height 4
• Height 4:

Case: Height 4
T

Case: Height 4
AF
DR

3.3 Boolean Algebras

Definition 3.3.1. [Boolean algebra] A Boolean algebra is a set S which is closed under the
binary operations ∨ (called the join) and ∧ (called the meet) and for each x, y, z ∈ S, satisfies
the following properties.
1. x ∨ y = y ∨ x, x ∧ y = y ∧ x [commutative] .
2. x ∨ (y ∧ z) = (x ∨ y) ∧ (x ∨ z), x ∧ (y ∨ z) = (x ∧ y) ∨ (x ∧ z) [distributive] .
3. ∃ 0, 1 ∈ S such that x ∨ 0 = x, x ∧ 1 = x [identity elements] .
4. For each x ∈ S, ∃ y ∈ S such that x ∨ y = 1 and x ∧ y = 0 [inverse] .

Proposition 3.3.2. Let S be a Boolean algebra. Then, the following statements are true.
1. Elements 0 and 1 are unique.
2. For each s ∈ S, ¬s is unique. Therefore, for each x ∈ S, ¬x is called the inverse of x.
3. If y is the inverse of x, then x is the inverse of y. That is, x = ¬(¬x).

Proof.
1. Let 01 and 02 be two such elements. Then, 01 ∨ x = x and x = x ∨ 02 , for all x ∈ S.
Hence, 01 = 01 ∨ 02 = 02 . Thus, the required result follows. A similar argument implies
that 1 is unique.
90 CHAPTER 3. PARTIAL ORDERS, LATTICES AND BOOLEAN ALGEBRA

2. Suppose there exists t, r ∈ S such that s ∨ t = 1, s ∧ t = 0, s ∨ r = 1 and s ∧ r = 0. Then,

t = t ∧ 1 = t ∧ (s ∨ r) = (t ∧ s) ∨ (t ∧ r) = 0 ∨ (t ∧ r) = (s ∧ r) ∨ (t ∧ r) = (s ∨ t) ∧ r = 1 ∧ r = r.

3. It directly follows from the definition of ‘inverse’.

Example 3.3.3. 6 ∅. Then, P (S) is a Boolean algebra with ∨ = ∪, ∧ = ∩,

1. Let S =
c
¬A = A , 0 = ∅ and 1 = S. So, we have Boolean algebras of finite size as well as of
uncountable size.
30
2. Take S = {n ∈ N : n|30} with a ∨ b = lcm(a, b), a ∧ b = gcd(a, b), ¬a = a, 0 = 1 and
1 = 30. It is a Boolean algebra.
3. Let B = {T, F } with 0 = F , 1 = T and with usual ∨, ∧, ¬. It is a Boolean algebra.
4. Let B be the set of all truth functions involving the variables p1 , . . . , pn , with usual ∨, ∧, ¬.
Take 0 = F and 1 = T. This is the free Boolean algebra on the generators p1 , . . . , pn .
5. The class of finite length formulae involving variables p1 , p2 , . . . is a countable infinite
Boolean algebra with usual operations.

Observation.
The rules of Boolean algebra treat (∨, 0) and (∧, 1) equally. Notice that the second part
T
AF

of the rules in Definition 3.3.1 can be obtained by replacing ∨ with ∧ and 0 with 1. Thus,
any statement that one can derive from these rules has a dual version which is derivable
DR

from the rules. This is called the principle of duality.

Theorem 3.3.4. [Rules] Let (S, ∨, ∧, ¬) be a Boolean algebra. Then, the following rules, as
well as their dual, hold true.
1. ¬0 = 1.
2. For each s ∈ S, s ∨ s = s [idempotence] .
3. For each s ∈ S, s ∨ 1 = 1.
4. For each s, t ∈ S, s ∨ (s ∧ t) = s [absorption] .
5. If s ∨ t = r ∨ t and s ∨ ¬t = r ∨ ¬t, then s = r [cancelation] .
6. (s ∨ t) ∨ r = s ∨ (t ∨ r) [associative] .

Proof. We give the proof of the first part of each item and that of its dual is left for the reader.
1. 1 = 0 ∨ (¬0) = ¬0.
2. s = s ∨ 0 = s ∨ (s ∧ ¬s) = (s ∨ s) ∧ (s ∨ ¬s) = (s ∨ s) ∧ 1 = (s ∨ s).
3. 1 = s ∨ ¬s = s ∨ (¬s ∧ 1) = (s ∨ ¬s) ∧ (s ∨ 1) = 1 ∧ (s ∨ 1) = s ∨ 1.
4. s ∨ (s ∧ t) = (s ∧ 1) ∨ (s ∧ t) = s ∧ (1 ∨ t) = s ∧ 1 = s.
5. s = s ∨ 0 = s ∨ (t ∧ ¬t) = (s ∨ t) ∧ (s ∨ ¬t) = (r ∨ t) ∧ (r ∨ ¬t) = r ∨ (t ∧ ¬t) = r ∨ 0 = r.
3.3. BOOLEAN ALGEBRAS 91

6. We will prove it using absorption and cancelation. Using absorption, (s ∨ t) ∧ s = s and

s ∨ (r ∧ s) = s. Thus, (s ∨ t) ∨ r ∧ s = (s ∨ t) ∧ s ∨ (r ∧ s) = s ∨ (r ∧ s) = s. Using

absorption, we also have s ∨ (t ∨ r) ∧ s = s and hence

s ∨ (t ∨ r) ∧ s = (s ∨ t) ∨ r ∧ s.

Now, we see that [s ∨ (t ∨ r)] ∧ ¬s = 0 ∨ [(t ∨ r) ∧ ¬s] = (t ∧ ¬s) ∨ (r ∧ ¬s) and on similar
lines, [(s ∨ t) ∨ r] ∧ ¬s = (t ∧ ¬s) ∨ (r ∧ ¬s). Thus, we again have

s ∨ (t ∨ r) ∧ ¬s = (s ∨ t) ∨ r ∧ ¬s.

Hence, applying the cancelation property, the required result follows.

Example 3.3.5. Let (L, ≤) be a distributive complemented lattice. Then, by Definition 3.2.2,
L has two binary operations ∨ and ∧ and by Definition 3.2.15, the operation ¬x. It can be
easily verified that (L, ∨, ∧, ¬) is a indeed a Boolean algebra.

Now, let (B, ∨, ∧, ¬) be a Boolean algebra. Then, for any two elements a, b ∈ B, we define
a ≤ b if a ∧ b = a. The next result shows that ≤ is a partial order in B. This partial order is
generally called the induced partial order. Thus, we see that the Boolean algebra B, with
the induced partial order, is a distributive complemented lattice.
T
AF

Theorem 3.3.6. Let (B, ∨, ∧, ¬) be a Boolean algebra. Define, a ≤ b if a ∧ b = a. Then, ≤ is

a partial order on B. Furthermore, a ∨ b = lub{a, b} and a ∧ b = glb{a, b}.
DR

Proof. We first verify that (B, ≤) is indeed a partial order.

Reflexive: By idempotence, s ≤ s and hence ≤ is reflexive.
Antisymmetry: Let s ≤ t and t ≤ s. Then, we have s = s ∧ t = t.
Transitive: Let s ≤ t and t ≤ r. Then, using associativity, s∧r = (s∧t)∧r = s∧(t∧r) = s∧t = s
and thus, s ≤ r.
Now, we show that a ∨ b = lub{a, b}. Since B is a Boolean algebra, using absorption, we get
(a ∨ b) ∧ a = a and hence a ≤ a ∨ b. Similarly, b ≤ a ∨ b. So, a ∨ b is an upper bound for {a, b}.
Now, let x be any upper bound for {a, b}. Then, by distributive property, (a ∨ b) ∧ x =
(a ∧ x) ∨ (b ∧ x) = a ∨ b. So, a ∨ b ≤ x. Thus, a ∨ b is the lub of {a, b}. The rest of the proof is
similar and hence is left for the reader.

Thus, we observe that there is one-to-one correspondence between the set of Boolean Algebras
and the set of distributive complemented lattice.

Definition 3.3.7. [Atom] Let B be a Boolean algebra. If there exists a b ∈ B, b 6= 0 such that
b is a minimal element in B, then b is called an atom.
Example 3.3.8. 1. In the powerset Boolean algebra, singleton sets are the only atoms.
2. Atoms of the ‘divides 30’ Boolean algebra are 2, 3 and 5.
3. The {F, T } Boolean algebra has only one atom, namely T .
92 CHAPTER 3. PARTIAL ORDERS, LATTICES AND BOOLEAN ALGEBRA

Exercise 3.3.9. 1. Determine the atoms of the free Boolean algebra with generators p1 , . . . , pn ?
Ans: All possible conjunctions of literals.
2. Is it necessary that every Boolean algebra has at least one atom?
Ans: The free Boolean algebra on generators p1 , p2 , . . . has no atoms.

Definition 3.3.10. [Boolean homomorphism] Let B1 and B2 be two Boolean algebras. A

function f : B1 → B2 is a Boolean homomorphism if it preserves 0, 1, ∨, ∧, and ¬. That is,

f (01 ) = 02 , f (11 ) = 12 , f (a ∨ b) = f (a) ∨ f (b), f (a ∧ b) = f (a) ∧ f (b), and f (¬a) = ¬f (a).

A Boolean isomorphism is a Boolean homomorphism which is a bijection.

Exercise 3.3.11. Let B1 and B2 be two Boolean algebras and let f : B1 → B2 be a function
that satisfies the four conditions f (01 ) = 02 , f (11 ) = 12 , f (a ∨ b) = f (a) ∨ f (b) and f (a ∧ b) =
f (a) ∧ f (b). Then, prove that f also satisfies the fifth condition, namely f (¬a) = ¬f (a).

Example 3.3.12. The function f : P (J4 ) → P (J3 ) defined as f (S) = S \ {4} is a Boolean
homomorphism. We check just two properties and the rest is left as an exercise.

f (A ∨ B) = f (A ∪ B) = (A ∪ B) \ {4} = (A \ {4}) ∪ (B \ {4}) = f (A) ∨ f (B).

f (11 ) = f (J4 ) = J4 \ {4} = J3 = 12 .

Proposition 3.3.13. Let B be a Boolean algebra and p, q be two distinct atoms. Then, p∧q = 0.
AF
DR

Proof. Suppose that p ∧ q 6= 0. As p ∧ q ≤ p and p is an atom, we must have p ∧ q = p, i.e.,

q ≤ p. As p 6= q and q is an atom, it follows that p cannot be an atom.

Proposition 3.3.14. Let B be a Boolean algebra with three distinct atoms p, q and r. Then,
p ∨ q 6= p ∨ q ∨ r.

Proof. Let if possible p ∨ q = p ∨ q ∨ r. Then, we have

r = r ∨ 0 = r ∨ [(p ∨ q) ∧ ¬(p ∨ q)] = [r ∨ p ∨ q] ∧ [r ∨ ¬(p ∨ q)] = [p ∨ q] ∧ [r ∨ ¬(p ∨ q)]

= [(p ∨ q) ∧ r] ∨ [(p ∨ q) ∧ ¬(p ∨ q)] = (p ∨ q) ∧ r = (p ∧ r) ∨ (q ∧ r) = 0 ∨ 0 = 0,

a contradiction to r being an atom, i.e., r is nonzero.

Example 3.3.15. Let B be a Boolean algebra having distinct atoms A = {p, q, r}. Then, B
has at least 23 elements.
W
To show this, we define f : P(A) → B by f (∅) = 0 and for S ⊆ A, f (S) = x and claim
x∈S
that f is a one-one function.
Suppose f (S) = f (T ). Then, f (S) = f (S) ∨ f (T ) = f (S ∪ T ). In view of Proposition 3.3.14,
we have S = S ∪ T , i.e., T ⊆ S. Similarly, as f (T ) = f (T ∪ S), we have S ⊆ T and hence S = T .
Thus, f is a one-one function. Therefore, f (S) is distinct, for each subset of A and thus B has
at least 23 elements.
3.3. BOOLEAN ALGEBRAS 93

Theorem 3.3.16. Let B be a Boolean algebra having distinct atoms A = {p, q, r, s}. Let b ∈ B,
b 6= 0. Suppose that S = {atoms x : x ≤ b} = {p, q, r}. Then, b = p ∨ q ∨ r.

Proof. It is clear that p ∨ q ∨ r ≤ b. Suppose that p ∨ q ∨ r < b. Then,

b = b ∧ [(p ∨ q ∨ r) ∨ ¬(p ∨ q ∨ r)] = [b ∧ (p ∨ q ∨ r)] ∨ [b ∧ ¬(p ∨ q ∨ r)] = (p ∨ q ∨ r) ∨ [b ∧ ¬(p ∨ q ∨ r)].

Therefore, the above equality implies that [b ∧ ¬(p ∨ q ∨ r)] 6= 0. So, there is an atom, say x,
such that x ≤ b ∧ ¬(p ∨ q ∨ r). Thus, we have x ≤ b and x ≤ ¬(p ∨ q ∨ r).
Notice that if x ≤ (p ∨ q ∨ r), then x ≤ 0, which is not possible. So, x 6= p, q, r is an atom in
S, a contradiction.

Theorem 3.3.17. [Representation] Let B be a finite Boolean algebra. Then, there exists a
set X such that B is isomorphic to P(X).

Proof. Put X = {atoms of B}. Note that X 6= ∅. Define f : B → P(X) by f (b) = {atoms ≤ b}.
We show that f is the required Boolean isomorphism.
Injection: Let b1 6= b2 . Then, either b1 b2 or b2 b1 . Without loss of generality, let b1 b2 .
[Now imagine the power set Boolean algebra. Saying b1 b2 is the same as b1 * b2 . In that case,
we have an element in b1 which is not in b2 . That is, b1 ∩ bc2 6= ∅. That is, there is a singleton
subset of b1 ∩ bc2 . This is exactly what we are aiming for, i.e., to prove that b1 ∧ ¬b2 6= 0.] Note
that b1 = b1 ∧ (b2 ∨ ¬b2 ) = (b1 ∧ b2 ) ∨ (b1 ∧ ¬b2 ). Also, the assumption b1 b2 implies b1 ∧ b2 6= b1
T

and hence b1 ∧ ¬b2 6= 0. So, there exists an atom x ≤ (b1 ∧ ¬b2 ) and hence x = x ∧ b1 ∧ ¬b2 .
AF

Therefore,
DR

x ∧ b1 = (x ∧ b1 ∧ ¬b2 ) ∧ b1 = x ∧ b1 ∧ ¬b2 = x.
Thus, x ≤ b1 . Similarly, x ≤ ¬b2 . As x 6= 0, we cannot have x ≤ b2 (the condition x ≤ ¬b2 and
x ≤ b2 implies x ≤ b2 ∧ ¬b2 = 0). Thus, f (b1 ) 6= f (b2 ).
Surjection: Let A = {x1 , . . . , xk } ⊆ X and put b = x1 ∨ · · · ∨ xk (if k = 0, then b = 0). Clearly,
A ⊆ f (b). Need to show: A = f (b). So, let y ∈ f (b), i.e., y is an atom in B and

y = y ∧ b = y ∧ (x1 ∨ · · · ∨ xk ) = (y ∧ x1 ) ∨ · · · ∨ (y ∧ xk ).

Since y 6= 0, by Proposition 3.3.13, it follows that y ∧ xi0 6= 0, for some i0 ∈ {1, 2, . . . , k}. As
xi0 and y are atoms, we have y = y ∧ xi = xi and hence y ∈ A. Thus, f is a surjection.
Preserving 0, 1: Clearly f (0) = ∅ and f (1) = X.
Preserving ∨, ∧: By definition,

x ∈ f (b1 ∧ b2 ) ⇔ x ≤ b1 ∧ b2 ⇔ x ≤ b1 and x ≤ b2
⇔ x ∈ f (b1 ) and x ∈ f (b2 ) ⇔ x ∈ f (b1 ) ∩ f (b2 ).

Now, let x ∈ f (b1 ∨ b2 ). Then, by definition, x = x ∧ (b1 ∨ b2 ) = (x ∧ b1 ) ∨ (x ∧ b2 ). So, there exists

i such that x ∧ bi 6= 0 (say, x ∧ b1 ). As, x is an atom, x ≤ b1 and hence x ∈ f (b1 ) ⊆ f (b1 ) ∪ f (b2 ).
Conversely, let x ∈ f (b1 ) ∪ f (b2 ). Without loss of generality, let x ∈ f (b1 ). Thus, x ≤ b1 and
hence x ≤ b1 ∨ b2 which in turn implies that x ∈ f (b1 ∨ b2 ).

As a direct corollary, we have the following result.

94 CHAPTER 3. PARTIAL ORDERS, LATTICES AND BOOLEAN ALGEBRA

Corollary 3.3.18. Let B be a finite Boolean algebra having exactly k atoms. Then, B is
isomorphic to P({1, 2, . . . , k}) and hence has exactly 2k elements.
Exercise 3.3.19. 1. Determine the number of elements in a finite Boolean algebra.
2. Supply a Boolean homomorphism f from P (J4 ) to P (J3 ) such that the image of P (J4 ) has
4 elements.
Ans: {1, 2}, {1}, {2} → ∅, {3} → {3} and {4} → {4}.
3. Prove/Disprove: The number of Boolean homomorphisms from P (J4 ) to P (J3 ) is less than
the number of lattice homomorphisms from P (J4 ) to P (J3 ).
Ans: Yes.
Since a Boolean algebra is a distributive lattice, a Boolean homomorphism preserves lub and
glb for two elements. Hence, it is a lattice homomorphism from P (J4 ) to P (J3 ).
The function f : P (J4 ) → P (J3 ) defined as f (x) = {1, 2} is a lattice homomorphism which
is not a Boolean homomorphism.
4. Show that a lattice homomorphism on a Boolean algebra which preserves 0 and 1 is a
Boolean homomorphism.
Ans: Let f : B → B be a lattice homomorphism and a ∈ B. Then, f (a) ∨ f (¬a) =
f (a ∨ ¬a) = f (1) = 1 and f (a) ∧ f (¬a) = f (a ∧ ¬a) = f (0) = 0. Thus, f (¬a) is a (hence
the) complement of f (a). So, f (¬a) = ¬f (a).
T
AF

5. Consider the class of all functions f : R → {π, e}. Can we define some operations on this
DR

class to make it a Boolean algebra?

6. Show that a finite Boolean algebra must have at least one atom. Is ‘finite’ necessary?
7. A positive integer is called squarefree if it is not divisible by the square of a prime. Let
Bn = {k ∈ N : k|n}. For a, b ∈ Bn take the operations a ∨ b = lcm(a, b), a ∧ b = gcd(a, b)
and ¬a = n/a. Show that Bn is a Boolean algebra if and only if n > 1 is squarefree.
8. Show that the set of subsets of N which are either finite or have a finite complement is a
countable infinite Boolean algebra. Find the atoms. Is it isomorphic to the Boolean algebra
of all finite length formulae involving variables p1 , p2 , · · · ?
Ans: Call this set B. Since P (N) is a Boolean algebra, all we need to show is that x ∨ y, x ∧
y, ¬x are elements of B whenever x, y ∈ B.
Note that x∨y has a finite complement if and only if at least one of x, y has a finite complement;
otherwise x ∨ y is finite.
Note that x ∧ y has is finite if and only if at least one of x, y is finite; otherwise x ∨ y has a
finite complement.
Note that ¬x has a finite complement if and only if x is finite.
∞
S
Note that set A of all finite subsets of N is nothing but P (Jn ). Hence, it is countable. So,
n=1
the set A0 = {N \ X : X ∈ A} is also countable. Thus, B = A ∪ A0 is countable.
Atoms of B are precisely {1}, {2}, . . ..
3.3. BOOLEAN ALGEBRAS 95

No, this is not isomorphic to the finite length formulae Boolean algebra as the later Boolean
algebra does not have an atom.
9. Let B be a Boolean algebra and xi ∈ B, i = 1, 2, . . .. We know that, for each n ∈ N, the
n
W ∞
W
expression ‘ xi ’ is meaningful in each Boolean algebra due to associativity. Is ‘ xi ’
i=1 i=1
necessarily a meaningful expression?
Ans: No. Consider the Boolean algebra of all finite length formulae written using the variables
∞
W
p1 , p2 , . . .. The expression pi means p1 ∨ p2 ∨ p3 ∨ · · · . This is not a finite length formula
i=1
∞
W
and so it is not in the Boolean algebra. Thus, it does not make sense to talk about pi in a
i=1
Boolean algebra.
10. Prove/Disprove: Let f : B1 → B2 be a Boolean homomorphism and a ∈ B1 be an atom.
Then, f (a) is an atom of B2 .
Ans: No. We know that f : P (J4 ) → P (J3 ) defined as f (A) = A \ {4} is a Boolean
homomorphism. Note that {4} is an atom of P (J4 ) and f ({4}) = ∅ = 02 .
11. Fill in the blank: The number of Boolean homomorphisms from P (J4 ) to P (J3 ) is .
Ans: Let f : P (J4 ) → P (J3 ) be a Boolean homomorphisms. Notice that two distinct atoms,
say {1}, {2}, of P (J4 ) go to elements with intersection zero, that is,

02 = f (01 ) = f ({1} ∩ {2}) = f ({1}) ∩ f ({2}).

T
AF

n o n o
Define a relation Ff from {1}, {2}, {3}, {4} to {1}, {2}, {3} as (x, y) ∈ Ff if y ⊆ f (x).
DR

n o n o
Then, by the previous observation we see that Ff−1 : {1}, {2}, {3} → {1}, {2}, {3}, {4} .
Conversely any such function gives a Boolean homomorphism. So, the answer is 64.
12. Fill in the blank: The number of Boolean homomorphisms from P (J4 ) onto P (J3 ) is .
Ans: Let f : P (J4 ) → P (J3 ) be an onto Boolean homomorphisms. Notice that two distinct
atoms, say {1}, {2}, of P (J4 ) go to elements with intersection zero, that is,

02 = f (01 ) = f ({1} ∩ {2}) = f ({1}) ∩ f ({2}).

Thus, f ({1}) cannot be a two element set, say, {2, 3}, otherwise, we have ∅ ( f −1 ({2}) ⊆
f −1 ({2, 3}) = {1} implying that f ({1}) = {2} and so f is not a function.
Since f (J4 ) = J3 it follows that three atoms of P (J4 ) must go to three atoms of P (J3 ). The
fourth atom of P (J4 ) must go to ∅ = 02 . So, the answer is 4 × 3! = 24.
13. How many atoms does “divides 30030 Boolean algebra” has? How many elements does it
have?
Ans: Six : 2, 3, 5, 7, 11, 13; Total number of elements is 26 .
14. If B1 and B2 are Boolean algebras of size k (k > 100), then they must be isomorphic and
there must be more than k isomorphisms between them.
Ans: Yes. Take k = 2m . So, m ≥ 4. There are m! > 2m isomorphisms (considering the
atoms in each Boolean algebra).
96 CHAPTER 3. PARTIAL ORDERS, LATTICES AND BOOLEAN ALGEBRA

15. Give examples of two countably infinite non-isomorphic Boolean algebras.

Ans: The ‘finite and finite complement subsets in N’ Boolean algebra and the ‘finite length
formulae on variables p1 , p2 , . . .’ Boolean algebra are both countably infinite. They are non
isomorphic (see some earlier exercises).
16. Give examples of two uncountably infinite non-isomorphic Boolean algebras.
Ans: P (R) Boolean algebra and the ‘finite and finite complement subsets in R’ Boolean
algebra. They are non isomorphic as given any element in the later one, either there are
infinitely many elements which are greater than it or there are infinitely many elements less
that it, but not both. Whereas, in the former Boolean algebra, we can have an element which
satisfies both.

T
AF
DR
Chapter 4

Basic Counting

Discussion 4.0.1. In the previous chapters, we had learnt that two sets, say A and B, have
the same cardinality if there exists a one-one and onto function f : A → B. We also learnt the
following two rules of counting which play a basic role in the development of this subject.
1. [Multiplication rule] If a task has n compulsory parts, say A1 , A2 , . . . , An and the ith
part can be completed in mi = |Ai | ways, i = 1, . . . , n, then the task can be completed in
m1 m2 · · · mn ways. In mathematical terms,

|A1 × A2 × · · · × An | = |A1 | · |A2 | · · · · · |An |.

T
AF

2. [Addition rule] If a task consists of n alternative parts, say A1 , A2 , . . . , An , and the ith
part can be done in |Ai | = mi ways, i = 1, . . . , n, then the task can be completed in
DR

m1 + m2 + · · · + mn ways. In mathematical terms,

|A1 ∪ A2 ∪ · · · ∪ An | = |A1 | + |A2 | + · · · + |An |, whenever Ai ∩ Aj 6= ∅, 1 ≤ i < j ≤ n.

Example 4.0.2. 1. How many three digit natural numbers can be formed using digits 0, 1, · · · , 9?
Identify the number of parts in the task and the type of the parts (compulsory or alterna-
tive). Which rule applies here?
Ans: The task has three compulsory parts. Part 1: choose a digit for the leftmost place.
Part 2: choose a digit for the middle place. Part 3: choose a digit for the rightmost place.

Multiplication rule applies. Ans: 900.

2. How many three digit natural numbers with distinct digits can be formed using digits
1, · · · , 9 such that each digit is odd or each digit is even? Identify the number of parts in
the task and the type of the parts (compulsory or alternative). Which rule applies here?
Ans: The task has two alternative parts. Part 1: form a three digit number with distinct
numbers from {1, 3, 5, 7, 9} using the odd digits. Part 2: form a three digit number with
distinct numbers from {2, 4, 6, 8} using the even digits. Observe that Part 1 is a task

97
98 CHAPTER 4. BASIC COUNTING

having three compulsory subparts. In view of 4.0.2, we see that Part 1 can be done in 60
ways. Part 2 is a task having three compulsory subparts. In view of 4.0.2, we see that
Part 2 can be done in 24 ways. Since our task has alternative parts, addition rule applies.
Ans: 84.

Definition 4.0.3. We use the notation n! = 1 · 2 · · · · · n. By convention, we take 0! = 1.

4.1 Permutations and Combinations

Definition 4.1.1. An r-sequence of elements of X is a sequence of length r with elements
from X. This may be viewed as a word of length r with alphabets from X or as a function
f : {1, 2, . . . , r} → X. We write ‘an r-sequence of X’ to mean ‘an r-sequence of elements of X’.

Theorem 4.1.2. The number of r-sequences of {1, 2, . . . , n} is nr .

Proof. Here the task has r compulsory parts. Choose the first element of the sequence, the
second element and so on.

Exercise 4.1.3. 1. In how many ways can r distinguishable/distinct balls be put into n
distinguishable/distinct boxes?
Ans: rn : Number the balls as 1, 2, . . . , r and the boxes as x1 , x2 , . . . , xn .
T
AF

2. How many distinct ways are there to make a 5 letter word using the ENGLISH alphabet
DR

(a) with no restriction?

Ans: 265 .
(b) with ONLY consonants?
Ans: 215 .
(c) with ONLY vowels?
Ans: 55 .
(d) with a consonant as the first letter and a vowel as the second letter?
Ans: 21 × 5 × (26)3 .
(e) if the vowels appear only at odd positions?
Ans: (26)3 (21)2 .

3. Determine the total number of possible outcomes if

(a) two coins are tossed?

Ans: 22 , f : {1, 2} → {H, T }, where H for Head and T for Tail.
(b) a coin and a die are tossed?
Ans: 2 × 6, f : {1} → {H, T } and g : {1} → {1, 2, 3, 4, 5, 6}.
(c) two dice are tossed?
Ans: 62 , f : {1, 2} → {1, 2, 3, 4, 5, 6}.
4.1. PERMUTATIONS AND COMBINATIONS 99

(d) three dice are tossed?

Ans: 63 , f : {1, 2, 3} → {1, 2, 3, 4, 5, 6}.
(e) k dice are tossed, where k ∈ N?
Ans: 6k , f : {1, 2, . . . , k} → {1, 2, 3, 4, 5, 6}.
(f ) five coins are tossed?
Ans: 25 , f : {1, 2, 3, 4, 5} → {H, T }.

4. How many 5-letter words using only A’s, B’s, C’s, and D’s are there that do not contain
the word “CAD”?
Ans: Total number of 5-letter words using only A’s, B’s, C’s, and D’s equals 45 as we need
to consider f : {1, 2, 3, 4, 5} → {A, B, C, D}. The word “CAD” may start at the first place or
the second place or the third place. As we need 5-letter words, there are two places left after
using the places for the word CAD. So, the number of words that have CAD are 3 · 42 . So,
the answer is 45 3 × 42 .

Definition 4.1.4. [r-permutation, n-set] By an n-set, we mean a set containing n elements.

An r-permutation of an n-set S is an arrangement of r distinct elements of S in a row. An
r-permutation may be viewed as a one-one mapping f : {1, 2, . . . , r} → S. An n-permutation of
an n-set is simply called a permutation.
T

Example 4.1.5. How many one-one maps f : {1, 2, 3, 4} → A = {A, B, . . . , Z} are there?
AF

Ans: The task has 4 compulsory parts: select f (1), select f (2), select f (3) and select f (4).
DR

Note that f (2) cannot be f (1), f (3) cannot be f (1) or f (2) and so on. Now apply the multipli-
cation rule. Ans: 26 · 25 · 24 · 23 = 26!
22! .

Theorem 4.1.6. [Number of r-permutations] The number of r-permutation of an n-set S is

n!
P (n, r) = (n−r)! .

Proof. Let us view an r-permutation as a one-one map from f : {1, 2, . . . , r} → S. Here the
task has r compulsory tasks: select f (1), select f (2), . . ., select f (r) with the condition, for
2 ≤ k ≤ r, f (k) 6∈ {f (1), f (2), . . . , f (k − 1)}. Multiplication rule applies. Hence, the number of
n!
r-permutations equals n(n − 1) · · · (n − r + 1) = (n−r)! .

Definition 4.1.7. By P (n, r), we denote the number of r-permutations of {1, 2, . . . , n}. By
convention, P (n, 0) = 1. Some books use the notation n(r) and call it the falling factorial of
n. Thus, if r > n then P (n, r) = n(r) = 0 and if n = r then P (n, r) = n(r) = n!.

Exercise 4.1.8. 1. How many distinct ways are there to make 5 letter words using the EN-
GLISH alphabet if the letters must be different?
26!
Ans: 26 · 25 · 24 · 23 · 22 = 21! .

2. How many distinct ways are there to arrange the 5 letters of the word ROY AL?
Ans: 5!
100 CHAPTER 4. BASIC COUNTING

3. Determine the number of ways to place 4 couples in a row if each couple seats together.
Ans: A couple can be thought of as one cohesive group (they are to be seated together).
So, the 4 cohesive groups can be arranged in 4! ways. But a couple can sit either as “wife and
husband” or “husband and wife”. So, the total number of arrangements is 24 4!.

4. How many distinct ways can 8 persons, including Ram and Shyam, sit in a row, with Ram
and Shyam sitting next to each other?
Ans: Ram and Shyam can be thought of as one person. So, the 7 persons can be arranged
in 7! ways. But Ram and Shyam can sit either as “Ram and Shyam” or “Shyam and Ram”.
So, the total number of arrangements is 2! · 7!.

Proposition 4.1.9. [principle of disjoint pre-images of equal size] Let A, B be finite sets and
f : A → B be a function such that for each pair b1 , b2 ∈ B we have |f −1 (b1 )| = k = |f −1 (b2 )|
(recall that f −1 (b1 ) ∩ f −1 (b2 ) = ∅). Then, |A| = k|B|.

Discussion 4.1.10. Consider the word AABAB. Give subscripts to the three As and the two
Bs and complete the following list. Notice that each of them will give us AABAB if we erase
the subscripts.
A1 A2 B1 A3 B2 A1 A2 B1 B2 A3 A1 A2 A3 B1 B2 A1 A2 A3 B2 B1 A1 A2 B2 B1 A3
A1 A2 B2 A3 B1 ··· ··· ··· ···
··· ··· ··· ··· ···
T

··· ··· ··· ··· ···

··· ··· ··· B 2 A3 B 1 A1 A2 B 2 A3 B 1 A2 A1

Example 4.1.11. How many words of size 5 are there which use three A’s and two B’s?
Ans: Put A = {arrangements of A1 , A2 , A3 , B1 , B2 } and B = {words of size 5 which use three
A’s and two B’s}. For each arrangement a ∈ A, define f (a) to be the word in B obtained by
erasing the subscripts. Then, the function f : A → B satisfies:

‘for each b, c ∈ B, b 6= c, we have |f −1 (b)| = |f −1 (c)| = 3!2! and f −1 (b) ∩ f −1 (c) = ∅’.
|A| 5!
Thus, by Proposition 4.1.9, |B| = 3!2! = 3!2! .

Remark 4.1.12. Let us fix n, k ∈ N with 0 ≤ k ≤ n and ask the question ‘how many words of
size n are there which uses k many A’s and (n − k) many B’s’ ?
Ans: Put A = {arrangements of A1 A2 . . . Ak B1 B2 . . . Bn−k } and B = {words of size n which
uses k many A’s and (n − k) many B’s} and proceed as above to get
|A| n!
|B| = =
k!(n − k)! k!(n − k)!
n!
as the required answer. Observe that the above argument implies k!(n−k)! ∈ Z. We denote this
number by P (n; k). Note that P (n; k) = P (n; n − k), Also, as per convention, P (n; k) = 0,
whenever k < 0 or n < k.

The above idea is further generalized below.

Definition 4.1.13. A multiset is a collection of objects where an object can appear more than
once. So, a set is a multiset. Note that {a, a, b, c, d} and {a, b, a, c, d} are the same 5-multisets.
4.1. PERMUTATIONS AND COMBINATIONS 101

Theorem 4.1.14. [Arrangements] Let us fix n, k ∈ N with 1 ≤ k ≤ n and let S be a multiset

k
P
containing ni ∈ N objects of i-th type, for i = 1, . . . , k with n = ni . Then, there are
i=1
(n1 + · · · + nk )! n!
= arrangements of the objects in S.
n1 !n2 ! · · · nk ! n1 !n2 ! · · · nk !
Proof. Assume that S consists of ni copies of Ai , i = 1, . . . , k. Put
A = {A11 , . . . , A1n1 , A21 , . . . , A2n2 } and
B = {words of size made using elements of }. For each arrangement a ∈ A,
define f (a) to be the word in B obtained by erasing the right subscripts of the objects of a.
Then, the function f : A → B satisfies:

‘for each b, c ∈ B, b 6= c, we have |f −1 (b)| = |f −1 (c)| = and f −1 (b) ∩ f −1 (c) = ∅’.

|A| (n1 +···+nk )! n!
Thus, by Proposition 4.1.9, |B| = n1 !···nk ! = n1 !···nk ! = n1 !n2 !···nk ! .

Theorem 4.1.15. [Allocation I: distinct locations; identical objects (ni of type i); at most
one per place] Fix a positive integer k and for 1 ≤ i ≤ k, let Gi ’s be boxes containing ni ∈ N
k
P
identical objects. If the objects in distinct boxes are non-identical and n ≥ ni then, the number
i=1
of allocations of the objects in n distinct locations l1 , . . . , ln , each location receiving at most one
n! P
object, is n1 !···nk !(n− ni )! .

k
P
T
Proof. Consider a new group Gk+1 with nk+1 = n − ni objects of a new type. Notice that
AF

1
an allocation of objects from G1 , . . . , Gk to n distinct places, where each location receives at
most one object, gives a unique arrangement of elements of G1 , . . . , Gk+1 .1 Thus, the number
DR

of allocations of objects from G1 , . . . , Gk to n distinct places, where each location receives at

most one object, is the same as the number of arrangements of elements of G1 , . . . , Gk+1 . By
n! P
Theorem 4.1.14, this number is n1 !···nk !(n− ni )! .

Definition 4.1.16. Let n, n1 , n2 , . . . , nk ∈ N. Then, by P (n; n1 , . . . , nk ), we denote the number

n!
P .
n1 ! · · · nk !(n − ni )!

Thus, P (6; 1, 1, 1) = P (6, 3). As a convention, P (n; n1 , . . . , nk ) = 0 whenever either ni < 0; for
Pk
some i, 1 ≤ i ≤ k, or ni > n. Many texts use C(n; n1 , · · · , nk ) to mean P (n; n1 , · · · , nk ). We
i=1
shall interchangeably use them.

Definition 4.1.17. [r-combination] An r-combination of an n-set S is an r-subset of S.

The number of r-subsets of an n-set is denoted by C(n, r). Thus, for any natural number n,
C(n, 0) = C(n, n) = 1.
1
Take an allocation of objects from G1 , . . . , Gk to n distinct places, where each location receives at most one
object. There are nk+1 locations which are empty. Supply an object from Gk+1 to each of these locations.
We have created an arrangement of elements of G1 , . . . , Gk+1 . Conversely, take an arrangement of elements of
G1 , . . . , Gk+1 . View this as an allocation of elements of G1 , . . . , Gk+1 to n distinct places. Empty the places which
have received elements from Gk+1 . We have created an allocation of elements of G1 , . . . , Gk to n distinct places,
where each location receives at most one object.
102 CHAPTER 4. BASIC COUNTING

n!
Theorem 4.1.18. [Combination] C(n, r) = P (n; r) = r!(n−r)! .

Proof. By Theorem 4.1.15, the number of allocations of r identical objects in n distinct places
(p1 , . . . , pn ) with each place receiving at most 1 is P (n; r). Note that each such allocation A
uniquely corresponds to a r-subset of {1, 2, . . . , n}, namely to {i | pi receives an object by A}.
n!
Thus, C(n, r) = P (n; r) = r!(n−r)! .

Example 4.1.19. In how many ways can you allocate 3 identical passes to 10 students so that
each student receives at most one? Ans: C(10, 3)

Theorem 4.1.20. [Pascal] C(n, r) + C(n, r + 1) = C(n + 1, r + 1).

n!
Proof. By Theorem 4.1.18, C(n, r) = r!(n−r)! . Now verify the above identity to get the result.

Experiment
T

Complete the following list by filling the left list with all 3-subsets of {1, 2, 3, 4, 5} and the
AF

right list with 3-subsets of {1, 2, 3, 4} as well as with 2-subsets of {1, 2, 3, 4} as shown below.
DR

 

 {1, 2, 3} {1, 2, 3} 


 
C(4, 3)





 

{2, 3, 4} {2, 3, 4} 

 


{1, 2, 5} {1, 2}

C(5, 3)





 


 
C(4, 2)





 


 


 

{3, 4, 5} {3, 4}
 

Theorem 4.1.21. [Alternate proof of Pascal’s Theorem 4.1.20] Here we supply a combi-
natorial proof, i.e., ‘by associating the numbers with objects’. Let S = {1, 2, . . . , n, n + 1} and
A be an (r + 1)-subset of S. Then, there are C(n + 1, r + 1) such sets with either n + 1 ∈ A or
n + 1 6∈ A.
Note that n + 1 ∈ A if and only if A \ {n + 1} is an r-subset of {1, 2, . . . , n}. So, the number of
(r + 1)-subsets of {1, 2, . . . , n, n + 1} which contain the element n + 1 is, by definition, C(n, r).
Also, n + 1 ∈
/ A if and only if A is an (r + 1)-subset of {1, 2, . . . , n}. So, a set A which does not
contain n + 1 can be formed in C(n, r + 1) ways. Hence, an (r + 1)-subset of S can be formed,
by definition, in C(n, r) + C(n, r + 1) ways. Thus, the required result follows.
4.1. PERMUTATIONS AND COMBINATIONS 103

Experiment
Here we consider subsets of {1, 2, 3, 4}. Complete the following list by using 0’s, 1’s, x’s
and y’s, where x and y are commuting (xy = yx) symbols.
∅ 0000 yyyy = y 4
{1} 1000 xyyy = xy 3
{2} 0100 yxyy = xy 3
{3} 0010 yyxy = xy 3
{4} 0001 yyyx = xy 3
{1, 2} 1100 xxyy = x2 y 2

{1, 2, 3, 4} 1111 xxxx = x4

Practice 4.1.22. Give a combinatorial proof of C(n, r) = C(n, n − r), whenever n, r ∈ N with
0 ≤ r ≤ n.

Theorem 4.1.23. [Allocation II: distinct locations; distinct objects; ni at place i] The
number of ways of allocating objects o1 , . . . , on into pockets p1 , . . . , pk so that pocket pi contains
ni objects, is P (n; n1 , . . . , nk ).

Proof. Task has k compulsory parts: select n1 for pocket p1 and so on. So, the answer is
T
C(n, n1 )C(n − n1 , n2 ) · · · C(n − n1 − · · · − nk−1 , nk ) = P (n; n1 , . . . , nk ).
AF

Alternate. Take an allocation of o1 , . . . , on into pockets p1 , . . . , pk so that the pocket pi gets

ni objects. This is an allocation of n1 copies of p1 , · · · , nk copies of pk into locations o1 , . . . , on

where each location gets exactly one. Hence, the answer is P (n; n1 , . . . , nk ).
Exercise 4.1.24. 1. In a class there are 17 girls and 20 boys. A committee of 5 students is
to be formed to represent the class.

(a) Determine the number of ways of forming the committee consisting of 5 students.
Ans: C(37, 5) (use Theorem 4.1.18).
(b) Suppose the committee also needs to choose two different people from among them-
selves, who will act as “spokesperson” and “treasurer”. In this case, determine the
number of ways of forming a committee consisting of 5 students. Note that two com-
mittees are different if
i. either the members are different, or
ii. even if the members are the same, they have different students as spokesperson
and/or treasurer.
Ans: C(37, 5) × C(5, 2) as the spokesperson and treasurer can be chosen in C(5, 2)
ways from the selected committee members. Verify that C(37, 5) × C(5, 2) = C(37, 2) ×
C(35, 3). Can you think of a combinatorial justification?
(c) Due to certain restrictions, it was felt that the committee should have at least 3 girls.
In this case, determine the number of ways of forming the committee consisting of 5
students (no one is to be designated as spokesperson and/or treasurer).
104 CHAPTER 4. BASIC COUNTING

Ans: C(17, 5) + C(17, 4) × C(20, 1) + C(17, 3) × C(20, 2) corresponding to committees

having exactly 5, 4, and 3 girls, respectively.

2. Combinatorially prove the following identities:

(a) kC(n, k) = nC(n − 1, k − 1).
Ans: Choose a team consisting of k people from a set of n people (C(n, k). Now,
among the chosen k people there are k = C(k, 1) ways of choosing the leader. This gives
kC(n, k). The other way, we can first choose the leader in n = C(n, 1) ways and then
build a group consisting of k people by adding k − 1 people from the remaining n − 1
people (C(n − 1, k − 1)).
(b) [Newton’s Identity] : C(n, r)C(r, k) = C(n, k)C(n − k, r − k).
Ans: A combinatorial argument to prove this is the following. Select a team of size
r from n students and then from that team select k leaders. This can be done in
C(n, r)C(r, k) ways. Alternately, select the leaders first in C(n, k) ways and out of the
rest select another r − k for the team in C(n − k, r − k) ways. So, the number of ways
of doing this is C(n, k)C(n − k, r − k).
(c) C(n, r) = C(r, r)C(n − r, 0) + C(r, r − 1)C(n − r, 1) + · · · + C(r, 0)C(n − r, r).
Ans: The LHS is the number of r-subsets of {1, 2, . . . , n}. On the other hand an r-subset
of {1, 2, . . . , n} can be chosen by choosing a k-subset A1 of {1, 2, . . . , r} and an r − k-
r
P
subset A2 of {r+1, r+2, · · · , n}. So, the number of r-subsets is C(r, k)C(n−r, r−k).
T

k=0
AF

(d) C(n, 0)2 + C(n, 1)2 + · · · + C(n, n)2 = C(2n, n).

Ans: Let A be a subset of size n from the set {1, 2, . . . , 2n}. Then, A can be chosen
by choosing k elements from {1, 2, . . . , n} and by deleting k elements from {n + 1, n +
n
C(n, k)2 .
P
2, · · · , 2n}. So, the number of ways to choose A is
k=0

3. Determine the number of ways of selecting a committee of m people from a group consisting
of n1 women and n2 men, with n1 + n2 ≥ m.
Ans: Since there are n1 + n2 people, there are C(n1 + n2 , m) ways of choosing distinct
committees.
Alternately, either “Choose m people only from women” or “m − 1 people only from women
m
P
and 1 from men” or .... So, the number is C(n1 , k)C(n2 , m − k).
k=0
4. Determine the number of ways of arranging the letters of the word

(a) ABRACADABARAARCADA.
Ans: There are 18 letters among which A repeats 9 times, R repeats 3 times and the
others, namely, B, C and D repeat 2 times each. So, the number of arrangements is
C(18; 9, 3, 2, 2, 2).
(b) KAGART HALAM N AGART HALAM .
Ans: There are 22 letters among which A repeats 8 times, K and N repeat once
and the others, namely, G, R, T, H, L and M repeat 2 times each. So, the number of
arrangements is C(22; 8, 2, 2, 2, 2, 2, 2, 1, 1).
4.1. PERMUTATIONS AND COMBINATIONS 105

5. How many anagrams of M ISSISSIP P I are there so that no two S are adjacent?
7!
Ans: There are four S in this word. The other 7 letters can be arranged in 4!2! different ways.
Now there are 8 places available for the S’s, two on either side and six in the gaps. This can
7!
be done in C(8, 4) ways. So, the total number of arrangements possible is 4!2! C(8, 4). (Do
not try bundling up two S’s together. That is incorrect.)
6. How many rectangles are there in an n × n square? How many squares are there?
Ans: A rectangle corresponds uniquely to 2 points out of n + 1 on the leftmost vertical line
and two points on the topmost horizontal line. So, the number of rectangles is C(n + 1, 2)2 .
For counting the squares, a square is obtained uniquely by selecting two points from the right
n
2C(i, 2) = 1 · 2 + 2 · 3 + · · · + (n − 1) · n + n(n+1)
P
diagonals. So, there are C(n + 1, 2) + 2
i=2
n
n(n+1)
(12 (22 1)2 i2 .
P
= + 1) + + 2) + · · · + ((n − + (n − 1))) + 2 =
i=1

Alternate. We can directly count that there are n2 many squares of size one; (n − 1)2 many
of size two; etc.
7. Show that a product of n consecutive natural numbers is always divisible by n!.
(k+1)(k+2)···(k+n)
Ans: n! is nothing but the number of k-subsets of a (n + k)-set.
8. Show that (m!)n divides (mn)!.
Ans: (mn)!
(m!)n is the number of words formed using all of: m copies of x1 , m copies of x2 , · · · ,
T

m copies of xn .
AF

9. If n points are placed on the circumference of a circle and all the lines connecting them are
DR

joined, what is the largest number of points of intersection of these lines inside the circle
that can be obtained?
Ans: Any choice of four points on the circumference can give us one point of intersection.
So, the maximum number of points of intersection we can get is C(n, 4).
10. Prove that C(pn, pn − n) is a multiple of p in two ways. Hint: Newton’s identity.
Ans: C(pn, pn − n) = pn(pn−1)(pn−2)···(pn−n+1) = p (pn−1)(pn−2)···(pn−n+1)

n! (n−1)! = pC(pn −
1, pn − n).

Alternate. Take a team (subset) of size n from {1, 2, . . . , pn} and select a team leader.
This can be done in C(pn, n)n ways. Alternately, select the team leader first and them select
the rest n − 1: in C(pn, 1)C(pn − 1, n − 1) ways. So, pnC(pn − 1, n − 1) = nC(pn, n) or
pC(pn − 1, n − 1) = C(pn, n).
11. How many ways are there to form the word MATHEMATICIAN starting from any side
and moving only in horizontal or vertical directions?
M
M A M
M A T A M
M A T H T A M
M A T H E H T A M
M A T H E M E H T A M
M A T H E M A M E H T A M
M A T H E M A T A M E H T A M
M A T H E M A T I T A M E H T A M
M A T H E M A T I C I T A M E H T A M
M A T H E M A T I C I C I T A M E H T A M
M A T H E M A T I C I A I C I T A M E H T A M
M A T H E M A T I C I A N A I C I T A M E H T A M
106 CHAPTER 4. BASIC COUNTING

Ans: Observe that you have to take 12 steps and there are 13 different rows to start with. If
you start with the 5th row leftmost M then you have to take 5 − 1 = 4 steps towards right and
8 steps down to reach N . This can be done in 12

4 ways. Similar fact holds if you start from
12

the rightmost M . So, the 5th row contributes a number 2 4 to the total we are looking for.
Also observe that the first row contribute only 1 (not 2). Hence, the total number of words
that can be formed is 12 12 12 12 13

0 + 2 1 + 2 2 + · · · + 2 12 = 2 − 1.
12. (a) In how many ways can one arrange n different books in m different boxes kept in a
row, if books inside the boxes are also kept in a row?
Ans: (a) Let xi be the number of books in box i. Any solution to x1 + · · · + xm = n
gives n! number of ways to keep the books. So, the answer is C(n + m − 1, n)n! =
P (n + m − 1, n).
Alternate: first book can be kept in m ways; second book can be kept in m + 1 ways;
and so on. So, the answer is m(m + 1) · · · (m + n − 1) = P (n + m − 1, n).
(b) What if no box can be empty?
Ans: If no box is to be empty, then the answer is n! × C(n − 1, m − 1).
13. Prove by induction that 2n |(n + 1) · · · (2n).
Ans: Let f (n) = (n + 1) · · · (2n). Then, f (n + 1) = (n + 2) · · · (2n + 2) = 2(2n + 1)f (n).

4.1.1 Multinomial theorem

Definition 4.1.25. Let x, y and z be commuting symbols. Then, by an algebraic expansion1

of (x + y + z)n we mean an expansion where each term is of the form αxi y j z k so that two terms
DR

differ in the degree of at least one of x, y, or z. By a word expansion2 of (x + y + z)n we

mean an expansion where each term is a word of length n using symbols x, y, z. Expansions for
(x1 + · · · + xr )n , whenever xi ’s are commuting symbols, may be defined in a similar way.
Example 4.1.26. 1. x3 + 3xy 2 + y 3 + 3yx2 is an algebraic expansion of (x + y)3 , where as
xxx + xxy + xyx + xyy + yxx + yxy + yyx + yyy is a word expansion of (x + y)3 .
2. Take the word expansion of (X + Y + Z)9 . A term with exactly two X’s and exactly three
Y ’s is nothing but an arrangement of two X’s, three Y ’s and four Z’s. So, the coefficient
of X 2 Y 3 Z 4 in the algebraic expansion of (X + Y + Z)9 is P (9; 2, 3, 4).
3. Consider (x+y+z)n = (x + y + z) · (x + y + z) · · · · · (x + y + z). Then, in this expression,
| {z }
n times
we need to choose, say

(a) i places from the n possible places for x (i ≥ 0),

(b) j places from the remaining n − i places for y (j ≥ 0), and
(c) the n − i − j left out places for z (with n − i − j ≥ 0).

Thus, we get
X X
(x + y + z)n = C(n, i)C(n − i, j)xi y j z n−i−j = P (n; i, j)xi y j z n−i−j .
i,j≥0,i+j≤n i,j≥0,i+j≤n
1
Nonstandard notion
2
Nonstandard notion
4.1. PERMUTATIONS AND COMBINATIONS 107

Theorem 4.1.27. [Multinomial Theorem] Fix a positive integer n and let x1 , x2 , . . . , xn be a

collection of commuting symbols. Then, for n = n1 + · · · + nk , the coefficient of xn1 1 xn2 2 · · · xnk k
in the algebraic expansion of (x1 + · · · + xk )n is P (n; n1 , · · · , nk ). So

P (n; n1 , · · · , nk ) xn1 1 · · · xnk k .

X
(x1 + · · · + xk )n =
n1 , . . . , nk ≥ 0
n1 + · · · + nk = n

Proof. The proof is left as an exercise for the reader.

As a special case, we have the famous binomial theorem.
n
Corollary 4.1.28. [Binomial Theorem] (x + y)n = C(n, k)xn−k y k . !!
P
k=0

Example 4.1.29. Form words of size 5 using letters from ‘MATHEMATICIAN’ (including
multiplicity, that is, you may use M at most twice). How many are there?
P
Ans: C(5; k1 , · · · , k8 ).
k1 +···+k8 =5
k1 ≤2,k2 ≤3,k3 ≤2,k4 ≤1,k5 ≤1,k6 ≤2,k7 ≤1,k8 ≤1

Exercise 4.1.30. 1. Show that |P({1, 2, . . . , n})| = 2n in the following ways.

(a) By using Binomial Theorem.

(b) By using ‘select a subset is a task with n compulsory parts’.
T
AF

(d) Arguing in the line of ‘a subset of {1, 2, . . . , n, n + 1} either contains n + 1 or not’

and using induction.

2. Let S be a set of size n. Then, prove in two different ways that the number of subsets
of S of odd size is the same as the number of subsets of S of even size, or equivalently
C(n, 2k + 1) = 2n−1 .
P P
C(n, 2k) =
k≥0 k≥0

Ans: The number of subsets of odd size is C(n, 1) + C(n, 3) + · · · and the number of subsets
of even size is C(n, 0) + C(n, 2) + C(n, 4) + · · · . Note that 0 = (1 − 1)n = C(n, 0) − C(n, 1) +
C(n, 2) − C(n, 4) + · · · .

Alternate. By induction. For n = 1 it is trivial. Assume it is true for k. Let E be

the collection of subsets of even size of {1, 2, . . . , k} and O be the collection of subsets
of odd size. By induction hypothesis E = O. Let O0 = {A ∪{k + 1} : A ∈ O} and
E 0 = {A ∪{k + 1} : A ∈ E}. Then, O0 is the collection of subsets of even sizes of
{1, 2, . . . , k, k + 1} and E 0 is the collection of subsets of odd sizes. It is clear that both of
these collection have the same cardinality.

3. Prove the following identities on Binomial coefficients.

n
C(k, `)C(n, k) = C(n, `)2n−` .
P
(a)
k=`
108 CHAPTER 4. BASIC COUNTING

Ans: Recall that C(k, `)C(n, k) = C(n, `)C(n − `, k − `). Hence,

n
X n
X n
X
C(k, `)C(n, k) = C(n, `)C(n − `, k − `) = C(n, `) C(n − `, k − `)
k=` k=` k=`
n−`
X
= C(n, `) C(n − `, s) = C(n, `)2n−` .
s=0

P̀
(b) C(m + n, `) = C(m, k) C(n, ` − k).
k=0
Ans: Use the idea that there are two ways of forming a committee of size ` from a
group consisting of m men and n women.
Pt n
P
(c) C(n, `) = C(t, k) C(n − t, ` − k) = C(t, k) C(n − t, ` − k), for any t, 0 ≤ t ≤ n.
k=0 k=0
Ans: Use the idea that there are two ways of forming a committee of size ` from a
group consisting of t men and n − t women. Also, the terms C(t, t + 1) = C(t, t + 2) =
· · · = C(t, n) = 0, whenever n > t.
Pr
(d) C(n + r + 1, r) = C(n + `, `).
`=0
Ans: Note C(n, 0) = C(n + 1, 0). Thus, by Pascal’s triangle C(n, 0) + C(n + 1, 1) =
C(n + 1, 0) + C(n + 1, 1) = C(n + 2, 1). Now, keep using Pascal’s triangle to finally get
C(n + r, r − 1) + C(n + r, r) = C(n + r + 1, r).
Pn
(e) C(n + 1, r + 1) = C(`, r).
T

`=r
AF

Ans: Note C(r, r) = C(r + 1, r + 1). Thus, by Pascal’s triangle C(r, r) + C(r + 1, r) =
DR

C(r + 1, r + 1) + C(r + 1, r) = C(r + 2, r + 1). Now, keep using Pascal’s triangle to

finally get C(n, r + 1) + C(n, r) = C(n + 1, r + 1).
Alternately, using the previous problem,
n
X n−r
X
C(`, r) = C(r+`, `) = C(r+(n−r)+1, n−r) = C(n+1, n−r) = C(n+1, r+1).
`=r `=0
n
P n
P
4. Evaluate (2k + 1) C(n, 2k + 1) and (5k + 3) C(n, 2k + 1), whenever n ≥ 3.
k=0 k=0
n n n
C(n − 1, 2k) = n2n−2 .
P P P
Ans: (2k + 1) C(n, 2k + 1) = n C(n − 1, 2k) = n
k=0 k=0 k=0
n n
5 1 5n n−2
+ 12 2n−1 .
P P
(5k + 3) C(n, 2k + 1) = 2 (2k + 1) + 2 C(n, 2k + 1) = 2 2
k=0 k=0
5. [Generalized Pascal] Assume that k1 + · · · + km = n. Show that

C(n; k1 , . . . , km ) = C(n − 1; k1 − 1, . . . , km ) + · · · + C(n − 1; k1 , . . . , km − 1).

Ans: C(n; k1 , . . . , km ) is the number of arrangements of n objects in which ki objects are of

type i, i = 1, . . . , m. Such an arrangement has the last object either of type 1 or of type 2
or of type 3 and so on. Suppose an arrangement has the last object either of type 1. Then,
deletion of the last object gives an arrangement of k1 − 1 objects of type 1 and ki objects of
type i, where i = 2, 3, . . . , m. So

C(n; k1 , . . . , km ) = C(n − 1; k1 − 1, . . . , km ) + · · · + C(n − 1; k1 , . . . , km − 1).

4.2. CIRCULAR PERMUTATIONS 109

P
6. What is C(n; k1 , . . . , km )?
k1 +...+km =n

Ans: Prove that (x1 + x2 + · · · + xm )n = C(n; k1 , . . . , km )xk11 xk22 · · · xkmm . Now,

P
k1 +...+km =n
substitute 1 for xi to get the answer as mn .
7. Put l = b m (−1)k2 +k4 +···+k2l C(n; k1 , . . . , km )?
P
2 c. What is
k1 +...+km =n

Ans: Recall (x1 + x2 + · · · + xm )n = C(n; k1 , . . . , km )xk11 xk22 · · · xkmm . Now,

P
k1 +...+km =n
substitute 1 for x2i+1 and −1 for x2i to get the answer as 1, if m is odd and 0, if m is even.

4.2 Circular Permutations

Definition 4.2.1. [Circular permutation/arrangement] A circular permutation is an ar-
rangement of n distinct objects on a circle. Two circular arrangements are the same if each
element has the same ‘clockwise adjacent’ element. When |S| = n, we write ‘a circular arrange-
ment of S’ to mean ‘a circular arrangement of elements of S’. By [x1 , x2 , . . . , xn , x1 ] we shall
denote a circular arrangement keeping the anticlockwise direction in picture.

Example 4.2.2. Exactly two pictures in Figure 4.1 represent the same circular permutation.
T

A4 A3 A1 A5 A2 A3
AF
DR

A5 A2 A2 A4 A1 A4
A1 A3 A5
[A1 , A2 , A3 , A4 , A5 , A1 ] [A1 , A5 , A4 , A3 , A2 , A1 ]

Figure 4.1: Circular permutations

Example 4.2.3. Determine the number of circular permutations of X = {A1 , A2 , A3 , A4 , A5 }?

Ans: 4!. Proof. Let B = {circular permutations of X} and A = {permutations of X}.
Now, define f : A → B as f (a) = b if a is obtained by breaking the cycle b at some gap and
then following in the anticlockwise direction. For example, if we break the leftmost circular
permutation in Figure 4.1 at the gap between A and B, we get [A2 , A3 , A4 , A5 , A1 ]. Notice that
|f −1 (b)| = 5, for each b ∈ B. Further if b, c ∈ B, then f −1 (b) ∩ f −1 (c) = ∅ (why?1 ). Thus, by
the principle of disjoint pre-images of equal size, the number of circular permutations is 5!/5.

Theorem 4.2.4. [Circular permutations] The number of circular permutations of {1, 2, . . . , n}

is (n − 1)!.

Proof. A proof may be obtained on the line of the previous example. Here we give an al-
ternate proof. Put A = {circular permutations of {1, 2, 3, 4, 5}}. Put B = {permutations of
{1, 2, 3, 4}}. Define f : A → B as f ([5, x1 , x2 , x3 , x4 , 5]) = [x1 , x2 , x3 , x4 ]. Define g : B → A as
1
Think of creating the circular permutation from a given permutation.
110 CHAPTER 4. BASIC COUNTING

g([x1 , x2 , x3 , x4 ]) = [5, x1 , x2 , x3 , x4 , 5]. Then, g ◦ f (a) = a, for each a ∈ A and f ◦ g(b) = b, for
each b ∈ B. Hence, by the bijection principle (see Theorem 1.2.30) f is a bijection.

Example 4.2.5. Find the number of circular arrangements of {A, B, B, C, C, D, D, E, E}.

Ans: There is only one A. Cutting A out from a circular arrangement we get a unique
arrangement of {B, B, C, C, D, D, E, E}. So, the required answer is 2!8!4 .

Definition 4.2.6. [Rotation, Orbit size]

1. Given an arrangement [X1 , . . . , Xn ], by a rotation R1 ([X1 , . . . , Xn ]), in short R1 (X1 , . . . , Xn ),
we mean [X2 , . . . , Xn , X1 ] and by R2 (X1 , . . . , Xn ) we mean [X3 , . . . , Xn , X1 , X2 ]. On sim-
ilar lines, we define Ri , i ∈ N and put R0 (X1 , . . . , Xn ) = [X1 , . . . , Xn ]. Thus, for each
k ∈ N,
R0 (X1 , . . . , Xn ) = Rkn (X1 , . . . , Xn ) = [X1 , . . . , Xn ].

2. The orbit size of an arrangement [X1 , . . . , Xn ] is the smallest positive integer i which
satisfies Ri (X1 , . . . , Xn ) = [X1 , . . . , Xn ]. In that case, we call
n o
R0 (X1 , . . . , Xn ), R1 (X1 , . . . , Xn ), . . . , Ri−1 (X1 , . . . , Xn )

the orbit of [X1 , . . . , Xn ].

Example 4.2.7. 1. We have R1 (ABCABCABC) = [BCABCABCA], R2 (ABCABCABC) =
[CABCABCAB] and R3 (ABCABCABC) = [ABCABCABC]. Thus, orbit size of ABCABCABC
T

is 3.
AF

2. An arrangement of S = {A, A, B, B, C, C} with orbit size 6 is [AABCBC]. An arrange-

ment of S with orbit size 3 is [ACBACB].

3. There is no arrangement of {A, A, B, B, C, C} with orbit size 2. In fact, if [X1 X2 · · · X6 ]
is an arrangement with orbit size 2 then, [X1 X2 X3 X4 X5 X6 ] = [X3 X4 X5 X6 X1 X2 ]. Thus,
X1 = X3 = X5 which is not possible.
4. There is no arrangement of {A, A, B, B, C, C} with orbit size 1 or 2 or 4 or 5.
5. There are 3! arrangements of {A, A, B, B, C, C} with orbit size 3.
6. Take an arrangement of {A, A, B, B, C, C} with orbit size 3. Make a circular arrangement
by joining the ends. How many distinct arrangements can we generate by breaking the
circular arrangement at gaps?
Ans: 3. They are the elements of the same orbit.
7. Take an arrangement of {A, A, B, B, C, C} with orbit size 6. Make a circular arrangement
by joining the ends. How many distinct arrangements can we generate by breaking the
circular arrangement at gaps?
Ans: 6. They are the elements of the same orbit.
8. Take an arrangement of n elements with orbit size k. Make a circular arrangement by
joining the ends. How many distinct arrangements can we generate by breaking the circular
arrangement at gaps?
Ans: k. They are the elements of the same orbit.
4.2. CIRCULAR PERMUTATIONS 111

9. If we take the set of all arrangements of a finite multiset and group them into orbits (notice
that each orbit gives us exactly one circular arrangement), then the number of orbits is
the number of circular arrangements.

Example 4.2.8. Find the number of circular arrangements of S = {A, A, B, B, C, C, D, D, E, E}.

Ans: There are two types of arrangements of S: one of orbit size 10 and the other of orbit
size 5. The number of arrangements of S with orbit size 5 is 5!. So, they can generate 4! distinct
10!
circular arrangements. The number of arrangements of S with orbit size 10 is 2!2!2!2!2! − 5!.
10! 5!
Hence, they can generate 2!2!2!2!2!10 − 10 distinct circular arrangements. Thus, the total number
10! 5!
of circular arrangements is 4! + 2!2!2!2!2!10 − 10 .

Example 4.2.9. Suppose, we are given an arrangement [X1 , . . . , X10 ] of five A’s and five B’s.
Can it have an orbit size 3?
Ans: No. To see this assume that it’s orbit size is 3. Then,

[X1 , . . . , X10 ] = R3 (X1 , . . . , X10 ) = R6 (X1 , . . . , X10 ) = R9 (X1 , . . . , X10 ) = R2 (X1 , . . . , X10 ).

Since 3 was the least positive integer with R3 (X1 , . . . , X10 ) = [X1 , . . . , X10 ], we arrive at a
contradiction. Hence, the orbit size cannot be 3.

Proposition 4.2.10. The orbit size of an arrangement of an n-multiset is a divisor of n.

Proof. Suppose, the orbit size of [X1 , . . . , Xn ] is k and n = kp + r, for some r, 0 < r < k. Then,
T
AF

Rk (X1 , . . . , Xn ) = R2k (X1 , . . . , Xn ) = · · · = Rkp (X1 , . . . , Xn ) = Rk−r (X1 , . . . , Xn ).

Thus, Rk−r (X1 , . . . , Xn ) = [X1 , . . . , Xn ], contradicting the minimality of k. Hence, a contradic-

tion and therefore r = 0. Or equivalently, k divides n.

Proposition 4.2.11. Let S1 = {Pi1 , Pi2 , . . . , Pik } and S2 = {Pj1 , Pj2 , . . . , Pjl } be any two orbits
of certain arrangements of an n-multiset. Then, either S1 ∩ S2 = ∅ or S1 = S2 .

Proof. If S1 ∩ S2 = ∅, then there is nothing to prove. So, let there exists an arrangement
Pt ∈ S1 ∩ S2 . Then, by definition, there exist rotations R1 and R2 such that R1 (Pi1 ) = Pt and
R2 (Pj1 ) = Pt . Thus, R2−1 (Pt ) = Pj1 and hence R2−1 (R1 (Pi1 )) = R2−1 (Pt ) = Pj1 . Therefore, we
see that the arrangement Pj1 ∈ S1 and hence S2 ⊆ S1 . A similar argument implies that S1 ⊆ S2
and hence S1 = S2 .

Definition 4.2.12. [Binary operation] Let [X1 , . . . , Xn ] and [Y1 , . . . , Yn ] be two arrangements
of an n-multiset. Then, in the remainder of this section,
1. we shall consider expressions like [X1 , . . . , Xn ] + [Y1 , . . . , Yn ].
2. by [Ri + Rj ](X1 , . . . , Xn ), we mean the expression Ri (X1 , . . . , Xn ) + Rj (X1 , . . . , Xn ).
3. by Ri ([X1 , . . . , Xn ]+[Y1 , . . . , Yn ]) we denote the expression Ri (X1 , . . . , Xn )+Ri (Y1 , . . . , Yn ).
6!
Example 4.2.13. Think of all arrangements P1 , . . . , Pn , n = 3!3! , of three A’s and three B’s.
How many copies of [ABCABC] are there in [R0 + · · · + R5 ](P1 + · · · + Pn )?
Ans: Of course 6. To see this, note that R0 takes [ABCABC] to itself; R1 will take
[CABCAB] to [ABCABC]; R2 will take [BCABCA] to [ABCABC]; and so on.
112 CHAPTER 4. BASIC COUNTING

Example 4.2.14. Let P = [X1 , . . . , X12 ] be an arrangement of a 12-multiset with orbit size 3.
Since, the orbit size of P is 3, the set S = {P, R1 (P ), R2 (P )} forms the orbit of P . Thus, the
rotations R0 , R3 , R6 and R9 fix each element of S, i.e., Ri (Rj (P )) = Rj (P ) for all i ∈ {0, 3, 6, 9}
and j ∈ {0, 1, 2}. In other words, [R0 + · · · + R11 ](P ) accounts for 4 counts of the same circular
arrangement, where 4 is nothing but the number of rotations fixing P . Thus, we see that

[R0 + R1 + · · · +R11 ](P + R1 (P ) + R2 (P ))

= [R0 + R1 + · · · R11 ](P ) + [R0 + R1 + · · · R11 ](R1 (P ))
+[R0 + R1 + · · · R11 ](R2 (P ))
= 4(P + R1 (P ) + R2 (P )) + 4(P + R1 (P ) + R2 (P )) + 4(P + R1 (P ) + R2 (P ))
= 12(P + R1 (P ) + R2 (P ))

The proof of the next result is similar to the idea in the above example and hence is omitted.

Proposition 4.2.15. Let P1 , . . . , Pn be all the arrangements of an m-multiset. Then,

[R0 + · · · + Rm−1 ](P1 + · · · + Pn ) = m(P1 + · · · + Pn ).

Let P be an arrangement of an m-multiset with orbit size k. Then, by Proposition 4.2.10

k divides m. Now, from the understanding obtained from the above example, we note that
m m
[R0 + · · · + Rm−1 ](P ) accounts for counts of the same circular arrangement, where is
T

k k
nothing but ‘the number of rotations fixing P ’. Also, by Proposition 4.2.11, we know that two
AF

orbits are either disjoint or the same and hence the next two results are immediate. Therefore,
DR

the readers are supposed to provide a proof of the following results.

Discussion 4.2.16. Let P1 , . . . , Pn be all the arrangements of an m-multiset. Then,

X X
the number of rotations fixing Pi = [R0 + · · · + Rm−1 ](Pi )
Pi Pi
= m(P1 + · · · + Pn )
= m(the number of circular arrangements).

Discussion 4.2.17. Let P1 , . . . , Pn be all the arrangements of an m-multiset and {R0 , R1 , . . . , Rm−1 }
the set of all rotations. Then,
X X
the number of rotations fixing Pi = |{Rj | Rj (Pi ) = Pi }| = |{(Pi , Rj ) | Rj (Pi ) = Pi }|
Pi Pi
X
= |{Pi | Rj (Pi ) = Pi }|
Rj
X
= the number of Pi ’s fixed by Rj .
Rj

Hence, using Discussion 4.2.16, the number of circular arrangements is

1 X
the number of Pi ’s fixed by Rj .
m
Rj a rotation
4.2. CIRCULAR PERMUTATIONS 113

Example 4.2.18. 1. How many circular arrangements of {A, A, A, B, B, B, C, C, C} are there?

9!
Ans: R0 fixes 3!3!3! arrangements, None of R1 , R2 , R4 , R5 , R7 andR8 fixes any arrange-
ment, R3 and R6 fixes 3! arrangements, namely the 3! arrangements of X, Y, Z, where
X = AAA, Y = BBB and Z = CCC.
h i
Thus, the number of circular arrangements is 91 3!3!3!
9!
+ 3! + 3! = 5·6·7·8+12
9 = 564
3 = 188.

2. Determine the number of circular arrangements of size 5 using the alphabets A, B and C.
Ans: In this case, R0 fixes all the 35 arrangements. The rotations R1 , R2 , R3 and R4
fixes the arrangements AAAAA, BBBBB and CCCCC. Hence, the required number is
1 5

5 3 + 4 · 3 = 51.

Verify that the answer will be 8 if we have just two alphabets A and B.
Exercise 4.2.19. 1. If there are n girls and n boys then what is the number of ways of
making them sit around a circular table in such a way that no two girls are adjacent and
no two boys are adjacent?
Ans: Make the seating arrangement for the girls first in alternate chairs. This can be done in
(n − 1)! ways. For each seating arrangement of the girls, the boys can be seated in n! ways.
So, the total number of ways is n!(n − 1)!.
2. Persons P1 , . . . , P100 are seating on a circle facing the center and talking. If Pi talks lie,
then the
T

(a) person to his right talks truth. So, the minimum number of persons talking truth is
AF

.
DR

(b) second person to his right talks truth’ ? So, the minimum number of persons talking
truth is .
(c) next two persons to his right talk truth’ ? So, the minimum number of persons talking
truth is .

3. Let us assume that any two garlands are same if one can be obtained from the other by
rotation. Then, determine the number of distinct garlands that can be formed using 6
flowers, if the flowers
(a) are of 2 colors, say ‘red’ and ‘blue’.
Ans: Is the problem same as finding the number of circular arrangements of S =
{R, R, R, R, R, R, B, B, B, B, B, B}? Ans: 14.
(b) are of 3 different colors.
Ans: Is the problem same as finding the number of circular arrangements of S =
{R, R, R, R, R, R, B, B, B, B, B, B, G, G, G, G, G, G}? Ans: 130.
(c) are of k different colors, for some k ∈ N.
Ans: Is the problem same as finding the number of circular arrangements of S =
1
{R1 , R1 , R1 , R1 , R1 , R1 , . . . , Rk , Rk , Rk , Rk , Rk , Rk }? Ans: {k 6 + 2.k + 2.k 2 + k 3 }.
6
(d) of ‘red’ color are 2 and that of ‘blue’ color is 4.
Ans: Is the problem same as finding the number of circular arrangements of S =
{R, R, B, B, B, B}? Ans: 3.
114 CHAPTER 4. BASIC COUNTING

4. Find the number of circular permutations of {A, A, B, B, C, C, C, C}.

Ans: There are two types of permutations of S = {A, A, B, B, C, C, C, C}: one of orbit
size 8 and the other of orbit size 4. The number of permutations of S with orbit size 4 is
4!
12 = 1!1!2! . So, they can generate 3 = 12/4 distinct circular permutations. The number of
8! 8!
permutations of S with orbit size 8 is 2!2!4! − 12. So, they can generate ( 2!2!4! − 12)/8 distinct
8!
circular permutations. Thus, the total number of circular permutations is ( 2!2!4! + 12)/8 = 54.

4.3 Solutions in Non-negative Integers

Definition 4.3.1. [Solution in nonnegative integers] Recall that N0 := N ∪ {0}. A point
p = (p1 , . . . , pk ) ∈ Nk0 with p1 + · · · + pk = n is called a solution of the equation x1 + · · · + xk = n
in nonnegative integers or a solution of x1 + · · · + xk = n in N0 . Two solutions (p1 , . . . , pk )
and (q1 , . . . , qk ) are said to be the same if pi = qi , for each i = 1, . . . , k. Thus, (5, 0, 0, 5) and
(0, 0, 5, 5) are two different solutions of x + y + z + t = 10 in N0 .

Example 4.3.2. Determine the number of

1. words which uses 3 A’s and 6 B’s.

2. arrangements of 3 A’s and 6 B’s.

3. distinct strings that can be formed using 3 A’s and 6 B’s.

4. solutions of the equation x1 + x2 + x3 + x4 = 6, where each xi ∈ N0 and 0 ≤ xi ≤ 6.

5. ways of placing 6 indistinguishable balls into 4 distinguishable boxes.

6. 3 subsets of an 9-set.
Ans: Observe that all the problems correspond to forming strings using +’s (or |’s or
bars) and 1’s (or balls or dots) in place of A’a and B’s, respectively?

BBABBBABA 11 + 111 + 1+ = 2 + 3 + 1 + 0 • •|• • •|•|

ABBBBBAAB +11111 + +1 = 0 + 5 + 0 + 1 | • • • • • || •
ABBBABABB +111 + 1 + 11 = 0 + 3 + 1 + 2 |• • •| • |• •

Figure 4.2: Understanding the three problems

Note that the A’s are indistinguishable among themselves and the same holds for B’s.
Thus, we need to find 3 places, from the 9 = 3 + 6 places, for the A’s. Hence, the answer
is C(9, 3). The answer will remain the same as we just need to replace A’s with +’s (or
|’s) and B’s with 1’s (or balls) in any string of 3 A’s and 6 B’s. See Figure 4.2 or note that
four numbers can be added using 3 +’s or four adjacent boxes can be created by putting
3 vertical lines or |’s.

In general, we have the following result.

4.3. SOLUTIONS IN NON-NEGATIVE INTEGERS 115

Theorem 4.3.3. [solutions in N0 ] The number of solutions of x1 + · · · + xr = n in N0 is

C(n + r − 1, n).

Proof. Each solution (x1 , . . . , xr ) may be viewed as an arrangement of n dots and r − 1 bars.
‘Put x1 many dots; put a bar; put x2 many dots; put another bar; continue; and end by
putting xr many dots.’
For example, (0, 2, 1, 0, 0) is associated to | • •| • || and vice-versa. Thus, there are C(n + r −
1, r − 1) arrangements of n dots and r − 1 bars.

Theorem 4.3.4. (a) The number of solutions of x1 + · · · + xr ≤ n in nonnegative integers is

C(n + r, n).
(b) The number of terms in the algebraic expansion of (x1 + · · · + xr )n is C(n + r − 1, n).

Proof. (a) Any solution of x1 +· · ·+xr ≤ n uniquely corresponds to a solution of x1 +· · ·+xr +y =

n in nonnegative integers..
(b) Note that each term in the algebraic expansion of (x1 +· · ·+xr )n has the form xi11 xi22 · · · xirr ,
with i1 +i2 +· · ·+ir = n. Thus, each term uniquely corresponds to a solution of i1 +i2 +· · ·+ir = n
in nonnegative integers.

Theorem 4.3.5. [r-multiset] The number of r-multisets of elements of {1, 2, . . . , n} is C(n +

r − 1, n − 1).
T

Proof. Let A be an r-multiset. Let di be the number of copies of i in A. Then, any solution of
AF

d1 + · · · + dn = r in nonnegative integers gives A uniquely. Hence, the conclusion.

Alternate. Put A = {arrangements of n − 1 dots and r bars}. Put B = {r-multisets of

{1, 2, . . . , n}}. For a ∈ A, define f (a) to be the multiset

f (a) = {d(i) + 1 | where d(i) is the number of dots to the left of the i-th bar}.

For example, || • •| • || gives us {1, 1, 3, 4, 4}. It is easy to define g : B → A so that f (g(b)) =

b, for each b ∈ B and g(f (a)) = a, for each a ∈ A. Thus, by the bijection principle (see
Theorem 1.2.30), |A| = |B|. Also, we know that |A| = C(n + r − 1, n − 1) and hence the required
result follows.
Example 4.3.6. 1. There are 5 kinds of ice-creams available in our market complex. In how
many ways can you buy 15 of them for a party?
Ans: Suppose you buy xi ice-creams of the i-th type. Then, the problem is the same as
finding the number of solutions of x1 + · · · + x5 = 15 in nonnegative integers.
2. How many solutions in N0 are there to x + y + z = 60 such that x ≥ 3, y ≥ 4, z ≥ 5?
Ans: (x, y, z) is such a solution if and only if (x−3, y−4, z−5) is a solution to x+y+z = 48
in N0 . So, answer is C(50, 2).
3. How many solutions in N0 are there to x + y + z = 60 such that 20 ≥ x ≥ 3, 30 ≥ y ≥
4, 40 ≥ z ≥ 5?
Ans: We are looking for solution in N0 of x + y + z = 48 such that x ≤ 17, y ≤ 26 and
z ≤ 35. Let A = {(x, y, z) ∈ N30 | x + y + z = 48}, Ax = {(x, y, z) ∈ N30 | x + y + z = 48, x ≥
116 CHAPTER 4. BASIC COUNTING

18}, Ay = {(x, y, z) ∈ N30 | x + y + z = 48, y ≥ 27} and Az = {(x, y, z) ∈ N30 | x + y + z =

48, z ≥ 36}. We know that |A| = C(50, 2). Our answer is then C(50, 2) − |Ax ∪ Ay ∪ Az |.
Very soon we will learn to find the value of |Ax ∪ Ay ∪ Az |.
Exercise 4.3.7. 1. Determine the number of solutions of x + y + z = 7 with x, y, z ∈ N?
2. Find the number of allocations of n identical objects to r distinct locations so that location
i gets at least pi ≥ 0 elements, i = 1, 2, · · · , r.
Ans: Each such allocation corresponds uniquely to a solution in integers of x1 +x2 +· · · xr = n,
xi ≥ pi . Put yi = xi − pi and p = p1 + p2 + · · · + pr . Each solution to the above equation
corresponds uniquely to a solution of y1 + y2 + · · · + yr = n − p in nonnegative integers. The
number of such solutions is C(n + r − p − 1, n − p).
3. In how many ways can we pick integers x1 < x2 < x3 < x4 < x5 , from {1, 2, . . . , 20} so
that xi − xi−1 ≥ 3, i = 2, 3, 4, 5? Solve in three different ways.
Ans: Take 15 copies of N and 5 copies of Y . If we get an arrangement in which between two
consecutive Y there are at least two N , then such a kind of arrangement corresponds uniquely
to one solution of our problem. Place five Y s first. Put two N s in each of the four gaps. The
remaining seven copies of N can be put in any of the six places (left to first Y , between first
and second Y , so on). This is the same as distributing 7 identical objects into 6 boxes. So, the
answer is the number of solutions in nonnegative integers of the equation x1 + · · · + x6 = 7,
which is C(12, 5).
T
AF

Alternate. we can take any arrangement of 7 N’s and 5 Y’s and then put 2 N’s in the gaps.
The answer is C(12, 5).
DR

Alternate. Note that 1 ≤ x1 ≤ x2 · · · ≤ x5 ≤ 20. Consider the differences d1 = x1 − 1,

di = xi − xi−1 for i = 2, 3, 4, 5 and d6 = 20 − x5 . Then, d1 + · · · + d6 = 19, where
d2 , d3 , d4 , d5 ≥ 3 and d1 , d6 ≥ 0. Solving this is the same as solving d1 +d02 +d03 +d04 +d05 +d6 = 7
in nonnegative integers.
4. Find the number of solutions in nonnegative integers of a + b + c + d + e < 11.
Ans: Same as the number of nonnegative integer solutions of a + b + · · · + f = 10.
5. In a room, there are 2 distinct book racks with 5 shelves each. Each shelf is capable of
holding up to 10 books. In how many ways can we place 10 distinct books in two racks?
Ans: We can put k books in rack A and 10 − k books in rack B. To keep k books in rack A,
we have C(4 + k, 4)k! ways. To keep 10 − k books in rack B, we have C(4 + 10 − k, 4)(10 − k)!
10
P
ways. So, the answer is C(10, k) C(4 + k, 4)k! C(4 + 10 − k, 4)(10 − k)!.
k=0

Alternate. 10!C(19, 9).

6. How many 4-letter words (with repetition) are there with the letters in alphabetical order?
Ans: Think of number of terms in (A + B + · · · + Z)4 . Or equivalently, let A appear
i1 times, B appear i2 times, ..., Z appear i26 times. Then, we have to find number of
solutions in non-negative integers to the equation i1 + i2 + · · · + i26 = 4. So, the answer is
C(26 + 4 − 1, 4) = C(29, 4).
4.3. SOLUTIONS IN NON-NEGATIVE INTEGERS 117

7. Determine the number of non-decreasing sequences of length r using the numbers 1, 2, . . . , n.

Ans: Think of number of terms in (x1 + x2 + · · · + xn )r , with xi representing the number i.
Ans: C(n + r − 1, r).
8. In how many ways can m indistinguishable balls be put into n distinguishable boxes with
the restriction that no box is empty.
Ans: Think of number of terms in (x1 + x2 + · · · + xn )m−n , with xi representing the number
of indistinguishable balls in the i-th box. We need to look at m − n as there is at least one
ball in each box. Ans: C(n + (m − n) − 1, m − n) = C(m − 1, m − n) = C(m − 1, n − 1).
Alternately, as each box is non-empty, let us put exactly one ball in each box. Then, we are
left with m − (n) = m − n indistinguishable balls and they are to be put into n boxes. So, the
answer is C(m − n + n − 1, n − 1) = C(m − 1, n − 1).
9. How many 26-letter permutations of the ENGLISH alphabets have no 2 vowels together?
Ans: Arrange the vowels A, E, I, O, U in 5! ways. For each such arrangement, we need to
place the ‘a particular permutation of 21-consonants’, in 6 boxes (Box-1≡ before A, Box-2≡
between A and B, ..., Box-5≡ between O and U, and Box-6≡ after U). So, having no 2 vowels
together implies that we need to look at the solution x1 + x2 + · · · + x6 = 21 with x1 , x6 ≥ 0
and x2 , x3 , x4 , x5 ≥ 1. Note that this equation reduces to x1 + y2 + · · · + y5 + x6 = 17
with x1 , y2 , y3 , y4 , y5 , x6 ≥ 0, with yi = xi − 1, for 2 ≤ i ≤ 5. Hence, the answer is
T

5! · · · 21! · C(17 + 6 − 1, 17) = 21!5!C(22, 5).

Alternately, take a particular arrangement of 21-consonants and a particular arrangement of

5-vowels. So, no 2 vowels together implies that we need to choose 5 places among the 22
places that appear ‘before, between and after the 21-consonants’. Number of ways of choosing
5 places among 22 places equals C(22, 5). Hence, we get the same number as earlier.
10. How many 26-letter permutations of the ENGLISH alphabets have at least two consonants
between any two vowels?
Ans: Arrange the vowels A, E, I, O, U in 5! ways. For each such arrangement, we need to
place the ‘a particular permutation of 21-consonants’, in 6 boxes (Box-1≡ before A, Box-2≡
between A and B, ..., Box-5≡ between O and U, and Box-6≡ after U). So, having no 2 vowels
together implies that we need to look at the solution x1 + x2 + · · · + x6 = 21 with x1 , x6 ≥ 0
and x2 , x3 , x4 , x5 ≥ 2. Note that this equation reduces to x1 + y2 + · · · + y5 + x6 = 13
with x1 , y2 , y3 , y4 , y5 , x6 ≥ 0, with yi = xi − 2, for 2 ≤ i ≤ 5. Hence, the answer is
5! · · · 21! · C(13 + 6 − 1, 17) = 21!5!C(18, 5).
11. How many ways are there to select 10 integers from the set {1, 2, . . . , 100} such that the
positive difference between any two of the 10 integers is at least 3.
Ans: Note that the selected numbers can be put in increasing order. Now, we need to have at
least 3 numbers (so, the difference between any two of them is at least 4) between any two of the
selected numbers. So, remove 27 = 3×(10−1) numbers from the list. This leaves u Alternately,
let the numbers be x1 , x2 , . . . , x10 . Then x1 ≥ 1, x1 + 4 ≤ x2 , . . . , x9 + 4 ≤ x10 ≤ 100 and
xi+1 − xi ≥ 4 for i = 1, 2, . . . , 6. Define y1 = x1 − 1 and for i = 2, 3, . . . , 10, define
118 CHAPTER 4. BASIC COUNTING

yi+1 = xi+1 − xi − 4 with 73 numbers. So, the answer is C(73, 10). Then yi ≥ 0 and
y1 + y2 + · · · + y10 = x10 − 37. As x10 ≤ 100, we need to solve the inequality

y1 + y2 + · · · + y10 ≤ 63.

This is same as solving in non-negative integers the equation y1 + y2 + · · · + y10 + y11 = 63.
So, the answer is C(63 + 11 − 1, 63) = C(73, 10).
12. How many 10-element subsets of the ENGLISH alphabets do not have a pair of consecutive
letters?
Ans: Proceeding as above, we need so solve the inequality y1 + y2 + · · · + y10 ≤ 7 as x1 ≥ 1
and xi+1 − xi ≥ 2, for 2 ≤ i ≤ 10. So, the answer is C(7 + 10, 7) = C(17, 7).
13. How many 10-element subsets of the ENGLISH alphabets have a pair of consecutive letters?
Ans: Using the previous exercise, the answer is 2610 − C(17, 7).
14. How many ways are there to distribute 50 balls to 5 persons if Ram and Shyam together
get no more than 30 and Mohawk gets at least 10?
Ans: Need to find the number of solution in non-negative integers of the equation x1 + x2 +
· · · + x5 = 60 with the restriction that x1 + x2 ≤ 30 and x3 ≥ 10. As Mohan gets at least
10, we get define x3 = y3 + 10. Also, for any r, 0 ≤ r ≤ 30, we need to solve in non-negative
integers the equation x1 + x2 = r. So, we need to solve in non-negative integers the equation
30
T
P
y3 + x4 + x5 = 40 − r. So, the answer is C(r + 2 − 1, r) · C(40 − r + 3 − 1, 40 − r) =
AF

r=0
30
P
C(r + 1, r) · C(42 − r, 40 − r).
DR

r=0
15. How many arrangements of the letters of KAGARTHALAMNAGARTHALAM have no 2
vowels adjacent?
Ans: Note that only the vowel A appears and it appears 8 times. The consonants are 14 in
14!
number and they can be arranged in ways. Also, each arrangement of consonants
2!2!2!2!2!2!
gives rise to 15 places. So, for each arrangement of the consonants, we have to choose 8 places
14!
for A among the 15 places. Hence, the required answer is C(15, 8) · .
2!2!2!2!2!2!
16. How many arrangements of the letters of RECURRENCERELATION have no 2 vowels
adjacent?
Ans: The reason is similar to the previous question. Here, the arrangement of vowels is
8! 10!
, arrangements of consonants is and for each arrangement of vowels and consonants
4! 4! 2! 2!
10! 8!
there are C(11, 8) ways to choose the places for vowels. So, the answer is · ·C(11, 8).
4! 2! 2! 4!
17. How many ways are there to arrange the letters in ABRACADABARAARCADA such
that the first

(a) A precedes the first B?

Ans: Note that there are 9 A’s, 2 B’s, 2 C’s, 2 D’s and 3 R’s. Let us just put the
9 A’s. Then B’s can appear at any of the 9 places after the first A has appeared. So,
we need to solve in non-negative integers the equation x1 + x2 + · · · + x9 = 2. This
4.3. SOLUTIONS IN NON-NEGATIVE INTEGERS 119

gives C(2 + 9 − 1, 2) ways. The rest of the letters can be arranged among themselves
7!
in ways. Also, they can be put anywhere among the possible 12 places. So, we
2! 2! 3!
need to solve in non-negative integers the equation x1 + x2 + · · · + x12 = 7. This gives
7!
C(7 + 12 − 1, 7) ways. So, the answer is · C(10, 2) · C(18, 7).
2! 2! 3!
(b) B precedes the first A and the first D precedes the first C?
Ans: Using the idea of the previous question, we need to solve x1 + x2 = 9 in non-
negative integers to get C(10, 9) arrangements in which the first B precedes the first A.
Similarly, there are C(3, 2) arrangements to get the first D preceding the first C. The
D’s and C’s can be put anywhere among the possible 12 places. So, we need to solve in
non-negative integers the equation x1 + x2 + · · · + x12 = 4. This gives C(15, 4) ways.
Now, the R’s can be put anywhere among the possible 16 places. So, we need to solve
in non-negative integers the equation x1 + x2 + · · · + x16 = 3. This gives C(18, 3) ways.
So, the answer is C(10, 9) · C(3, 2) · C(15, 4) · C(18, 3).
(c) B precedes the first A and the first A precedes the first C?
Ans: Using the idea of the previous question, we need to solve x1 + x2 = 9 in non-
negative integers to get C(10, 9) arrangements in which the first B precedes the first A.
Similarly, there are C(3, 2) arrangements to get the first D preceding the first C. The
D’s and C’s can be put anywhere among the possible 12 places. So, we need to solve in
non-negative integers the equation x1 + x2 + · · · + x12 = 4. This gives C(15, 4) ways.
T

Now, the R’s can be put anywhere among the possible 16 places. So, we need to solve
AF

in non-negative integers the equation x1 + x2 + · · · + x16 = 3. This gives C(18, 3) ways.

So, the answer is C(10, 9) · C(3, 2) · C(15, 4) · C(18, 3).

18. How many ways are there to arrange the letters in KAGART HALAM N AGART HAT AM
such that the first

(a) A precedes the first T ?

Ans:
(b) M precedes the first G and the first H precedes the first A?
Ans:
(c) M precedes the first G and the first T precedes the first G?
Ans:

19. In how many ways can we pick 20 letters from 10 A’s, 15 B’s and 15 C’s?
Ans: Let the number of A’s, B’s and C’s be x1 , x2 and x3 , respectively. Then, we need
to solve in non-negative integers the equation x1 + x2 + x3 = 20 with the condition that
x1 ≤ 10, x2 ≤ 15 and x3 ≤ 15.
20. Determine the number of ways to sit 10 men and 7 women so that no 2 women sit next to
each other?
21. How many ways can 8 persons, including Ram and Shyam, sit in a row with Ram and
Shyam not sitting next to each other?
120 CHAPTER 4. BASIC COUNTING

Ans: There are 6! ways to seat the six persons different from Ram and Shyam. Then, there
are C(7, 2) places for Ram and Shyam. Also, there are 2! ways of arranging Ram and Shyam.
So, the answer is 2! · 6! · C(7, 2) = 6 · 7!.
Alternately, if Ram and Shyam have to sit next to each other then we have seen earlier that
the number of such arrangements is 2 · 7!. Hence, the required number is 8! − 2 · 7! = 6 · 7!.
n P
P i2
i1 P iP
k−1
22. Evaluate ··· 1.
i1 =1 i2 =1 i3 =1 ik =1
n P
P i2
i1 P in−1
P n P
P i2
i1 P in−1
P in
P
Ans: Note that ··· in = ··· 1. The later is equal to
i1 =1 i2 =1 i3 =1 in =1 i1 =1 i2 =1 i3 =1 in =1 in+1 =1
the number of nonincreasing sequences (x1 , x2 , · · · , xn+1 ) of positive integers with x1 ≤ n.
Such a sequence corresponds uniquely to an arrangement of n balls and n + 1 bars such that
the last element is a ball: xi is the number of balls to the right of i-th bar. The number of
such arrangements is C(2n, n + 1) which is our answer.
9 P i2
i1 P i8
i29 .
P P
23. Evaluate ···
i1 =1 i2 =1 i3 =1 i9 =1
9 i1 i2 i8 i1 P
9 P i2 i8
x+y
i29 and put y =
P P P P P P
Ans: Put x = ··· ··· i9 . Then, z = 2 =
i1 =1 i2 =1 i3 =1 i9 =1 i1 =1 i2 =1 i3 =1 i9 =1
i1 P
9 P
P i2 Pi8 i9
P
··· i10 .
i1 =1 i2 =1 i3 =1 i9 =1 i10 =1
Note that y is the number of nonincreasing sequences of length 10 made from {1, 2, . . . , 9}
T
AF

and z is the number of nonincreasing sequences of length 11 made from {1, 2, . . . , 9}. Hence,
y = C(18, 8) and z = C(19, 8). So, x = 2C(19, 11) − C(18, 10) = 107406.
DR

Alternate. Take i9 = 1. How many nonincreasing sequences of length 9 are there with
elements from {1, 2, . . . , 9} such that the last element is 1? We have 9 ≥ i1 ≥ · · · ≥ i8 ≥
i9 = 1. This is the number of N0 solutions of d1 + · · · + d9 = 8. So, it is C(16, 8).
Take i9 = 2. How many nonincreasing sequences of length 9 are there with elements from
{1, 2, . . . , 9} such that the last element is 2? We have 9 ≥ i1 ≥ · · · ≥ i8 ≥ i9 = 2. This the
number of N0 solutions of d1 + · · · + d9 = 7. So, it is C(15, 8). Contribution to the sum is
22 C(15, 8).
9
Proceeding as above, the total sum is C(16, 8) + C(15, 8)22 + · · · + C(8, 8)92 =
P
C(17 −
i=1
i, 8)i2 = 107406.

4.4 Set Partitions

Recall that a partition of a set S is a collection of pairwise disjoint nonempty subsets whose
union is S. For clarity, let us look at a few examples once again.

Example 4.4.1. (a) {1, 2}, {3}, {4, 5, 6} , {1, 3}, {2}, {4, 5, 6} and {1, 2, 3, 4}, {5}, {6} are
both partitions of {1, 2, 3, 4, 5, 6} into 3 subsets.
(b) There are 2n−1 − 1 partitions of {1, 2, . . . , n}, n ≥ 2 into two subsets. To see this, ob-
serve that for each nontrivial subset A ∈ P({1, 2, . . . , n}), the set {A, Ac } is a parti-
4.4. SET PARTITIONS 121

tion of {1, 2, . . . , n} into two subsets. Since, the total number of nontrivial subsets of
P({1, 2, . . . , n}) equals 2n − 2, the required result follows.

(c) Number of allocations of 7 students into 7 different project groups so that each group has
one student, is 7! = C(7; 1, 1, 1, 1, 1, 1, 1) but the number of partitions of a set of 7 students
into 7 subsets is 1.
n o
(d) In how many ways can I write {1, 2}, {3, 4}, {5, 6}, {7, 8, 9}, {10, 11, 12} on a piece of
paper, with the condition that sets have to be written in a row in increasing size?

Ans: Let us write a few first.

n o
{1, 2}, {3, 4}, {5, 6}, {7, 8, 9}, {10, 11, 12} correct
n o
{2, 1}, {3, 4}, {5, 6}, {7, 8, 9}, {10, 11, 12} correct
n o
{5, 6}, {3, 4}, {1, 2}, {10, 11, 12}, {9, 7, 8} correct
n o
{2, 3}, {1, 4}, {5, 6}, {7, 8, 9}, {10, 11, 12} incorrect, not the same partition
n o
{2, 1}, {3, 4}, {7, 8, 9}, {5, 6}, {10, 11, 12} incorrect, not satisfying the condition

There are 3!(2!)3 × 2!(3!)2 ways. Notice that from each written partition, if I remove the
brackets I get an arrangement of elements of {1, 2, . . . , 12}.

(e) How many arrangements do I generate from a partition with pi subsets of size ni , n1 <
T
AF

· · · < nk ?
k
DR

Y
Ans: p1 !(n1 !)p1 · · · pk !(nk !)pk = [pi !(ni )pi ].
i=1

Theorem 4.4.2. [Set partition] The number of partitions of {1, 2, . . . , n} with pi subsets of
size ni , n1 < · · · < nk is
n!
.
(n1 !) 1 p1 ! · · · (nk !)pk pk !
p

k
[pi !(ni )pi ] arrangement of elements of {1, 2, . . . , n}.
Q
Proof. Note that each such partition generates
i=1
Conversely, for each arrangement of elements of {1, 2, . . . , n} we can easily construct a partition
of the above type which can generate this arrangement. Thus, the proof is complete.

Definition 4.4.3. Stirling numbers of the second kind, denoted S(n, r), is the number of
partitions of {1, 2, . . . , n} into r-subsets (r-parts). By convention, S(n, r) = 1, if n = r and 0,
whenever either ‘n > 0 and r = 0’ or ‘n < r’.

Theorem 4.4.4. [recurrence for S(n, r)] S(n + 1, r) = S(n, r − 1) + rS(n, r).

Proof. Write an r-partition of {1, 2, . . . , n, n + 1} and erase n + 1 from it. That is, if {n + 1}
is an element of an r-partition, then the number of such partitions become S(n, r − 1); else
n + 1 appears in one of the element of an r-partition of {1, 2, . . . , n}, which gives the number
rS(n, r).
122 CHAPTER 4. BASIC COUNTING

Example 4.4.5. Determine the number of ways of putting n distinguishable/distinct balls into
r indistinguishable boxes with the restriction that no box is empty.
Ans: Let A be the set of n distinct balls and let the balls in i-th box be Bi , 1 ≤ i ≤ r.

1. Since each box is non-empty, each Bi is non-empty.

r
S
2. Also, each ball is in some box and hence Bi = A.
i=1

3. As the boxes are indistinguishable, we arrange the boxes in non-increasing order, i.e.,
|B1 | ≥ · · · ≥ |Br |.

Thus, B1 , B2 , . . . , Br is a partition of A into r-parts. Hence, the required number of ways is

given by S(n, r), the Stirling number of the second kind.

To proceed further, consider the following example.

Example 4.4.6. Let A = {a, b, c, d, e} and S = {1, 2, 3}. Define an onto function f : A → S by
f (a) = f (b) = f (c) = 1, f (d) = 2 and f (e) = 3. Then, f gives a partition B1 = {a, b, c}, B2 =
{d} and B3 = {e} of A into 3-parts. Also, let A1 = {a, d}, A2 = {b, e} and A3 = {c} be a
partition of A into 3-parts. Then, this partition gives 3! onto functions from A into S, each of
them being a one-to-one function from {A1 , A2 , A3 } to S, namely,
T
AF

f1 (a) = f1 (d) = 1, f1 (b) = f1 (e) = 2, f1 (c) = 3, ⇔ f1 (A1 ) = 1, f1 (A2 ) = 2, f1 (A3 ) = 3

f2 (a) = f2 (d) = 1, f2 (b) = f2 (e) = 3, f2 (c) = 2, ⇔ f2 (A1 ) = 1, f2 (A2 ) = 3, f2 (A3 ) = 2

f3 (a) = f3 (d) = 2, f3 (b) = f3 (e) = 1, f3 (c) = 3, ⇔ f3 (A1 ) = 2, f3 (A2 ) = 1, f3 (A3 ) = 3
f4 (a) = f4 (d) = 2, f4 (b) = f4 (e) = 3, f4 (c) = 1, ⇔ f4 (A1 ) = 2, f4 (A2 ) = 3, f4 (A3 ) = 1
f5 (a) = f5 (d) = 3, f5 (b) = f5 (e) = 1, f5 (c) = 2, ⇔ f5 (A1 ) = 3, f5 (A2 ) = 1, f5 (A3 ) = 2
f6 (a) = f6 (d) = 3, f6 (b) = f6 (e) = 2, f6 (c) = 1, ⇔ f6 (A1 ) = 3, f6 (A2 ) = 2, f6 (A3 ) = 1.

Lemma 4.4.7. The total number of onto functions f : {1, 2, . . . , r} → {1, 2, . . . , n} is n!S(r, n).

Proof. ‘f is onto’ means ‘for all y ∈ {1, 2, . . . , n} there exists x ∈ {1, 2, . . . , r}, such that
f (x) = y’. Therefore, the number of onto functions is 0, whenever r < n. So, we assume that
r ≥ n. Then,

1. for each i ∈ {1, 2, . . . , n}, f −1 (i) = {x ∈ {1, 2, . . . , r} | f (x) = i} is a non-empty set (f is

onto).

2. f −1 (i) ∩ f −1 (j) = ∅, whenever 1 ≤ i 6= j ≤ n (f is a function).

n
f −1 (i) = {1, 2, . . . , r} (domain of f is {1, 2, . . . , r}).
S
3.
i=1

Therefore, f −1 (i)’s give a partition of {1, 2, . . . , r} into n-parts. Also, note that each such
function f , gives a one-to-one function from {f −1 (1), . . . , f −1 (r)} to {1, 2, . . . , n}.
4.4. SET PARTITIONS 123

Conversely, for each partition A1 , A2 , . . . , An of {1, 2, . . . , r} into n-parts, we get n! one-to-one

function from {A1 , A2 , . . . , An } to {1, 2, . . . , n}. Hence,

{f : {1, 2, . . . , r} → {1, 2, . . . , n} | f is onto} =

{g : {A1 , A2 , . . . , An } → {1, 2, . . . , n} | g is one-to-one} ×
Partition of {1, 2, . . . , r} into n-parts = n! S(r, n).

Thus, the required result follows.

Lemma 4.4.8. Let r, n ∈ N and ` = min{r, n}. Then,

`
X
nr = C(n, k)k!S(r, k). (4.1)
k=1

Proof. Let A = {f | f : {1, 2, . . . , r} → {1, 2, . . . , n}}. We compute |A| by two different methods.
Method 1: By Theorem 4.1.2, |A| = nr .
Method 2: Let f0 : {1, 2, . . . , r} → {1, 2, . . . , n} be any function. Then, f0 is an onto function
from {1, 2, . . . , r} to Im (f0 ) = f0 ({1, 2, . . . , r}). Moreover, , for some k, 1 ≤ k ≤ ` = min{r, n}.
S̀
Thus, A = Ak , where Ak = {f : {1, 2, . . . , r} → {1, 2, . . . , n} | |f ({1, 2, . . . , r})| = k} and
k=1
Ak ∩ Aj = ∅, whenever 1 ≤ j 6= k ≤ `. Now, using Theorem 4.1.18, a subset of {1, 2, . . . , n} of
size k can be selected in C(n, k) ways. Thus, for 1 ≤ k ≤ `,
T
AF

|Ak | = {K : K ⊆ {1, 2, . . . , n}, |K| = k} × {f : {1, 2, . . . , r} → K | f is onto} = C(n, k)k!S(r, k).

Therefore,
`
[ `
X `
X
|A| = Ai = |Ak | = C(n, k)k!S(r, k).
k=1 k=1 k=1

Hence, using the two counting methods, the required result follows.

Remark 4.4.9. 1. The following two problems are equivalent.

(a) Count the number of onto functions f : {1, 2, . . . , r} → {1, 2, . . . , n}.
(b) Count the number ways to put r distinguishable/distinct balls into n distinguish-
able/distinct boxes so that no box is empty.

2. The numbers S(r, k) can be recursively calculated using Equation (4.1). For example, we
show that S(m, 1) = 1, for all m ≥ 1.
Ans: Take n ≥ 1 and r = 1 in Equation (4.1) to get
1
X
n = n1 = C(n, k)k!S(1, k) = C(n, 1)1!S(1, 1) = nS(1, 1).
k=1

Thus, S(1, 1) = 1. Take n = 1 and r ≥ 2 in Equation (4.1) to get

1
X
r
1=1 = C(1, k)k!S(r, k) = S(r, 1).
k=1
124 CHAPTER 4. BASIC COUNTING

3. As exercise, verify that S(5, 2) = 15, S(5, 3) = 25, ; S(5, 4) = 10, S(5, 5) = 1.
Exercise 4.4.10. 1. Determine the number of ways of

(a) selecting r distinguishable objects from n distinguishable objects, when n ≥ r.

(b) distributing 20 distinct toys among 4 children if each children gets 5 toys?
(c) placing r distinguishable balls into n indistinguishable boxes if no box is empty?
(d) placing r distinguishable balls into n indistinguishable boxes?

2. For n ∈ N, let b(n) denote the number of partitions of the set {1, 2, . . . , n}. Then, b(n) =
n
S(n, r) is called the nth Bell number. By definition, b(0) = 1 = b(1). Determine b(n),
P
r=0
for 2 ≤ n ≤ 5.
3. Fix n ∈ N. Then, a composition of n is an expression of n as a sum of positive integers.
For example, if n = 4, then the distinct compositions are

4, 3 + 1, 1 + 3, 2 + 2, 2 + 1 + 1, 1 + 1 + 2, 1 + 2 + 1, 1 + 1 + 1 + 1.

Let Sk (n) denote the number of compositions of n into k parts. Then, S1 (4) = 1, S2 (4) =
P
3, S3 (4) = 3 and S4 (4) = 1. Determine Sk (n), for 1 ≤ k ≤ n and Sk (n).
k≥1
4. Let S = {f | f : {1, 2, . . . , r} → {1, 2, . . . , n}}. Compute |S| in two ways to prove (n+1)r =
r
C(r, k)nk .
P
T

k=0
AF

5. Suppose 13 people get on the lift at level ◦. If all the people get down at some level, say
1, 2, 3, 4 and 5 then, calculate the number of ways of getting down if at least one person
DR

gets down at each level.

Definition 4.4.11. [Partition of a number] Let n, k ∈ N. A partition of n into k parts is

a tuple (x1 , · · · , xk ) ∈ Nk written in non-increasing order such that x1 + · · · + xk = n. It may
be viewed as a k-multiset S ⊆ N with sum n. By πn (k), we denote the number of partitions
of n into exactly k parts and by πn , the number of partitions of n. Conventionally π0 = 1 and
πn (k) = 0, whenever k > n.

Remark 4.4.12. π7 (4) = 3 as the partitions of 7 into 4-parts are 4 + 1 + 1 + 1, 3 + 2 + 1 + 1

and 2 + 2 + 2 + 1. Verify that π7 (2) = 3 and π7 (3) = 4.

Example 4.4.13. Determine the number of ways of placing r indistinguishable balls into n
indistinguishable boxes

1. with the restriction that no box is empty.

Ans: As the balls are indistinguishable, we need to count the number of balls in each box.
As the boxes are indistinguishable, arrange them so that the number of balls inside boxes
are in non-increasing order. Also, each box is non-empty and hence the answer is πr (n).

2. with no restriction.
Ans: Let us place one ball in each box. Now ‘placing r indistinguishable ball into n
indistinguishable boxes with no restriction’ is same as ‘placing r + n indistinguishable
4.5. LATTICE PATHS AND CATALAN NUMBERS 125

balls into n indistinguishable boxes so that no box is empty.’ Therefore, the required
answer is πm+n (n).

Exercise 4.4.14. 1. Calculate π(n), for n = 1, 2, 3, . . . , 8.

2. Prove that π2r (r) = π(r), for any r ∈ N.

3. For a fixed n ∈ N determine a recurrence relation for the numbers πn (r)’s for 1 ≤ r ≤ n.

Definition 4.4.15. The Stirling number of the first kind, denoted s(n, k), is the coefficient
of xk in xn, where xn is called the falling factorial and equals x(x − 1)(x − 2) · · · (x − n + 1).
The rising factorial xn is defined as x(x + 1)(x + 2) · · · (x + n − 1).

Exercise 4.4.16. Prove by induction that

1. s(n, m)(−1)n−m is the coefficient of xm in xn and |s(n, m)| = s(n, m)(−1)n−m .
Ans: Denote by f (n, m) the coefficient of xm in xn. The statements are true for n = 1.
Assume that the first statement is true for n ≤ k. Then, s(k + 1, m) = the coefficient of xm
in xk(x − k) which is s(k, m − 1) − ks(k, m). Similarly f (k + 1, m) = f (k, m − 1) + kf (k, m).
So, f (k + 1, m) = (−1)k−m+1 s(k, m − 1) + k(−1)k−m s(k, m) = (−1)k−m+1 s(k, m − 1) −
ks(k, m) = (−1)k−m+1 s(k + 1, m).

As f (n, m) are always positive, second statement follows.

2. Let a(n, k) denote the number of permutations of {1, 2, . . . , n} which have k disjoint cy-
T
AF

cles. For example, a(4, 2) = 11 as it corresponds to the permutations (12)(34), (13)(24),

(14)(23), (1)(234), (1)(243), (134)(2), (143)(2), (124)(3), (142)(3), (123)(4) and (132)(4).
DR

By convention, a(0, 0) = 1 and a(n, 0) = 0 = a(0, n), whenever n ≥ 1. Determine prove

that the numbers a(n, k)’s satisfy

a(n, k) = (n − 1)a(n − 1, k) + a(n − 1, k − 1).

3. Prove that a(n, m) = |s(n, m)| for all n, m ∈ N0 .

Ans: The statement is obviously true for n = 1. Assume that it is true for n < k. Notice
that given a permutation σ of {1, 2, . . . , k − 1} which is a product of m − 1 cycles, (k)σ is a
permutation of {1, 2, . . . , k} which is a product of m cycles.
Also, given a permutation σ = Γ1 · · · Γm of {1, 2, . . . , k − 1} which is a product of m cycles,
we can get k − 1 permutations of {1, 2, . . . , k} which are product of m cycles by inserting k
in the gaps or to the right of Γi .
Argue that these are the only permutations of {1, 2, . . . , k} that are product of m cycles and
there is no repetition. Thus, the number of permutations of {1, 2, . . . , k} that are product of
m cycles is |s(k − 1, m − 1)| + (k − 1)|s(k − 1, m)| = |s(k, m)|.

4.5 Lattice Paths and Catalan Numbers

Consider a lattice of integer lines in R2 and let S = {(m, n) | m, n = 0, 1, . . .} be the said of
points on the lattice. For a pair of points, say A = (m1 , n1 ) and B = (m2 , n2 ) with m1 ≤ m2
126 CHAPTER 4. BASIC COUNTING

and n1 ≤ n2 , we define a lattice path from A to B to be a subset {e1 , . . . , ek } of S such that

if ei = (x, y) then ei+1 is either (x + 1, y) or (x, y + 1), for 1 ≤ i ≤ k − 1. That is, at each step
we move either one unit right, denoted R, or one unit up, denoted U (see Figure 4.3).

(8, 7)

U = UP
(2, 3)
R = RIGHT

(0, 0)

Figure 4.3: A lattice with a lattice path from (2, 3) to (8, 7)

Example 4.5.1. 1. Determine the number of lattice paths from (0, 0) to (m, n).
Ans: As at each step, the unit increase is either R or U , we need to take n many R steps
and m many U steps to reach (m, n) from (0, 0). So, any arrangement of n many R’s and
T
AF

m many U ’s will give such a path uniquely. Hence, the answer is C(m + n, m).
m
DR

P
2. Use the method of lattice paths to prove C(n + `, `) = C(n + m + 1, m).
`=0
Ans: Observe that C(n + m + 1, m) is the number of lattice paths from (0, 0) to (m, n + 1)
and the left hand side is the number of lattice paths from (0, 0) to (`, n), where 0 ≤ ` ≤ m.
Fix `, 0 ≤ ` ≤ m and let P be a lattice path from (0, 0) to (`, n). Then, the path P ∪ Q,
where Q = U RR · · · R with R appearing m − ` times, gives a lattice path from (0, 0) to
(m, n + 1), namely
P U Q
(0, 0) −
→ (`, n) −
→ (`, n + 1) −
→ (m, n + 1).

These lattice paths for 0 ≤ ` ≤ m are all distinct and hence the result follows.

Exercise 4.5.2. 1. Give a bijection between ‘the solution set of x0 + x1 + x2 + · · · + xk = n

in non-negative integers’ and ‘the number of lattice paths from (0, 0) to (n, k)’.
n
C(n, k) = 2n .
P
2. Use lattice paths to construct a proof of
k=0

n
C(n, k)2 = C(2n, n). [Hint: C(n, k) is the
P
3. Use lattice paths to construct a proof of
k=0
number of lattice paths from (0, 0) to (n − k, k) as well as from (n − k, k) to (n, n).]

Discussion 4.5.3. As observed earlier, the number of lattice paths from (0, 0 to (n, n) is
C(2n, n). Suppose, we wish to take paths so that at no step the number of U ’s exceeds the
number of R’s. Then, what is the number of such paths?

1
4.5. LATTICE PATHS AND CATALAN NUMBERS 127

Ans: Call an arrangement of n many U ’s and n many R’s a ‘bad path’ if the number of U ’s
exceeds the number of R’s at least once. For example, the path RRU U U RRU is a ‘bad path’.
To each such arrangement, we correspond another arrangement of n + 1 many U ’s and n − 1
many R’s in the following way: spot the first place where the number of U ’s exceeds that of
R’s in the ‘bad path’. Then, from the next letter onwards change R to U and U to R. For
example, the bad path RRU U U RRU corresponds to the path RRU U U U U R. Notice that this
is a one-one correspondence. Thus, the number of bad paths is C(2n, n − 1). So, the answer to
C(2n, n)
the question is C(2n, n) − C(2n, n − 1) = .
n+1
Definition 4.5.4. [Catalan number] The nth Catalan number, denoted Cn , is the number
of different representations of the product A1 · · · An+1 of n + 1 square matrices of the same size
using n pairs of brackets. By convention C0 = 1.
C(2n,n)
Theorem 4.5.5. [Catalan number] Prove that Cn = n+1 for all n ∈ N.

Proof. Claim: After the (n − k)-th ‘(’, there are at least k + 2 many A’s. To see this pick the
substring starting right from the (n − k)-th ‘(’ till we face (k + 1) many ‘)’s. This substring
represents a product of matrices. So, it must contain (k + 2) many Ai ’s.
Given one representation of the product, replace each Ai by A. Drop the right brackets to have
a sequence of n many ‘(’s and n + 1 many A’s. Thus, the number of A’s used till the n − kth ‘(’
is at most n + 1 − (k + 2) = n − k − 1. So, the number of A’s never exceeds the number of ‘(’.
Conversely, given such an arrangement, we can put back the ‘)’s: find two consecutive letters
T

from the last ‘(’; put a right bracket after them; treat (AA) as a letter; repeat the process. For
AF

example,
DR

((A((AAAA → ((A((AA)AA → ((A((AA)A)A → ((A((AA)A))A = ((A((AA)A))A)

By previous example the number of such arrangements is C(2n,n) n+1 .

The readers who are interested in knowing more about Catalan numbers should look at the
book “enumerative combinatorics” by Stanley [12].
Exercise 4.5.6. 1. Give a recurrence relation for Cn ’s (i.e., a formula for Cn involving
C0 , . . . , Cn−1 ). Hence, show that Cn = C(2n, n)/(n + 1).
n
P
Ans: Remove the outer brackets and see that Cn = Ci Cn−i .
i=1
2. Give an arithmetic proof of the fact that (n + 1) divides C(2n, n).
Ans: As C(2n, n) = C(2n, n + 1) n+1
n and gcd(n, n + 1) = 1, it follows that n|C(2n, n + 1),
that is, (n + 1)|C(2n, n).
3. A man is standing on the edge of a swimming pool (facing it) holding a bag containing n
blue and n red balls. He randomly picks up one ball at a time and discards it. If the ball
is blue he takes a step back and if the ball is red, he takes a step forward. What is the
probability of his falling into the swimming pool?
Ans: This is the number of sequences of n blue balls and n red balls such that at some
position the number of red balls (till that position) exceeds the number of blue balls (till that
position). We know it is C(2n, n − 1). The total number of sequences is C(2n, n). Hence,
n
the probability of his falling into the pool is n+1 .
128 CHAPTER 4. BASIC COUNTING

4. Consider a regular polygon with vertices 1, 2, · · · , n. In how many ways can we divide the
polygon into triangles using (n − 3) non-crossing diagonals?
Ans: Let f (k+2) be the number of ways of dividing a convex polygon into triangles. Obviously
f (3) = 1 and f (4) = 2. Let the vertices be 1, 2, · · · , k + 2. Let d(i) denote the degree of the
vertex i. Note that after a division either the d(1) = 2 or not.
The number of ways of dividing the convex polygon into triangles such that d(1) = 2 is
f (k + 1). If d(1) > 2, then let i be the smallest positive integer (other than 2) adjacent to 1.
Thus, i can vary from 3 to k + 1 and for each i there are f (i − 1)f (k + 4 − i) ways to divide
the k + 2 convex polygon into triangles (as in the polygon [1, 2, . . . , i] we must have d(1) = 2).
Thus, using f (2) = 1 and f (k + 2) = g(k),
k+1
X
g(k) = f (k + 2) = f (k + 1) + f (i − 1)f (k + 4 − i)
i=3
= f (k + 1)f (2) + f (2)f (k + 1) + f (3)f (k) + · · · + f (k)f (3)
k−1
X
= g(i)g(k − 1 − i).
i=0

As g(0) = g(1) = 1, it follows that g(n) = Cn , the Catalan numbers.

5. How many arrangements of n blue and n red balls are there such that at any position in
the arrangement the number of blue balls (till that position) is at most one more than the
T
AF

number of red balls (till that position)?

6. We want to write a matrix of size 10 × 2 using numbers 1, . . . , 20 with each number ap-
pearing exactly once. Then, determine the number of such matrices in which the numbers

(a) increase from left to right?

(b) increase from up to down?
(c) increase from left to right and up to down?

Ans: First: C(20, 2)C(18, 2) · · · C(4, 2) = 220!

10 . Second: C(20, 10). Third: First select 10 for

the left row. Put them in increasing order. Circle these number on a row of 1, . . . , 20. For
example, take

1 2 3 4 5 6 7 8

We want to put the rest in increasing order on the right row. Notice that this selection will
not give us a correct arrangement, because there is a problem at 7: the number of circled
numbers till that point is 3, whereas the number of uncircled numbers till that point is 4 > 3.
That means, we will have 7 to the right of 8, not good. Thus, the answer is the number of
arrangements of 10 circles and 10 dots (representing uncircled numbers) such that the number
of dots does not exceed the number of circles at any point, which is C(20,10)
11 .
7. How many lattice paths are there from (0, 0) to (9, 9) which does not cross the dotted line?
4.6. SOME GENERALIZATIONS 129

(9, 9)

(0, 0)

Ans: Any such path uniquely corresponds to a similar path in a 11 × 11 grid where he cannot
cross the diagonal.

T
AF
DR

So, the answer is C(20, 10)/11.

4.6 Some Generalizations

n!
1. Let n, k ∈ N with 0 ≤ k ≤ n. Then, in Theorem 4.1.18, we saw that C(n, k) = .
k!(n − k)!
n · (n − 1) · · · (n − k + 1)
Hence, we can think of C(n, k) = . With this understanding, we
k!
generalize C(n, k) for any n ∈ R and k ∈ N0 as follows:



 0, if k < 0

 0,
 if n = 0, n 6= k
C(n, k) = 1, if n = k (4.2)

 n · (n − 1) · · · (n − k + 1) ,



otherwise.

k!
With the notations as above, we give the generalized binomial theorem without proof.

Theorem 4.6.1. [Generalized binomial theorem] Let n be any real number. Then,

(1 + x)n = 1 + C(n, 1)x + C(n, 2)x2 + · · · + C(n, r)xr + · · · .

1
130 CHAPTER 4. BASIC COUNTING

In particular, (1 − x)−1 = 1 + x + x2 + x3 + · · · and if a, b ∈ R with |a| < |b|, then

a n X a r X
(a + b)n = bn 1 + = bn C(n, r) = C(n, r)ar bn−r .
b b
r≥0 r≥0

Let us now understand Theorem 4.6.1 through the following examples.

1
(a) Let n = . In this case, for k ≥ 1, Equation (4.2) gives
2
1
· ( 1 − 1) · · · ( 12 − k + 1) 1 · (−1) · · · (3 − 2k) (−1)k−1 (2k − 2)!

1
C ,k = 2 2 = = .
2 k! 2k k! 22k−1 (k − 1)!k!
Thus,
X 1 1 −1 1 X (−1)k−1 (2k − 2)!
(1 + x)1/2 = C( , k)xk = 1 + x + 3 x2 + 4 x3 + xk .
2 2 2 2 22k−1 (k − 1)!k!
k≥0 k≥4

The above expression can also be obtained by using the Taylor series expansion of
f (x) = (1 + x)1/2 around x = 0. Recall that the Taylor series expansion of f (x)
00 P f (k) (0) k
around x = 0 equals f (x) = f (0) + f 0 (0)x + f 2!(0) x2 + k! x , where f (0) = 1,
k≥3
−1
f 0 (0) = 12 , f 00 (0) = 22
and in general f (k) (0) = 1
2 · ( 12 − 1) · · · ( 12 − k + 1), for k ≥ 3.
(b) Let n = −r, where r ∈ N. Then, for k ≥ 1, Equation (4.2) gives C(−r, k) =
−r · (−r − 1) · · · (−r − k + 1)
= (−1)k C(r + k − 1, k). Thus,
T

k!
AF

1 X
(1 + x)n = = 1 − rx + C(r + 1, 2)x2
+ C(r + k − 1, k)(−x)k .
DR

(1 + x)r
k≥3

m n
2. Let n, m ∈ N. Recall the identity nm =
P P
C(n, k)k!S(m, k) = C(n, k)k!S(m, k) in
k=0 k=0
Equation (4.1). Note that for each m ∈ N, the above identity equals X = AY , where
 m
0    
 m C(0, 0) 0 0 ··· 0 0!S(m, 0)
1     
 
 2m 
 C(1, 0)
 C(1, 1) 0 ··· 
 0  1!S(m, 1) 
 
X =  m  , A =  C(2, 0)
   C(2, 1) C(2, 2) · · ·  0
and Y =
 2!S(m, 2) 
.
3 
 
 . .. .. .. .. .
 .. ..
  
 . 
 .  . . .  .  
 . 
   
C(n, 0) C(n, 1) C(n, 2) · · · C(n, n) n!S(m, n)
nm

As A is lower triangular with det(A) = 1, it has an inverse and each entry of A−1 has a
similar form. So, Y = A−1 X, where
 
C(0, 0) 0 0 0 ··· 0
 −C(1, 0) C(1, 1) 0 0 ··· 0 
 
 
 C(2, 0) −C(2, 1) C(2, 2) 0 · · · 0 
A−1 =  .
 
 −C(3, 0) C(3, 1) −C(3, 2) C(3, 3) ··· 0 
 .. .. .. .. .. .. 
. . . . . . 
 

(−1)n C(n, 0) (−1)n−1 C(n, 1) (−1)n−2 C(n, 2) (−1)n−3 C(n, 3) · · · C(n, n)
4.6. SOME GENERALIZATIONS 131

Hence, for n, m ∈ N, we have

1 X
S(m, n) = (−1)k C(n, k)(n − k)m . (4.3)
n!
k≥0

3. The above matrix inversion implies that for n ∈ N0 , the identity

X X
a(n) = C(n, k)b(k) holds if and only if b(n) = (−1)k C(n, k)a(k) holds.
k≥0 k≥0

We end this chapter with another set of exercises.

Exercise 4.6.2. 1. Prove that there exists a bijection between any two of the following sets.

(a) The set of words of length n on an alphabet consisting of m letters.

(b) The set of maps of an n-set into an m-set.
(c) The set of distributions of n distinct objects into m distinct boxes.
(d) The set of n-tuples on m letters.

2. Prove that there exists a bijection between any two of the following sets.

(a) The set of n letter words with distinct letters out of an alphabet consisting of m letters.
(b) The set of one-one functions from an n-set into an m-set.
T
AF

object is put in a box, no other object can be put in the same box’.
(d) The set of n-tuples on m letters, without repetition.
(e) The set of permutations of m symbols taken n at a time.

3. Prove that there exists a bijection between any two of the following sets.

(a) The set of increasing words of length n on m ordered letters.

(b) The set of distributions on n non-distinct objects into m distinct boxes.
(c) The set of combinations of m symbols taken n at a time with repetitions permitted.
132 CHAPTER 4. BASIC COUNTING

T
AF
DR
Chapter 5

Advanced Counting Principles

5.1 Pigeonhole Principle

Discussion 5.1.1. [Pigeonhole principle (PHP)]
PHP1. If n + 1 pigeons stay in n holes then there is a hole with at least two pigeons.
PHP2. If kn + 1 pigeons stay in n holes then there is a hole with at least k + 1 pigeons.
PHP3. If p1 + · · · + pn + 1 pigeons stay in n holes then there is a hole i with at least pi + 1
pigeons.
Example 5.1.2. 1. Consider a tournament of n > 1 players, where each pair plays exactly
T
AF

once and each player wins at least once. Then, there are two players with the same number
of wins.
DR

Ans: Number of wins vary from 1 to n − 1 and there are n players.

2. A bag contains 5 red, 8 blue, 12 green and 7 yellow marbles. The least number of marbles
to be chosen to ensure that there are
(a) at least 4 marbles of the same color is 13,
(b) at least 7 marbles of the same color is 24,
(c) at least 4 red or at least 7 of any other color is 22.

3. In a group of 6 people, prove that there are three mutual friends or three mutual strangers.
Ans: Let a be a person in the group. Let F be the set of friends of a and S the set of
strangers to a. Clearly |S| + |F | = 5. By PHP either |F | ≥ 3 or |S| ≥ 3.
Case 1: |F | ≥ 3. If any two in F are friends then those two along with a are three mutual
friends. Else F is a set of mutual strangers of size at least 3.
Case 2: |S| ≥ 3. If any pair in S are strangers then those two along with a are three
mutual strangers. Else S becomes a set of mutual friends of size at least 3.
4. If 7 points are chosen inside or on the unit circle, then there is a pair of points which are
at a distance at most 1.
Ans: To see this divide the circle into 6 equal cone type parts creating an angle of 60o
with the center. By PHP there is a part containing at least two points. The distance
between these two is at most 1.

133
134 CHAPTER 5. ADVANCED COUNTING PRINCIPLES

5. If n+1 integers are selected from {1, 2, . . . , 2n}, then there is a pair which has the property
that one of them divides the other.
Ans: Each number has the form 2k s, where s = 2m + 1 is an odd number. There are n
odd numbers. If we select n + 1 numbers from S, by PHP some two of them (say, x, y)
have the same odd part, that is, x = 2i s and y = 2j s. If i ≤ j, then x|y, otherwise y|x.
6. (a) Let r1 , r2 , · · · , rmn+1 be a sequence of mn + 1 distinct real numbers. Then, prove that
there is a subsequence of m + 1 numbers which is increasing or there is a subsequence
of n + 1 numbers which is decreasing.
Ans: Define li to be the maximum length of an increasing subsequence starting at
ri . If some li ≥ m + 1 then we have nothing to prove. So, let 1 ≤ li ≤ m. Since
(li ) is a sequence of mn + 1 integers, by PHP, there is one number which repeats at
least n + 1 times. Let li1 = li2 = · · · = lin+1 = s, where i1 < i2 < · · · < in+1 . Notice
that ri1 > ri2 , because if ri1 < ri2 , then ‘ri1 together with the increasing sequence
of length s starting with ri2 ’ gives an increasing sequence of length s + 1. Similarly,
ri2 > ri3 > · · · > rin+1 and hence the required result holds.
Alternate. Let S = {r1 , r2 , · · · , rmn+1 } and define a map f : S → Z × Z by
f (ri ) = (s, t), for 1 ≤ i ≤ mn + 1, where s equals the length of the largest increasing
subsequence starting with ri and t equals the length of the largest decreasing sub-
sequence ending at ri . Now, if either s ≥ m + 1 or t ≥ n + 1, we are done. If not,
then note that 1 ≤ s ≤ m and 1 ≤ t ≤ n. So, the number of tuples (s, t) is at most
T
AF

mn. Thus, the mn + 1 distinct numbers are being mapped to mn tuples and hence
by PHP there are two numbers ri 6= rj such that f (ri ) = f (rj ). Now, proceed as in
DR

the previous case to get the required result.

(b) Does the above statement hold for every collection of mn distinct numbers? No.
Consider the sequence:

n, n−1, · · · , 1, 2n, 2n−1, . . . , n+1, 3n, 3n−1, · · · , 2n+1, · · · , mn, mn−1, · · · , mn−n+1.

7. Given any 1010 integers, prove that there is a pair that either differ by, or sum to, a
multiple of 2017. Is this true if we replace 1010 by 1009?
Ans: Let the numbers be n1 , n2 , . . . , n1010 and S = {n1 − nk , n1 + nk : k = 2, . . . , 1010}.
Then, |S| = 2018 and hence, at least two of them will have the same remainder when di-
vided by 2017. Then, consider their difference. For the later part, consider {0, 1, 2, . . . , 1008}.
p
8. Let a ∈ R \ Q. Then, there are infinitely many rational numbers q such that |a − pq | < 1
q2
.
Ans: Enough to show that there are infinitely many (p, q) ∈ Z2 with |qa − p| < 1q . As a
is irrational, note that for every m ∈ N, 0 < ia − biac < 1, for i = 1, . . . , m + 1. Hence, by
PHP there exist i, j with i < j such that
1 1
|(j − i)a − (bjac − biac)| < ≤ .
m j−i
Then, the tuple (p1 , q1 ) = (bjac − biac, j − i) satisfies the required property. To generate
another tuple, find m2 such that
1 p1
< |a − |
m2 q1
5.1. PIGEONHOLE PRINCIPLE 135

and proceed as before to get (p2 , q2 ) such that |q2 a − p2 | < m12 ≤ q12 . Since |a − pq22 | < 1
m2 <
|a − pq11 |, we have pq11 6= pq22 . Now use induction to get the required result.
9. Prove that there exist two powers of 3 whose difference is divisible by 2017.
Ans: Let S = {1 = 30 , 3, 32 , 33 , . . . , 32017 }. Then, |S| = 2018. As the remainders of any
integer when divided by 2017 is 0, 1, 2, . . . , 2016, by PHP, there is a pair which has the
same remainder. Hence, 2017 divides 3j − 3i for some i, j.
10. Prove that there exists a power of three that ends with 0001.
Ans: Let S = {1 = 30 , 3, 32 , 33 , . . .}. Now, divide each element of S by 104 . As |S| > 104 ,
by PHP, there exist i > j such that the remainders of 3i and 3j , when divided by 104 , are
equal. But gcd(104 , 3) = 1 and thus, 104 divides 3` − 1. That is, 3` − 1 = s · 104 for some
positive integer s. That is, 3` = s · 104 + 1 and hence the result follows.
Exercise 5.1.3. 1. Consider the poset (X = P({1, 2, 3, 4}), ⊆). Write 6 maximal chains
P1 , . . . P6 (need not be disjoint) such that ∪ Pi = X. Let A1 , . . . , A7 be 7 distinct subsets
i
of {1, 2, 3, 4}. Use PHP, to prove that there exist i, j such that Ai , Aj ∈ Pk , for some k.
That is, {A1 , . . . , A7 } cannot be an anti-chain. Conclude that this holds as the width of
the poset is 6.
P9
2. Let {x1 , . . . , x9 } ⊆ N with xi = 30. Then, prove that there exist i, j, k ∈ {1, 2, . . . , 9}
i=1
with xi + xj + xk ≥ 12.
T

9
AF

P
xi
30
Ans: Note that i=1
9 = 9 = 3 + 39 . Now use PHP.
DR

3. Pick any 6 integers from {1, 2, . . . , 10}, then there exists a pair with odd sum.
Ans: Each such set must contain an even number and an odd number.
4. Any 14-subset of {1, 2, . . . , 46} has four elements a, b, c, d such that a + b = c + d.
Ans: We have C(14, 2) = 91 pairs with sum between 3 and 91. By PHP, there are two pairs
{a, b} =
6 {c, d} with the same sum.
5. In a row of 12 chairs 9 are filled. Then, some 3 consecutive chairs are filled. Will 8 work?
Ans: Divide the chairs into 4 blocks [C1, C2, C3], [C4, C5, C6], [C7, C8, C9], [C10, C11, 12]
9 1
and apply PHP as = 2 + .
4 4
6. Every n-sequence of integers has a consecutive subsequence with sum divisible by n.
i
P
Ans: Let the sequence be (li ) and put ri ≡ lj (mod n). If some ri = 0, then we have
j=1
nothing to show. Otherwise, by PHP, there are i < j with ri = rj . Then, n|(li+1 + · · · + lj ).
7. Let n > 3 and S ⊆ {1, 2, . . . , n} of size m = b n+2
2 c + 1. Then, there exist a, b, c ∈ S such
that a + b = c.
Ans: Let s1 < s2 < . . . < sm be the elements of S. Consider A = {s2 , . . . , sm } and
B = {s2 − s1 , . . . , sm − s1 }. Since |A| + |B| = 2m − 2 > n, |A ∩ B| ≥ 1.
8. Let a, b ∈ N, a < b. Given more than half of the integers in the set {1, 2, . . . , a + b}, there
is a pair which differ by either a or b.
136 CHAPTER 5. ADVANCED COUNTING PRINCIPLES

Ans: Think of 1, . . . , a + b as pigeons. Define the holes {1, a + 1}, {2, a + 2}, . . . , {b, a +
b}, {b + 1, 1}, . . . , {b + a, a}. Notice that each pigeon has 2 holes. There are a total of a + b
holes. So, if we select more than half of the pigeons, some two will share a hole.
9. Consider a chess board with two of the diagonally opposite corners removed. Is it possible to
cover the board with pieces of rectangular dominos whose size is exactly two board squares?
Ans: Any rectangular domino whose size is exactly 2 board squares covers exactly one black
and exactly one white square which are adjacent. But, the diagonally opposite corners are of
the same color and hence their removal gives unequal number of white and black squares.
10. Mark the centers of all squares of an 8 × 8 chess board. Is it possible to cut the board with
13 straight lines not passing through any center, so that every piece had at most 1 center?
Ans: Suppose that we have a set of lines that cuts the chess board into pieces so that each
piece contains at most one center. Consider the square formed by the four corner middle points
a, b, c, d. Observe that the lines must intersect the segment ab at least at 7 points (not at the
centers). The same is true for the line segments bc, cd, da. Notice that any line that does not
pass through the centers can intersect with at most two of these four line segments. So, to
create 28 points of intersection we need at least 14 lines.
11. Fifteen squirrels have 100 nuts. Then, some two squirrels have equal number of nuts.
Ans: Let H = {n : n nuts are gathered by some squirrel}. As 14×15
2 = 105 > 100, |H| ≤ 14.
There are 15 squirrels. By PHP some two have gathered the same number of nuts.
T
AF

12. Suppose that f (x) is a polynomial with integer coefficients. If

(a) f (x) = 2 for three distinct integers, then for no integer x, f (x) can be equal to 3.
(b) f (x) = 14 for three distinct integers, then for no integer x, f (x) can be equal to 15.
(c) f (x) = 11 for five distinct integers, then for no integer x, f (x) can be equal to 9.

Ans: Let f (x) = 2, for x ∈ {a, b, c}. If f (d) = 3, for an integer d, then (d−a)|f (d)−f (a) = 1.
So, a = d ± 1. Similarly b, c = d ± 1. By PHP two of a, b, c are the same, a contradiction.
Alternate: If f is an integer polynomial and f (m) = 0 for some integer m, then f (x) =
(x − m)g(x), where g is an integer polynomial (use factor/remainder theorem). For our
problem, we see that f (x) = (x − a)(x − b)(x − c)g(x) + 2, where g is an integer polynomial.
If f (n) = 3, then (n − a), (n − b), (n − c)|1, so that (n − a), (n − b), (n − c) ∈ {1, −1}. By
PHP some two of them are the same, a contradiction.
13. Choose 5 points at random inside an equilateral triangle of side 1 unit, then there exists a
pair which have distance at most 0.5 units.
1
Ans: Divide the triangle into 4 small equilateral triangles of side 2. By PHP one of them
contains 2 points.
14. Prove that among any 55 integers 1 ≤ x1 < x2 < x3 < · · · < x55 ≤ 100, there is a pair with
difference 9, a pair with difference 10, a pair with difference 12 and a pair with difference
13. Surprisingly, there need not be a pair with difference 11.
Ans: Suppose that no two differ by 9. Consider the remainders ri ≡ xi (mod 9). By PHP
some ri1 = · · · = ri7 . As xij ’s are equivalent modulo 9 and they do not differ by 9 (assumption),
5.1. PIGEONHOLE PRINCIPLE 137

the difference between any two of the xij ’s is at least 18. So, x55 ≥ xi7 ≥ 18+xi6 ≥ · · · ≥ 109,
a contradiction. This method works for 10, 13. For 12, first show that x49 ≥ 97. A sequence
of 55 numbers where no two differ by 11 is
1, · · · , 11, 23, 24 · · · , 33, 45, 46, · · · , 55, 67, 68, · · · , 77, 89, 90, 91, · · · , 99.
15. Let {x1 , x2 , . . . , xn } ⊆ Z. Prove that there exist 1 ≤ i ≤ j ≤ n such that
(a) xi + xi+1 + · · · + xj−1 + xj is a multiple of 2017, whenever n ≥ 2017.
Ans: Consider the n numbers x1 , x1 + x2 , x1 + x2 + x3 , . . . , x1 + x2 + · · · + xn . If 2017
divides one of them then we are done. Else, they give n remainders between 1 and 2016.
So, if n ≥ 2017, then by PHP there will be at least 2 remainders which are same .
(b) xj + xi or xj − xi is a multiple of 2017, whenever n ≥ 1010.

16. Let A and B be two discs, each having 2n equal sectors. On disc A, n sectors are colored
red and n are colored blue. The sectors of disc B are colored arbitrarily with red and blue
colors. Show that there is a way of putting the two discs, one above the other, so that at
least n corresponding sectors have the same colors.
Ans: Put the disc B above disc A so that there sectors match. Now, for 1 ≤ i ≤ 2n, let αi
iπ
be the number of sectors of the same color when the disc B is rotated by an angle . Then,
n
2n
2
P
αi = 2n as after 2n rotations, each sector of B will eventually lie above all the n sectors
i=1
α1 + · · · + α2n
T
with which it shares its color. Hence, = n and thus by PHP, the required
2n
AF

result follows.
DR

17. There are 7 distinct real numbers. Is it possible to select two of them, say x and y such
x−y
that 0 < 1+xy < √13 ?

Ans: Since for any x ∈ R there exists a unique θ ∈ (− π2 , pi 2 ) such that tan θ = x, we have 7
π pi
numbers in (− 2 , 2 ) so there exists a pair for which the difference is π6 .
Qn
18. If n is odd then for any permutation p of {1, 2, . . . , n} the product i − p(i) is even.
i=1
Ans: Since n is odd, the number of odd integers is 1 more than the number of even integers.
Hence, all the odd integers cannot be mapped to even integers.
19. Fix a positive α ∈ R \ Q. Then, S = {m + nα : m, n ∈ Z} is dense in R.
Ans: Given any interval (a, b), there exists n ∈ N such that n1 < b − a (Archimedean
property). Note that 0 < rk = kα − bkαc < 1, k = 1, . . . , n + 1. By PHP, some two satisfy

0 < ri − rj < 1/n. Then x = ri − rj = (i − j)α + bjαc − biαc ∈ S. Let p be the smallest
integer so that px > a. If px ≥ b, then (a, b) ⊆ (p − 1)x, px and so b − a ≤ x < n1 , which

is not possible. So, px ∈ (a, b) and px ∈ S as well.

20. If more than half of the subsets of {1, 2, . . . , n} are selected, then some two of the selected
subsets have the property that one is a subset of the other.

Ans: Notice that P ({1, 2, . . . , n}) = ∪ A, A ∪ {n} . If F ⊆ P ({1, 2, . . . , n}) is
A⊆{1,2,...,n−1},
a collection of size 2n−1
+ 1, then by PHP there is a A ⊆ {1, 2, . . . , n − 1}, such that both A
and A ∪ {n} are in F .
138 CHAPTER 5. ADVANCED COUNTING PRINCIPLES

21. Given any ten 4-subsets of {1, 2, . . . , 11},, some two of them have at least 2 elements in
common.
Ans: Let Ai ⊆ {1, 2 . . . , 11} have size 4, i = 1, . . . , 10. Let mi be the number of occurrences
of i in these 10 subsets. So m1 + · · · + m11 = 40. By PHP at least one mi ≥ 4, say m1 . That
is, some four Ai s contain 1, say,

{1, x1 , x2 , x3 }, {1, x4 , x5 , x6 }, {1, x7 , x8 , x9 }, {1, x10 , x11 , x12 }.

Again by PHP some xi = xj . Note that both xi and xj cannot be in the same set.
P
Alternate. The contribution of element 1 to |Ai ∩ Aj | is C(m1 , 2). Note that the
P 2
minimum value of mi is obtained when each pair, mi , mj , differs by at most 1. Thus, one
i
has 21 m2i − 12 mi ≥ 12 (7 × 42 + 4 × 32 ) − 20 = 54. There are C(10, 2) = 45 Ai ∩ Aj ’s.
P P

So, by PHP, some Ai ∩ Aj will get contribution from two elements.

22. A person takes at least one aspirin a day for 30 days. If he takes 45 aspirin altogether
then prove that in some sequence of consecutive days he takes exactly 14 aspirins.
Ans: Let di be the number of aspirins taken in day i, put sj = d1 + · · · + dj and rj ≡ sj
(mod 14). By PHP some ri = rj = rk . In that case 14|(sj − si ) and 14|(sk − sj ). If
sk − sj 6= 14 and sj − si 6= 14, then sk ≥ 28 + sj ≥ 56 + si ≥ 57, a contradiction.

Alternate. Consider the numbers s1 < s2 < · · · < s30 and s1 +14 < s2 +14 < · · · < s30 +14.
T

Then, each of the numbers lie between 1 and 59, whereas we have 60 numbers. So, si +14 = sj ,
AF

for some i, j.
DR

23. If 58 entries of a 14 × 14 matrix are 1, then there is a 2 × 2 submatrix with all entries 1.
Pn
Ans: Let n(i, j) = aki akj (the number of 1’s common in columns i and j). Put N =
P k=1
n(i, j). On the other hand if row i has 1 at ri places, then it will contribute C(ri , 2) to N .
i<j
Thus
X 1 hX 2 X i 1 X 2
N= C(ri , 2) = ri − ri = ri − 29 ≥ 121 − 29 = 92.
2 2
i

ri2 is obtained when each pair differs by

P
In the above we used that the minimum value of
i
at most 1. There are C(14, 2) = 91 many n(i, j)’s. By PHP, some n(i, j) ≥ 2.
24. Let A and B be two finite non-empty sets with B = {b1 , b2 , . . . , bm }. Let f : A → B be any
function. Then, for any non-negative integers a1 , a2 , . . . , am if |A| = a1 +a2 +· · ·+am −m+1
then prove that there exists an i, 1 ≤ i ≤ m such that |f −1 (bi )| ≥ ai .
Ans: Suppose not. Then, for each i, 1 ≤ i ≤ m, |f −1 (bi )| ≤ ai − 1. As the sets
f −1 (bi ) are pairwise-disjoint and A = m −1 (b ), we see that |A| = | m f −1 (b )| =
S S
i=1 f i i=1 i
m m
|f −1 (bi )| ≤
P P
(ai − 1) = a1 + a2 + · · · + am − m. This contradicts the given condition
i=1 i=1
that |A| = a1 + a2 + · · · + am − m + 1. Thus, the result follows.
25. Each of the given 9 lines cuts a given square into two quadrilaterals whose areas are in the
ratio 2 : 3. Prove that at least three of these lines pass through the same point.
5.1. PIGEONHOLE PRINCIPLE 139

Ans: Each line passes through opposite edges. By PHP, some 5 lines L1 , . . . , L5 pass through
a pair of opposite sides, say, ab, cd. Consider two new lines mn and m0 n0 which divide the
square into two quadrilaterals in the required way.
Imagine ab and cd are vertical sides. Then mn is a horizontal side at height 2/3 and m0 n0 is at
height 1/3. Argue that each Li passes through the midpoint of mn or m0 n0 . By PHP, some
3 lines will pass through one midpoint.
26. Five points are chosen at the nodes of a square lattice (view Z × Z). Why is it certain that
a mid-point of some two of them is a lattice point?
Ans: The points can be of the type (even,even), (even,odd), (odd,even), (odd,odd). Since
there are only four types and there are five points some two of them are of the same type.
Consider the middle point of these two points.
27. Take 25 points on a plane satisfying ‘among any three of them there is a pair at a distance
less than 1’. Then, some circle of unit radius contains at least 13 of the given points.
Ans: Select a point a and consider the circle Γa of radius 1 around a. Let b a point outside
this circle (if no such points exists, then we have nothing to prove). Consider the circle Γb of
radius 1 around b. Notice that if there is a point c outside these two discs, then a, b, c do not
satisfy the hypothesis. So, the union of these two discs contains all the 25 points. By PHP,
one of them contains at least 13 points.
T
28. If each point of a circle is colored either red or blue, then show that there exists an isosceles
AF

triangle with vertices of the same color.

π
DR

Ans: Consider the points ei 5 , i = 0, 1, . . . , 4 on the unit circle. By PHP, some 3 of them
have the same color. Notice that they form an isosceles triangle.
29. Each point of the plane is colored red or blue, then prove the following.
(a) There exist two points of the same color which are at a distance of 1 unit.
Ans: Take a point, say P . Draw a unit circle with P as the center. If all the points on
the circumference have the same color then we are done. Else, the circumference contains
a point which has the same color as that of P .
(b) There is an equilateral triangle all of whose vertices have the same color.
Ans: Start with a hexagon of unit length. If the vertices have the same color, we are
done. Else, the center of the hexagon has the same color as that of a vertex. Now,
proceed to get a ew cases to complete the argument.
(c) There is a rectangle all of whose vertices have the same color.
Ans: Consider a 7 × 7 grid. By PHP some 25 points have the same color, say, blue.
Again by PHP some 3 rows (say, the first three rows) have at least 12 blue points. Let
Ri ⊆ {1, 2, . . . , 7} be the positions where i-th row has blue points. If some |Ri ∩ Rj | ≥ 2,
we are done. So, let all |Ri ∩ Rj | ≤ 1. Then,
X
7 ≥ |R1 ∪ R2 ∪ R3 | = |R1 ∪ R2 ∪ R3 | − |Ri ∩ Rj | ≥ 12 − 3 = 9,
i<j

not possible.
140 CHAPTER 5. ADVANCED COUNTING PRINCIPLES

Alternate. Find four points on a line of the same color, say blue. Consider the
correspond points on a parallel line. What are their colors? Consider another parallel line.
Alternate. Follows from 23

30. Let S ⊆ {1, 2, . . . , 100} be a 10-set. Then, some two disjoint subsets of S have equal sum.
P
Ans: For each nonempty subset V of S call f (V ) = n the weight of V . Notice that
n∈V
1 ≤ f (V ) ≤ 1000. There are 210 − 1 = 1023 nonempty subsets of S. By PHP, some two have
the same weight. Then, the sets U \ V and V \ U are two nonempty disjoint sets with the
same weight.
31. Fix a positive integer n. Prove that there exists an ` ∈ N such that n divides 2` − 1.
32. Does there exist a multiple of 2017 that is formed using only the digits
(a) 2? Justify your answer.
i
z }| {
Ans: Consider ai ≡ 1 · · · 1 (mod 2017), i = 1, . . . , 2018. By PHP some two are the
same. Take the difference. Now multiply by 2.
(b) 2 and 3 and the number of 2’s and 3’s are equal? Justify your answer.
Ans: Use aj obtained in the previous part to construct the required number by adding
certain multiple.
T

33. Each natural number has a multiple of the form 9 · · · 90 · · · 0, with at least one 9.
AF

i
DR

z }| {
Ans: Consider ai ≡ 9 · · · 9 (mod n), i = 1, . . . , n + 1. By PHP some two are the same. Take
the difference.

5.2 Principle of Inclusion and Exclusion

We start this section with the following example.

Example 5.2.1. How many natural numbers n ≤ 1000 are not divisible by any of 2, 3?
Ans: Let A2 = {n ∈ N | n ≤ 1000, 2|n} and A3 = {n ∈ N | n ≤ 1000, 3|n}. Then,
|A2 ∪ A3 | = |A2 | + |A3 | − |A2 ∩ A3 | = 500 + 333 − 166 = 667. So, the required answer is
1000 − 667 = 333.

We now generalize the above idea whenever we have 3 or more sets.

Theorem 5.2.2. [Principle of inclusion and exclusion] Let A1 , · · · , An be finite subsets of a

set U . Then,
n
n X
k+1
X
∪ Ai = (−1) Ai1 ∩ · · · ∩ Aik . (5.1)
i=1
k=1 1≤i1 <···<ik ≤n

Or equivalently, the number of elements of U which are in none of A1 , A2 , . . . , An equals

n
n X
k
X
|U | − ∪ Ai = |U | − (−1) Ai1 ∩ · · · ∩ Aik .
i=1
k=1 1≤i1 <···<ik ≤n
5.2. PRINCIPLE OF INCLUSION AND EXCLUSION 141

n
Proof. Let x ∈
/ ∪ Ai . Then, we show that inclusion of x in some Ai contributes (increases the
i=1
value) 1 to both sides of Equation (5.1). So, assume that x is included only in the sets A1 , · · · , Ar .
Then, the contribution of x to |Ai1 ∩ · · · ∩ Aik | is 1 if and only if {i1 , . . . , ik } ⊆ {1, 2, . . . , r}.
P
Hence, the contribution of x to |Ai1 ∩ · · · ∩ Aik | is C(r, k). Thus, the contribution
1≤i1 <···<ik ≤n
of x to the right hand side of Equation (5.1) is

C(r, 1) − C(r, 2) + C(r, 3) − · · · + (−1)r+1 C(r, r) = 1.

The element x clearly contributes 1 to the left hand side of Equation (5.1) and hence the required
result follows. The proof of the equivalent condition is left for the readers.

Example 5.2.3. How many integers between 1 and 10000 are divisible by none of 2, 3, 5, 7?
Ans: For i ∈ {2, 3, 5, 7}, let Ai = {n ∈ N | n ≤ 10000, i|n}. Therefore, the required answer is
10000 − |A2 ∪ A3 ∪ A5 ∪ A7 | = 2285.

Definition 5.2.4. [Euler totient function] For a fixed n ∈ N, the Euler’s totient function
is defined as ϕ(n) = |{k ∈ N : k ≤ n, gcd(k, n) = 1}|.
k
pαi i , be a factorization of n into distinct primes p1 , . . . , pk . Then,
Q
Theorem 5.2.5. Let n =
i=1
1 1 1
ϕ(n) = n 1 − 1− ··· 1 − .
p1 p2 pk
T

Proof. For 1 ≤ i ≤ k, let Ai = {m ∈ N : m ≤ n, pi |m}. Then,

k
1 1 1
DR

h X X i
ϕ(n) = n − | ∪ Ai | = n 1 − + − · · · + (−1)k
i pi pi pj p1 p2 · · · pk
i=1 1≤i<j≤k
1 1 1
= n 1− 1− ··· 1 −
p1 p2 pk
n n
as |Ai | = pi , |Ai ∩ Aj | = pi pj and so on. Thus, the required result follows.

Definition 5.2.6. A derangement of objects in a finite set S is a permutation/arrangement

σ on S such that for all x, σ(x) 6= x.

For example, 2, 1, 4, 3 is a derangement of 1, 2, 3, 4. The number of derangements of 1, 2, . . . , n

is denoted by Dn . By convention, D0 = 1. Also, we use a ≈ b to mean that b is an approximate
value of a.
n
X (−1)k Dn 1
Theorem 5.2.7. For n ∈ N, Dn = n! . Thus, ≈ .
k! n! e
k=0

Proof. For each i, 1 ≤ i ≤ n, let Ai be the set of arrangements σ such that σ(i) = i. Then,
verify that |Ai | = (n − 1)!, |Ai ∩ Aj | = (n − 2)! and so on. Thus,
n
X (−1)k−1
| ∪ Ai | = n.(n − 1)! − C(n, 2)(n − 2)! + · · · + (−1)n−1 C(n, n)0! = n! .
i k!
k=1

n Dn 1
P (−1)k
So, Dn = n! − | ∪ Ai | = n! k! . Furthermore, lim = .
i k=0 n→∞ n! e
142 CHAPTER 5. ADVANCED COUNTING PRINCIPLES

Example 5.2.8. For n ∈ N, how many squarefree integers do not exceed n?

√
Ans: Let P = {p1 , · · · , ps } be the set of primes not exceeding n and for 1 ≤ i ≤ s, let Ai
be the set of integers between 1 and n that are multiples of p2i . It is easy to see that
n n
|Ai | = b c, |Ai ∩ Aj | = b c,
p2i p2i p2j

and so on. So, the number of squarefree integers not greater than n is
s
s X n X n X n
n − | ∪ Ai | = n − b 2c + b 2 2c − b c + ···
i=1
i=1
p i p
1≤i<j≤s i j
p p2i p2j p2k
1≤i<j<k≤s

For n = 100, we have P = {2, 3, 5, 7}. So, the number of squarefree integers not exceeding 100
is
100 100 100 100 100 100
100 − b c−b c−b c−b c+b c+b c = 61.
4 9 25 49 36 100
Exercise 5.2.9. 1. Let m, n ∈ N with gcd(m, n) = 1. Then, ϕ(mn) = ϕ(m)ϕ(n).
Ans: Let n = pα1 1 pα2 2 · · · pαk k and m = q1β1 q2β2 · · · qrβr , where pi , qj are primes and (pi , qj ) = 1,
∀i, j. So
1 1 1 1 1 1
ϕ(n)ϕ(m) = n 1 − 1− ··· 1 − m 1− 1− ··· 1 − = ϕ(nm).
p1 p2 pk q1 q2 qr

1 r−1
(−1)i C(r, i)(r − i)n .
P
2. Let n ∈ N. Then, use inclusion-exclusion to prove S(n, r) =
T

r! i=0
AF

Ans: By Lemma 4.4.7, the number of onto functions f : {1, 2, . . . , n} → {1, 2, . . . , r} equals
DR

r!S(n, r). Now, let Ai be the set of functions f : {1, 2, . . . , n} → {1, 2, . . . , r} such that
i∈/ f ({1, 2, . . . , n}). Then,

|Ai | = (r − 1)n , |Ai ∩ Aj | = (r − 2)n ,

and so on. So, the number of onto functions is

rn − | ∪ Ai | = rn − C(r, 1)(r − 1)n + C(r, 2)(r − 2)n − · · · + (−1)r−1 C(r, r − 1)(1)n

i
r−1
(−1)i C(r, i)(r − i)n .
P
=
i=0
(
m n! if m = n
(−1)k C(m, k)(m k)n
P
3. Show that − =
k=0 0 if m > n.
Ans: This is m!S(n, m).
4. Determine the number of 10-letter words using ENGLISH alphabets that does not contain
all the vowels.
Ans: Let Aα be the set of 10-letter words using ENGLISH alphabets which does not contain
|Aα | = C(5, 1)2510 , |Aα ∩ Aβ | = C(5, 2)2410 ,
P P
the vowel α. Then,
α∈{a,e,i,o,u} α,β∈{a,e,i,o,u}
|Aα ∩Aβ ∩Aγ | = C(5, 3)2310 , |Aα ∩Aβ ∩Aγ ∩Aδ | = C(5, 4)2210
P P
α,β,γ∈{a,e,i,o,u} α,β,γ,δ∈{a,e,i,o,u}
5
and |Aa ∩ Ae ∩ Ai ∩ Ao ∩ Au | = 2110 . So, the required answer is (−1)k−1 C(5, k)(26 − k)10 .
P
k=1
5.2. PRINCIPLE OF INCLUSION AND EXCLUSION 143

5. In a school there are 12 students who take an art course A, 20 who take a biology course
B, 20 who take a chemistry course C and 8 who take a dance course D. There are 5
students who take both A and B, 7 students who take both A and C, 4 students who take
both A and D, 16 students who take both B and C, 4 students who take both B and D
and 3 students who take who take both C and D. There are 3 who take A, B and C; 2
who take A, B and D; 3 who take A, C and D; and 2 who take B, C and D. Finally there
are 2 in all four courses and further 71 students who have not taken any of these courses.
Find the total number of students.

Ans: Total number of students equals 71 + |A ∪ B ∪ C ∪ D|, where |A| = 12, |B| = 20,
|C| = 20, |D| = 8, |A ∩ B| = 5, |A ∩ C| = 7, |A ∩ D| = 4, |B ∩ C| = 16, |B ∩ D| = 4,
|C ∩ D| = 3, |A ∩ B ∩ C| = 3, |A ∩ B ∩ D| = 2, |A ∩ C ∩ D| = 3, |B ∩ C ∩ D| = 2 and
|A ∩ B ∩ C ∩ D| = 2. So, the answer is 100.

6. Determine all integers n satisfying ϕ(n) = 13.

Ans: Notice that ϕ(n) = p1 p2n···pk (p1 − 1)(p2 − 1) · · · (pk − 1) is always a multiple of p − 1
if p is a prime divisor of n. In particular, if n is a multiple of an odd prime, then ϕ(n) is even.
On the other hand ϕ(2k ) = 2k−1 , which is odd only when k = 1. Thus, there is no natural
number n such that ϕ(n) = 13.

7. Determine all integers n satisfying ϕ(n) = 12.

Ans: Notice that n 6= 2k as ϕ(2k ) = 2k−1 6= 12, for any k. If n = pk , for an odd prime p,
AF

then ϕ(n) = (p − 1)pk−1 = 12. If k > 1, then p|12. So, p = 3 and ϕ(n) = 23k−1 = 12, which
DR

has no solution. So, k = 1, p = 13 = n. For the remaining case let n = mr, (m, r) = 1. So,
either m is even and r is odd, or both m, r are odd. In either case, 8 - m, or else 8|ϕ(n) = 12.

Case: m = 2 and r = pk , where p is an odd prime. In that case ϕ(n) = pk−1 (p − 1) = 12.
So, k = 1, p = 13, n = 26.

Case: m = 2 r = st where s, t are odd and relatively prime. In this case both ϕ(s) and ϕ(t)
are even and ϕ(s)ϕ(t) = 12. Let ϕ(s) = 6, ϕ(t) = 2. So, t = 3 and s = 32 or 7. But since
(s, t) = 1, we have s = 7. Thus, n = 42.

Case: m = 4, r = pk , where p is an odd prime. So, ϕ(r) = pk−1 (p − 1) = 6. Thus, r = 9, 7

so that n = 36, 28.

Case: m = 4, r = st, where s, t are odd and relatively prime. Then 8|12, not possible.

Case: Both m, r are odd. Conclude that n = 21.

Thus, the numbers n with ϕ(n) = 12 are 13, 21, 26, 28, 36, 42.
P
8. For each fixed n ∈ N, use mathematical induction to prove that ϕ(d) = n.
d|n

Ans: We use induction on the number of distinct prime factors of n. For n = pm , where p is
a prime, we have
X
ϕ(d) = 1 + (p − 1) + (p2 − p) + · · · + (pm − pm−1 ) = pm = n.
d|n
144 CHAPTER 5. ADVANCED COUNTING PRINCIPLES

α
Let the statement be true when n is a product of k distinct primes. Let n = pα1 1 · · · pk+1
k+1
and
α1 α
n1 = p1 · · · pk . Then
k

α
X X X X X
ϕ(d) = ϕ(d) + ϕ(pk+1 d) + ϕ(p2k+1 d) + · · · + k+1
ϕ(pk+1 d)
d|n d|n1 d|n1 d|n1 d|n1
α
X X X X
= ϕ(d) + ϕ(d)ϕ(pk+1 ) + ϕ(d)ϕ(p2k+1 ) + · · · + k+1
ϕ(d)ϕ(pk+1 )
d|n1 d|n1 d|n1 d|n1
αk+1 αk+1
= n1 1 + ϕ(pk+1 ) + ϕ(p2k+1 ) + · · · + ϕ(pk+1

) = n1 pk+1 = n.

9. A function f : N → N is said to be multiplicative if f (nm) = f (n)f (m), whenever

gcd(n, m) = 1.
P
(a) Let f, g : N → N be functions satisfying f (n) = g(d) and f (1) = g(1) = 1. If f is
d|n
multiplicative then use induction to show that g is also multiplicative.
(b) Imagine the fractions n1 , n2 , . . . , nn . Reduce the fractions to standard form by canceling
P
the common factors and regroup to show that n = ϕ(d). For example,
d|n

1 2 3 4 5 6 1 1 1 2 5 1 1 2 1 5
, , , , , → , , , , , 1 → 1, , , , , .
6 6 6 6 6 6 6 3 2 3 6 2 3 3 6 6

(c) Use the first part to conclude that ϕ is multiplicative.

Ans: (a) Assume g to be multiplicative up to k − 1. Write k = nm, (n, m) = 1. Then,

P P
f (n) = g(d1 ) and f (m) = g(d2 ). So,
DR

d1 |n d2 |m

X X XX X
f (n)f (m) = g(d1 ) g(d2 ) = g(d1 )g(d2 ) = g(n)g(m) + g(d1 )g(d2 )
d1 |n d2 |m d1 |n d2 |m d1 |n,d2 |m
d1 d2 6=nm

P P P
and f (nm) = g(d) = g(d1 d2 ) = g(nm) + g(d1 )g(d2 ). As f is multi-
d|nm d1 |n,d2 |m d1 |n,d2 |m
d1 d2 6=nm

plicative, f (nm) = f (n)f (m) and hence g(nm) = g(n)g(m).

10. Show that for n ≥ 2, Dn = b n! 1
e + 2 c.
P (−1)k
Ans: | n!
e − Dn | = |n!
1
k! | ≤ n+1 +
1
(n+1)(n+2) + ··· < 1
n+1 + 1
(n+1)2
+ ··· = 1
n < 12 .
k>n
n! 1
So that e + 2 ∈ (Dn , Dn + 1).
n
P
11. Prove combinatorially: C(n, i)Dn−i = n!.
i=0
Ans: Each permutation is a product of single cycles and a derangement.
12. Find the number of nonnegative integer solutions of a + b + c + d = 27, where 1 ≤ a ≤
5, 2 ≤ b ≤ 7, 3 ≤ c ≤ 9, 4 ≤ d ≤ 11.
Ans: The number is the same as the number of nonnegative integer solutions of a+b+c+d =
17 such that a ≤ 4, b ≤ 5, c ≤ 6, d ≤ 7. Let A be the set of solutions with a > 4. Let B
be the set of solutions with b > 5, C be the set of solutions with c > 6, and D be the set of
solutions with d > 7. Our intention is to find out | [A ∪ B ∪ C ∪ D]0 |.
5.2. PRINCIPLE OF INCLUSION AND EXCLUSION 145

We see that, |A| = C(15, 3), |B| = C(14, 3), |C| = C(13, 3), |D| = C(12, 3), |A ∩ B| =
C(9, 3), |A ∩ C| = C(8, 3), |A ∩ D| = C(7, 3), |B ∩ C| = C(7, 3), |B ∩ D| = C(6, 3),
|C ∩ D| = C(5, 3), |A ∩ B ∩ C| = 0. So, the answer is 55.
13. Let x be a positive integer less than or equal to 9999999.

(a) Find the number of x’s for which the sum of the digits in x equals 30.
Ans: Note that we need to solve the system x1 + x2 + · · · + x7 = 30 in non-negative
integers with 1 ≤ x1 ≤ 9, 0 ≤ xi ≤ 9, for i = 2, . . . , 9. Or equivalently, x1 +· · ·+x7 = 29
in non-negative integers with 0 ≤ xi ≤ 9, for i = 1, . . . , 9. So, for 1 ≤ i ≤ 7, let Ai
denote the number of solutions so that xi ≥ 10. Now, calculate |[A1 ∪ · · · ∪ A7 ]0 |.
(b) How many of the solutions obtained in the first part consist of 7 digits?
Ans: Note that we need to solve the system x1 + x2 + · · · + x7 = 30 in non-negative
integers with 1 ≤ xi ≤ 9, for i = 1, . . . , 9.

14. Determine the number of ways to arrange 10 digits 0, 1, . . . , 9, so that the digit i is never
followed immediately by i + 1.
Ans: For 1 ≤ i ≤ 9, let Ai denote the number of ways to arrange the digits so that i appears
immediately after i − 1. Then, we are interested in |[A1 ∪ A2 ∪ · · · ∪ A9 ]0 |.
15. Determine the number of strings of length 15 consisting of the 10 digits, 0, 1, . . . , 9, so that
no string contains all the 10 digits.
T
AF

Ans: Note that we are looking for the number of functions from a set having 15 elements to
a set having 10 elements which are not onto.
DR

16. Determine the number of ways of permuting the 26 letters of the ENGLISH alphabets so
that none of the patterns lazy, run, show and pet occurs.
Ans: Let A` denote the number of ways of permuting the 26 letters of the ENGLISH alphabets
so that the pattern lazy occurs. Similarly, define the sets Ar , As and Ap . Then, we are
interested in |[A` ∪ Ar ∪ As ∪ Ap ]0 |.
P X 15!
17. Let S = {(n1 , n2 , n3 ) | ni ∈ N, ni = 15}. Evaluate .
n1 !n2 !n3 !
(n1 ,n2 ,n3 )∈S
15!
Ans: is the number of words of length 15 made with n1 copies of A1 , n2 copies of
n1 !n2 !n3 !
X 15!
A2 and n3 copies of A3 . Thus, represents the number of words of length
n1 !n2 !n3 !
(n1 ,n2 ,n3 )∈S
15 using letters from {A1 , A2 , A3 }, where each of the letters appears at least once. Note that
we are talking of all onto functions from {1, 2, . . . , 15} to {A1 , A2 , A3 }. You can also use PIE
to get this number as 315 − 3 × 215 + 3.
18. Each of the 9 senior students said: ‘the number of junior students I want to help is exactly
one’. There were 4 junior students a, b, c, d, who wanted their help. The allocation was
done randomly. What is the probability that either a has exactly two seniors to help him
or b has exactly 3 seniors to help him or c has no seniors to help him?
Ans: The allocations are functions f : {1, 2, . . . , 9} → {a, b, c, d}. Put A = {f : |f −1 (a)| =
2}, B = {f : |f −1 (b)| = 3} and C = {f : |f −1 (c)| = 0}. Then, |A| = C(9, 2)37 , |B| =
146 CHAPTER 5. ADVANCED COUNTING PRINCIPLES

C(9, 3)36 , |C| = 39 , |A ∩ B| = C(9, 2)C(7, 3)24 , |A ∩ C| = C(9, 2)27 , |B ∩ C| = C(9, 3)26
and |A ∩ B ∩ C| = C(9, 2)C(7, 3). Using PIE, the answer is |A ∪ B ∪ C|

= C(9, 2)37 + C(9, 3)36 + 39 − C(9, 2)C(7, 3)24 − C(9, 2)27 − C(9, 3)26 + C(9, 2)C(7, 3).

So, the probability is the above number divided by 49 .

5.3 Generating Functions

This is one of the strongest tools in combinatorics. We start with the definition of formal power
series over Q and develop the theory of generating functions. This is then used to get closed
form expressions for some known recurrence relations and are then further used to get some
binomial identities.
an xn , where an ∈ Q
P
Definition 5.3.1. 1. An algebraic expression of the form f (x) =
n≥0
for all n ≥ 0, is called a formal power series in the indeterminate x over Q. By P(x),
we denote "the set of all #formal power series in x and by cf[xn , f ], the coefficient of xn in

f , e.g., cf xn , an xn = an .
P
n≥0

2. Two elements f, g ∈ P(x) are said to be equal if cf[xn , f ] = cf[xn , g] for all n ≥ 0.
an xn , g(x) = bn xn ∈ P(x). Then, their
P P
3. Let f (x) =
T
n≥0 n≥0
AF

(a) sum/addition is defined by cf[xn , f + g] = cf[xn , f ] + cf[xn , g].

n
(b) product (called the Cauchy product) is defined by cf[xn , f · g] = cn =
P
ak bn−k .
k=0

Before proceeding further, we consider the following examples.

Example 5.3.2. 1. How many words of size 8 can be formed with 6 copies of A and 6 copies
of B?
6
P
Ans: C(8, k), as we just need to choose k places for A, where 2 ≤ k ≤ 6.
k=2

Alternate. In any such word, we need m many A’s and n many B’s with m + n = 8,
8!
m ≤ 6 and n ≤ 6. Also, the number of words with m many A’s and n many B’s is .
m!n!
8!xm y n
We identify this number with and note that this is a term of degree 8 in
m!n!
h x2 x3 x4 x5 x6 ih y2 y3 y4 y5 y6 i
8! 1 + x + + + + + 1+y+ + + + + .
2! 3! 4! 5! 6! 2! 3! 4! 5! 6!
If we replace y by x, then our answer is
h i
8 x2 x3 x4 x5 x6 x2 x3 x4 x5 x6
8!cf x , (1 + x + 2! + 3! + 4! + 5! + 6! )(1 + x + 2! + 3! + 4! + 5! + 6! )
h 2 3 4 5 6 2 3 4 5 6
i
= 8!cf x8 , ( x2! + x3! + x4! + x5! + x6! )( x2! + x3! + x4! + x5! + x6! )
h 2 3 2 3
i
= 8!cf x8 , ( x2! + x3! + · · · )( x2! + x3! + · · · )
8
= 8!cf x8 , (ex − 1 − x)2 = e2x + 1 + x2 − 2xex − 2ex + 2x = 8! 28! − 7!2 − 8!2 = 238.

5.3. GENERATING FUNCTIONS 147

2. How many anagrams are there of the word M ISSISSIP P I?

11!
Ans: Using basic counting, the answer is . For another understanding, the readers
4!4!2!
should note that

x2 x3 x4 2 x2

11!
= 11! cf x11 , 1 + x 1 + x +

+ + 1+x+
4!4!2! 2! 3! 4! 2!
2 4 5 2 3

11 x x x 2 x x
= 11! cf x , x + + ··· + + ··· + + ···
2! 4! 5! 2! 3!

x4 x4 x2
as we need to have x, , and for the alphabets M, I, S and P , respectively.
4! 4! 2!
3. Prove that the number of nonnegative integer solutions of u + v + w + t = 10 equals
cf x10 , (1 + x + x2 + · · · )4 .

Ans: Note that u can take any value from 0 to 10 which corresponds to 1 + x + · · · + x10 .
Hence, using Theorem 4.6.1, the required answer is

4 · 5 · · · · 13
cf x10 , (1 + x + x2 + · · · )4 = (1 − x)−4 = C(13, 10) =

.
10!

Definition 5.3.3. Let (br )∞

0 be a sequence of integers. Then,

1. the ordinary generating function (ogf) is the formal power series

T
AF

b0 + b1 x + b2 x2 + b3 x3 + · · · , and
DR

2. the exponential generating function (egf) is the formal power series

x2 x3
b0 + b1 x + b2 + b3 + · · · .
2! 3!

If there exists an M ∈ N such that br = 0 for all r ≥ M , then the generating functions have
finitely many terms.

Example 5.3.4. What is the number of nonnegative integer solutions of 2a + 3b + 5c = r,

r ∈ N0 ?
Ans: Note that a ∈ N0 and hence 2a corresponds to the formal power series 1 + x2 + x4 + · · · .
Thus, we need to consider the ogf

1
(1 + x2 + x4 + · · · )(1 + x3 + x6 + · · · )(1 + x5 + x10 + · · · ) = .
(1 − x2 )(1 − x3 )(1 − x5 )

1
Hence, the required answer is cf xr , .
(1 − x2 )(1 − x3 )(1 − x5 )

Pxn P xn
Remark 5.3.5. 1. Let f (x) = an
, g(x) = bn ∈ P(x). Then, in case of egf,
n≥0 n! n≥0 n!
P xn Pn
n

their product equals dn , where dn = k ak bn−k , for n ≥ 0.
n≥0 n! k=0
148 CHAPTER 5. ADVANCED COUNTING PRINCIPLES

x −1 P yn x P (ex − 1)n
2. Note that ee ∈ P(x) as ey = implies that ee −1 = and
n≥0 n! n≥0 n!
 
X (ex − 1)n m x n
m (e − 1)

m ex −1 m
X
cf x , e = cf x ,
  = cf x , . (5.2)
n! n!
n≥0 n=0

x
That is, for each m ≥ 0, cf xm , ee −1 is a sum of a finite number of rational numbers.

x x
Whereas, the expression ee 6∈ P(x) requires infinitely many computation for cf xm , ee ,

for all m ≥ 0.

With the algebraic operations as defined in Definition 5.3.1.3, it can be checked that P(x)
forms a Commutative Ring with identity, where the identity element is given by the formal
an xn is said to have a reciprocal if
P
power series f (x) = 1. In this ring, the element f (x) =
n≥0
bn xn ∈ P(x) such that f (x) · g(x) = 1. So, the question
P
there exists another element g(x) =
n≥0
arises, under what conditions on cf[xn , f ], can we find g(x) ∈ P(x) such that f (x)g(x) = 1.
The answer to this question is given in the following proposition.

Proposition 5.3.6. The reciprocal of f ∈ P(x) exists if and only if cf x0 , f 6= 0.

bn xn ∈ P(x) be the reciprocal of f (x) = an xn . Then, f (x)g(x) = 1 if

P P
Proof. Let g(x) =
n≥0 n≥0
and only if cf x0 , f · g = 1 and cf[xn , f · g] = 0, for all n ≥ 1.

But, by definition of the Cauchy product, cf x0 , f · g = a0 b0 . Hence, if a0 = cf x0 , f = 0

T
AF

then cf x0 , f · g = 0 and thus, f cannot have a reciprocal. However, if a0 6= 0, then the

coefficients cf[xn , g] = bn ’s can be recursively obtained as follows:

b0 = 1/a0 as 1 = c0 = a0 b0 ;
b1 = −(a1 b0 )/a0 as 0 = c1 = a0 b1 + a1 b0 ;
b2 = −(a2 b0 + a1 b1 )/a0 as 0 = c2 = a0 b2 + a1 b1 + a2 b0 ; and in general, if we have computed bk ,
for k ≤ r, then using 0 = cr+1 = ar+1 b0 + ar b1 + · · · + a1 br + a0 br+1 ,
br+1 = −(ar+1 b0 + ar b1 + · · · + a1 br )/a0 . Hence, the required result follows.
Note that, in Proposition 5.3.6, bn ∈ Q as a0 ∈ Q. We now look at the composition of formal
an xn , g(x) = bn xn ∈ P(x) then the composition
P P
power series. Recall that, if f (x) =
n≥0 n≥0
X X X
(f ◦ g)(x) = f (g(x)) = an (g(x))n = an ( bm xm )n
n≥0 n≥0 m≥0

may not be defined (just to compute the constant term of the composition, one may have to
look at an infinite sum of rational numbers). For example, let f (x) = ex and g(x) = x + 1. Note
that g(0) = 1 6= 0. Here, (f ◦ g)(x) = f (g(x)) = f (x + 1) = ex+1 . So, as function f ◦ g is well
defined, but there is no formal procedure to write ex+1 as ak xk ∈ P(x) (i.e., with ak ∈ Q)
P
k≥0
and hence ex+1 is not a formal power series over Q. The next result gives the condition under
which the composition (f ◦ g)(x) is well defined.

Proposition 5.3.7. Let f, g ∈ P(x). Then, the composition (f ◦ g)(x) ∈ P(x) if either f is a
polynomial or cf x0 , g(x) = 0. Moreover, if cf x0 , f (x) = 0, then there exists g ∈ P(x), with

cf x0 , g(x) = 0, such that (f ◦ g)(x) = x. Furthermore, (g ◦ f )(x) ∈ P(x) and (g ◦ f )(x) = x.

5.3. GENERATING FUNCTIONS 149

cn xn and suppose that either f is a polynomial

P
Proof. As (f ◦ g)(x) ∈ P(x), let (f ◦ g)(x) =
n≥0
or cf x0 , g(x) = 0. Then, to compute ck = cf xk , (f ◦ g)(x) , for k ≥ 0, one just needs to

k
an (g(x))n , whenever f (x) = an xn . Hence, each ck ∈ Q and thus,
P P
consider the terms
n=0 n≥0
(f ◦ g)(x) ∈ P(x). This completes the proof of the first part. We leave the proof of the other
part for the reader.
The proof of the next result is left for the reader.
Proposition 5.3.8. [Basic tricks] Recall the following statements from Binomial theorem and
Theorem 4.6.1.
1. cf xn , (1 − x)−r = (1 + x + x2 + · · · )r = C(n + r − 1, n).

2. (1 − xm )n = 1 − C(n, 1)xm + C(n, 2)x2m − · · · + (−1)n xnm .

1 − xm n

2
3. (1 + x + x + · · · + x m−1 n
) = = (1 − xm )n (1 + x + x2 + · · · )n .
1−x
We now define the formal differentiation in P(x) and give some important results. The proof
is left for the reader.
an xn ∈ P(x). Then, the formal differentiation of f (x),
P
Definition 5.3.9. Let f (x) =
n≥0
denoted f 0 (x), is defined by
X
f 0 (x) = a1 + 2a2 x + · · · + nan xn−1 + · · · = nan xn−1 .
n≥1
T

Proposition 5.3.10. [ogf: tricks] Let g(x), h(x) be the ogf ’s for the sequences (ar )∞ ∞
0 , (br )0 ,
AF

respectively. Then, the following are true.

1. Ag(x) + Bh(x) is the ogf for (Aar + Bbr )∞

0 .

2. (1 − x)g(x) is the ogf for the sequence a0 , a1 − a0 , a2 − a1 , · · · .

3. (1+x+x2 +· · · )g(x) = (1−x)−1 g(x) is the ogf for (Mr )∞
0 , where Mr = ar +ar−1 +· · ·+a0 .

4. g(x)h(x) is the ogf for (cr )∞

0 , where cr = a0 br + a1 br−1 + a2 br−2 + · · · + ar b0 .

5. xf 0 (x) is the ogf for (rar )∞

1 .

Proof. For example, to prove (3), note that if g(x) = a0 + a1 x + a2 x2 + · · · , then the coefficient
of x2 in (1 + x + x2 + · · · )(a0 + a1 x + a2 x2 + · · · ) is a2 + a1 + a0 .
Example 5.3.11. 1. Let ar = 1 for all r ≥ 0. Then, the ogf of the sequence (ar )∞ 0 equals
2 −1
1 + x + x + · · · = (1 − x) = f (x). So, for r ≥ 0, the ogf for
(a) ar = r is xf 0 (x) and
(b) ar = r2 is x f 0 (x) + xf 00 (x) .

2. Determine the number of ways to distribute 50 coins among 30 students so that no student
gets more than 4 coins equals

cf x50 , (1 + x + x2 + x3 + x4 )30 = cf x50 , (1 − x5 )30 (1 − x)−30

= C(79, 50) − 30C(74, 45) + C(30, 2)C(69, 40) + · · ·

10
X
= (−1)i C(30, i)C(79 − 5i, 50 − 5i).
i=0
150 CHAPTER 5. ADVANCED COUNTING PRINCIPLES

3. For n, r ∈ N, determine the number of solutions to y1 +· · ·+yn = r with yi ∈ N0 , 1 ≤ i ≤ n.

Ans: Recall that this number equals C(r + n − 1, r) (see Theorem 4.3.3).

Alternate. We can think of the problem as follows: the above system can be interpreted
as coming from the monomial xr , where r = y1 + · · · + yn . Thus, the problem reduces
to finding the coefficients of xyk of a formal power series, for yk ≥ 0. Now, recall that
cf xyk , (1 − x)−1 = 1. Hence, the question reduces to computing

r 1 r 1
cf x , = cf x , = C(r + n − 1, r).
(1 − x)(1 − x) · · · (1 − x) (1 − x)n
∞
1
Put f (x) = (1 − x)−1 . Then, using Proposition 5.3.10, the required sum
P
4. Evaluate 2k
k.
k=0
is 21 f 0 (1/2) = 2. Alternately (rearranging terms of an absolutely convergent series) it is
1
2 +
1 1
4 + 4 +
1 1 1
8 + 8 + 8 +
..
.
1
1+ 2 + · · · = 2.

nxn ∈ P(x).
P
5. Determine a closed form expression for
n≥0
T

!0
AF

0
Ans: As (1 − x)−1 = xn , one has (1 − x)−2 = (1 − x)−1 = xn nxn−1 .
P P P
=
n≥0 n≥0 n≥0
DR

x
Thus, the closed form expression is .
(1 − x)2
nxn = x + 2x2 + 3x3 + · · · . Then, xS = x2 + 2x3 + 3x4 + · · · .
P
Alternate. Let S =
n≥0
x x
xk = xk =
P P
Hence, (1 − x)S = x . Thus, S = .
k≥1 k≥0 1−x (1 − x)2
6. Determine the sum of the first N positive integers.
Ans: Using previous example, note that k = cf xk−1 , (1 − x)−2 . Therefore, by Proposi-

N
k = cf xN −1 , (1 − x)−1 · (1 − x)−2 and hence
P
tion 5.3.10, one has
k=1

N
X N (N + 1)
k = cf xN −1 , (1 − x)−3 = C(N + 1, N − 1) =

.
2
k=1

7. Determine the sum of the squares of the first N positive integers.

0 0
P n x P 2 n P n x x(1 + x)
Ans: Recall nx = (1−x)2 . Thus, n x =x nx = x (1−x)2 = .
n≥0 n≥0 n≥0 (1 − x)3
Hence,
N
X
2 1 x(1 + x) N −1 1 N −2 1
k = cf xN , · = cf x , + cf x ,
1 − x (1 − x)3 (1 − x)4 (1 − x)4
k=1
N (N + 1)(2N + 1)
= C(N + 2, N − 1) + C(N + 1, N − 2) = .
6
5.3. GENERATING FUNCTIONS 151

Exercise 5.3.12. 1. For n, r ∈ N and xi ∈ N0 for 1 ≤ i ≤ n, determine the number of

solutions to x1 + 2x2 + · · · + nxn = r.

1
Ans: cf xr ,
(1 − x)(1 − x2 ) · · · (1 − xn )
∞
P 1
2. Determine 2k
C(n + k − 1, k).
k=0

∞ ∞
1
xk C(n + k − 1, k) evaluated at x = 1/2.
P P
Ans: Note that 2k
C(n + k − 1, k) equals
k=0 k=0
∞ 1 ∞
1
xk C(n + k − 1, k) = + k − 1, k) = 2n .
P P
But . Hence, 2k
C(n
k=0 (1 − x)n k=0

3. Find the number of nonnegative integer solutions of a + b + c + d + e = 27, satisfying

(a) 3 ≤ a ≤ 8,
(b) 3 ≤ a, b, c, d ≤ 8
(c) c is a multiple of 3 and e is a multiple of 4.

x3 + · · · + x8 1 + · · · + x5 1 − x6

27
Ans: (a) cf x , = cf x , 24 24
= cf x ,
4 4 (1 − x)5
− x)
(1 (1 − x)
1 1
T

= cf x24 , − cf x18 , = C(28, 4) − C(22, 4).

(1 − x) 5 (1 − x)5
DR

(x3 + · · · + x8 )4 (1 − x6 )4 1 − 4x6 + 6x12 − · · ·

27
(b) cf x , = cf x , 15 = cf x , 15
1−x (1 − x)5 (1 − x)5
= C(19, 4) − 4C(13, 4) + 6C(7, 4).

27 1
(c) cf x , .
(1 − x)3 (1 − x3 )(1 − x4 )

4. Determine the number of ways in which 100 voters can cast their 100 votes for 10 candi-
dates such that no candidate gets more than 20 votes.

(1 − x21 )10

100 20 10
100
Ans: Need to find cf x , (1 + x + · · · + x ) = cf x , .
(1 − x)10
N
k3 .
P
5. Determine a closed form expression for
k=1

x(1 + x) (N (N + 1))2
n2 xn =
P
Ans: Use and Proposition 5.3.10 to get .
n≥0 (1 − x)3 4

P n2 + n + 6
6. Determine a closed form expression for .
n≥0 n!

P n2 + n + 6 P n+1 P 6 P 1
Ans: = + = + 2e + 6e = 9e.
n≥0 n! n≥1 (n − 1)! n≥0 n! n≥2 (n − 2)!

7. Verify the following table of formal power series.

152 CHAPTER 5. ADVANCED COUNTING PRINCIPLES

Table of Formal Power Series

P xk
ex = (1 + x)n = C(n, k)xk , n ∈ N0
P
k≥0 k! r≥0
P (−1)r x2r P (−1)r x2r+1
cos(x) = sin(x) =
r≥0 (2r)! r≥0 (2r + 1)!
P x2r P x2r+1
cosh(x) = sinh(x) =
r≥0 (2r)! r≥0 (2r + 1)!
Radius of convergence: |x| < 1
P xk
log(1 − x) = −
k≥1 k
1 P k 1
C(n + k − 1, k)xk , n ∈ N
P
= x n
=
1−x k≥0 (1 − x) k≥0
(1 + x)n k xn
C(k, n)xk , n ∈ N0
P P
= C(n, r + k)x =
xr k≥−r (1 − x) n+1
k≥0
1
Radius of convergence: |x| <
√ 4
1 k 1 − 1 − 4x P 1
√ C(2k, k)xk
P
= C(2k, k)x =
1 − 4x k≥0 2x k≥0 k + 1

Definition 5.3.13. For n, k ∈ N, let (n1 , n2 , · · · , nk ) be a partition of n ∈ N into k parts. So,

ni ≥ ni+1 , for 1 ≤ i ≤ k − 1. Then, the Ferrer’s Diagram of (n1 , n2 , · · · , nk ) is a pictorial
T

representation (pattern) using dots in the following way: place n1 dots in the first row. The n2
AF

dots in the second row are placed in such a way to cover the first n2 dots of the first row and so
DR

on (see Figure 5.1).

Example 5.3.14. 1. (1, 1, 1, 1), (2, 2), (2, 1, 1) are a few partitions of 4.
2. Ferrer’s diagram for λ = (5, 3, 3, 2, 1, 1) and λ0 are given below.

• • • • • → 9 : I-hook
••••• •••••• • • • • • → 7 : II-hook
••• ••••
••• ••• • • • • → 3 : III-hook
•• • • • •
• •
• • •
(5, 3, 3, 2, 1, 1) (6, 4, 3, 1, 1) (5, 5, 4, 3, 2)

Figure 5.1: Ferrer’s diagram of λ = (5, 3, 3, 2, 1, 1), λ0 and hook length of (5, 5, 4, 3, 2)

3. Let λ be a partition and µ it’s Ferrer’s diagram. Then, the diagram µ0 obtained by
interchanging the rows and columns of µ is called the conjugate of λ, denoted λ0 . Thus,
the conjugate of the partition (5, 3, 3, 2, 1, 1) is (6, 4, 3, 1, 1), another partition of 15.

Definition 5.3.15. A partition λ is said to be self conjugate if the Ferrer’s diagram of λ and
λ0 is the same.
5.3. GENERATING FUNCTIONS 153

Example 5.3.16. Find a one-one correspondence between self conjugate partitions and parti-
tions of n into distinct odd terms.
Ans: Let λ be a self conjugate partition with k diagonal dots. For 1 ≤ i ≤ k, define ni =
number of dots in the i-th ‘hook’ (dotted lines in Figure 5.1). Since λ is self-conjugate, each of
ni ’s are odd.
Conversely, given any partition, say (x1 , . . . , xk ) with odd terms, we can get a self conjugate
partition by putting x1 dots in the first ‘hook’, x2 dots in the second ‘hook’ and so on. Since each
xi is odd, the hook is symmetric and xi ≤ xi−1 + 2, for 2 ≤ i ≤ k, implies that the corresponding
diagram of dots is indeed a Ferrer’s diagram and hence the result follows.

Theorem 5.3.17. [Euler: partition of n] The generating function for πn is

1
ε(x) = (1 + x + x2 + · · · )(1 + x2 + x4 + · · · ) · · · (1 + xn + x2n + · · · ) = .
(1 − x)(1 − x2 ) · · · (1 − xn )

Proof. Note that any partition λ of n has m1 copies of 1, m2 copies of 2 and so on till mn
Pn
copies of n, where mi ∈ N0 for 1 ≤ i ≤ n and mi = n. Hence, λ uniquely corresponds to
i=1
(x1 )m1 (x2 )m2 · · · (xn )mn in the word-expansion of

(1 + x + x2 + · · · )(1 + x2 + x4 + · · · ) · · · (1 + xn + x2n + · · · ).

Thus, πn = cf[xn , ε(x)].

T
AF

Example 5.3.18. Let f (n) be the number of partitions of n in which no part is 1. Then, note
that the ogf for f (n) is (1 − x)ε(x). Hence, using Proposition 5.3.10, f (n) = πn − πn−1 .
DR

Alternate. Let λ = (n1 , . . . , nk ) be a partition of n with nk = 1. Then, λ gives a partition

of n − 1, namely (n1 , . . . , nk−1 ). Conversely, if µ = (t1 , . . . , tk ) is a partition of n − 1, then
(t1 , . . . , tk , 1) is a partition of n with last part 1, Hence, the required result follows.

The next result is the same idea as Theorem 5.3.17 and hence the proof is omitted.
r

1
Theorem 5.3.19. The number of partitions of n with entries at most r is cf xn ,
Q
1−xi
.
i=1

Corollary 5.3.20. Fix n, r ∈ N. Then, the ogf for the number of partitions of n into at most r
parts, is (1−x)(1−x12 )···(1−xr ) .

Proof. Note that by using Ferrer’s diagram (taking conjugate) we see that the number of
partitions of n into at most r parts is same as the numberof partitions of n with entries at most
r
1
r. So, by Theorem 5.3.19, this number is cf xn ,
Q
1−xi
.
i=1

Theorem 5.3.21. [ogf of πn (r)] Fix n, r ∈ N. Then, the ogf for πn (r), the number of partitions
r
of n into r parts, is (1−x)(1−xx2 )···(1−xr ) .

Proof. Consider a partition (λ1 , . . . , λr ) of n. So, n ≥ r. Assume that λ1 , . . . , λk > 1 and

λk+1 , . . . , λr = 1. Then (λ1 − 1, . . . , λk − 1) is a partition of n − r into at most r parts.
Conversely, if (µ1 , . . . , µk ), k ≤ r, is a partition of n − r into at most r parts, then (µ1 +
1, . . . , µk + 1, 1, . . . , 1), where the number of 1’s is r − k times, is an r partition of n.
154 CHAPTER 5. ADVANCED COUNTING PRINCIPLES

Thus, the number of r partitions of n is the same as the number ofh partitions of n − r with i
at most r parts. Thus, by Corollary 5.3.20 the required number is cf xn−r , (1−x)(1−x12 )···(1−xr ) .
Hence, the ogf for πn (r) is
xr
.
(1 − x)(1 − x2 ) · · · (1 − xr )

Exercise 5.3.22. 1. For n, r ∈ N, prove that πn (r) is the number of partitions of n + C(r, 2)
into r unequal parts.
Ans: Let p be a partition of n with r parts. Add (r − 1, r − 2, . . . , 0) to get a partition of
n + C(r, 2) with r unequal parts. Conversely, given a partition of n + C(r, 2) with r unequal
parts, removing r − 1 dots from the first row, r − 2 dots from the second row and so on will
give a partition of n into r parts.
2. Let P, M ⊆ N and f (n) be the number of partitions of n where parts are from P and
multiplicities are from M . Find the generating function for the numbers f (n).

Q P mp
Ans: x .
p∈P m∈M

Theorem 5.3.23. Suppose there are k types of objects.

1. If there is an unlimited supply of each object, then the egf of the number of r-permutations
is ekx .
T

2. If there are mi copies of i-th object, then the egf of the number of r-permutations is
AF

x2 xm 1 x2 xmk

DR

1+x+ + ··· + ··· 1 + x + + ··· + .

2! m1 ! 2! mk !

xr
3. Moreover, n!S(r, n) is the coefficient of r! in (ex − 1)n .

Proof. Part 1: Since there are unlimited supply of each object, the egf for each object corresponds
xn
to ex = 1 + x + · · · + + · · · . Hence, the required result follows.
n!
Part 2: Argument is similar to that of Part 1 and is omitted.
Part 3: Recall that n!S(r, n) is the number of surjections from {1, 2, . . . , r} to X = {s1 , · · · , sn }.
Each surjection can be viewed as word of length r of elements of X, with each si appearing at least
Pn
once. Thus, we need a selection of ki ∈ N copies of si , with ki = r. Also, by Theorem 4.1.23,
i=1
this number equals C(r; k1 , · · · , kn ). Hence,
n
x2 x3
r
r x x n
n!S(r, n) = r!cf x , x + + + ··· = cf , (e − 1) .
2! 3! r!
Example 5.3.24. 1. In how many ways can you get Rs 2007 using denominations 1, 10, 100, 1000
only?

1
Ans: cf x2007 , .
(1 − x)(1 − x10 )(1 − x100 )(1 − x1000 )
2. If we use at most 9 of each denomination in Part 1, then this number is
9
" ! 9 ! 9 ! 9 !#
10000
2007 1 − x
X X X X
2007 i 10i 100i 1000i
cf x , x x x x = cf x , = 1.
1−x
i=1 i=1 i=1 i=1
5.3. GENERATING FUNCTIONS 155

3. Every natural number has a unique base-r representation (r ≥ 2). Note that Part 2
corresponds to the case r = 10.
4. Consider n integers k1 < k2 < · · · < kn with gcd(k1 , . . . , kn ) = 1. Then, the number of
natural numbers not having a partition using {k1 , . . . , kn } is finite.
P
Ans: Since gcd(k1 , . . . , kn ) = 1, there exist αi ∈ Z such that αi ki = 1. Let m =
max{|α1 |, . . . , |αn |}, k = min{ki } and N = km(k1 + · · · + kn ). Notice that N, N + k, N +
P
2k, . . . can be represented as βi ki where βi ≥ km. For 1 ≤ r < k, we have N + r =
P P
km(k1 + · · · + kn ) + r αi ki = (km − rαi )ki . Thus, each integer greater than N can be
represented using k1 , . . . , kn .
Determining the largest such integer (Frobenius number) is the coin problem/ money
changing problem. The general problem is NP-hard. No closed form formula is known for
n > 3.

Notice!
Some times we have a way to obtain a recurrence relation from the generating function.
This is important and hence study the next example carefully.
1 X
Example 5.3.25. 1. Suppose F = = an xn . Then,
(1 − x)(1 − x10 )(1 − x100 )(1 − x1000 )
n≥0
taking log and differentiating, we get
10x9 100x99 1000x999

1
T
0
F =F + + + .
1 − x 1 − x10 1 − x100 1 − x1000
AF

So,
DR

n
10x9 100x99 1000x999

n−1 0 n−1 1 X
nan = cf x , F = cf x ,F + + + = an−k bk ,
1 − x 1 − x10 1 − x100 1 − x1000
k=1

where

 1
 if 10 - k

10x9 100x99 999

 11
k−1 1 1000x if 10|k, 100 - k
bk = cf x , + + + =
1 − x 1 − x10 1 − x100 1 − x1000 
 111 if 100|k, 1000 - k


 1111 else.
n n
P 1 P 1
2. We know that lim k = ∞. What about lim , where pk is the k-th prime?
n→∞ k=1 n→∞ k=1 pk
n
P 1
Ans: For n > 1, let sn = k. Then, note that
k=1
Yn
1 1 1 1 1 1 1
sn ≤ 1 + + + · · · 1 + + + ··· ··· 1 + + 2 + ··· = (1 + ).
2 4 3 9 pn pn pk − 1
k=1

Thus,
n n n n−1
!
Y 1 X 1 X 1 X 1
log sn ≤ log (1 + ) ≤ log(1 + )≤ ≤1+ .
pk − 1 pk − 1 pk − 1 pk
k=1 k=1 k=1 k=1
n
1
P
As n → ∞, we see that lim = ∞ as lim log sn = ∞.
n→∞ i=1 pi n→∞
156 CHAPTER 5. ADVANCED COUNTING PRINCIPLES

3. Let X be the set of natural numbers with only prime divisors 2, 3, 5, 7. Then,
X 1 1 1 1 1 1 1 2357 35
1+ = (1 + + + · · · )(1 + + + · · · ) · · · (1 + + + ···) = = .
n 2 4 3 9 7 49 1246 8
n∈X
P n
P
Exercise 5.3.26. 1. Let σ(n) = d, for n ∈ N. Then, prove that nπn = πn−k σ(k).
d|n k=1

Ans: Let F = 1/(1 − x)(1 − x2 ) · · · . Then, ogf for nπn is

 
x 2x2 X 3x3i
ix 
xF 0 (x) = F+ 2
F+ 3
F + ··· =  F.
1−x 1−x 1−x 1 − xi
i≥1

Hence, cf[xn , xF 0 (x)] has the form c1 πn−1 +c2 πn−2 +· · ·+cn π0 . Notice that the contribution
ixi i 2i + x3i + · · · )F to c is cf xk , i(xi + x2i + x3i + · · · )F , which is i if

of 1−x i F = i(x + x k
" ! #
ix i
i|k and 0 else. Thus, ck = cf xk ,
P P
1−x i F = i = σ(k).
i≥1 i|k
2. A Durfee square is the largest square in a Ferrer’s diagram. Find the generating function
for the number of self conjugate partitions of n with a fixed size k of Durfee square. Hence,
∞ 2
X xk
show that (1 + x)(1 + x3 ) · · · = 1 + .
(1 − x2 )(1 − x4 ) · · · (1 − x2k )
k=1
Ans: Note that if (a1 , . . . , ar ) is a self conjugate partition with size of Durfee square k,
then ak+1 + ak+1 + · · · + ar = a1 − k + a2 − k + · · · + ak − k. Thus, (2ak+1 , 2ak+2 , . . . , 2ar )
is a partition of n − k 2 into even parts of size at most 2k. So, the generating function is
T

2
AF

xk
(1−x )(1−x )···(1−x2k )
2 4 . But, by Example 5.3.16, the number of self conjugate partitions is the
same as the number of partitions into distinct odd terms. Hence, the next equality.
DR

3. Show that the number of partitions of n into distinct terms (each term is distinct) is the
same as the number of partitions of n into odd terms (each term is odd).
Ans: Ogf for the number of partitions of n into distinct terms is

f = (1 + x)(1 + x2 )(1 + x3 ) · · ·
Y Y Y
= (1 + x2i ) × (1 + x3i ) × (1 + x5i ) × · · · .
i≥1 i≥1 i≥1

Ogf for the number of partitions of n into odd terms is

1
g = (1 + x + x2 + · · · )(1 + x3 + x6 + · · · )(1 + x5 + x10 + · · · ) · · · = .
(1 − x)(1 − x3 )(1 − x5 ) · · ·
Note that (1−xk ) (1+xki ) = (1−xk )(1+xk )(1+x2k )(1+x4k ) · · · = 1 (equate coefficients
Q
i≥1
(1 − x)(1 − x3 )(1 − x5 ) · · ·
of xn both sides). Use this to verify that f = g.
(1 − x)(1 − x3 )(1 − x5 ) · · ·
4. Find the number of r-digit binary numbers that can be formed using an even number of
0’s and an even number of 1’s.
k
C(2k, 2i) = 22k−1 if r = 2k.
P
Ans: The number is 0 for odd r and it is
i=0
xr 2 4 2 ex +e−x 2
Alternate. It is the coefficient of r! in 1 + x2! + x4! + · · ·

= 2 , which is
1 r r r−1 or 0 according as r is even or odd.
4 [2 + (−2) ] = 2
5.3. GENERATING FUNCTIONS 157

5. Find the egf of the number of words of size r using A, B, C, D, E,

(a) if the word has all the letters and the letter A appears an even many times.
(b) if the word has all the letters and the first letter of the word appears an even number
of times.
2 4 x −x
Ans: (a) x2! + x4! + · · · (ex − 1)4 = e +e − 1 (ex − 1)4 .

2

(b) The number of codewords of size r starting with A such that A appears an even number
of times in the word and every other letter also appear in the word is the same as the number
of codewords of size r − 1 such that A appears an odd number of times in the word and
xr−1
every other letter also appear in the word. The later number is the coefficient of (r−1)! in
x
e −e −x xr x
e −e −x
(ex − 1)4 . or equivalently, it is the coefficient of (r)! (ex − 1)4 . But, the
R
2 in 2
R ex −e−x x
first letter can be any one of the 5 letters and hence the required egf is 5 2 (e − 1)4 .
6. A permutation σ of {1, 2, . . . , n} is said to be connected if there does not exist k, 1 ≤
k < n such that σ takes {1, 2, . . . , k} to itself. Let cn denote the number of connected
permutations of {1, 2, . . . , n} (put c0 = 0), then show that
n
X
ck (n − k)! = n!.
k=1

Hence, derive the relationship between the generating functions of (n!) and (cn ).
Ans: A permutation σ is either connected or there is a smallest k, 1 ≤ k < n such that
T
AF

σ = πτ , where π is a connected permutation of {1, 2, . . . , k} and τ is any permutation of

n
ci (n − i)! = n!. Ogf for n! is F (x) = 1 + x + 2!x2 + · · · . Let the
P
{k + 1, k + 2, . . . , n}. So
DR

i=1
ogf for cn be G(x) = c1 x + c2 x2 + · · · . Then
∞ X ∞
X n n X
F (x)G(x) = ci (n − i)! x = n!xn = F (x) − 1.
n=1 i=1 n=1
−1
Thus, F (x)(G(x) − 1) = −1 or F (x) = 1 − G(x) .
7. Let f (n, r) be the number of partitions of n where each part repeats less than r times.
Let g(n, r) be the number of partition of n where no part is divisible by r. Show that
f (n, r) = g(n, r).
Ans: Ogf for f (n, r) is (1 + x + x2 + · · · + xr−1 ) × (1 + x2 + x4 + · · · + x2(r−1) ) × · · · which
equals
(1 − xr )(1 − x2r )(1 − x3r ) · · · 1
= r−1 .
(1 − x)(1 − x2 )(1 − x3 ) · · · r−1
(1 − xi ) (1 − xr+i ) · · ·
Q Q
i=1 i=1
But, note that the last expression is the ogf for g(n, r).

Alternate. Let X and Y be the sets of partitions of n corresponding to f (n, r) and g(n, r),
respectively. Take A = X \ Y and B = Y \ X. For p ∈ A define a partition f (p) as follows:
replace each part x of p that is divisible by r with rk copies of x/rk , where k is the largest
positive integer for which rk divides x. Example: n = 22, r = 3, p = (12, 6, 3, 1), f (p) =
(4, 4, 4, 2, 2, 2, 1, 1, 1, 1).
158 CHAPTER 5. ADVANCED COUNTING PRINCIPLES

Conversely, for q ∈ B, define g(q) as follows: if y appears in f (p) for s ≥ r times then find
the base-r representation s1 s2 , · · · sk of s. Delete all ys from q and put s1 copies of yrk−1 ,
· · · , sk copies of yr0 . Note that f (g(q)) = q.
8. Find the number of 9-sequences that can be formed using 0, 1, 2, 3 in each case.
(a) The sequence has an even number of 0’s.
(b) The sequence has an odd number of 1’s and an even number of 0’s.
(c) No digit appears exactly twice.
3
9 x2 x4 x2
Ans: (a) 9!cf x , 1 + 2! + 4! + · · · 1 + x + 2! + · · ·
h x −x
i 9 9
= 9!cf x9 , e +e e3x = 9!2 cf x9 , e4x + e3x = 4 +2

2 2 .
Alternate: select even number of places for 0 and fill the rest with 1, 2, 3. So, the answer is
9 9
39 + C(9, 2)37 + C(9, 4)35 + C(9, 6)33 + C(9, 8)3 = (3+1) +(3−1) 2 .
h x −x ex −e−x
i
(b) 9!cf x9 , e +e
2 2 e2x = 48 .

2 4
(c) 9!cf x9 , ex − x2! .

5.4 Recurrence Relation

Definition 5.4.1. A recurrence relation is a way of recursively defining the terms of a
T

sequence as a function of preceding terms together with certain initial conditions.

Example 5.4.2. an = 3 + 2an−1 for n ≥ 1 with the initial condition a0 = 1 is a recurrence

relation. Note that it completely determines the sequence (an ) = {1, 5, 13, 29, 61, . . .}.

Definition 5.4.3. For a sequence (an ), the first difference d(an ) is an − an−1 . The k-th
difference dk (an ) = dk−1 (an ) − dk−1 (an−1 ). A difference equation is an equation involving
an and its differences.
Example 5.4.4. 1. an − d2 (an ) = 5 is a difference equation. But, note that it doesn’t give
a recurrence relation as we don’t have any initial condition(s).
2. Every recurrence relation can be expressed as a difference equation. The difference equa-
tion corresponding to the recurrence relation an = 3 + 2an−1 is an = 3 + 2(an − d(an )).

Definition 5.4.5. A solution of a recurrence relation is a function u(n), generally denoted by

un , satisfying the recurrence relation.
Example 5.4.6. 1. un = 2n+2 − 3 is a solution of an = 3 + 2an−1 with a0 = 1.
2. The Fibonacci sequence is given by an = an−1 +an−2 for n ≥ 2 with a0 = 0, a1 = 1. Use
√ 2 √ √ 2 √ 1 1+√5 n 1−√5 n
1+ 5 3+ 5 1− 5 3− 5
2 = 2 and 2 = 2 to verify that a n = √ 2 − 2 is
5
a solution of the recurrence relation that defines the Fibonacci sequence.

Definition 5.4.7. A recurrence relation is a linear nonhomogeneous recurrence relation

with constant coefficients (LNHRRCC) of order r if, for a known function f

an = c1 an−1 + · · · + cr an−r + f (n), where ci ∈ R for 1 ≤ i ≤ r, cr 6= 0. (5.3)

5.4. RECURRENCE RELATION 159

If f = 0, then Equation (5.3) is homogeneous and is called the associated linear homogeneous
recurrence relation with constant coefficients (LHRRCC).

Theorem 5.4.8. For k ∈ N, let fi , 1 ≤ i ≤ k be known functions. Consider the k LNHRRCC

an = c1 an−1 + · · · + cr an−r + fi (n) for i = 1, . . . , k, (5.4)

with the same set of initial conditions. If ui (n), for 1 ≤ i ≤ k, is a solution of the i-th recurrence
then,
Xk
an = c1 an−1 + · · · + cr an−r + αi fi (n) (5.5)
i=1
k
P
under the same set of initial conditions has αi ui (n) as it’s solution.
i=1

Proof. The proof is left as an exercise for the reader.

Definition 5.4.9. Consider a LHRRCC an = c1 an−1 + · · · + cr an−r with cr 6= 0. If an = xn is

a solution, then either x = 0 or x is a root of

xr − c1 xr−1 − · · · − cr = 0. (5.6)

Equation (5.6) is called the characteristic equation of the given LHRRCC. If x1 , . . . , xr are the
r
roots of Equation (5.6), then an = xni (and hence an = αi xni for αi ∈ R) is a solution of the
P
i=1
T

given LHRRCC.
AF

Theorem 5.4.10. [General solution: distinct roots] If the roots xi , i = 0, . . . , r − 1 of Equa-

tion (5.6) are distinct, then every solution u(n) is a linear combination of xni . Moreover, the
solution is unique if we are given r consecutive initial conditions.

Proof. Let u(n) be any solution. Then, we need to show that there exist αi ∈ R, 0 ≤ i ≤ r − 1,
r−1
αi xni . Substituting n = 0, 1, . . . , r − 1, one obtains the linear system
P
such that u(n) =
i=0
    
u(0) 1 ··· 1 α0
 u(1)   x0 · · · xr−1  α1 
.. =  .. 
    
. . .

 .    . 
u(r − 1) xr−1
0 · · · x r−1
r−1 αr−1

in the unknowns αi ’s 0 ≤ i ≤ r − 1. Since the above r × r matrix (commonly known as the

r−1
αi xm
P
Vandermonde matrix) is invertible, there exist α0 , . . . , αr−1 , such that u(m) = i for
i=0
0 ≤ m ≤ r − 1. Hence, we have proved the result for the first r values of u(n). So, let us assume
that the result is true for n < k. Then, by definition
r r r−1 r−1 r r−1
αi xk−j cj xk−j
X X X X X X
u(k) = cj h(k − j) = cj i = αi i = αi xki ,
j=1 j=1 i=0 i=0 j=1 i=0

r−1
as for n = k, xki is a solution of Equation (5.6). Thus, by PMI, u(n) = αi xni for all n. The
P
i=0
uniqueness is left as an exercise for the reader.
160 CHAPTER 5. ADVANCED COUNTING PRINCIPLES

Example 5.4.11. 1. Solve an − 4an−2 = 0 for n ≥ 2 with a0 = 1 and a1 = 1.

Ans: Note that ±2 are the roots of the characteristic equation, x2 − 4 = 0. As the roots
are distinct, the general solution is an = α(−2)n + β2n for α, β ∈ R. The initial conditions
give α + β = 1 and 2β − 2α = 1. Hence, α = 41 , β = 34 . Thus, the unique solutions is
an = 2n−2 3 + (−1)n .

2. Solve an = 3an−1 + 4an−2 for n ≥ 2 with a0 = 1 and a1 = c, a constant.

Ans: Note that −1 and 4 are the roots of the characteristic equation, x2 − 3x − 4 = 0.
As the roots are distinct, the general solution is an = α(−1)n + β4n for α, β ∈ R. Now,
the initial conditions imply α = 4−c 1+c
5 andβ = 5 . Thus, the unique general solution is
(4 − c)(−1)n (1 + c)4n
(a) an = + , if c 6= 4.
5 5
n
(b) an = 4 , if c = 4.

3. Solve the Fibonacci recurrence an = an−1 + an−2 with initial conditions a0 = 0, a1 = 1.

Ans:
√
In this case, note that the roots of the characteristic equation, x2 − x − 1 √= 0, are
√
1+ 5 n
1± 5
n
2 . As the roots are distinct, the general solution is an = α 2 + β 1−2 5 for
α, β ∈ R. Now, using the initial conditions, we get α = √15 , β = −α. Hence, the required
solution is
√ √ " √ !n √ !n #
T
1 + 5 n 1 − 5 n 1 1+ 5 1− 5
an = α +β =√ − . (5.7)
AF

2 2 5 2 2
DR

Theorem 5.4.12. [General solution:

s−1 multiple
roots] Let t is a root of Equation (5.6) of

multiplicity s. Then, u(n) = tn αi ni , for αi ∈ R, 0 ≤ i ≤ s − 1, is a solution (basic

P
i=0
solution). In general, if ti is a root of Equation (5.6) with multiplicity si ), for i = 1, . . . , k,
then every solution is a sum of the k basic solutions.

Proof. It is given that t is a zero of the polynomial F = xr − c1 xr−1 − · · · − cr of multiplicity s.

Put G0 = xn−r F = xn − c1 xn−1 − · · · − cr xn−r and G1 = xG00 , G2 = xG01 , . . ., Gs−1 = xG0s−2 .
Then, each of G0 , G1 , . . . , Gs−1 has a zero at t. That is, for i = 0, 1, . . . , s − 1, we have

Gi (t) = tn ni − c1 tn−1 (n − 1)i − . . . − cr tn−r (n − r)i = 0.

s−1
k i αi , for k ≥ 0 then
P
Thus, for any choice of αi ∈ R, 0 ≤ i ≤ s − 1, if one defines P (k) =
i=1

s−1
X
αi Gi (t) = tn P (n) − c1 tn−1 P (n − 1) − · · · − cr tn−r P (n − r) = 0.
i=0

Thus, by definition u(n) − c1 u(n − 1) − · · · − cr u(n − r) = 0. Hence, u(n) is a solution of the

LHRRCC. The other part of the proof is left for the reader.

Example 5.4.13. Suppose that an LHRRCC has roots 2, 2, 3, 3, 3. Then, the general solution
is given by 2n (α1 + nα2 ) + 3n (β1 + nβ2 + n2 β3 ).
5.5. GENERATING FUNCTION FROM RECURRENCE RELATION 161

Theorem 5.4.14. [LNHRRCC] Consider the LNHRRCC in Equation (5.3) and let un be a
general solution to the associated LHRRCC. If vn is a particular solution of the LNHRRCC,
then an = un + vn is a general solution of the LNHRRCC.

Proof. The proof is left for the reader.

Notice!
No general algorithm are there to solve an LNHRRCC. If f (n) = an or nk or a linear
combination of these, then a particular solution can be easily obtained.

Obtaining particular solution after knowledge of the characteristic roots.

1. If f (n) = an and a is not a root of Equation (5.3), then vn = can .

2. If f (n) = an and a is a root of Equation (5.3) of multiplicity t, then vn = cnt an .
3. If f (n) = nk and 1 is not a root of Equation (5.3), then use vn = c0 + c1 n + · · · + ck nk .
4. If f (n) = nk and 1 is a root of Equation (5.3) of multiplicity t, then
vn = nt (c0 + c1 n + · · · + ck nk ).

Example 5.4.15. 1. Let an = 3an−1 + 2n for n ≥ 1 with a0 = 1.

Ans: Observe that 3 is the characteristic root of the associated LHRRCC (an = 3an−1 ).
AF

Thus, the general solution of LHRRCC is un = 3n α. Note that 1 is not a characteristic

root and hence a particular solution is a + nb, where a and b are to be computed using
DR

3
a + nb = 3(a + (n − 1)b) + 2n. This gives a = −3 n
2 and b = −1. Hence, an = 3 α − n − 2 .
5
Using a0 = 1, check that α = .
2
2. Let an = 3an−1 − 2an−2 + 3(5)n for n ≥ 3 with a1 = 1, a2 = 2.
Ans: Observe that 1 and 2 are the characteristic roots of the associated LHRRCC (an =
3an−1 − 2an−2 ). Thus, the general solution of the LHRRCC is un = α1n + β2n . Note that
5 is not a characteristic root and thus, vn = c5n is a particular solution of LNHRRCC if
and only if c5n = 3c5n−1 − 2c5n−2 + 3(5)n . That is, if and only if c = 25/4. Hence, the
general solution of LNHRRCC equals an = α + β2n + (25/4)5n , where compute α and β
using the initial conditions.
3. In the above take f (n) = 3(2n ). Then, we see that with c(2)n as a choice for a particular
solution, we will have 4c = 6c − 2c + 12, an absurd statement. But, with the choice cn(2)n ,
we have 4nc = 6(n − 1)c − 2(n − 2)c + 12, implying c = 6. Hence, the general solution of
LNHRRCC is an = α + β2n + 6n2n , where compute α and β using the initial conditions.

5.5 Generating Function from Recurrence Relation

Sometimes we can find a solution to the recurrence relation using the generating function of an .
Example 5.5.1. 1. Consider an = 2an−1 + 1, a0 = 1.
162 CHAPTER 5. ADVANCED COUNTING PRINCIPLES

Ans: Let F (x) = a0 + a1 x + · · · be the generating function for {ai }. Then,

∞ ∞ ∞ ∞
X
i
X
i
X
i
X 1
F =1+ ai x = 1 + (2ai−1 + 1)x = x + 2x ai xi = + 2xF.
1−x
i=1 i=1 i=0 i=0

1 2 1
Hence, F = (1−x)(1−2x) = 1−2x − 1−x . Thus, an = cf[xn , F ] = 2n+1 − 1.
2. Find the ogf F for the Fibonacci recurrence relation an = an−1 + an−2 , a0 = 0, a1 = 1.
an xn = an xn . Then using the recurrence relation, we have
P P
Ans: Define F (x) =
n≥0 n≥1
X X
F (x) = an xn = x + (an−1 + an−2 ) xn = x + (x + x2 )F (x).
n≥0 n≥2
√ √
x 1+ 5 1− 5
So, F (x) = 1−x−x 2 . Let α = 2 and β = 2 . Then it can be checked that (1 −
αx)(1 − βx) = 1 − x − x2 and
 
1 1 1 1 X X
F (x) = √ − =√  α n xn − β n xn  .
5 1 − αx 1 − βx 5 n≥0 n≥0

1 P n
Therefore, an = cf[xn , F (x)] = √ (α − β n ), which equals Equation (5.7).
5 n≥0
The next result follows using a small calculation and hence the proof is left for the reader.
T
AF

Theorem 5.5.2. [Obtaining generating function from recurrence relation] Consider the r-th
order LHRRCC given by
DR

an = c1 an−1 + · · · + cr an−r with initial conditions ai = Ai for i = 0, 1, . . . , r − 1. (5.8)

Then, the generating function of Equation (5.8) equals

r−1 r−2 r−3
Ai xi − c1 x Ai xi − c2 x2 Ai xi − · · · − cr−1 xr−1 A0
P P P
i=0 i=0 i=0
.
1 − c1 x − · · · − cr xr
Example 5.5.3. 1. Find the ogf for the Catalan numbers Cn ’s.
Cn xn , where Cn = C(2n,n) 2(2n−1)
P
Ans: Let g(x) = 1 + n+1 = n+1 Cn−1 with C0 = 1. Then,
n≥1

X X 2(2n − 1)
g(x) − 1 = Cn x n = Cn−1 xn
n+1
n≥1 n≥1
∞ X −6 ∞ Zx
X 4n + 4 −6
= Cn−1 xn + Cn−1 xn = 4xg(x) + tg(t)dt.
n+1 n+1 x
n=1 n=1 0

Rx
So, [g(x) − 1 − 4xg(x)]x = −6 tg(t)dt. Now, we differentiate with respect to x to get
0
x(1 − 4x)g 0 + (1 − 2x)g = 1. To solve the ode, we first observe that

1 − 2x
Z
1 2 x
Z
= + = ln √ .
x(1 − 4x) x 1 − 4x 1 − 4x
5.5. GENERATING FUNCTION FROM RECURRENCE RELATION 163

Thus, the integrating factor of the given ode is √ x . Hence, the ode can be re-written
1−4x
as
x 1 − 2x 1 d x 1
g(x)0 √

+ g(x) 3/2
= 3/2
⇔ g(x) √ = .
1 − 4x (1 − 4x) (1 − 4x) dx 1 − 4x (1 − 4x)3/2
x 1
√
Hence, g(x) √1−4x = 2√1−4x + C, where C ∈ R. Or, equivalently 2xg(x) = 1 + 2C 1 − 4x.
Note that C = − 21 as C0 = lim g(x) = 1. Thus, the ogf of the Catalan numbers is
x→0
√
1− 1 − 4x
g(x) = .
2x
Alternate. Recall that Cn is the number of representations of the product of n + 1 square
matrices of the same size, using n pairs of brackets. From such a representation, remove
the leftmost and the rightmost brackets to obtain the product of two representations of
the form:

A1 (A2 · · · An+1 ), (A1 A2 )(A3 · · · An+1 ), · · · , (A1 · · · Ak )(Ak+1 · · · An+1 ), · · · , (A1 · · · An )An+1 .

Hence, we see that

Cn = C0 Cn−1 + C1 Cn−2 + · · · + Cn−1 C0 . (5.9)
∞
Cn xn , then for n ≥ 1,
P
Thus, if we define g(x) =
n=0
T
 !2 
∞ n−1
AF

X X
cf xn−1 , g(x)2 = cfxn−1 , n

Cn x = Ci Cn−1−i = Cn using Equation (5.9).
n=0 i=0
DR

That is, cf xn , xg(x)2 = Cn . Hence, g(x) = 1 + xg(x)2 . Solving for g(x), we get

r ! √
1 1 1 4 1 ± 1 − 4x
g(x) = ± − = .
2 x x2 x 2x

As the function g is continuous (being a power series in the domain of convergence) and
lim g(x) = C0 = 1, it follows that
x→0
√
1− 1 − 4x
g(x) = .
2x
n
P
2. Fix r ∈ N and let (an ) be a sequence with a0 = 1 and ak an−k = C(n + r, r), for all
k=0
n ≥ 1. Determine an .
an xn . Then, note that C(n + r, r) = c(n + (r + 1) − 1, n). Hence,
P
Ans: Let g(x) =
n≥0

n
!
X X X X 1
g(x)2 = ak an−k xn = C(n + r, r)xn = C(n + r, n)xn = .
(1 − x)r+1
n≥0 k=0 n≥0 n≥0
h i
1
Hence, an = cf xn , (1−x)(r+1)/2 . For example, for r = 2 (see Equation (4.2)),

3 · 5 · 7 · · · (2n + 1) (2n + 1)!

an = (−1)n C(−3/2, n) = n
= 2n .
2 n! 2 n!n!
164 CHAPTER 5. ADVANCED COUNTING PRINCIPLES

3. Determine the sequence {f (n, m) | n, m ∈ N0 } which satisfies f (n, 0) = 1 for all n ≥ 0,

f (0, m) = 0 for all m > 0 and

f (n, m) = f (n − 1, m) + f (n − 1, m − 1) for (n, m) 6= (0, 0). (5.10)

f (n, m)xm = 1 + f (n, m)xm . Then F1 (x) = 1 + x

P P
Ans: For n ≥ 1, define Fn (x) =
m≥0 m≥1
and for n ≥ 2,
X X
Fn (x) = f (n, m)xm = 1 + (f (n − 1, m) + f (n − 1, m − 1)) xm
m≥0 m≥1
X X
= 1+ f (n − 1, m)xm + f (n − 1, m − 1)xm
m≥1 m≥1
= Fn−1 (x) + xFn−1 (x) = (1 + x)Fn−1 (x) = · · · = (1 + x)n

as F1 (x) = 1 + x. Thus,
(
C(n, m) if 0 ≤ m ≤ n
f (n, m) = cf[xm , (1 + x)n ] =
0 if m > n.

f (n, m)y n = f (n, m)y n . Then, G1 (y) =

P P
Alternate. For m ≥ 1, define Gm (y) =
n≥0 n≥1
y
and for m ≥ 2, Equation (5.10) gives
(1 − y)2
T
X X
Gm (y) = f (n, m)y n = (f (n − 1, m) + f (n − 1, m − 1)) y n
AF

n≥1 n≥1
DR

X X
n
= f (n − 1, m)y + f (n − 1, m − 1)y n
n≥1 n≥1
= yGm (y) + yGm−1 (y).
y y ym
Therefore, Gm (y) = Gm−1 (y). As G1 (y) = , one has G m (y) = .
1−y (1 − y)2 (1 − y)m+1
Thus,
(
ym if 0 ≤ m ≤ n

1 C(n, m)
f (n, m) = cf y n , = cf y n−m
, =
(1 − y)m+1 (1 − y)m+1 0 if m > n.

4. Determine the sequence {S(n, m) | n, m ∈ N0 } which satisfy S(0, 0) = 1, S(n, m) = 0 if

either m = 0 or n = 0 but not both and

S(n, m) = mS(n − 1, m) + S(n − 1, m − 1), (n, m) 6= (0, 0). (5.11)

y
S(n, m)y n = S(n, m)y n . Then, G1 (y) =
P P
Ans: For n ≥ 1, define Gm (y) = 1−y and
n≥0 n≥1
for m ≥ 1, Equation (5.11) gives
X X
Gm (y) = S(n, m)y n = (mS(n − 1, m) + S(n − 1, m − 1)) y n
n≥0 n≥1
X X
= m S(n − 1, m)y n + S(n − 1, m − 1)y n
n≥1 n≥1
= myGm (y) + yGm−1 (y).
5.5. GENERATING FUNCTION FROM RECURRENCE RELATION 165

y y
Therefore, Gm (y) = Gm−1 (y). As G1 (y) = , one has
1 − my 1−y

m
ym X αk
Gm (y) = = ym , (5.12)
(1 − y)(1 − 2y) · · · (1 − my) 1 − ky
k=1

(−1)m−k k m
where αk = , for 1 ≤ k ≤ m. Thus,
k! (m − k)!

m m
" #
n m
X αk X αk
S(n, m) = cf y , y = cf y n−m ,
1 − ky 1 − ky
k=1 k=1
m m
X X (−1)m−k k n
= αk k n−m = (5.13)
k! (m − k)!
k=1 k=1
m m
1 X 1 X
= (−1)m−k k n C(m, k) = (−1)k (m − k)n C(m, k).
m! m!
k=1 k=1

1 P m
Therefore, S(n, m) = (−1)k (m − k)n C(m, k).
m! k=1
This identity is generally known as the Stirling’s Identity.
T
AF

Observation.
DR

S(n, m)xm . Then, verify that Hn (x) = (x + xD)n · 1 as

P
(a) Let us consider Hn (x) =
m≥0
H0 (x) = 1. Therefore, H1 (x) = x, H2 (x) = x + x2 , · · · . Thus, we don’t have a single
expression for Hn (x) which gives the value of S(n, m)’s. But, it helps in showing that
S(n, m), for fixed n ∈ N, first increase and then decrease (commonly called unimodal).
The same holds for the sequence of binomial coefficients {C(n, m), m = 0, 1, . . . , n}.
(b) As there is no restriction on n.m ∈ N0 , Equation (5.13) is also valid for n < m. But,
we know that S(n, m) = 0, whenever n < m. Hence, we get the following identity,
m
X (−1)m−k k n−1
= 0 whenever n < m.
(k − 1)! (m − k)!
k=1

5. For n ∈ N, the n-th Bell number, denoted b(n), is the number of partitions of {1, 2, . . . , n}.
Pn
Thus, b(n) = S(n, m), for n ≥ 1 and b(0) = 1. Hence, for n ≥ 1,
m=1

n m
X X XX (−1)m−k k n−1
b(n) = S(n, m) = S(n, m) =
(k − 1)! (m − k)!
m=1 m≥1 m≥1 k=1
X kn X (−1)m−k 1X kn 1X kn
= = = as 0n = 0 for n 6= 0. (5.14)
k! (m − k)! e k! e k!
k≥1 m≥k k≥1 k≥0
166 CHAPTER 5. ADVANCED COUNTING PRINCIPLES

kn
Thus, Equation (5.14) is valid even for n = 0. As b(n) has terms of the form , we
n
k!
P x
compute its egf. Thus, if B(x) = b(n) then,
n≥0 n!
 
X xn X 1 X k n xn
B(x) = 1 + b(n) =1+  
n! e k! n!
n≥1 n≥1 k≥1
1X 1 X n xn
1 X 1 X (kx)n
= 1+ k =1+
e k! n! e k! n!
k≥1 n≥1 k≥1 n≥1

1 X (ex )k

1 X 1 kx 1
= 1+ e −1 =1+ −
e k! e k! k!
k≥1 k≥1
1 ex x
e − 1 − (e − 1) = ee −1 .

= 1+ (5.15)
e
x
Recall that ee −1 is a valid formal power series (see Remark 5.3.5). Taking logarithm of
Equation (5.15), we get log B(x) = ex − 1. Hence, B 0 (x) = ex B(x), or equivalently
X b(n)xn−1 X xn X xm X xn
B 0 (x) = = ex b(n) = · b(n) .
(n − 1)! n! m! n!
n≥1 n≥0 m≥0 n≥0

Thus,
 
n−1
T

b(n) X x m X n
x  X 1 b(m)
n−1 0 n−1

= cf x , B (x) = cf x , · b(n) = · .
AF


(n − 1)! m! n! (n − 1 − m)! m!
m≥0 n≥0 m=0
DR

n−1
P
Hence, we get b(n) = C(n − 1, m)b(m), for n ≥ 1, with b(0) = 1.
m=0
Exercise 5.5.4. 1. Find the number of binary words without having a subword 00 and 111.
Ans: Let f (n, i): number of such words of length n ending with digit i, i = 0, 1. Notice that

f (1, 1) = 1, f (1, 0) = 1, f (2, 1) = 2, f (2, 0) = 1 and f (3, 1) = 2, f (3, 0) = 2.

For n ≥ 4, we have f (n, 0) = f (n − 1, 1) and

f (n, 1) = f (n − 1, 01) + f (n − 1, 11) = f (n − 2, 1) + f (n − 2, 0) = f (n − 2, 1) + f (n − 3, 1).

Thus, we have a system of recurrence relation:

f (n) = f (n − 1, 1) + f (n − 2, 1) + f (n − 3, 1)
f (n, 1) = f (n − 2, 1) + f (n − 3, 1)

2. Find the number of subsets of {1, . . . , n} not containing consecutive integers.

3. Let Fn be the nth Fibonacci number. Then, prove that Fn divides Fnm where n, m are
positive integers.
√ √
1+ 5 1− 5
Ans: Recall that Fn = √15 (αn − β n ), where α = 2 and β = 2 . Since an − bn divides
amn − bmn , for all m ≥ 1, the result follows.
5.5. GENERATING FUNCTION FROM RECURRENCE RELATION 167

Alternate. Notice that Fn = Fn−1 + Fn−2 = F2 Fn−1 + F1 Fn−2 = F3 Fn−2 + F2 Fn−3 . It

can be easily proved by induction that

Fn = Fk Fn+1−k + Fk−1 Fn−k ∀k ∈ {1, 2, · · · , n}. (5.16)

Thus, Fnm = Fn Fnm+1−n + Fn−1 Fnm−n = Fn Fnm+1−n + Fn−1 Fn(m−1) is divisible by Fn if

and only if Fn(m−1) is divisible by Fn . The proof is over by induction.

Objects-n Places-r Places

Relate Number
distinct? distinct? nonempty?
r!S(n, r) =
Y Y Y Onto functions r−1
(−1)i C(r, i)(r − i)n
P
i=0
Y Y N All functions rn
r-partition of a
Y N Y S(n, r)
set
All partitions of r
P
Y N N b(n) = S(n, i)
a set i=1

Positive integer
N Y Y C(n − 1, r − 1)
solutions
T

Nonnegative
AF

N Y N C(n + r − 1, r − 1)
integer solutions
DR

πn (r)
h =
N N Y r-partition of n i
cf xn−r , (1−x)(1−x12 )···(1−xr )
Partitions of n r
P
N N N πn (i)
of length ≤ r i=1

Exercise 5.5.5. 1. In a particular semester 6 students took admission in our PhD program.
There were 9 professors who were willing to supervise these students. As a rule ‘a student
can have either one or two supervisors’. In how many ways can we allocate supervisors
to these students if all the ‘willing professors’ are to be allocated? What if we have an
additional condition that exactly one supervisor gets to supervise two students?

Ans: Assume that {1, 2, . . . , 9} is the set of willing professors. Since a student can have at
most two supervisors, all we need is a six partition of {1, 2, . . . , 9} into parts of size 2, 2, 2, 1, 1, 1.
9!
So, the answer is = 4.5.7.9. For the other part, apply PHP to see that the answer
(2!) 3!(1!)3 3!
3
is 0.
2. (a) Prove combinatorially that, for n ≥ 2, we have Dn = (n − 1)(Dn−1 + Dn−2 ).
e−x
(b) Use Part (a) to show that the exponential generating function of Dn is .
1−x
Ans: (a) Given a derangement σ of {1, 2, . . . , n−1} we create a derangement µ of {1, 2, . . . , n}
168 CHAPTER 5. ADVANCED COUNTING PRINCIPLES

by selecting a k ∈ {1, 2, . . . , n − 1} and by defining


 σ(i) if i ∈ {1, 2, . . . , n − 1}, i 6= k,

µ(i) = n if i = k,

σ(k) if i = n


This can be done in (n − 1)Dn−1 ways. Note here that k ∈ µ({1, 2, . . . , n − 1}, \{k}).
Given an arrangement of {1, 2, . . . , n − 1} which fixes just one element, we can create a
derangement in a similar way. This can be done in (n − 1)Dn−2 ways. These derangements
are different from the earlier ones as k ∈
/ µ({1, 2, . . . , n − 1}, \{k}).
All the derangements of {1, 2, . . . , n} can be created in this way. Thus, the conclusion.
(b) Note that
X xn X xn
D(x) = Dn = ((n − 1)(Dn−1 + Dn−2 )
n! n!
n≥2 n≥2
X xn+1 X xn+2
= nDn + (n + 1)Dn
(n + 1)! (n + 2)!
n≥2 n≥2
X xn+1 X xn+1 X xn+2 X xn+2
= Dn − Dn + Dn − Dn (5.17)
n! (n + 1)! (n + 1)! (n + 2)!
n≥2 n≥2 n≥2 n≥2

P x n+1 Rx
Verify that Dn (n+1)! = D(t)dt and hence use Equation (5.17), to form an integral
T

n≥2 0
AF

equation. Now, differentiate it to (1 − x)D0 = xD.

3. My friend says that he has n ≥ 2 subsets of {1, 2, . . . , 14} each of which has size 6. Give a
value of n so that we can guarantee ‘some two of his subsets have 3 elements in common’,
without seeing his collection’ ? What is the smallest possible value of n?
14
P
Ans: Let ni be the number of appearances of i, 1 ≤ i ≤ 14. So, ni = 6n. Further, the
i=1
n
P
contribution of i to the intersections is C(ni , 2). So, the total contribution is C(ni , 2) =
i=1
n n 14 14
1 P 2 1 6n 2
n2i − 3n. Since, n2i ≥ 14
P P P
2 [ ni − 6n] = 2 ni = 6n, we know that 14 . Thus,
i=1 i=1 i=1 i=1

n n 2
X 1X 2 1 6n 9
C(ni , 2) = ni − 3n ≥ · 14 − 3n = n2 − 3n.
2 2 14 7
i=1 i=1

Since we want at least 3 intersections for one of the i’s, we need 79 n2 − 3n > 42. Thus, n > 7.
P 2
Verify that n = 8 indeed holds. One needs to use “the minimum value of ni is obtained
i
when each pair differs by at most 1”. By PHP some intersection will get at least 3.
4. Find the number of words of size 12 made using letters from {A, B, C} in which ‘BCA’
does not appear (as a consecutive subword). For example: ABCABCCCCCBA has an
appearance of ‘BCA’ but BCCABCCABCCA does not.
Ans: Let Si be the set of words in which ‘BCA’ appears in positions i, i + 1, i + 2. Note that
10
| ∪ Si | = 10.39 − f.36 + g.33 − 1, where f is the number of (i1 , i2 ) such that i2 ≥ i1 + 3,
i=1
5.5. GENERATING FUNCTION FROM RECURRENCE RELATION 169

i1 , i2 ∈ {1, 2, . . . , 10} and g is the number of (i1 , i2 , i3 ) such that i2 ≥ i1 + 3, i3 ≥ i2 + 2,

i1 , i2 , i3 ∈ {1, 2, . . . , 10}.
Finding f is as good as finding 2 elements from {1, 2, . . . , 8}. So, it is C(8, 2). Finding g is
as good as finding 3 elements from {1, 2, 3, 4, 5, 6}. So, it is C(6, 3).
Thus, our answer to the original question is 312 − 10.39 + C(8, 2)36 − C(6, 3)33 + 1 = 354484.

Alternate. Let sn be the number of such words of length n. To create a word of length
n + 1, I can add any letter to the end, except that adding A is not allowed when n-th letter
is C and (n − 1)-th letter is B. Thus, sn+1 = 3sn − sn−2 . We have s1 = 3, s2 = 32 and
1
s3 = 33 − 1. Now calculate s12 . Is S(x) = sn xn = −1 + 1−x(3−x
P
2) ?
n≥1

5. Find the number of 8 letter words made using alphabets from {A, B, C, D} in which 3
consecutive letters are not allowed to be the same.
Ans: Let sn be the number of words of size n in which no three consecutive letters are the
same. Note that such a word either has different letters at n-th place and (n − 1)-th place
or has the same letter at n-th and (n − 1)-th place but a different letter at (n − 2)-th place.
Thus, sn = 3(sn−1 + sn−2 ). As s1 = 4 and s2 = 16, we have s3 = 60, s4 = 228 and so on.
We have s8 = 47088.
Extra: if you want to go further, then put f = s0 + s1 x + s2 x2 + s3 x3 + · · · , here s0 = 0.
4(x+x2 )
Then f = s1 x + s2 x2 + 3x(f − s1 x) + 3x2 f = f (3x2 + 3x) + 4x + 4x2 . So, f = 1−3x 2 −3x .
T

√
AF

1 1 α−β −3− 21
Put 3x2 + 3x − 1 = (x − α)(x − β). Note that x−α − x−β = (x−α)(x−β) . Taking α = 2
√ √
and β = −3+2 21 , we have α − β = − 21. Thus
DR

√
4(x + x2 ) 4(x + x2 ) − 21 4(x + x2 )

1 1
f= = √ = √ −
1 − 3x2 − 3x 21 3x2 + 3x − 1 21 x−α x−β

4(x + x2 ) −1

1 1 1
= √ + .
21 α 1 − x/α β 1 − x/β
So, the coefficient of xn can be computed now.

Alternate. Let si be the number of words of length i in which no three consecutive letters
are the same. Put s0 = 1. We count the (wrong) words with three consecutive letters the
same. Let Wi be the words in which repetition occurs for the first time at places i, i + 1, i + 2.
Then, |W1 | = 46 and for i ≥ 2 we have |Wi | = si−1 · 3 · 48−i−2 . So, the total number of
6
|Wi | = 46 + s1 · 3 · 44 + s2 · 3 · 43 + s3 · 3 · 42 + s4 · 3 · 4 + s5 · 3. Now
P
wrong words is w8 =
1
s5 = 45 − 43 − 6 · 42 , s4 = 44 − 42 − 4 · 3. So, s8 = 48 − w8 = 47088.
6. We have 3 blue bags, 4 red bags and 5 green bags. We have many balls of each of the colors
blue, red and green. Fill in the blank with the smallest positive integer.
If we distribute balls (without seeing the colors) into these bags, then one of the
following must happen:
(a) a blue bag contains 3 blue balls or 4 red balls or 5 green balls
(b) a red bag contains 3 blue balls or 5 red balls or 7 green balls
170 CHAPTER 5. ADVANCED COUNTING PRINCIPLES

Ans: 151. By PHP, either a blue bag contains 10 or a red bag contains 13 or a green bag
contains 16. If a blue bags contains 10, then by PHP, it must contain either 3 blue balls or 4
red balls or 5 green balls. Other cases are similar. It is trivial to see that with 150 this is not
possible.
7. We have an integer polynomial f (x). Fill in the blank with the smallest positive integer.
If f (x) = 2009 has many distinct integer roots, then f (x) = 9002 cannot have an
integer root.
Ans: Let a1 , . . . , an be those integers for which f (ai ) = 2009. Suppose that we have an
integer b for which f (b) = 9002. Then, (ai − b) divides f (ai ) − f (b) = −6993. Thus,
ai − b ∈ ±{1, 2, . . . , 6993}. Thus, the number of ai ’s can at most be 2 · 6993. Our answer is
13993. (Ok type).

Alternate. Let a1 , . . . , an be those integers for which f (ai ) = 2009. Suppose that we have
an integer b for which f (b) = 9002. Then, (ai − b)|f (ai ) − f (b) = −6993 = 33 · 7 · 37. Thus,
ai − b = ±3α 7β 37γ , where 0 ≤ α ≤ 3, 0 ≤ β, γ ≤ 1. So, we can have at most 32 distinct
ai ’s. So, the answer is n = 33 > 32. (Good).

Alternate. Let a1 , . . . , an be those integers for which f (ai ) = 2009. Suppose that we
have an integer b for which f (b) = 9002. Note that f (x) − 2009 has ai as zeros. So,
T
Q Q
f (x) − 2009 = g(x) (x − ai ). Thus, f (b) − 2009 = g(b) (b − ai ). That is b − ai are
AF

distinct divisors of 6993 = 3 · 3 · 3 · 7 · 37. The maximum number of distinct divisors of 6993
DR

with product 6993 could be 6, for example −37, −7, −3, −1, 1, 9. So, we can have at most 6
distinct ai ’s. So, the answer is 7. (Best).
We cannot have the number 6. The function f (x) = (x − 37)(x − 7)(x − 3)(x − 1)(x + 1)(x +
9) + 2009 takes value 2009 at 6 distinct places and it also takes 9002 at x = 0.
8. (a) In how many ways can one distribute 10 identical chocolates among 10 students?
Ans: C(19, 9) = cf x10 , (1 − x)−10 . Corresponds to a solution of x1 + · · · + x10 = 10.

(b) In how many ways can one distribute 10 distinct chocolates among 10 students?
Ans: 1010 = cf x10 , 10!(ex )10 . Corresponds to function from {1, 2, . . . , 10} (choco-

lates) to {1, 2, . . . , 10}(students).

(c) In how many ways can one distribute 10 distinct chocolates among 10 students so
that each receives one?
Ans: 10! = cf x10 , 10!(ex − 1)10 . Corresponds to all one-one functions.

(d) In how many ways can one distribute 15 distinct chocolates among 10 students so
that each receives at least one?
h 2 6
i
Ans: cf x15 , 15!(ex − 1)10 = cf x15 , 15!(x + x2! + · · · + x6! )10 = 10!S(15, 10) =

10
(−1)i C(10, i)(10 − i)15 . Corresponds to all onto functions from {1, 2, . . . , 15} (choco-
P
i=0
lates) to {1, 2, . . . , 10} (students).
(e) In how many ways can one distribute 10 out of 15 distinct chocolates among 10
students so that each receives one?
5.5. GENERATING FUNCTION FROM RECURRENCE RELATION 171

Ans: P (15, 10). Corresponds to all one-one functions from {1, 2, . . . , 10} (students) to
{1, 2, . . . , 15} (chocolates).
(f ) In how many ways can one distribute 15 distinct chocolates among 10 students so
that each receives
at most three?
10
15 x2 x3
Ans: 15!cf x , 1 + x + 2! + 3! . Each such distribution corresponds to a word
of length 15 using at most three copies of elements of {1, 2, . . . , 10}.
(g) In how many ways can one distribute 15 distinct chocolates among 10 students so
that each receives
at least one and at most three?
3 10
2

Ans: 15!cf x15 , x + x2! + x3! .

(h) In how many ways can one distribute 15 identical chocolates among 10 students so
that each receives at most three?
Ans: cf x15 , (1 + x + x2 + x3 )10 .

9. (a) In how many ways can one carry 15 distinct objects with 10 identical bags? Answer
using S(n, r).
P10
Ans: S(15, i). Each partition of {1, 2, . . . , 15} into i nonempty subsets corresponds
i=1
to one way of carrying.
(b) In how many ways can one carry 15 distinct objects in 10 identical bags with no empty
bag? Answer using S(n, r).
T
AF

Ans: S(15, 10). Each partition of {1, 2, . . . , 15} into 10 nonempty subsets corresponds
to one way of carrying.
DR

(c) In how many ways can one carry 15 distinct objects in 10 identical bags with each bag
containing at most three objects?
h 2 3
i
Ans: 15!cf x15 , (1 + x + x2! + x3! )10 .
(d) In how many ways can one carry 15 identical objects in 10 identical bags?
10
P
Ans: π15 (i).
i=1
(e) In how many ways can one carry 15 identical objects in 10 identical bags with no
empty bag?
Ans: π15 (10).
(f ) In how many ways can one carry 15 identical objects in 20 identical bags?
h i h i
Ans: π15 = cf x15 , (1−x)(1−x12 )···(1−x20 ) or π15 = cf x15 , (1−x)(1−x12 )···(1−x15 ) .

10. What is the number of integer solutions of x + y + z = 10, with x ≥ −1, y ≥ −2 and
z ≥ −3?

Ans: C(18, 2).

11. Is the number of solutions of x+y+z = 10 in nonnegative multiples of 12 (x, y, z are allowed
to be 0, 1/2, 1, 3/2, . . .) at most four times the number of nonnegative integer solutions of
x + y + z = 10?

Ans: Yes. Compare C(22, 2) with 4C(12, 2).

172 CHAPTER 5. ADVANCED COUNTING PRINCIPLES

12. How many words of length 8 can be formed using the English alphabets, where each letter
can appear at most twice? Give answer using generating function.
h 2
i
Ans: cf x8 , 8!(1 + x + x2 )26 .
13. Let p1 , . . . , pn , n ≥ 2 be distinct prime numbers. Consider the set {p1 , . . . , pn , p21 , . . . , p2n }.
In how many ways can we partition the set into subsets of size two such that no prime is
in the same subset containing its square?
n
P (−1)k
Ans: Dn = n! k! .
k=0
15
(−1)k C(15, k)(15 − k)5 ?
P
14. What is the value of
k=0

Ans: 15!S(5, 15) = 0 = 5!cf x5 , (ex − 1)15 .

15. Give your answers using generating function.

(a) What is the number of partitions of n with entries at most r?

r

1
Ans: cf xn ,
Q
1−xi
.
i=1
(b) What is the number of partitions of n with at most r parts?
r

1
Ans: cf xn ,
Q
1−xi
.
i=1
(c) What is the number of partitions of n with exactly r parts (πn (r))?
r r
T

1
Ans: cf xn , xr / (1 − xi ) = cf xn−r ,
Q Q
.
AF

1−xi
i=1 i=1
(d) What is the number of partitions of n + C(r, 2) with r distinct parts?
DR

r r

1
Ans: cf xn , xr / (1 − xi ) = cf xn−r ,
Q Q
1−xi
.
i=1 i=1
(e) What is the number of partitions of n with distinct entries?
∞
h i
Ans: cf xn , (1 + xi ) = cf xn , (1−x)(1−x13 )(1−x5 )··· .
Q
i=1
(f ) What is the number of partitions of n with entries odd?
∞
h i
n 1 n
Q i
Ans: cf x , (1−x)(1−x3 )(1−x5 )··· = cf x , (1 + x ) .
i=1
(g) What is the number of partitions of n with distinct odd entries?
Ans: cf xn , (1 + x)(1 + x3 )(1 + x5 ) · · · .

(h) What is the number of partitions of n which are self conjugate?

Ans: cf xn , (1 + x)(1 + x3 )(1 + x5 ) · · · .

16. How many words of length 15 are there using the letters A,B,C,D,E such that each letter
must appear in the word and A appears an even number of times? Give your answers
using generating function.
h 2 4
i
Ans: 15!cf x15 , x2! + x4! + · · · (ex − 1)4 .

17. The characteristic roots of a LHRRCC are 2, 2, 2, 3, 3. What is the form of the general
solution?
Ans: 2n (a0 + a1 n + a2 n2 ) + 3n (b0 + b1 n).
5.5. GENERATING FUNCTION FROM RECURRENCE RELATION 173

18. Consider the LNHRRCC an = c1 an−1 + · · · + cr an−r + 5n . Give a particular solution.

Ans: v = cnt 5n , where t is the multiplicity of 5 as a root of the characteristic equation of
the LHRRCC.
19. Obtain the ogf for an , where an = 2an−1 − an−2 + 2n , a0 = 0, a1 = 1.
Ans: Let F = a0 + a1 x + · · · . So, F = x + (2a1 − a0 + 22 )x2 + (2a2 − a1 + 23 )x3 +
· · · = x + 2x ∞
P i 2
P∞ i
P∞ i 2 1
1 ai x − x 0 ai x + 2 (2x) = x + 2xF − x F + 1−2x − 1 − 2x. So
F (1 − x)2 = x(1+2x) x(1+2x)
(1−2x) and F = (1−x)2 (1−2x) .

20. Solve the recurrence relation an = 2an−1 − an−2 + 2n + 5, a0 = 0, a1 = 1.

Ans: Ch roots: 1, 1. GS for LHRRCC: un = 1n (a + bn). PS for f (n) = 2n is vn = c2n .
Substituting vn in the RR with f (n) = 2n , we get c = 4. Ps for f (n) = 1 is wn =
dn2 . Substituting wn in RR with f (n) = 1, we get d = 21 . So, GS for LNHRRCC is
a + bn + 2n+2 + 52 n2 . Using initial conditions, we get a = −4 and b = − 11
2 . So, the solution
11 5 2 n+2
to the above LNHRRCC is an = −4 − 2 n + 2 n + 2 .
21. My class has n CSE, m MSC and r MC students. Suppose that t copies of the same book
are to be distributed so that each branch gets at least s. In how many ways can this be
done, if each student gets at most one? In how many ways can this be done, without the
previous restriction? Answer only using generating function.
Ans: If we assume that each student gets at most one book, then a typical distribution can
T

be obtained in the following way:

k1 copies for CSE k2 copies for MSC k3 copies for MC

choose k1 students of CSE in choose k2 students of MSC in choose k3 students of MC in
DR

C(n, k1 ) ways C(m, k2 ) ways C(m, k3 ) ways

So, the answer is

C(n, s)xs + C(n, s + 1)xs+1 + · · · ×

 

cfxt , C(m, s)xs + C(m, s + 1)xs+1 + · · · ×  .

 

C(r, s)xs + C(r, s + 1)xs+1 + · · ·

If we relax this assumption then a typical distribution can be obtained in the following way:
k1 copies for CSE k2 copies for MSC k3 copies for MC
distribute them in C(n+k1 −1, n− distribute them in C(m + k2 − distribute them in C(r+k3 −1, r−
1) ways 1, m − 1) ways 1) ways
So, our answer is

C(s + n − 1, n − 1)xs + C(s + n, n − 1)xs+1 + · · · ×

 

cfxt , C(s + m − 1, m − 1)xs + C(s + m, m − 1)xs+1 + · · · ×  .

 

C(s + r − 1, r − 1)xs + C(s + r, r − 1)xs+1 + · · ·

Exercise 5.5.6. 1. My class has n CSE, m MSC and r MC students. Suppose that t distinct
books are to be distributed so that each branch gets at least s. In how many ways can this
be done, if each student gets at most one? In how many ways can this be done, without
the previous restriction? Answer only using generating function.
Ans: If we assume that each student gets at most one book then a typical distribution can
be obtained in the following way:
174 CHAPTER 5. ADVANCED COUNTING PRINCIPLES

Choose k1 books for CSE in Choose k2 books for MSC in C(t−

distribute the remaining k3 in
C(t, k1 ) ways k1 , k2 ) ways
P (r, k3 ) ways
distribute them in P (n, k1 ) ways distribute them in P (m, k2 ) ways
A corresponding term looks like C(t, k1 )P (n, k1 )C(t − k1 , k2 )P (m, k2 )P (r, k3 )xk1 xk2 xk3 =
t!C(n, k1 )C(m, k2 )C(r, k3 )xk1 xk2 xk3 .
So, our answer is

t! C(n, s)xs + C(n, s + 1)xs+1 + · · · ×

 

cfxt , C(m, s)xs + C(m, s + 1)xs+1 + · · · ×  .

 

C(r, s)xs + C(r, s + 1)xs+1 + · · ·

If we relax this assumption then a typical distribution can be obtained in the following way:
Choose k1 books for CSE in Choose k2 books for MSC in C(t−
distribute the remaining k3 in rk3
C(t, k1 ) ways k1 , k2 ) ways
ways
distribute them in nk1 ways distribute them in mk2 ways
k k k
A corresponding term looks like C(t, k1 )nk1 C(t−k1 , k2 )mk2 rk3 xk1 xk2 xk3 = t! nk11 !k
m 2 r 3 k1 k2 k3
2 !k3 !
x x x .
So, our answer is  
s xs+1
t! ns xs! + ns+1 (s+1)! + ··· ×
 s

xs+1
cfxt , ms xs! + ms+1 (s+1)! + ··· × .
 
 s 
xs+1
rs xs! + rs+1 (s+1)! + ···
T

2. My class has N students. Assume that, to conduct an exam, we have M identical answer
AF

scripts. In how many ways can we distribute the answer scripts so that each student gets
DR

at least 2. Answer only using generating function.

h N i
Ans: cf xM , x2 + x3 + · · · = C(M − N − 1, N − 1)
3. My class has N students. Assume that, for an exam, we have M questions; each student
answers all the questions in an order decided by him/her (for example one can follow
1, 2, · · · , M and another can follow M, M −1, · · · , 1). In how many ways can it happen that
some three or more students have followed the same order? Answer only using generating
function.
Ans: Assume that there are n students and k types of objects (each type unlimited in num-
ber). Then the number of ways of allocating one object to each student is CF (xn , n!ekx ) (in
generating function way) = k n (direct answer). Suppose, we want each ‘no type of object
should repeat more than twice’. Then it would be CF (xn , n!(1 + x + x2 /2!)k ). So, if we
want, at least one type should repeat at least thrice, then the answer is CF (xn , n!ekx ) −
CF (xn , n!(1 + x + x2 /2!)k ). In our
h question n =2 N iand k = M ! and hence the required
− N !cf xN , (1 + x + x2! )M !
N M !x
answer is N !cf x , e
4. When ‘Freshers Welcome’ was organized 11 teachers went to attend. There were 4 types of
soft drinks available. In how many ways a total of 18 glasses of soft drinks can be served
to them, in general? Answer only using generating function.
Ans: cf x18 , (1 − x)−44 = C(44 + 18 − 1, 18)

Chapter 6

Introduction to Logic

6.1 Propositional Logic

We study logic to differentiate between valid and invalid arguments. An argument is a set of
statements which has two parts: premise and conclusion. There can be many statements in the
premise. Conclusion is just one statement. An argument has the structure
premise: Statement1 , . . ., Statementk ; therefore conclusion: Statementc .
Consider the following examples.

• Statement1 : If today is Monday, then Mr. X gets Rs. 5.

T
AF

Statement2 : Today is Monday.

Statementc : Therefore, Mr. X gets Rs. 5 (statementc ).
DR

• Statement1 : If today is Monday, then Mr. X gets Rs. 5.

Statement2 : Mr. X gets Rs. 5.
Statementc : Therefore, today is Monday.

• Statement1 : If today is Monday, then Mr. X gets Rs. 5.

Statement2 : Today is Tuesday.
Statementc : Therefore, Mr. X gets Rs. 5.

• Statement1 : If today is Monday, then Mr. X gets Rs. 5.

Statement2 : Today is Tuesday.
Statementc : Therefore, Mr. X does not get Rs. 5.

We understand that the first one is a valid argument, whereas the next three are not. In order
to differentiate between valid and invalid argument, we need to analyze an argument. And in
order to do that, we first have to understand ‘what is a statement’. A simple statement is an
expression which is either false or true but not both. We create complex statements from the
old ones by using ‘and’, ‘or’ and ‘not’.
For example, ‘today is Monday’ is a statement. ‘Today is Tuesday’ is also a statement. ‘Today
is Monday and today is Tuesday’ is also a statement. ‘Today is not Monday’ is also a statement.
One way to analyze an argument is by writing it using symbols. The following definition
captures the notion of a ‘statement’.

175
176 CHAPTER 6. INTRODUCTION TO LOGIC

Definition 6.1.1. 1. [Atomic formulae and truth values] Consider a nonempty finite set
of symbols F. We shall call an element of F as an atomic formula (also called atomic
variable). (These are our simple statements). The truth value of each element in F is
exactly one of T (for TRUE) and F (for FALSE). Normally, we use symbols p, q, p1 , p2 , . . .
for atomic formulae.
2. [Operations to create new formulae] We use three symbols ‘∨’ (called disjunction/or),
‘∧’ (called conjunction/and), and ‘¬’ (called negation) to create new formulae. The
way they are used and the way we attribute the truth value to such a new formula is
described below.

If p and q are formulae, then p ∧ q, p ∨ q, and ¬p are formulae. The truth value of p ∧ q is
defined to be T when the truth values of both p and q are T . Its truth value is defined to be F
in all other cases. The truth value of p ∨ q is defined to be T when the truth values of at least
one of p and q are T . Its truth value is defined to be F when the truth values of both p and q
are F . The truth value of ¬p is defined to be T if the truth value of p is F . The truth value of
¬p is defined to be F if the truth value of p is T .

Understanding ∨, ∧ and ¬
The following tables describe how we attribute the truth values to p ∨ q, p ∧ q and ¬p.
T

p q p∧q p q p∨q
AF

T T T T T T p ¬p
DR

T F F T F T T F
F T F F T T F T
F F F F F F

How do we read these tables? Look at row 3 of the leftmost table (exclude the header). It
tells that the formula p ∧ q takes the truth value F if p takes truth value F and q takes T .

Remark 6.1.2. We use brackets while creating new formulae to make the meaning unambiguous.
For example, the expression p ∨ q ∧ r is ambiguous, where as p ∨ (q ∧ r) is unambiguous.
Definition 6.1.3. 1. Sometimes we write ‘f (p1 , . . . , pk ) is a formula’ to mean that ‘f is a
formula involving the atomic formulae p1 , . . . , pk ’.
2. Let f (p1 , . . . , pk ) be a formula. Then, the truth value of f is determined based on the truth
values of the atomic formulae p1 , . . . , pk . Since, there are 2 assignments for each pi , 1 ≤ i ≤
k, there are 2k ways of assigning truth values to these atomic formulae. An assignment of
truth values to these atomic formulae is nothing but a function A : {p1 , . . . , pk } → {T, F }.
3. By saying ‘T F T is an assignment to the atomic variables p, q, r’, we mean that the truth
value of p is T , that of q is F and that of r is T . Keeping this in mind, all possible
assignments to p, q, r are listed below. (Notice that, it is in the dictionary order, that is,
‘F F F appears before F F T in the list as if they are words in a dictionary’. The reader will
notice that in the table given above, we have followed the reverse dictionary order while
writing a truth table, which is natural to us. This should not create any confusion.)
6.1. PROPOSITIONAL LOGIC 177

p q r
F F F
F F T
F T F
F T T
T F F
T F T
T T F
T T T

4. A truth table for a formula f (p1 , . . . , pk ) is a table which systematically lists the truth
values of f under every possible assignment of truth values to the involved atomic formulae.
The following is a truth table for the formulae p ∨ (q ∧ r).

p q r q ∧ r p ∨ (q ∧ r)
F F F F F
F F T F F
F T F F F
T

F T T T T
AF

T F F F T
DR

T F T F T
T T F F T
T T T T T

5. In the previous table, if we fill the fourth column arbitrarily using T ’s and F ’s, will it be
a truth table of some formula involving p, q and r? We shall talk about it later.

We have already noted that we use ∨, ∧ and ¬ to create new formulae from old ones. Some of
them will indeed be very important.

Definition 6.1.4. [Conditional formulae]

1. [p implies q] If p and q are formulae, then the formula (¬p) ∨ q is denoted by p → q (read
as p implies q). Its truth table is

p q (¬p) ∨ q
T T T
T F F
F T T
F F T
178 CHAPTER 6. INTRODUCTION TO LOGIC

Observe
a) p → q takes the truth value F if and only if p takes the truth value T and q takes
the truth value F .
b) If under some assignment ‘p → q takes the truth value T ’ and that ‘in this assignment
p is T ’, then it follows that in this assignment q must be T . This is why p → q is called
‘if p then q’.
c) Other phrases used for ‘if p then q’ are ‘p is sufficient for q’ or ‘p only if q’ or ‘q is a
necessary condition for p’.
d) We sometimes use p ← q to mean q → p.

2. [p if and only if q] The formula (p ↔ q) (called ‘p if and only if q’) means (p → q)∧(q → p).
Note that (p ↔ q) takes the truth value ‘T whenever p and q take the same truth values’
and takes the truth value ‘F whenever p and q take different truth values’. Its truth table
is
p q p↔q
T T T
T F F
F T F
F F T
T

3. [Converse/Contrapositive] The formula q → p is called the converse of p → q and the

formula ¬q → ¬p is called the contrapositive of p → q.

Discussion 6.1.5. [Understanding a conditional formula] When we assign different ‘English

statements’ to the involved atomic formulae, we get an English statement corresponding to those
formulae. For example, for the formula p → q, consider the following statements:
p: you attend the class.
q: you understand the subject.
Then, p → q is the statement ‘if you attend the class, then you understand the subject’. The
formula p → q is true under the following three cases.
1. p is true and q is true: this means ‘you attend the class and understand the subject’.
2. p is false and q is false: this means ‘you do not attend the class and do not understand
the subject’.
3. p is false and q is true: this means ‘you do not attend the class but understand the subject’.

The formula p → q is false if ‘p is true and q is false’, which means ‘you attend the class and do
not understand the subject’.

Definition 6.1.6. [Connectives] The symbols ∨, ∧, ¬, → and ↔ are called connectives. The
set of well formed formulae (wff ) are defined inductively. Each atomic variable is a wff. If f
and g are two wff, then f ∨ g, f ∧ g, ¬f , f → g, and f ↔ g are wff. Brackets are used to avoid
ambiguity.
Example 6.1.7. 1. p ∧ ∨q, ∨q, p ∨ q∧ are not wff, as they do not make sense.
6.1. PROPOSITIONAL LOGIC 179

2. p ∨ q ∧ r is not a wff as it is not clear what it means. We use brackets to get p ∨ (q ∧ r) or

(p ∨ q) ∧ r which are wff.

3. (p → q) → r, (p ∨ ¬q) → ¬r, ¬(p → q) are wff.

Did you notice?

The connectives ∨, ∧, →, and ↔ always connect two old formulae to create a new one. This
is why they are called ‘binary connectives’. The connective ¬ is used on a single old formula
to give a new one. So, it is called a ‘unary connective’.

Definition 6.1.8. [Truth function] Let A be the set of assignments to the variables p1 , . . . , pk .
k
A function f : A → {T, F } is called a truth function. Since |A| = 2k , there are 22 such truth
functions.

Example 6.1.9. The table on the left describes a truth function f and that on the right
describes the truth table for a particular formula.

p q f p q (p ∧ q) ∨ (p ∧ (¬q))
T T F T T T
T F T T F T
F T T F T F
T
AF

F F F F F F
DR

Exercise 6.1.10. 1. Draw a truth table for the formula p ∧ ¬p → (p ∨ ¬q) .

Ans:

p q ¬p ¬q p ∨ (¬q) ¬p → (p ∨ (¬q)) p ∧ (¬p → (p ∨ (¬q)))

T T F F T T T
T F F T T T T
F T T F F F F
F F T T T T F

2. Can both the formulae p → q and q → p be false for some assignment on p and q?

Ans: NO, as p → q is false means that the assignments for p is True and that for q is False.
Whereas, q → p is false means the assignment for q is True and that for p is False.

Definition 6.1.11. 1. [Contradiction and tautology] A contradiction (F ) is a formula

which takes truth value F under each assignment. For example, p ∧ ¬p. A tautology (T )
is a formula which takes truth value T under each assignment. For example, p ∨ ¬p.

2. [Equivalence of formulae] Two formulae f and g are said to be equivalent, denoted

f ≡ g, if they have the same truth table involving all the atomic variables of both f and g.
That is, if both f and g carry the same truth values under each assignment to the involved
atomic variables.
180 CHAPTER 6. INTRODUCTION TO LOGIC

Example 6.1.12. 1. Is p → q ≡ ¬q → ¬p? Yes, because they have the same truth tables.

p q f = p → q g = ¬q → ¬p
T T T T
T F F F
F T T T
F F T T

2. Is p ≡ p ∧ (q ∨ (¬q))? Yes, because they have the same truth tables.

p q f = p g = p ∧ (q ∨ (¬q))
T T T T
T F T T
F T F F
F F F F

Remark 6.1.13. 1. There is another way to establish equivalence of two formulae f and g.
We show that f has a truth value T (or F ) if and only if g has the same truth value. For
example, to show that p → q ≡ ¬q → ¬p, proceed in the following way.

Step 1: Suppose that p → q has a truth value F for an assignment a. Then a(p) = T and
T
AF

a(q) = F . But then, under that assignment, we have ¬q is T and ¬p is F . That is, under
a, we have ¬q → ¬p is F .
DR

Step 2: Suppose that p → q has a truth value T for an assignment a. Then a ∈

{T T, F T, F F }. Under T T , we have ¬p is F and ¬q is F , so that ¬q → ¬p is T . Under
F T , we have ¬p is T and ¬q is F , so that ¬q → ¬p is T . Under F F , we have ¬p is T
and ¬q is T , so that ¬q → ¬p is T .

Thus, both are equivalent.

2. Let f (p1 , . . . , pk ) be a formula and q1 , . . . , qr be some new atomic variables. Then f ≡

f ∧ (q1 ∨ (¬q1 )) ∧ · · · ∧ (qr ∨ (¬qr )). This can be argued using induction. Thus f can be
viewed as a formula involving atomic variables p1 , . . . , pk , q1 , . . . , qr .

3. We have seen that

(a) p → q ≡ ¬p ∨ q, and
(b) p ↔ q ≡ (p → q) ∧ (q → p).

Thus, the connectives ∨, ∧ and ¬ are enough for writing a formula in place of the 5
connectives ∨, ∧, ¬, → and ↔.

4. Recall that a formula on variables p, q and r is a truth function. So there are exactly
3
22 = 28 nonequivalent formulae on variables p, q and r.

Exercise 6.1.14. Is p ∨ ¬p ≡ q ∨ ¬q?

6.1. PROPOSITIONAL LOGIC 181

Ans: Yes, as
p q p ∨ ¬p q ∨ ¬q
T T T T
T F T T
F T T T
F F T T

Definition 6.1.15. [Substitution instance] Suppose B is a formula which involves some vari-
ables including p. Then, substituting a formula A for each appearance of the variable p in
B, gives us a new formula. This new formula is called a substitution instance of B. We
may substitute more than one variables, simultaneously. Note that A may involve old and new
variables.

Example 6.1.16. Let B: (p → q) → p. We substitute p → ¬q for p, and p for q, in B to obtain

the following substitution instance of B.

(p → ¬q) → p → (p → ¬q)

The following result is one of the most fundamental results of the subject.

Theorem 6.1.17. Any substitution instance of a tautology is a tautology.

Proof. Let P (p1 , . . . , pk ) be a tautology. Suppose that we replace each occurrence of p1 by a

formula f to obtain the formula R. Consider all the atomic variables involved in P and f .
AF

View P and R as formulae involving all these atomic variables. Let a be an assignment to these
DR

atomic variables.
If f takes the value T on a, then the value of R on a is nothing but the value of P (T, p2 , . . . , pk )
on a, which is T as P is a tautology.
If f takes the value F on a, then the value of R on a is nothing but the value of P (F, p2 , . . . , pk )
on a, which is T as P is a tautology. Thus, R takes the value T under each assignment.

Exercise 6.1.18. Show that any substitution instance of a contradiction is a contradiction.

Definition 6.1.19. [Functionally complete] A subset S of connectives is called function-

ally complete/adequate, if each formula has an equivalent formula written only using the
connectives in S.

Example 6.1.20. We already know that S = {∨, ∧, ¬} is adequate.

Exercise 6.1.21. 1. Determine which are adequate. (i) {¬, ∨} (ii) {→, ¬}.
2. Fill in the blanks to prove that ‘f ≡ g’ if and only if ‘f ↔ g is a tautology’.
Proof. Assume that f ≡ g. Let b be an assignment. Then, the value of f and g are
under b. Thus, the value of f ↔ g is under b. As b is an assignment,
we see that f ↔ g is a .
Therefore, if f is T under b, then g is T under b. That is, f → g and g → f are both T
under b. Thus, f ↔ g is T under the assignment b.
182 CHAPTER 6. INTRODUCTION TO LOGIC

Conversely, suppose that f ↔ g is a . Assume that f 6≡ g. Then, there is

under which and take different .

So, suppose that f takes T and g takes F under b. Then is F under b and hence
f ↔ g takes F under b, a contradiction. A similar contradiction is obtained if f takes F
and g takes T under b.

The proof of the next result is left as an exercise for the readers.

Proposition 6.1.22. [Rules] If p, q, r are formulae, then

1. p ∨ q ≡ q ∨ p, p ∧ q ≡ q ∧ p (commutative)

2. p ∨ (q ∨ r) ≡ (p ∨ q) ∨ r, p ∧ (q ∧ r) ≡ (p ∧ q) ∧ r (associative)

3. p ∧ (q ∨ r) ≡ (p ∧ q) ∨ (p ∧ r), p ∨ (q ∧ r) ≡ (p ∨ q) ∧ (p ∨ r) (distributive)

4. ¬(p ∨ q) ≡ ¬p ∧ ¬q, ¬(p ∧ q) ≡ ¬p ∨ ¬q (De Morgan’s law)

5. p ∨ p ≡ p, p ∧ p ≡ p (idempotence)

6. F ∨ p ≡ p, F ∧ p ≡ F

7. T ∨ p ≡ T, T ∧ p ≡ p
T

8. ¬(¬p) ≡ p
AF

9. p ∨ (p ∧ q) ≡ p, p ∧ (p ∨ q) ≡ p (absorption law)
DR

Proof. First six may be proved suing direct arguments and the rest by using the first six.

Exercise 6.1.23. Does the absorption law imply p ∨ (p ∧ (¬q)) ≡ p and p ∧ (p ∨ (¬q)) ≡ p?

Discussion 6.1.24. The above rules can be used to simplify a formula or to show equivalence
of formulae. For example,

p → (q → r) ≡ ¬p ∨ (¬q ∨ r) as p → p ≡ (¬p) ∨ q
≡ ¬p ∨ ¬q ∨ r Associativity
≡ ¬(p ∧ q) ∨ r De Morgan’s law
≡ (p ∧ q) → r as p → p ≡ (¬p) ∨ q

Did you notice?

There are 3 ways to prove f ≡ g.
1. Using truth table.
2. Arguing that f is false under an assignment (of the variables involved in both) if and
only if g is false under the same assignment.
3. Using some of the above rules and by reducing f to g or g to f .
6.1. PROPOSITIONAL LOGIC 183

Experiment
Consider the variables p, q, r.
Give a formula which takes value T only on the assignment T T T .
Give a formula which takes value T only on the assignment T T F . (p ∧ q ∧ (¬r))
Give a formula which takes value T only on the assignment F T F .
Give a formula which takes value T only on the assignments T T F and F T F .
Give a formula which takes value T only on the assignments T F T , T T F and T F F .
Give a formula f which takes value T only on the assignments F T F and F F F or whose
truth table is the following
p q r f
T T T F
T T F F
T F T F
T F F F
F T T F
F T F T
F F T F
F F F T
T

Lemma 6.1.25. Let f be a truth function involving the variables p1 , . . . , pk . Then, there is a
AF

formula g involving p1 , . . . , pk , whose truth table is described by f .

Proof. If T ∈
/ rng f , then write q = p1 ∧¬p1 ∧p2 ∧· · ·∧pk . Otherwise, collect all those assignments
b such that f (b) = T . Call this set A1 . For each b ∈ A1 , define a formulae q = r1 ∧ r2 ∧ · · · ∧ rk ,
where for 1 ≤ j ≤ k, (
pj if b(pj ) = T
rj =
¬pj otherwise.
Then, the formulae q takes the value T only on the assignment b. Thus, taking the disjunctions
of all such q’s related to each b ∈ A1 , we get the required result.

Exercise 6.1.26. Illustrate 6.1.25 with the truth function f

p q f
T T T
T F T
F T F
F F F

Ans: We have A1 = {a1 = T T, a2 = T F }. For a1 = T T , we have q1 = p ∧ q and for a2 = T F

we have q2 = p ∧ ¬q. So the formula with the above truth table is (p ∧ q) ∨ (p ∧ ¬q).

Definition 6.1.27. [Normal forms] An atomic formula or it’s negation is called a literal.
We say that a formula f is in disjunctive normal form (in short, DNF) if it is expressed as
184 CHAPTER 6. INTRODUCTION TO LOGIC

a disjunction of conjunctions of literals. We say that a formula f is in conjunctive normal

form (in short, CNF) if it is expressed as a conjunction of disjunctions of literals.

Example 6.1.28. p, p ∨ q, p ∨ ¬q, (p ∧ ¬q) ∨ ¬r, (p ∧ ¬q) ∨ (q ∧ ¬r) ∨ (r ∧ s) are in DNF. Write
5 formulae in CNF involving p, q, r.

Theorem 6.1.29. Any formula is equivalent to a formulae in DNF. Similarly, Any formula is
equivalent to a formulae in CNF.

Proof. The proof of the first assertion follows from Lemma 6.1.25. For the second assertion, we
can write one proof in a similar way.
An alternate proof: take f , consider ¬f , get a DNF P for ¬f , and consider ¬P .

Exercise 6.1.30. Write all the truth functions on two variables and write formulae for them.

Definition 6.1.31. [Principal connectives] Let h be a formula. A principal connective in

h is defined in the following way.
1. If h is expressed in a format ¬f , then ¬ is the principal connective of h.
2. If h is expressed in a format f ∨ g, then ∨ is the principal connective of h.
3. If h is expressed in a format f ∧ g, then ∧ is the principal connective of h.

Exercise 6.1.32. Use induction on the number of connectives to show that any formula is
T
AF

equivalent to a formulae in DNF and a formula in CNF.

Definition 6.1.33. [Dual] The dual P ∗ of a formula P involving the connectives ∨, ∧, ¬ is

obtained by interchanging ∨ with ∧ and the special variable T with the special variable F.

Example 6.1.34. Note that the dual of ¬(p ∨ q) ∧ r is ¬(p ∧ q) ∨ r.

Lemma 6.1.35. Let A(p1 , . . . , pk ) be a formula in the atomic variables pi involving connectives
∨, ∧ and ¬. If A(¬p1 , . . . , ¬pk ) is obtained by replacing pi with ¬pi in A, then A(¬p1 , . . . , ¬pk ) ≡
¬A∗ (p1 , . . . , pk ).

Proof. Use induction on the number of connectives. If A = B ∨ C, then

A∗ = B ∗ ∧ C ∗ ≡ ¬B(¬p1 , . . . , ¬pk ) ∧ ¬C(¬p1 , . . . , ¬pk )

≡ ¬(B ∨ C)(¬p1 , . . . , ¬pk ) = ¬A(¬p1 , . . . , ¬pk ).

The remaining parts are similar and hence left for the reader.

Theorem 6.1.36. Let f, g be formulae using connectives ∨, ∧ and ¬. If f ≡ g, then f ∗ ≡ g ∗ .

Proof. By Lemma 6.1.35, we note that

f ∗ (¬b) = ¬f (b) = ¬g(b) = g ∗ (¬b) for any assignment b.

Thus, f ∗ ≡ g ∗ .
6.1. PROPOSITIONAL LOGIC 185

Discussion 6.1.37. [Tree representation] A formula can be represented by a tree. For exam-
ple, (r ∨ q) → (¬q ∧ p) has the following representation.

∨ ∧

¬
r q p

Definition 6.1.38. [Polish notation] A formula may be expressed using Polish notation. It
is defined inductively as follows.

‘Let P (f ) denote the Polish notation of f . Then P (f ∨ g) is ∨P (f )P (g), P (f ∧ g) is ∧P (f )P (g),

and P (¬f ) is ¬P (f ).’

This notation does not use brackets. Here the connectives are written in front of the expressions
they connect. Advantage: it takes less space for storage. Disadvantage: it’s complicated look.

Example 6.1.39. In Polish notation (r ∨ q) → (¬q ∧ p) becomes → ∨rq ∧ ¬qp.

T
AF

Exercise 6.1.40. Write a formula involving 8 connectives and the variables p, q, r. Draw it’s
tree. Write it’s Polish notation.
DR

Definition 6.1.41. 1. [Satisfiable] A formula is satisfiable if it is not a contradiction.

2. [Order of operations] To reduce the use of brackets, we fix the order of operations: ¬,
∧, ∨, →, ↔.

Discussion 6.1.42. There is another way of making a truth table for a formula. Consider
(p ∨ q) ∨ ¬r. Draw a table like the following and give the truth values to the atomic formulae.
Evaluate the connectives for the subformulae one by one. In this example, the sequence of
column operations is: 5, 2, 4.

(p ∨ q) ∨ ¬ r (p ∨ q) ∨ ¬ r
T T T T T T T F T
T T F T T T T T F
T F T T T F T F T
T F F T T F T T F
F T T F T T T F T
F T F F T T T T F
F F T F F F F F T
F F F F F F T T F
Definition 6.1.43. [Inference] We say g is a logical conclusion of {f1 , · · · , fn } if (f1 ∧
f2 ∧ · · · ∧ fn ) → g is a tautology. We denote this by {f1 , . . . , fn } ⇒ g. At times, we write
186 CHAPTER 6. INTRODUCTION TO LOGIC

f1 , . . . , fn ⇒ g to mean {f1 , . . . , fn } ⇒ g. Here, g is called the conclusion and {f1 , . . . , fn } is

called the hypothesis/premise.
Example 6.1.44. 1. Consider the following three statements.
A : if x = 4, then discrete math is bad;
B : discrete math is bad;
C : x = 4.

Does C logically follow from A, B?

Ans: No. Denote ‘x = 4’ by p and ‘discrete mathematics is bad’ by q. Then, the
above question is the same as asking whether {p → q, q} ⇒ p is true. That is, whether
P (p, q) := ((p → q) ∧ q) → p is a tautology.
To find that, suppose that there is an assignment for which P takes the value F . So, for
that assignment, p must be F and (p → q) ∧ q must be true.
As (p → q) ∧ q is true, q must be T . So, the assignment must be F T . Notice that p → q
has a value T with this assignment. Thus, P (p, q) takes F under F T . Hence, it is not a
tautology. So, C does not logically follow from A, B.
2. Consider the following three statements.
A : ‘if discrete math is bad, then x = 4’;
B : ‘discrete math is bad’;
T

C : ‘x = 4’.
AF

Does C logically follow from A, B?

Ans: Yes. Denote ‘x = 4’ by p and ‘discrete mathematics is bad’ by q. Then, the

above question is the same as asking whether {q → p, q} ⇒ p is true. That is, whether
P (p, q) := ((q → p) ∧ q) → p is a tautology.
To find that, suppose that there is an assignment for which P takes the value F . So, for
that assignment, p must be F and (q → p) ∧ q must be true.
As (q → p) ∧ q is true, q must be T and q → p must be T . So, the assignment must be
F T . But we see that q → p has a value F with this assignment. This is a contradiction.
Thus, there is no assignment for which P (p, q) takes F . Hence, it is a tautology. So C
logically follows from A, B.
Definition 6.1.45. We write f ⇔ g to mean ‘f ⇒ g and g ⇒ f ’.

Did you notice?

Let f, g, h be some formulae. Then f, g ⇒ h means that ‘whenever f and g are T , h is also
T ’. That is, ‘if f and g are T under an assignment, then h is T under that assignment’.
Thus ‘f ⇔ g’ is the same as ‘f ≡ g’.
Example 6.1.46. 1. Show that {α → β, β → γ, γ → δ} ⇒ α → δ.
Ans: Suppose α → δ is F . Then α is T and δ is F . Assume that all the propositions in
the hypothesis are true. As δ is F and γ → δ is T , γ must be F . Continuing, we get α is
F , a contradiction.
6.1. PROPOSITIONAL LOGIC 187

2. Determine validity of the argument.

The meeting can take place if all members are informed in advance and there is quorum
(a minimum number of members are present). There is a quorum if at least 15 members
are present. Members would have been informed in advance if there was no postal strike.
Therefore, if the meeting was canceled, then either there were fewer than 15 members
present or there was a postal strike.
A : Let us denote the different statements with symbols, say
m: the meeting takes place;
a: all members are informed;
f : at least fifteen members are present;
q: the meeting had quorum;
p: there was a postal strike.

So, we reformulate the problem: whether (q∧a) → m, f → q, ¬p → a ⇒ ¬m → (¬f ∨p)?
From first two statements, we get (f ∧ a) → m. Considering the third statement, we get
(f ∧ ¬p) → m. The conclusion is the contrapositive of this statement.

Alternate. Suppose that conclusion is F . This means that ¬m → (¬f ∨ p) takes the

value F and (q ∧ a) → m, f → q, ¬p → a takes the value T .
The first one implies that ¬f ∨ p takes the value F and ¬m takes the value T . Hence, we
T

see that the variables m, f and p take values F, T and F , respectively.

The second one implies that all the three expressions (q ∧ a) → m, f → q, and ¬p → a
DR

take the value T . Since the second statement takes the value T and f has the value T ,
we see that q has to take the value T . Similarly, using the third statement, we see that a
has to take the value T . So, we see that the first statement (q ∧ a) → m takes the value
T with the assignment of both q and a being T . So, we must have m to have the value T ,
contradicting the value F taken by m in the previous paragraph.
Exercise 6.1.47. 1. List all the nonequivalent formulae involving variables p and q which
take truth value T on exactly half of the assignments.
Ans: p, ¬p, q, ¬q, (p ∧ q) ∨ (¬p ∧ ¬q), ¬((p ∧ q) ∨ (¬p ∧ ¬q)).
2. We assume F ≤ T . Let f and g be two truth functions on the variables p1 , . . . , p9 . Suppose
that for each assignment a, we have f (a) ≤ g(a). Does this imply ‘f → g is a tautology’ ?
Ans: Yes. Let a be any assignment. We want to show that f → g takes value T under
a. Suppose it does not. Then f must be T under a and g must be F under a. But then
f (a) 6≤ g(a), a contradiction.
3. Let f and g be two formulae involving the variables p1 , . . . , pk . Prove that ‘f ≡ g’ (the
same truth table) if and only if ‘f ↔ g is a tautology’.
Ans: ⇒. Assume that f ≡ g. Let a be an assignment. Then, the value of f and g are the
same under a. Thus, the value of f → g is T under a and the value of g → f is T under a.
That is, the value of f ↔ g is T under a. As a is an arbitrary assignment, we see that f ↔ g
is a tautology.
188 CHAPTER 6. INTRODUCTION TO LOGIC

⇐. Suppose that f ↔ g is a tautology. Assume that f 6≡ g. Then, there is an assignment a

under which f and g take different truth values.
Suppose that f takes T and g takes F under a. Then f → g is F under a and hence f ↔ g
takes F under a, a contradiction. A similar contradiction is obtained if f takes F and g takes
T under a.

4. Without using →, write an equivalent simplified statement of (p → q) → p → (q → r) .

Ans: (p → q) → p → (q → r) ≡ ¬(p → q)∨ p → (q → r) ≡ ¬(¬p∨q)∨ ¬p∨(¬q∨r) ≡

(p ∧ ¬q) ∨ (¬p ∨ ¬q ∨ r) ≡ p ∨ (¬p ∨ ¬q ∨ r) ∧ ¬q ∨ (¬p ∨ ¬q ∨ r) ≡ (¬q ∨ ¬p ∨ ¬q ∨ r) ≡
¬p ∨ ¬q ∨ r.
5. Determine which of the following are logically equivalent.

(a) p → (r ∨ s) ∧ (q ∧ r) → s .

(b) (p ∨ r) ∨ (s → p) ∧ p → (s → r) .
(c) q → s.

(d) s → (q ∨ r) ∧ (q ∧ s) → r .

(e) (p ∨ s) ∨ (q → p) ∧ p → (q → s) .

6. Let p be a formula written only using connectives ∧, ∨ and → and involving the atomic
variables p1 , · · · , pk , for some k. Show that the truth value of p is T under the assignment
f (pi ) = T , for all i.
T
AF

Ans: Follows by induction on the number k of connectives used. For k = 1 we have nothing
to prove. Assume that the statement holds for all numbers less than k. Let p be a formula
DR

involving k connectives. Then p can be viewed as q[∗]r, where q, r are formulae and [∗] is a
principal connective in p. By induction hypothesis q, r are true under f . Thus q ∧ r, q ∨ r, and
q → r are all true under f .
7. Is {→, ∨, ∧} adequate?
Ans: No. Any formula r written only using ∨, →, ∧ takes truth value T under the assignment
where all atomic formulae are true. Thus we cannot get an equivalent formula for F only using
these three connectives.
8. Verify the following assertions.
(a) P ∧ Q ⇒ P
(b) P ⇒ P ∨ Q
(c) ¬P ⇒ P → Q
(d) ¬(P → Q) ⇒ P
(e) ¬P, P ∨ Q ⇒ Q
(f ) P, P → Q ⇒ Q
(g) ¬Q, P → Q ⇒ ¬P
(h) P → Q, Q → R ⇒ P → R
(i) P ∨ Q, P → R, Q → R ⇒ R
(j) P ↔ Q ⇔ (P ∧ Q) ∨ (¬P ∧ ¬Q)
6.1. PROPOSITIONAL LOGIC 189

(k) {p ∧ q, p ∨ q} ⇒ q → r
(l) {p → q, ¬p} ⇒ ¬q
(m) {p0 → p1 , p1 → p2 , . . . , p9 → p10 } ⇒ p0 ∨ p5 .

(n) (¬p ∨ q) → r, s ∨ ¬q, ¬t, p → t, (¬p ∧ r) → ¬s ⇒ ¬q.

(o) p → q, r ∨ s, ¬s → ¬t, ¬q ∨ s, ¬s, (¬p ∧ r) → u, w ∨ t ⇒ u ∧ w.

9. If H is a set of formulae, then H ⇒ α → β if and only if H ∪ {α} ⇒ β.

10. Prove the equivalence of the following in three different ways (truth table, simplification,
each is a logical consequence of the other): p → (q ∨ r) ≡ (p ∧ ¬q) → r.

11. Determine which of the following conclusions are correct.

(a) If the lecture proceeds, then either black board is used or the slides are shown or the
tablet pc is used. If the black board is used, then students at the back bench are not
comfortable in reading the black board. If the slides are shown, then students are
not comfortable with the speed. If the tablet pc is used, then it causes lots of small
irritating disturbances to the instructor. The lecture proceeds and the students are
comfortable. So, it is deduced that the instructor faces disturbances.
Ans: Let the variables be L: lecture proceeds, B: black board used, S: slides shown,
T : tablet pc used, C: students are comfortable, and D: disturbances occur.
T
AF

Then, we need to answer, whether L → (B ∨ S ∨ T ), B → ¬C, S → ¬C, T → ¬D ⇒
L ∧ C → D?
DR

Let the assertion be FALSE (F). This means D takes the value F and L ∧ C take the
value T . Hence, L and C both take the value T .
As the formulae B → ¬C and S → ¬C are TRUE and in both C takes the value T RU E
and hence ¬C takes F , we see that both B and S take the value F . Thus, the formula
L → (B ∨ S ∨ T ) being TRUE and L having the value T RU E implies that T must be
T RU E. Now, as T is TRUE and the formula T → ¬D holds, we get ¬D is TRUE and
thus, there is disturbance for the teacher. Hence, the argument is valid.
(b) There are three persons Mr X, Mr Y and Mr Z making statements. If Mr X is wrong,
then Mr Y is right. If Mr Y is wrong, then Mr Z is right. If Mr Z is wrong, then Mr
X is right. Therefore, some two of them are always right.
Ans: Let x, y, z be the variables denoting the events Mr X is right, Mr Y is right, and

Mr Z is right, respectively. Formal expression: whether ¬x → y, ¬y → z, ¬z → x ⇒
x ∨ y ∨ z? Notice that, if any two of x, y, z are false, say, x and y, then the formula
¬x → y is false. Hence the argument is valid.

12. Consider the set S of all nonequivalent formulae written using two atomic variables p and
q. For f, g ∈ S, define f ≤ g if f ⇒ g. Prove that this is a partial order on S. Draw it’s
Hasse diagram.
13. Consider the set S of all nonequivalent formulae written using three atomic variables p, q, r.
190 CHAPTER 6. INTRODUCTION TO LOGIC

For f, g ∈ S define f ≤ g if f ⇒ g. Let f1 and g1 be two formulae having the truth tables

p q r f1 p q r g1
T T T T T T T T
T T F F T T F F
T F T T T F T T
T F F T T F F F
F T T F F T T T
F T F T F T F F
F F T F F F T T
F F F F F F F F

How many nonequivalent formulae h are there such that {f1 , g1 } ⇒ h?

Ans: f1 ∧ g1 is true only on T T T and T F T . Any formula which is true on these two
assignments will get implied by f1 and g1 . So there are 26 such formulae.
14. How many assignments of truth values to p, q, r and w are there for which (p → q) →

r → w is true? Guess a formula in terms of the number of variables.
2n+1 −(−1)n+1
Ans: 5. A formula is 3 .
15. Check the validity of the argument. If discrete math is bad, then computer programming
is bad. If linear algebra is good, then discrete math is good. If complex analysis is good,
then discrete math is bad. If computer programming is good, then linear algebra is bad.
Complex analysis is bad and hence, at least one more subject is bad.
T
AF

Ans: Let DM mean ‘discrete math is good’, LA mean ‘linear algebra is good’, CA mean
‘complex analysis is good’, CP mean ‘computer programming is good’.
DR

Formulation:

{¬DM → ¬CP, LA → DM, CA → ¬DM, CP → ¬LA, ¬CA} ⇒ (¬DM ∨ ¬LA ∨ ¬CP ).

Suppose that the Right hand side takes F under some assignment a. Hence, under a, DM
must be T , LA must be T and CP must be T . But then, the fourth statement of the premise
‘CP → ¬LA’ becomes F . Hence, this is a valid conclusion.

6.2 Predicate Logic∗

Definition 6.2.1. [Predicate] A k-place predicate or propositional function p(x1 , . . . , xk )
is a statement involving the variables x1 , . . . , xk . A truth value can be assigned to a predicate
p(x1 , · · · , xk ) for each assignment of x1 , . . . , xk from their respective universe of discourses
(in short, UD) (the set of values that xi ’s can take is the i-th UD).

Example 6.2.2. Let p(x) mean ‘x > 0’. Then p(x) is a 1-place predicate on some UD. Let
p(x, y) mean ‘x2 + y 2 = 1’. Then p(x, y) is a 2-place predicate on some UD.

Definition 6.2.3. [Quantifiers] We call the symbols ∀ and ∃, the quantifiers. Formulae
involving them are called quantified formulae. The statement ∀x p(x) is true if for each x (in
the UD) the property p(x) is T . The statement ∃x p(x) is T if p(x) is T for some x in the UD.
6.2. PREDICATE LOGIC∗ 191

Example 6.2.4. Let UD be the set of all human beings. Consider the 2-place predicate F (x, y):
‘x runs faster than y’. Then
1. ∀x ∀y F (x, y) means ‘each human being runs faster than every human being’.
2. ∀x ∃y F (x, y) means ‘for each human being there is a human being who runs slower’.
3. ∃x ∃y F (x, y) means ‘there is a human being who runs faster than some human being’.
4. ∃x ∀y F (x, y) means ‘there is a human being who runs faster than every human being ’.
Definition 6.2.5. 1. [Scope of quantifier] In the quantified formulae ∀x p(x) or ∃x p(x)
the formula p(x) is called the scope of the quantifier (extent to which that quantification
applies).
2. An x-bound part in a formula is a part of the form ∃x p(x) or ∀x q(x). Any occurrence
of x in an x-bound part of the formula is a bound occurrence of x. Any other occurrence
of x is a free occurrence of x.

Example 6.2.6. In ∃x p(x, y) the occurrence of y is free and both the occurrences of x are
bound. In ∀y ∃x p(x, y) all the occurrences of x and y are bound.
Definition 6.2.7. 1. A quantified formulae is well formed if it is created using the following
rules.
(a) Any atomic formula (of the form P , P (x, y), P (x, b, y)) is a wff.
(b) If A and B are wffs, then A ∨ B, A ∧ B, A → B, A ↔ B, and ¬A are wffs.
T

(c) If A is a wff and x is any variable, then ∀x A and ∃x A are wffs.

AF
DR

2. Let f be a formula. An interpretation (for f ) means the process of specifying the UD,
specifications of the predicates, and assigning values to the free variables from the UD. By
an interpretation of f , we mean the formula f under a given interpretation.

Example 6.2.8. Consider the wff ∀x p(x, y).

1. Take N as UD. Let p(x, y) specify ‘x > y’. Let us assign 1 to the free variable y. Then, we
get the interpretation ‘each natural number is greater than 1’ which has the truth value
F.
2. Take N as UD. Let p(x, y) mean ‘x + y is an integer’, and take y = 2. Then, we get an
interpretation ‘when we add 2 to each natural number we get an integer’ which has a truth
value T .

Discussion 6.2.9. [Translation] We expect to see that ‘our developments on logic’ help us
in drawing appropriate conclusions. In order to do that, we must know how to translate an
‘English statement’ into a ‘formal logical statement’ that involves no English words. We may
have to introduce appropriate variables and required predicates. We may have to specify the
UD, but normally we use the most general UD.
Example 6.2.10. 1. Translate: ‘each person in this class room is either a BTech student or
an MSc student’.
A: Does the statement guarantee that there is a person in the room?
No. All it says is, if there is a person, then it has certain properties. Let P (x) mean ‘x is
192 CHAPTER 6. INTRODUCTION TO LOGIC

a person in this class room’; B(x) mean ‘x is a BTech

student’; and M (x) mean ‘x is an

MSc student’. Then, the formal expression is ∀x P (x) → B(x) ∨ M (x) .
2. Translate: ‘there is a student in this class room who speaks Hindi or English’.
A: Does the statement guarantee that there is a student in the room?
Yes. Let S(x) mean ‘x is a student in this class room’; H(x) mean
‘x speaks Hindi’; and

E(x) mean ‘x speaks English’. Then, the formal expression is ∃x S(x) ∧ (H(x) ∨ E(x)) .

Note that ∃x S(x) → (H(x) ∨ E(x)) is not the correct expression. Why?

Remember
∃x (S(x) → T (x)) never asserts S(x) BUT ∃x(S(x) ∧ T (x)) asserts both S(x) and T (x).

Practice 6.2.11. Translate into formal logic.

1. Every natural number is either the square of a natural number or it’s square root is irra-
tional.
Ans: A: Let N (x) mean ‘x is a natural number’; S(x) mean ‘x is the square of a natural
number’; I(x) mean ‘square root of x is irrational’. Then our formal expression is ∀x N (x) →

S(x) ∨ I(x) .
T

2. For every real number x there is a real number y such that x + y = 0.

Ans: A: Let r(x) mean ‘x is a real number’ and p(x, y)

mean ‘x + y = 0’. Then our formal
DR

expression is ∀x r(x) → ∃y r(y) ∧ p(x, y) or ∀x∃y r(x) → r(y) ∧ p(x, y) .
3. A subset S ⊆ Rn is called compact, if ‘—write the formal statement here—’.
4. A function f : R → R is called continuous at a point a, if ‘—write the formal statement
here—’.
5. A function f : R → R is called continuous, if ‘—write the formal statement here—’.
6. A function f : R → R is called uniformly continuous, if ‘—write the formal statement
here—’.
7. A subset S ⊆ Rn is called connected, if ‘—write the formal statement here—’.
8. A set S is called a group, if ‘—write the formal statement here—’.
9. A subset S ⊆ Rn is called a subspace, if ‘—write the formal statement here—’.
10. A function f : S → T is called a bijection, if ‘—write the formal statement here—’.
11. A function f : Rn → Rk is called a linear transformation, if ‘—write the formal statement
here—’.
12. A function f : (S, ◦) → (T, +) is called a group isomorphism, if ‘—write the formal
statement here—’.
13. A function f : V → W is called a vector space isomorphism, if ‘— write the formal
statement here—’.
6.2. PREDICATE LOGIC∗ 193

Definition 6.2.12. A quantified formula is called valid if every interpretation of it has truth
value T . Two quantified formulae A and B are called equivalent (A ≡ B) if A ↔ B is valid.
Example 6.2.13. 1. ∀x P (x) ∨ ∃x ¬P (x) is valid.
2. Is ∃x ∃y p(x, y) ≡ ∃y ∃x p(x, y)?
A: Yes. Denote ∃x ∃y p(x, y) by L and ∃y ∃x p(x, y) by R. Suppose that L → R is F . This
means, we have an interpretation in which L is T and R is F . As R is F , we see that
p(x, y) is F , for each x, y in the UD. In that case, L is F , a contradiction. So, L → R is
T . Similarly, R → L is T .
3. ∀x ∀y p(x, y) ≡ ∀y ∀x p(x, y). !!
4. ∃x ∀y p(x, y) 6≡ ∀y ∃x p(x, y). To see this take p(x, y): x > y.

Did you notice?

Two quantified formulae A and B are equivalent if and only if their interpretations under
‘the same UD, the same specification of predicates, and the same values to the free variables’
have the same truth value.

5. Is ∀x r(x) → ∃y r(y) ∧ p(x, y) ≡ ∀x∃y r(x) → r(y) ∧ p(x, y) ?
T

A: We want to know if
AF

∀x r(x) → ∃y r(y) ∧ p(x, y) ↔ ∀x∃y r(x) → r(y) ∧ p(x, y)

is valid. Let us see whether

∀x r(x) → ∃y r(y) ∧ p(x, y) → ∀x∃y r(x) → r(y) ∧ p(x, y)

is valid. Suppose that this is invalid. So there is an interpretation such that Right hand
side is F and Left hand side is T. As Right hand side is F, we see that ∃x, say x0 , for

which ∃y r(x) → r(y) ∧ p(x, y) is F. That is, ∀y the formula r(x0 ) → r(y) ∧ p(x0 , y)
is F. That is, r(x0 ) is T and for each y we see that r(y) ∧ p(x0 , y) is F. That is, r(x0 ) is T
and∃y(r(y) ∧ p(x0 , y)) is F. That is, the formula r(x0 ) → ∃y(r(y) ∧ p(x0 , y)) is F. That is,

∀x r(x) → ∃y r(y) ∧ p(x, y) is F, a contradiction. The other part is an exercise.

Alternate. Take A := r(x) → ∃y r(y) ∧ p(x, y) and B := ∃y r(x) → r(y) ∧ p(x, y) .
Consider an x0 in the UD. If r(x0 ) is F, Then A andB both have value T. If r(x 0 ) is T.

Then notice that r(x0 ) → ∃y r(y) ∧ p(x0 , y) and ∃y r(x0 ) → r(y) ∧ p(x0 , y) have the
same truth value.
Thus A ≡ B. Hence ∀xA ≡ ∀xB.
6. Any student who appears in the exam and gets a score below 30, gets an F grade. Mr x0
is a student who has not written the exam. Therefore, x0 should get an F grade. Do you
agree?
194 CHAPTER 6. INTRODUCTION TO LOGIC

A: Let S(x) mean ‘x is a student’, E(x) mean ‘x writes the exam’, B(x) mean ‘x gets a
score below 30’, and F (x) mean ‘x gets F grade’.
n o
We want to see whether ∀x[S(x) ∧ E(x) ∧ B(x) → F (x)], S(x0 ) ∧ ¬E(x0 ) ⇒ F (x0 )?
Take the following interpretation: S(x) is ‘x is a positive real number’, E(x) is ‘x is a
√
rational number’, B(x) is ‘x is an integer’, F (x) is ‘x is a natural number’, and x0 = 2.
In this interpretation, statements in the premise mean ‘every positive integer is a natural
√
number’ and ‘ 2 is a positive real number which is not rational’. They both are true.
√
Whereas the conclusion means ‘ 2 is a natural number’ which is false. So, the argument
is incorrect.
7. Translate the following into formal statements.
‘All scientists are human beings. Therefore, all children of scientists are children of human
beings.’
A: Let Sx : ‘x is a scientist’; Hx : ‘x is a human being’ and Cxy : x is a child of y.
Let the hypothesis be ∀x(Sx → Hx). Then, the possible translations of the conclusion are
the following.
(a) ∀x(∃y(Sy ∧ Cxy) → ∃z(Hz ∧ Cxz)). It means ‘for each x, if x has a scientist father,
then x has a human father’.
(b) ∀x[∀y(Sy ∧ Cxy) → ∀z(Hz ∧ Cxz)]. This is wrong, as the statement means ‘for all x,
T

if x is a common child of all scientists, then x is a common child of all human beings’.
AF

each child of x has a human father’.

(d) ∀x∀y(Sx ∧ Cyx) → ∀x∀y(Hx ∧ Cxy). What? This means ‘if each x is a scientist and
each y is a child of x (including x it self!), then each x is a human being and each y
is a child of x’.
Exercise 6.2.14. 1. Write a formal definition of lim f (x) 6= l.
x→a
2. Is ∃x [p(x) ∧ q(x)] → ∃x p(x) ∧ ∃x q(x) valid? Is it’s converse valid?
3. [common ones] If r does not involve x, then establish the following assertions.
(a) ¬∀x p(x) ≡ ∃x ¬p(x); ¬∃x p(x) ≡ ∀x ¬p(x)

(b) ∃x p(x) ∨ q(x) ≡ ∃x p(x) ∨ ∃x q(x); ∃x p(x) ∧ q(x) ⇒ ∃x p(x) ∧ ∃x q(x).

(c) ∀x p(x) ∧ q(x) ≡ ∀x p(x) ∧ ∀x q(x); ∀x p(x) ∨ q(x) ⇐ ∀x p(x) ∨ ∀x q(x).

(d) ∀x r ∨ q(x) ≡ r ∨ ∀x q(x); ∀x r → q(x) ≡ r → ∀x q(x)

(e) ∃x r ∧ q(x) ≡ r ∧ ∃x q(x); ∃x r → q(x) ≡ r → ∃x q(x).

(f ) ∀x p(x) → r ≡ ∃x p(x) → r ; ∃x p(x) → r ≡ ∀x p(x) → r .

4. Translate and check for validity of the following arguments.

(a) Recall that the decimal representation of a rational number either terminates or begins
to repeat the same finite sequence of digits, whereas that of an irrational number
neither terminates nor repeats. The square root of a natural number either has a
decimal representation which is terminating or has a decimal representation which
6.2. PREDICATE LOGIC∗ 195

is non-terminating and non-repeating. The square root of all natural numbers which
are squares have terminating decimal representation. Therefore, the square root of a
natural number which is not a square is an irrational number.
(b) For any two algebraic numbers a and b, a 6= 0, 1 and b irrational, we have that ab
is transcendental. The number i (imaginary unit) is irrational and algebraic. The
number i is not equal to 0 or 1. Therefore, the number ii is transcendental.

5. (a) Give an interpretation to show that ∀x r(x) → ∃y r(y) ∧ p(x, y) is not valid.
Ans: Let UD= {2, 3, 5, 7, 11}, r(x) : x is a prime number and p(x, y) : x > y. Then,
for all x ∈UD, r(x) is valid. But, for x0 = 2, there does not exist y ∈UD, such that y is
a prime and y < 2.

(b) Give an interpretation to show the incorrectness of ∀x p(x) → q(x) ⇒ ∃x ¬p(x) →

¬q(x) .
Ans: Choose {4, 6} as UD and suppose that p(x): x is a prime and q(x): x > 1. Then,
the first statement is true. Notice that ¬q(x) is false for each x and ¬p(x) is true. So,
the RHS is false.
6. Write a formal statement taking UD:= all students in all IIT’s in India, for the following.
‘For each student in IITG there is a student in IITG with more CPI.’
Ans: Let s(x): x is a student of IITG, and p(x, y): y has more CPI than x. Then the formal

expression is ∀x s(x) → ∃y s(y) ∧ p(x, y) .
T

7. Let UD= R, p(x): x is an integer, and q(x): x is a rational number. Translate the
AF

following statements into English.

(a) ∀x p(x) → q(x)

(b) ∃x ¬p(x) ∧ q(x)

(c) ∀x p(x) ∧ (x > 2) → ∀x q(x) ∧ (x < 2)

(d) ∃ > 0 ∀δ > 0(0 < |x − a| < δ → |f (x) − l| < )

Ans:
(a) Every integer is a rational number.
(b) There is a rational number which is not an integer.
(c) If each object is an integer greater than 2, then each object is a rational number less than
2.
(d) The set {|f (x) − l| : x ∈ R, 0 < |x − a| < δ} has an upper bound .

8. Take the most general UD. Check whether the following conclusion is valid or not.
Each student writes the exam using blue ink or black ink. A student who writes the exam
using black ink and does not write his/her roll number gets an F grade. A student who
writes the exam using blue ink and does not have his/her ID card gets an F grade. A
student who has his/her ID card has written the exam with black ink. Therefore, a student
who passes the exam must have written his roll number.
Ans: Let S(x) : x is a student, B(x) : x writes the exam using blue ink, Bl(x) : x writes
the exam using black ink, R(x) : x writes his/her roll number, I(x) : x has his/her ID card,
196 CHAPTER 6. INTRODUCTION TO LOGIC

F (x) : x gets an F grade. We have to determine, whether the following conclusion is valid.
(Continue.)

n
∀x S(x) → B(x) ∨ Bl(x) , ∀x S(x) ∧ Bl(x) ∧ ¬R(x) → F (x) ,
o
∀x S(x) ∧ B(x) ∧ ¬I(x) → F (x) , ∀x S(x) ∧ I(x) → Bl(x)

⇒ ∀x S(x) ∧ ¬F (x) → R(x) .

Assume under some interpretation the conclusion is F . So, there a x0 ∈ U D such that S(x0 )
is T and F (x0 ) is F and R(x0 ) is F .
From the second formula, it follows that Bl(x0 ) is F . Now, from the first formula, it follows
that B(x0 ) is T . From the third formula, it now follows that I(x0 ) is T . In that case, the
fourth formula is F .

T
AF
DR
Chapter 7

Graphs

7.1 Basic Concepts

Experiment
‘Start from a dot. Move through each line exactly once. Draw it.’ Which of the following
pictures can be drawn? What if we want the ‘starting dot to be the finishing dot’ ?
T
AF
DR

Later, we shall see a theorem by Euler addressing this question.

Definition 7.1.1. A pseudograph or a general graph G is a pair (V, E) where V is a nonempty

set and E is a multiset of unordered pairs of points of V . The set V is called the vertex set and
its elements are called vertices. The set E is called the edge set and its elements are called
edges.

Example 7.1.2. G = {1, 2, 3, 4}, {1, 1}, {1, 2}, {2, 2}, {3, 4}, {3, 4} is a pseudograph.

Discussion 7.1.3. A pseudograph can be represented in picture in the following way.

1. Put different points on the paper for vertices and label them.

2. If {u, v} appears in E some k times, draw k distinct lines joining the points u and v.

3. A loop at u is drawn if {u, u} ∈ E.

Example 7.1.4. A picture for the pseudograph in Example 7.1.2 is given in Figure 7.1.

197
198 CHAPTER 7. GRAPHS

1 2

3 4

Figure 7.1: A pseudograph

Definition 7.1.5. 1. An edge {u, v} is sometimes denoted uv. An edge uu is called a loop.
The vertices u and v are called the end vertices of the edge uv. Let e be an edge. We
say ‘e is incident on u’ to mean that ‘u is an end vertex of e’.

2. A multigraph is a pseudograph without loops. A multigraph is a simple graph if no

edge appears twice.

3. Henceforth, all graphs in this book are simple with a finite vertex set, unless stated oth-
erwise.

4. We use V (G) (or simply V ) and E(G) (or simply E) to denote the vertex set and the edge
set of G, respectively. The number |V (G)| is the order of the graph G. Sometimes it is
denoted |G|. By kGk we denote the number of edges of G. A graph with n vertices and
T
AF

m edges is called a (n, m) graph. The (1, 0) graph is the trivial graph.
DR

5. If uv is an edge in G, then we say ‘u and v are adjacent in G’ or ‘u is a neighbor of v’.

We write u ∼ v to denote that ‘u is adjacent to v’. Two edges e1 and e2 are adjacent if
they have a common end vertex.

6. A set of vertices or edges is said to be independent if no two of them are adjacent. The
maximum size of an independent vertex set is called the independence number, denoted
α(G), of G.

7. If v ∈ V (G), by N (v) or NG (v), we denote the set of neighbors of v in G and |N (v)| is

called the degree of v. It is usually denoted by dG (v) or d(v). A vertex of degree 0 is
called isolated. A vertex of degree one is called a pendant vertex.

Discussion 7.1.6. Note that a graph is an algebraic structure, namely, a pair of sets satisfying
some conditions. However, it is easy to describe and carry out the arguments with a pictorial
representation of a graph. Henceforth, the pictorial representations are used to describe graphs
and to provide our arguments, whenever required. There is no loss of generality in doing this.

Example 7.1.7. Consider the graph G in Figure 7.2. The vertex 12 is an isolated vertex. We
have N (1) = {2, 4, 7}, d(1) = 3. The vertices 1 and 6 are not adjacent. The set {9, 10, 11, 2, 4, 7}

is an independent vertex set. The set {1, 2}, {8, 10}, {4, 5} is an independent edge set.
7.1. BASIC CONCEPTS 199

4
5 13
8 6 3
10 12
2
7
11 9 1

Figure 7.2: A graph G.

Definition 7.1.8. Let G = (V, E) be a graph on n vertices, say V = {v1 , . . . , vn }. Then, G is

said to be a

1. Complete graph, denoted Kn , if each pair of vertices in G are adjacent.

2. Path graph, denoted Pn , if E = {vi vi+1 | 1 ≤ i ≤ n − 1}.

3. Cycle graph, denoted Cn , if E = {vi vi+1 | 1 ≤ i ≤ n − 1} ∪ {vn v1 }.

4. Bipartite graph if V = V1 ∪ V2 such that |V1 |, |V2 | ≥ 1, V1 ∩ V2 = ∅ and e = {u, v} ∈ E

if either u ∈ V1 and v ∈ V2 or u ∈ V2 and v ∈ V1 .

5. Complete bipartite graph, denoted Kr,s if E = {vi vj | 1 ≤ i ≤ r, r + 1 ≤ j ≤ n} with

r + s = n.
T
AF

The importance of the labels of the vertices depends on the context. At this point of time,
1 2 3 ··· n −1 n
even if we interchange the labels of the vertices, we still call them a complete graph or a path
DR

Pn
graph or a cycle or a complete bi-partite graph.

1 2 3 ··· n −1 n 1 2 3 ··· n −1 n
Pn Cn
Figure 7.3: Pn and Cn .

Quiz 7.1.9.1 What

2 is the
3 maximum 1 n of edges possible in a simple graph of order n?1
· · · n −number
Cn P
Lemma 7.1.10. [Hand shaking lemma] In any graph G, d(v) = 2|E|. Thus, the number
v∈V
of vertices of odd degree is even.
P P
Proof. Each edge contributes 2 to the sum d(v). Hence, d(v) = 2|E|. Note that
v∈V v∈V

X X X
2|E| = d(v) = d(v) + d(v)
v∈V v:d(v) is odd v:d(v) is even
P
is even. So, d(v) is even. Hence, the number of vertices of odd degree is even.
v:d(v) is odd
1
C(n, 2).
200 CHAPTER 7. GRAPHS

1 4 3 4 5

1 2 2 3 1 2 1 2 3
K1,1 K1,2 K2,2 K2,3
1 4 3 3
2
4
1 1 2 2 3 1 2 1
5
K1 K2 K3 K4 K5

1 4 3 3
2 2 4
4 5 1
2 3 1 2 1
5 3 6
C3 C4 C5 C6

1 4 3 2 1
T

1 1 2 2 3 1 2 3 4 5
AF

P1 P2 P3 P4 P5
DR

Figure 7.4: Some well known family of graphs

Example 7.1.11. The graph in Figure 7.5 is called the Petersen graph. We shall use it as
an example in many places.

5 9 3

10 8

6 7

1 2

Figure 7.5: Petersen graphs

Quiz 7.1.12. In a party of 27 persons, prove that someone must have an even number of friends
(friendship is mutual). 1

Proposition 7.1.13. In a graph G with n = |G| ≥ 2, there are two vertices of equal degree.
1 P
Otherwise d(v) is odd.
7.1. BASIC CONCEPTS 201

Proof. If G has two or more isolated vertices, we are done. So, suppose G has exactly one
isolated vertex. Then, the remaining n − 1 vertices have degree between 1 and n − 2 and hence
by PHP, the result follows. If G has no isolated vertex then G has n vertices whose degree lie
between 1 and n − 1. Now, again apply PHP to get the required result.
Exercise 7.1.14. 1. Let X = (V, E) be a graph with a vertex v ∈ V of odd degree. Then,
prove that there exists a vertex u ∈ V such that there is a path from v to u and deg(u) is
also odd.
2. Let X = (V, E) be a graph having exactly two vertices, say u and v, of odd degree. Then,
prove that there is a path in X connecting u and v.
Definition 7.1.15. 1. The minimum degree of a vertex in G is denoted δ(G) and the
maximum degree of a vertex in G is denoted ∆(G).
2. A graph G is called k-regular if d(v) = k for all v ∈ V (G).
3. A 3-regular graph is called cubic.
Example 7.1.16. 1. The graph Kn is regular.
2. The graph K4 is cubic.
3. The graph C4 is 2-regular.
4. The graph P4 is not regular.
T

5. The Petersen graph is cubic.

6. Consider the graph G in Figure 7.2. We have δ(G) = 0 and ∆(G) = 3.

Quiz 7.1.17. Can we have a cubic graph on 5 vertices?1

Definition 7.1.18. A graph H is a subgraph of G if V (H) ⊆ V (G) and E(H) ⊆ E(G). If

U ⊆ V (G), then the subgraph of G induced by U is denoted by hU i = (U, E), where the edge
set E = {uv ∈ E(G) | u, v ∈ U }. A subgraph H of G is a spanning subgraph if V (G) = V (H).
A k-regular spanning subgraph is called a k-factor.
Example 7.1.19. 1. Consider the graph G in Figure 7.2.

(a) Let H1 be the graph with V (H1 ) = {6, 7, 8, 9, 10, 12} and E(H1 ) = {6, 7}, {9, 10} .
Then, H1 is not a subgraph of G.

(b) Let H2 be the graph with V (H2 ) = {6, 7, 8, 9, 10, 12} and E(H2 ) = {6, 7}, {8, 10} .
Then, H2 is a subgraph but not an induced subgraph of G.
(c) Let H3 be the induced subgraph of G on the vertex set {6, 7, 8, 9, 10, 12}. Then, verify

that E(H3 ) = {6, 7}, {8, 9}, {8, 10} .
(d) The graph G does not have a 1-factor.

2. A complete graph has a 1-factor if and only if it has an even order.

3. The Petersen graph has many 1-factors. One of them is obtained by selecting the edges
{1, 6}, {2, 7}, {3, 8}, {4, 9}, and {5, 10}.
1 P
No, as d(v) = 15, not even.
202 CHAPTER 7. GRAPHS

Quiz 7.1.20. Consider K8 on the vertex set {1, 2, . . . , 8}. How many 1-factors does it have?1

Definition 7.1.21. Let G be a graph and v be a vertex. Then, the graph G − v, called the
vertex deleted subgraph, is obtained by deleting v and all the edges that are incident with
v. If e ∈ E(G), then the graph G − e = (V, E(G) \ {e}) is called the edge deleted subgraph.
If u, v ∈ V (G) such that u v, then G + uv = (V, E(G) ∪{uv}) is called the graph obtained
by edge addition.

Example 7.1.22. Consider the graph G in Figure 7.2. Let H2 be the graph with V (H2 ) =

{6, 7, 8, 9, 10, 12} and E(H2 ) = {6, 7}, {8, 10} . Consider the edge e = {8, 9}. Then, H2 + e is
the induced subgraph h{6, 7, 8, 9, 10, 12}i and H2 − 8 = h{6, 7, 9, 10, 12}i.

Definition 7.1.23. [Complement graph] The complement G of a graph G is defined as

(V (G), E), where E = {uv | u 6= v, uv ∈
/ E(G)}.

Example 7.1.24. 1. See Figure 7.6 for two examples of complement graphs.

4 3 4 3 3 3
2 2
4 4
1 2 1 2 1 1
5 5
T

C4 C4 C5 C 5 = C5
AF

Figure 7.6: Complement graphs

2. The complement of K3 contains 3 isolated points/vertices.

3. For any graph G, kGk + kGk = C(|G|, 2).

4. In any graph G of order n, dG (v) + dG (v) = n − 1. Thus, ∆(G) + ∆(G) ≥ n − 1.

Quiz 7.1.25. 1. Characterize graphs G such that ∆(G) + ∆(G) = n − 1.2

2. Can we have a graph G such that ∆(G) + ∆(G) = n?

3. Show that a k-regular simple graph on n vertices exists if and only if kn is even and
n ≥ k + 1.

Ans: Put n dots in a circular design on the paper. If k = 2r, put edges between i and the r
points that we meet first while moving from i clockwise.

Definition 7.1.26. The intersection of two graphs G and H, denoted G ∩ H, is defined as

(V (G) ∩ V (H), E(G) ∩ E(H)). The union of two graphs G and H, denoted G ∪ H, is defined
as (V (G) ∪ V (H), E(G) ∪ E(H)). A disjoint union of two graphs is the union while treating
the vertex sets as disjoint sets.
1
8!/4!(2!)4 .
2
If dG (u) < dG (v), then dG (u) = n − 1 − dG (u). Hence, ∆(G) + ∆(G) ≥ dG (v) + n − 1 − dG (u) > dG (v) + n −
1 − dG (v) ≥ n. Thus, the answer is regular graphs.
7.1. BASIC CONCEPTS 203

Example 7.1.27. Two graphs G and H are shown below. The graphs G ∪ H and G ∩ H are
also shown below.

2 2 2 2
4 4

1 3 1 1 3 1
G H G∪H G∩H

Figure 7.7: Disjoint union and join of graphs

The disjoint union of G and G ∪ H is G1 in Figure 7.8.

2 b a 2
2 ′
2
4′
K2 c K3
1 3 1′ 3′
1 a 1 b
G1 K2 + K3 K2 + K2
T
AF

Figure 7.8: Disjoint union and join of graphs

Definition 7.1.28. If V (G) ∩ V (G0 ) = ∅, then the join G + G0 (here ‘+’ represents join of
two graphs) is defined as G ∪ G0 + {vv 0 : v ∈ V, v 0 ∈ V 0 } (here ‘+’ means adding a set of edges
to a given graph).

Example 7.1.29. (a) K2 + K3 = K5 .

(b) K2 + K2 = C4 .
Quiz 7.1.30. 1. What is the complement of the disjoint union of G and H?1
2. Is Km,n = Km + Kn ?
Ans: Yes.

all possible edges

Km Kn

Definition 7.1.31. Let G = (V, E) and G0 = (V 0 , E 0 ) be two graphs. Then, the Cartesian
product of G and G0 , denoted G × G0 = (V1 , E1 ), is a graph having V1 = V × V 0 and whose
1
G + H.
204 CHAPTER 7. GRAPHS

edge set consists of all elements {(u1 , u2 ), (v1 , v2 )}, where either u1 = v1 and {u2 , v2 } ∈ E 0 or
u2 = v2 and {u1 , v1 } ∈ E.

Example 7.1.32. See the graphs in Figure 7.9.

13 23 33
b 1b 2b 3b
22
12 32

a 1 2 3 1a 2a 3a 11 21 31
X Y X ×Y Y ×Y

Figure 7.9: Cartesian product of graphs

7.2 Connectedness
Definition 7.2.1. An u-v walk in G is a finite sequence of vertices [u = v1 , v2 , · · · , vk = v] such
that vi vi+1 ∈ E, for all i = 1, · · · , k − 1. The length of a walk is the number of edges on it. A
walk is called a trail if edges on the walk are not repeated. A v-u walk is a called a path
if the vertices involved are all distinct, except that v and u may be the same. A path can have
length 0. A walk (trail, path) is called closed if u = v. A closed path is called a cycle/circuit.
T
AF

Thus, in a simple graph a cycle has length at least 3. A cycle (walk, path) of length k is also
written as a k-cycle (k-walk, k-path). If P is an u-v path with u 6= v, then we sometimes call u
DR

and v as the end vertices of P and the remaining vertices on P as the internal vertices.

Example 7.2.2.

(a) Take G = K5 with vertex set {1, 2, 3, 4, 5}.

• Then, [1, 2, 3, 2, 1, 2, 5, 4, 3] is a 8-walk in G and [1, 2, 2, 1] is not a walk.
• The walk [1, 2, 3, 4, 5, 2, 4, 1] is a closed trail.
• The walk [1, 2, 3, 5, 4, 1] is a closed path, that is, it is a 5-cycle.
• The maximum length of a cycle in G is 5 and the minimum length of a cycle in G is 3.
• There are 10 = C(5, 3) many 3-cycles in G.
• Verify that the number of 4-cycles in G is not C(5, 4). Can it be 3 × C(5, 4)?

(b) Let G be the Petersen graph.

• There is a 9-cycle in G, namely, [6, 8, 10, 5, 4, 3, 2, 7, 9, 6].
• There are no 10-cycles in G. We shall see this when we discuss the Eulerian graphs.

Proposition 7.2.3 (Technique). Let G be a graph and u, v ∈ V (G), u 6= v. Let W = [u =

u1 , . . . , uk = v] be a walk. Then, W contains an u-v-path.

Proof. If no vertex on W repeats, then W is itself a path. So, let ui = uj for some i < j. Now,
consider the walk W1 = [u1 , . . . , ui−1 , uj , uj+1 , . . . uk ]. This is also an u-v walk but of shorter
length. Thus, using induction on the length of the walk, the desired result follows.
7.2. CONNECTEDNESS 205

Definition 7.2.4. The distance d(u, v) between two vertices u and v in G is the shortest
length of an u-v path in G. If no such path exists, the distance is taken to be ∞. The greatest
distance between any two vertices in a graph G is called the diameter of G. We shall use
diam(G) to denote the diameter of G. Let distv = max d(v, u). The radius is the min distv and
u∈G v∈G
the center consists of all vertices v for which distv is the radius. The girth, denoted g(G), of
a graph G is the minimum length of a cycle contained in G. If G has no cycle, then we put
g(G) = ∞.

Example 7.2.5. Let G be the Petersen graph. It has diameter 2. The radius is 2. Each vertex
is in the center. Its girth is 5.

Practice 7.2.6. Determine the diameter, radius, center and girth of the following graphs: Pn ,
Cn , Kn and Kn,m .

Exercise 7.2.7. Let G be a graph. Then, show that the distance function d(u, v) is a metric
on V (G). That is, it satisfies
1. d(u, v) ≥ 0 for all u, v ∈ V (G) and d(u, v) = 0 if and only if u = v,
2. d(u, v) = d(v, u) for all u, v ∈ V (G) and
3. d(u, v) ≤ d(u, w) + d(w, u) for all u, v, w ∈ V (G).

Proposition 7.2.8 (Technique). Let G be a graph with kGk ≥ 1 and d(v) ≥ 2, for each vertex
T
AF

except one, say v1 . Then, G has a cycle.

Proof. Consider a longest path [v1 , . . . , vk ] in G (as V (G) is finite, such a path exists). As
d(vk ) ≥ 2, it must be adjacent to some vertex from v2 , . . . , vk−2 , otherwise, we can extend it to
a longer path. Let i ≥ 2 be the smallest such that vi is adjacent to vk . Then, [vi , vi+1 , . . . , vk , vi ]
is a cycle.

Proposition 7.2.9 (Technique). Let P and Q be two different u-v paths in G. Then, P ∪ Q
contains a cycle.

Proof. Imagine a signal was sent from u to v via P and was returned back from v to u via Q.
Call an edge ‘dead’ if signal has passed through it twice. Notice that each vertex receives the
signal as many times as it sends the signal.
Is E(P ) = E(Q)? No, otherwise both P and Q are the same paths.
So, there are some ‘alive’ edges. Get an alive edge − v− →
1 v2 . There must be an alive edge
−
v−→ 1 Similarly get − v−→
2 v3 . 3 v4 and so on. Stop at the first instance of repetition of a vertex:
[v1 , v2 , · · · , vi , vi+1 · · · , vj = vi ]. Then, [vi , vi+1 · · · , vj = vi ] is a cycle.

Alternate. Consider the graph H = V (P ) ∪ V (Q), E(P )∆E(Q) , where ∆ is the symmetric
difference. Notice that E(H) 6= ∅, otherwise P = Q. As the degree of each vertex in the
multigraph P ∪ Q is even and H is obtained after deleting pairs of multiple edges, each vertex
in H has even degree. Hence, by Proposition 7.2.8, H has a cycle.
1
Otherwise, v2 is incident to just one alive edge and some dead edges. This means v2 has received more signal
than it has sent.
206 CHAPTER 7. GRAPHS

Proposition 7.2.10. Every graph G containing a cycle satisfies g(G) ≤ 2 diam(G) + 1.

Proof. Let C = [v1 , v2 , . . . , vk , v1 ] be the shortest cycle and diam(G) = r. If k ≥ 2r + 2, then

consider the path P = [v1 , v2 , . . . , vr+2 ]. Since the length of P is r + 1 and diam(G) = r, there
is a vr+2 -v1 path R of length at most r. Note that P and R are different v1 -vr+2 paths. By
Proposition 7.2.9, the closed walk P ∪ R of length at most 2r + 1 contains a cycle. Hence, the
length of this cycle is at most 2r + 1, a contradiction to C having the smallest length k ≥ 2r + 2.

Definition 7.2.11. Let C = [v1 , . . . , vk = v1 ] be a cycle. An edge vi vj is called a chord of C if

it is not an edge of C. A graph is called chordal if each cycle of length at least 4 has a chord.
A graph is acyclic if it has no cycles.

Example 7.2.12. Complete graphs are chordal, so are the acyclic graphs. The Petersen graph
is not chordal.
Quiz 7.2.13. 1. How many acyclic graphs are there on the vertex set {1, 2, 3}?1
2. How many chordal graphs are there on the vertex set {1, 2, 3, 4}?2
Definition 7.2.14. 1. A graph G is said to be maximal with respect to a property P if G
has property P and no proper supergraph of G has the property P . We similarly define
the term minimal.

Notice!
T
AF

The class of all graphs with that property is the POSET here. So, the maximality and
the minimality are defined naturally.
DR

2. A complete subgraph of G is called a clique. The maximum order of a clique is called the
clique number of G. It is denoted ω(G).
3. A graph G is called connected if there is an u-v path, for each u, v ∈ V (G).
4. A graph which is not connected is called disconnected. If G is a disconnected graph,
then a maximal connected subgraph is called a component or sometimes a connected
component.

Example 7.2.15. Consider the graph G shown in Figure 7.2. Then,

1. some cliques in G are h{8, 10}i, h{2}i. The first is a maximal cliques. Notice that every
vertex is a clique. Similarly each edge is a clique. Here ω(G) = 2.
2. the graph G is not connected. It has four connected components, namely, h{8, 9, 10, 11}i,
h{1, 2, 3, 4, 5, 6, 7}i, h{12}i and h{13}i.

Quiz 7.2.16. What is ω(G) for the Petersen graph?3

Proposition 7.2.17. If δ(G) ≥ 2, then G has a path of length δ(G) and a cycle of length at
least δ(G) + 1.
1
7: 3 edges can be put in 23 ways. One of them is a cycle.
2
61: 6 edges can be put in 26 ways. There are three 4-cycles.
3
2.
7.3. ISOMORPHISM IN GRAPHS 207

Proof. Let [v1 , · · · , vk ] be a longest path in G. As d(vk ) ≥ 2, vk is adjacent to some vertex

v 6= vk−1 . If v is not on the path, then we have a path that is longer than [v1 , · · · , vk ] path. A
contradiction. Let i be the smallest positive integer such that vi is adjacent to vk . Thus,

δ(G) ≤ d(vk ) ≤ |{vi , vi+1 , · · · , vk−1 }|.

Hence, the cycle C = [vi , vi+1 , · · · , vk , vi ] has length at least δ(G) + 1 and the length of the path
P = [vi , vi+1 , · · · , vk ] is at least δ(G).

|E(G)|
Definition 7.2.18. The edge density, denoted ε(G), is defined to be the number |V (G)| .
Observe that ε(G) is also a graph invariant.

Quiz 7.2.19. 1. When does ‘deletion of a vertex’ reduce edge density?1

δ(G)
2. Is 2 a lower bound for ε(G)?2

3. Suppose that ε(G) ≥ δ(G). Should we have a vertex v with ε(G) ≥ d(v)?3

Proposition 7.2.20. Let G be a graph with kGk ≥ 1. Then, G has a subgraph H with δ(H) >
ε(H) ≥ ε(G).

Proof. If ε(G) < δ(G), then we take H = G. Otherwise, there is a vertex v with ε(G) ≥ d(v).
T

Put G1 = G − v. Then, it can be easily verified that ε(G1 ) ≥ ε(G).

If ε(G1 ) < δ(G1 ), then we take H = G1 . Otherwise, there is a vertex v ∈ G1 with ε(G1 ) ≥ d(v).
DR

Put G2 = G1 − v. Then, we again have ε(G2 ) ≥ ε(G1 ) ≥ ε(G).

Continuing as above, we note that “Initially ε(G) > 0. At the i-th stage, we obtained the
subgraph Gi satisfying |V (Gi )| = |G| − i, ε(Gi ) ≥ ε(Gi−1 ). That is, we have been reducing the
number of vertices and the corresponding edge densities have been nondecreasing.” Hence, this
process must stop before we reach a single vertex, as its edge density is 0.
So, let us assume that the process stops at H. Then, ‘ε(H) < δ(H)’ must be true, or else, the
process would not stop at H and hence the required result follows.

7.3 Isomorphism in Graphs

Definition 7.3.1. Two graphs G = (V, E) and G0 = (V 0 , E 0 ) are said to be isomorphic if

there is a bijection f : V → V 0 such that u ∼ v in G if and only if f (u) ∼ f (v) in G0 , for each
u, v ∈ V . In other words, an isomorphism is a bijection between the vertex sets which preserves
adjacency. We write G ∼ = G0 to mean that G is isomorphic to G0 .

Example 7.3.2. Consider the graphs in Figure 7.10. Then, note that

1
Put H = G − v. Then, kHk = ε(G)n − d(v), so that ε(H) = ε(G)n−d(v)
n−1
= ε(G) + ε(G)−d(v)
n−1
. So, we should
choose a vertex v with degree more that ε(G).
2
Yes.
3
Yes. Otherwise, we have ε(G) < d(v), for each v. In particular ε(G) < δ(G), a contradiction.
208 CHAPTER 7. GRAPHS

4
3 6 2 4

5 3 5 6
2 5
6 2

1 4 1 3
1
F G H
Figure 7.10: F is isomorphic to G but F is not isomorphic to H

1. the graph F is not isomorphic to H as α(F ), the independence number of F is 3 whereas

α(H) = 2. Alternately, H has a 3-cycle, whereas F does not.

2. the graph F is isomorphic to G as the map f : V (F ) → V (G) defined by f (1) = 1,

f (2) = 5, f (3) = 3, f (4) = 4, f (5) = 2 and f (6) = 6 gives an isomorphism.

Check the adjacency

F G
1 → 2, 4, 6 f (1) = 1 → f (2) = 5, f (4) = 4, f (6) = 6
T

3 → 2, 4, 6 f (3) = 3 → f (2) = 5, f (4) = 4, f (6) = 6

5 → 2, 4, 6 f (5) = 2 → f (2) = 5, f (4) = 4, f (6) = 6

All edges are covered, no need to check any further.

Discussion 7.3.3. [Isomorphism] Let F and G be isomorphic under f : V (F ) → V (G). Take

F . Relabel each vertex v ∈ F as f (v). Call the new graph F 0 . Then, F 0 = G. This is so, as
V (F 0 ) = V (G) and E(F 0 ) = E(G) due to the isomorphic nature of the function f .

Practice 7.3.4. Take the graphs F and G of Figure 7.10. Take the isomorphism f (1) = 1,
f (2) = 5, f (3) = 3, f (4) = 4, f (5) = 2 and f (6) = 6. Obtain the F 0 as described in Discussion
7.3.3. List V (F 0 ) and E(F 0 ). List V (G) and E(G). Notice that they are the same.

Definition 7.3.5. A graph G is called self-complementary if G ∼

= G.

Example 7.3.6. Let G be a self-complementary graph on n vertices. Then kGk = n(n − 1)/4.
Thus, either n = 4k or n = 4k + 1. Verify that

1. the path P4 = [0, 1, 2, 3] is self complimentary. An isomorphism from G to G is described

by f (i) = 2i (mod 5).

2. the cycle C5 = [0, 1, 2, 3, 4, 0] is self complimentary. An isomorphism from G to G is

described by f (i) = 2i (mod 5).

Exercise 7.3.7. 1. Construct a self-complementary graph of order 4k.

Ans:
7.3. ISOMORPHISM IN GRAPHS 209

1, . . . , k k + 1, . . . , 2k
Kk Kkc

Kk Kkc
2k + 1, . . . , 3k 3k + 1, . . . , 4k
Figure 7.11: Self-complimentary graphs

An isomorphism from G to G is described by

1 · · · k k + 1 · · · 2k 2k + 1 · · · 3k 3k + 1 · · · 4k
3k + 1 · · · 4k 2k + 1 · · · 3k 1 · · · k k + 1 · · · 3k

2. Construct a self-complementary graph of order 4k + 1.

Ans: Use the construction of self-complementary graphs of order 4k and introduce a new
vertex 4k + 1 and make it adjacent to each element of {1, 2, . . . , k, 2k + 1, . . . , 3k}.

Definition 7.3.8. A graph invariant is a function which assigns the same value (output) to
isomorphic graphs.
T
AF

Example 7.3.9. Observe that some of the graph invariants are: |G|, kGk, ∆(G), δ(G), ω(G),
DR

α(G) and the multiset {d(v) : v ∈ V (G)}.

Exercise 7.3.10. How many graphs are there with vertex set {1, 2, . . . , n}? Do you find it easy
if we ask for nonisomorphic graphs (try for n = 4)?

Proposition 7.3.11 (Technique). Let f : G → H be an isomorphism and v ∈ V (G). Then,

G−v =∼ H − f (v).

Proof. Consider the bijection g : V (G − v) → V (H − f (v)) described by g = fV (G−v) .

Definition 7.3.12. An isomorphism of G to G is called an automorphism.

Example 7.3.13. 1. Identity map is always an automorphism on any graph.
2. Any permutation in Sn is an automorphism of Kn .
3. There are only two automorphisms of a path P8 . Is it true for Pn , for n ≥ 3?

Proposition 7.3.14. Let G be a graph and let Γ(G) denote the set of all automorphisms of G.
Then, Γ(G) forms a group under composition of functions.

Proof. Let V (G) = {1, 2, . . . , n} and σ, µ ∈ Γ(G) be two automorphisms. Then,

ij ∈ E(G) ⇔ µ(i)µ(j) ∈ E(G) ⇔ (σ ◦ µ)(i)(σ ◦ µ)(j) ∈ E(G).

Thus, σ ◦ µ is an automorphism. Moreover, µ−1 , σ −1 are indeed automorphisms.

210 CHAPTER 7. GRAPHS

Example 7.3.15. Determine Γ(C5 ).

Ans: Consider C5 = [1, . . . , 5, 1]. Note that σ = (2, 3, 4, 5, 1) is an automorphism. Hence,
{e, σ, σ 2 , . . . , σ 4 } ⊆ Γ(C5 ) as σ 5 = e.
Now, let µ be an automorphism with µ(1) = i. Put τ = σ 6−i µ. Then, τ is an automorphism
with τ (1) = 1. If τ (2) = 2, then the adjacency structure implies that τ (j) = j for j = 3, 4, 5.
Hence, in this case, σ 6−i µ = e and thus, µ = σ i−6 = σ i−1 .
If τ (2) 6= 2, then τ (2) = 5, τ (3) = 4 and so τ = (2, 5)(3, 4) is the reflection which fixes 1. Let
us denote the permutation (2, 5)(3, 4) by ρ. Then, Γ(C5 ) is the group generated by σ and ρ and
hence Γ(C5 ) has 10 elements.

Example 7.3.16. Notice that Γ(C5 ) has a subgroup Γ1 = {e, σ, σ 2 , σ 3 , σ 4 }, with σ 5 = e, of

order 5. Let G be a subgraph of C5 obtained by deleting some (zero allowed) edges. If kGk = 5,
then |Γ(G)| = 10. If kGk = 0, then |Γ(G)| = |S5 | = 5!. If kGk = 4, then |Γ(G)| = 2. If kGk = 3,
then |Γ(G)| = 2 or 4. If kGk = 2, then |Γ(G)| = 4 or 8. If kGk = 1, then |Γ(G)| = 2 × 3!. Thus,
there is no subgraph of G whose automorphism group is Γ1 .
Exercise 7.3.17. 1. Determine the graphs G for which Γ(G) = Sn , the group of all permu-
tations of 1, . . . , n.
Ans: We know that Γ(Kn ) = Sn . Conversely, suppose Γ(G) = Sn , G 6= Kn . Suppose
{1, 2} ∈
/ E(G). Then, for any i 6= j, note that (1, i)(2, j) ∈ Sn = Γ(G) and hence observe
that ij ∈
/ E(G). So, G = K n .
T

2. Compute Γ(G) for some graphs of small order.

3. Let G be a subgraph of H of the same order. Explore more about the relationship between
DR

Γ(G) and Γ(H).

7.4 Trees
Definition 7.4.1. A connected acyclic graph is called a tree. A forest is a graph whose
components are trees.

Proposition 7.4.2. Let T be a tree and u, v ∈ V (T ). Then, there is a unique u-v-path in T .

Proof. On the contrary, assume that there are two u-v-paths in T . Then, by Proposition 7.2.9,
T has a cycle, a contradiction.

Proposition 7.4.3. Let G be a graph with the property that ‘between each pair of vertices there
is a unique path’. Then, G is a tree.

Proof. Clearly, G is connected. If G has a cycle [v1 , v2 , · · · , vk = v1 ], then [v1 , v2 , . . . , vk−1 ] and
[v1 , vk−1 ] are two v1 -vk−1 paths. A contradiction.

Definition 7.4.4. Let G be a connected graph. A vertex v of G is called a cut vertex if G − v

is disconnected. Thus, G − v is connected if and only if v is not a cut vertex.

Proposition 7.4.5. Let G be a connected graph with |G| ≥ 2. If v ∈ V (G) with d(v) = 1, then
G − v is connected. That is, a vertex of degree 1 is never a cut vertex.
7.4. TREES 211

Proof. Let u, w ∈ V (G − v), u 6= w. As G is connected, there is an u-w path P in G. The vertex

v cannot be an internal vertex of P , as each internal vertex has degree at least 2. Hence, the
path P is available in G − v. So, G − v is connected.

Proposition 7.4.6 (Technique). Let G be a connected graph with |G| ≥ 2 and let v ∈ V (G). If
G − v is connected, then either d(v) = 1 or v is on a cycle.

Proof. Assume that G − v is connected. If dG (v) = 1, then there is nothing to show. So, assume
that d(v) ≥ 2. We need to show that v is on a cycle in G.
Let u and w be two distinct neighbors of v in G. As G − v is connected there is a path, say
[u = u1 , . . . , uk = w], in G − v. Then, [u = u1 , . . . , uk = w, v, u] is a cycle in G containing v.

Quiz 7.4.7. Let G be a graph and v be a vertex on a cycle. Can G − v be disconnected?1

Definition 7.4.8. Let G be a graph. An edge e in G is called a cut edge or a bridge if G − e

has more connected components than that of G.

Proposition 7.4.9 (Technique). Let G be connected and e = [u, v] be a cut edge. Then, G − e
has two components, one containing u and the other containing v.

Proof. If G − e is not disconnected, then by definition, e cannot be a cut edge. So, G − e has
at least two components. Let Gu (respectively, Gv ) be the component containing the vertex u
T

(respectively, v). We claim that these are the only components.

Let w ∈ V (G). Then, G is a connected graph and hence there is a path, say P , from w to
DR

u. Moreover, either P contains v as its internal vertex or P doesn’t contain v. In the first case,
w ∈ V (Gv ) and in the latter case, w ∈ V (Gu ). Thus, every vertex of G is either in V (Gv ) or in
V (Gu ) and hence the required result follows.

Proposition 7.4.10 (Technique). Let G be a graph and e be an edge. Then, e is a cut edge if
and only if e is not on a cycle.

Proof. Suppose that e = [u, v] is a cut edge of G. Let F be the component of G that contains
e. Then, by Proposition 7.4.9, F − e has two components, namely, Fu that contains u and Fv
that contains v.
Let if possible, C = [u, v = v1 , . . . , vk = u] be a cycle containing e = [u, v]. Then, [v =
v1 , . . . , vk = u] is an u-v path in F − e. Hence, F − e is still connected. A contradiction. Hence,
e cannot be on any cycle.
Conversely, let e = [u, v] be an edge which is not on any cycle. Now, suppose that F is the
component of G that contains e. We need to show that F − e is disconnected.
Let if possible, there is an u-v-path, say [u = u1 , . . . , uk = v], in F − e. Then, [v, u =
u1 , . . . , uk = v] is a cycle containing e. A contradiction to e not lying on any cycle.
Hence, e is a cut edge of F . Consequently, e is a cut edge of G.

Proposition 7.4.11. Let T be a tree on n vertices. Then, T has n − 1 edges.

1
Yes. Take G = ({1, 2, 3, 4}, {{1, 2}, {1, 3}, {1, 4}, {3, 4}}) and v = 1.
212 CHAPTER 7. GRAPHS

Proof. We proceed by induction. Take a tree on n ≥ 2 vertices and delete an edge e. Then, we get
two subtrees T1 , T2 of order n1 , n2 , respectively, where n1 + n2 = n. So, E(T ) = E(T1 ) ∪ E(T2 ) ∪
{e}. By induction hypothesis kT k = kT1 k + kT2 k + 1 = n1 − 1 + n2 − 1 + 1 = n1 + n2 − 1 = n − 1.

Proposition 7.4.12. Let G be a connected graph with n vertices and n − 1 edges. Then, G is
acyclic.

Proof. On the contrary, assume that G has a cycle, say Γ. Now, select an edge e ∈ Γ and note
that G − e is connected. We go on selecting edges from G that lie on cycles and keep removing
them, until we get an acyclic graph H. Since the edges that are being removed lie on some
cycle, the graph H is still connected. So, by definition, H is a tree on n vertices. Thus, by
Proposition 7.4.11, |E(H)| = n − 1. But, in the above argument, we have deleted at least one
edge and hence, |E(G)| ≥ n. This gives a contradiction to |E(G)| = n − 1.

Proposition 7.4.13. Let G be an acyclic graph with n vertices and n − 1 edges. Then, G is
connected.

Proof. Let if possible, G be disconnected with components G1 , . . . , Gk , k ≥ 2. As G is acyclic, by

k
P P k
definition, each Gi is a tree on, say ni ≥ 1 vertices, with ni = n. Thus, kGk = (ni − 1) =
i=1 i=1
n − k < n − 1 = kGk, as k ≥ 2. A contradiction.

Theorem 7.4.14. Let G be a graph with V (G) = {1, 2, . . . , n}. Then, the following are equiva-
T

lent.
AF

1. G is a tree.
DR

2. G is a minimal connected graph on n vertices.

3. G is a maximal acyclic graph on n vertices.

Proof. (a)⇒(b). Suppose that G is a tree. If it is not a minimal connected graph on n vertices,
then there is an edge [u, v] such that G − [u, v] is connected. But then, by Theorem 7.4.10, [u, v]
is on a cycle in G. A contradiction.
(b)⇒(c). Suppose G is a minimal connected graph on n vertices. If G has a cycle, say Γ, then
select an edge e ∈ Γ. Thus, by Theorem 7.4.10, G − e is still connected graph on n vertices, a
contradiction to the fact that G is a minimal connected graph on n vertices. Hence, G is acyclic.
Since G is connected, for any new edge e, the graph G + e contains a cycle and hence, G is
maximal acyclic graph.
(c)⇒(a). Suppose G is maximal acyclic graph on n vertices. If G is not connected, let G1 and
G2 be two components of G. Select v1 ∈ G1 and v2 ∈ G2 and note that G + [v1 , v2 ] is acyclic
graph on n vertices. This contradicts that G is a maximal acyclic graph on n vertices. Thus, G
is connected and acyclic and hence is a tree.

Theorem 7.4.15. The following are Equivalent for a graph of order n.

(a) G is a tree.
(b) G is minimal connected.
(c) G is maximal acyclic.
7.4. TREES 213

(d) G is acyclic with kGk = n − 1.

(e) G is connected with kGk = n − 1.

Proof. Left as an exercise.

Proposition 7.4.16. The center of a tree always consists of a set of at most two vertices.

Proof. Let T be a tree of radius k. Since the center contains at least one vertex, let u be a vertex
in the center of T . Now, let v be another vertex in the center. We claim that u is adjacent to v.
Suppose u v. Then, there exists a path from u to v, denoted P (u, v), with at least one
internal vertex, say w. Let x be any pendant (d(x) = 1) vertex of T . Then, either v ∈ P (x, w)
or v ∈
/ P (x, w). In the latter case, check that kP (x, w)k < kP (x, v)k ≤ k.

u w v u w v

x x

If v ∈ P (x, w), then u ∈

/ P (x, w) and kP (x, w)k < kP (x, u)k ≤ k. That is, the distance from w
to any pendant vertex is less than k. Hence, k is not the radius, a contradiction. Thus, uv ∈ T .
We cannot have another vertex in the center, or else, we will have a C3 in T , a contradiction.
T

Exercise 7.4.17. 1. Show that a graph G is a tree if and only if between each pair of vertices
AF

of G there is a unique path.

2. Draw a tree on 8 vertices. Label V (T ) as 1, . . . , 8 so that each vertex i ≥ 2 is adjacent to

exactly one element of {1, 2, . . . , i − 1}.

Proposition 7.4.18. Let T be a tree. Then, any graph G with δ(G) ≥ |T | − 1 has a subgraph
H=∼ T.

Proof. We prove the result by induction on n = |T |. The result is trivially true if n = 1 or 2.

So, let the result be true for every tree on n − 1 vertices and take a tree T on n vertices. Also,
suppose that G is any graph with δ(G) ≥ |T | − 1.
Let v ∈ V (T ) with d(v) = 1. Take u ∈ V (T ) such that uv ∈ E(T ). Now, consider the tree
T1 = T − v. Then, δ(G) ≥ |T | − 1 = n − 1 > n − 2. Hence, by induction hypothesis, G has a
subgraph H such that H ∼ = T1 under a map, say φ. Let h ∈ V (H) such that φ(h) = u. Since
δ(G) ≥ |T | − 1, h has a neighbor, say h1 , such that h1 is not a vertex in H but is a vertex in G.
Now, map this vertex to v to get the required result.

Exercise 7.4.19. Let G be a graph on n > 2 vertices. If kGk > C(n − 1, 2), is G necessarily
connected? Give an ‘if and only if ’ condition for the connectedness of a graph with exactly
C(n − 1, 2) edges.

Ans: Think of components.

Proposition 7.4.20. A tree on n ≥ 2 vertices has at least two pendant vertices.

214 CHAPTER 7. GRAPHS

P
Proof. Let T be any tree on n vertices. Then, d(v) = 2kE(T )k = 2(n − 1) = 2n − 2.
v∈V (T )
Hence, by PHP, T has at least two vertices of degree 1.

Definition 7.4.21. Let T be a tree on n > 2 vertices and labeled by n integers, say {1, 2, . . . , n}.
The Prüfer code PT of T is a sequence X of size n − 2 created in the following way.
1. Find the largest pendant vertex, say v1 . Let u1 be the neighbor of v1 . Put X(1) = u1 .
2. Let T1 = T − v1 and find X(2).
3. Repeat the procedure to obtain X(3), . . . , X(n − 2).

Example 7.4.22. For example, Consider the tree T in Figure 7.13.

1 6 2 4
3
Figure 7.12: A tree T on 6 vertices

Then, the above process proceeds as follows.

Step Pendant vi Neighbor ui PT = X(1), X(2), . . . Ti = T − vi

1 6 2 4
DR

3
1 5 2 2
1 6 2
3
2 4 2 2,2

3 3 2 2,2,2 1 6 2

4 2 6 2,2,2,6 1 6

Figure 7.13: A tree T on 6 vertices

Exercise 7.4.23. In the above process, prove that uj = i, for some j, if and only if d(i) ≥ 2.

Ans: If uj = i then i is not a pendant vertex and hence d(i) ≥ 2. Conversely, let d(i) ≥ 2.
As the process of obtaining X(j)’s ends with K2 , an edge, the degree of the vertex i will reduce at
some stage for the first time, say at the k-th stage. Then, uk = i.

Example 7.4.24. Can I get back the original tree T from the sequence 2, 2, 2, 6? Ans: Yes.
The process of getting back the original tree is as follows.
1. Plot points 1, 2, . . . , 6.
7.4. TREES 215

2. Since ui is either 2 or 6, it implies that 2 and 6 are not the pendant vertices. Hence, the
pendant vertices in T must be {1, 3, 4, 5}. Thus, the algorithm implies that the largest
pendant 5 must be adjacent to (the first element of the sequence) 2.
3. At step 1, the vertex 5 was deleted. Hence, V (T1 ) = {1, 2, 3, 4, 6} with the given sequence
2, 2, 6. So, the pendants in T1 are {1, 3, 4} and the vertex 4 (largest pendant) is adjacent
to 2.
4. Now, V (T2 ) = {1, 2, 3, 6} with the sequence as 2, 6. So, 3 is adjacent to 2.
5. Now, V (T3 ) = {1, 2, 6} with the sequence as 6. So, the pendants in the current T are
{1, 2} and 2 is adjacent to 6.
6. Lastly, V (T4 ) = {1, 6}. As the process ends with K2 and we have only two vertices left,
they must be adjacent.
The corresponding set of figures are as follows.

1 2 3 1 2 3 1 2 3

4 5 6 4 5 6 4 5 6

1 2 3 1 2 3 1 2 3
T
AF

4 5 6 4 5 6 4 5 6
DR

Proposition 7.4.25. Let T be a tree on the vertex set {1, 2, . . . , n}. Then, d(v) ≥ 2 if and only
if v appears in the Prüfer code PT . Thus, {v : v ∈
/ PT } are precisely the pendant vertices in T .

Proof. Let d(v) ≥ 2. Since the process ends with an edge, there is a stage, say i, where d(v)
decreases strictly. Thus, till the (i − 1)-th stage, v was adjacent to a pendant vertex w and at
the i-th stage w was deleted and thus, v appears in the sequence.
Conversely, let v appear in the sequence at k-th stage for the first time. Then, the tree Tk had
a pendant vertex w of highest label that was adjacent to v. Note that Tk − w is a tree with at
least two vertices. Thus, d(v) ≥ dTk (v) ≥ 2.

Exercise 7.4.26. Prove that in the Prüfer code of T a vertex v appears exactly d(v) − 1 times.
[Hint: Use induction and if v is the largest pendant adjacent to w and T 0 = T − v then PT =
w, PT 0 .]

Proposition 7.4.27. Let T and T 0 be two trees on the same vertex set of integers. If PT = PT 0 ,
then T = T 0 .

Proof. The statement is trivially true for |T | = 3. Assume that the statement holds for 3 < |T | <
n. Now, let T and T 0 be two trees with vertex set {1, 2, . . . , n} and PT = PT 0 . As PT = PT 0 ,
T and T 0 have the same set of pendants. Further, the largest labeled pendant w is adjacent to
the vertex X(1) in both the trees. Thus, PT −w = PT 0 −w and hence, by induction hypothesis
T − w = T 0 − w. Thus, by PMI, T = T 0 .
216 CHAPTER 7. GRAPHS

Proposition 7.4.28. Let S be a set of n ≥ 3 integers and X be a sequence of length n − 2 of

elements from S. Then, there is a tree T with V (T ) = S and PT = X.

Proof. Verify the statement for |T | = 3. Now, let the statement hold for all trees T on n > 3
vertices and consider a set S of n + 1 integers and a sequence X of length (n − 1) of elements
of S.
Let v = max{x ∈ S : x ∈ / X}, S 0 = S − v and X 0 = X(2), . . . , X(n − 1). By definition, note
that v 6= X(i), for 2 ≤ i ≤ n − 1. Thus, X 0 is a sequence of elements of S 0 of length n − 2. As
|S 0 | = n, by induction hypothesis, there is a tree T 0 with PT 0 = X 0 .
Let T be the tree obtained by adding a new pendant v at the vertex X(1) of T 0 . In T 0 , the
vertices X(i), for i ≥ 2, were not available as pendants and now in T the vertex X(1) is also
not available as a pendant (here some X(i)’s may be the same). Let R0 = {x ∈ S 0 : x ∈ / X 0}
be the pendants in T 0 . Then, the set of pendants in T is (R0 ∪ {v}) \ {X(1)} which equals
{x ∈ S : x ∈ / X}. Thus, v is the pendant of T of maximum label. Hence, PT = X.

Theorem 7.4.29. [A. Cayley, 1889, Quart. J. Math] Let n ≥ 3. Then, there are nn−2
different trees with vertex set {1, 2, . . . , n}.

Proof. Let F be the class of trees on the vertex set {1, 2, . . . , n} and let G be the class of
(n − 2)-sequences of {1, 2, . . . , n}. Note that the function f : F → G defined as f (T ) = PT , the
Prüfer code, is a one-one and onto mapping. As |G| = nn−2 , the required result follows.
Exercise 7.4.30. 1. Find out all nonisomorphic trees of order 7 or less.
T
AF

2. Show that every automorphism of a tree fixes a vertex or an edge.

Ans: Under an automorphism a vertex in the center must be mapped to a vertex in the center.
We know that a tree has at most two vertices in the center and in case there are two vertices
they are adjacent. The conclusion now follows easily.
3. Give a class of trees T with |Γ(T )| = 6.
Ans:

···

4. Let T be a tree, σ ∈ Γ(T ), u ∈ V (T ) such that σ 2 (u) 6= u. Can we have an edge [u, v] ∈ T
such that σ(u) = v?
Ans: No. Suppose that u ∼ σ(u). Then, [u, σ(u), σ 2 (u), . . .] is a walk. Assume that σ k (u) =
σ k+r (u). But, as σ is an automorphism, we have u = σ r (u). That is, [u, σ(u), . . . , σ r (u) = u]
is a cycle of length r ≥ 3. A contradiction.
5. Let T be a tree with center {u} and radius r. Let v satisfy d(u, v) = r. Show that r is a
pendant.
Ans: Assume that v is not a pendant. Consider the unique path [u = v1 , . . . , vr+1 = v]. As v
is not a pendant, there exists w ∼ v, w 6= vr . The path [u = v1 , . . . , vr+1 = v, w] has length
r + 1. Since u is the center, there is an u-w path of length at most r. These two paths are
distinct. Hence, we must have a cycle. A contradiction.
7.4. TREES 217

6. Let T be a tree with |T | > 2. Let T 0 be obtained from T by deleting all the pendant vertices
of T . Show that the center of T is the same as the center of T 0 .

7. Let T be a tree with center {u} and σ ∈ Γ(T ). Show that σ(u) = u.

Ans: Let T 0 be obtained from T by deleting all the pendant vertices of T . Show that
σ|T 0 ∈ Γ(T 0 ). Apply induction.

8. Is it possible to have a tree such that |Γ(T )| = 7?

Ans: No, as every group of order 7 is cyclic and it has no non-trivial proper subgroup.

9. Construct a tree T on vertices S = {1, 2, 3, 6, 7, 8, 9} for which PT = 6, 3, 7, 1, 2.

Ans: Stage I. S = {1, 2, 3, 6, 7, 8, 9}, P = {8, 9}, v1 = 9 and edge added v1 X(1) = 96.
Stage II. S = {1, 2, 3, 6, 7, 8}, X = 3, 7, 1, 2; P = {8, 6}, v2 = 8 and edge added v2 X(1) = 83.
Stage III. S = {1, 2, 3, 6, 7}, X = 7, 1, 2; P = {3, 6}, v3 = 6 and edge added v3 X(1) = 67.
Stage IV. S = {1, 2, 3, 7}, X = 1, 2; P = {3, 7}, v4 = 7 and edge added v4 X(1) = 71.
Stage V. S = {1, 2, 3}, X = 2; P = {1, 3}, v5 = 3 and edge added v5 X(1) = 32.
Stage VI. S = {1, 2} and edge added 12.

10. Practice with examples: get the Prüfer code from a tree; get the tree from a given code and
a vertex set.

11. How many trees of the following forms are there on the vertex set {1, 2, . . . , 100}?
T
AF
DR

.. .. .. ···
. . .

Ans: 100 for star, as the central vertex has 100 choices.
For the double star, choose the central edge in C(100, 2) ways. For the rest, select a proper
subset of the remaining 98 to be adjacent to the smaller one of these two. The remaining will
be adjacent to the larger one. So, the answer should be C(100, 2)(298 − 2).
For the path there are 100!/2 choices.

12. Show that any tree has at least ∆(T ) leaves (pendant edges).

Ans: Use induction on ∆(T ).

13. Let T be a tree and T1 , T2 , T3 be subtrees of T such that T1 ∩ T3 6= ∅, T2 ∩ T3 6= ∅ and

T1 ∩ T2 ∩ T3 = ∅. Show that T1 ∩ T2 = ∅.

Ans: Suppose u ∈ T1 ∩ T3 , v ∈ T2 ∩ T3 and T1 ∩ T2 ∩ T3 = ∅. The path Puv = [u =

v0 , v1 , . . . , vk = v] ⊆ T3 . Let i be the largest and j be the smallest such that vi ∈ T1 and
vj ∈ T2 . Obviously i < j as T1 ∩ T2 ∩ T3 = ∅. Thus, the edge e = vi vi+1 ∈ / T1 , T2 , that is,
T1 , T2 ⊆ T − e. Note that T − e = Tu ∪ Tv , where Tu and Tv are the components containing
u and v, respectively. It follows that T1 ⊆ Tu and T2 ⊆ Tv . Thus, T1 ∩ T2 ⊆ Tu ∩ Tv = ∅.
218 CHAPTER 7. GRAPHS

14. Let T be a set of subtrees of a tree T . Assume that the trees in T have nonempty pairwise
intersection. Show that their overall intersection is nonempty. Is this true, if we replace
T by a graph G?
Ans: Notice that Ţ is a finite set, say T1 , T2 , . . . , Tn . Argue that Ti ∩Tj is also a subtree of T .
By Exercise 7.4.30.13, Ti ∩Tj ∩Tk 6= ∅ for all i < j < k. We proceed by induction. Suppose that
Ti1 ∩· · ·∩Tik 6= ∅ for all i1 < · · · < ik . Consider Ti1 ∩· · ·∩Tik ∩Tik+1 and assume it to be empty.
As T 0 ∩Tik+1 = Ti1 ∩· · ·∩Tik−1 ∩Tik+1 6= ∅ and T 00 ∩Tik+1 = Ti1 ∩· · ·∩Tik−2 ∩Tik ∩Tik+1 6= ∅,

it follows by Exercise 7.4.30.13 that T 0 ∩ T 00 = ∅. But then T1 ∩ · · · ∩ Tk = ∅, a contradiction

to our hypothesis.
15. Recall that a connected graph G is said to be unicyclic if G has exactly one cycle as it’s
subgraph. Prove that if G is connected and |G| = kGk, then G is a unicyclic graph.
Ans: Since |G| = kGk, by Theorem 7.4.15, G is not a tree. So, G has a cycle, say C. Let
e be an edge in C. Then, C − e is connected and hence the graph G − e itself is connected.
As G − e is connected and |G − e| = kG − ek + 1, by Theorem 7.4.15, G − e is a tree. Thus,
G has a unique cycle.

7.5 Connectivity
Proposition 7.5.1. Let G be a connected graph on vertex set {1, 2, . . . , n}. Then, its vertices
T

can be labeled in such a way that the induced subgraph on the set {1, 2, . . . , i} is connected for
AF

1 ≤ i ≤ n.
DR

Proof. If n = 1, there is nothing to prove. Assume that the statement is true if n < k and let
G be a connected graph on the vertex set {1, 2, . . . , k}. If G is a tree, pick any pendant vertex
and label it k. If G has a cycle, pick a vertex on a cycle and label it k. In both the case G − k
is connected. Now, use the induction hypothesis to get the required result.

Definition 7.5.2. Let G be a graph. Then, a set X ⊆ V (G) ∪ E(G) is called a separating set
if G − X has more connected components than that of G.

Let X be a separating set of G. Then, ‘there exists u, v ∈ V (G) that lie in the same component
of G but lie in different components of G − X’. If {u} ⊆ V (G) is a separating set of G, then u
is a cut vertex. If {e} ⊆ E(G) is a separating set of G, then it is a bridge/cut edge.
Example 7.5.3. 1. In a tree, each edge is a bridge and each non-pendant vertex is a cut
vertex. Is it true for a forest?
2. The graph K7 does not have a separating set of vertices. In K7 , a separating set of edges
must contain at least 6 edges.

Definition 7.5.4. A graph G is said to be k-connected if |G| > k and G is connected even
after deletion of any k − 1 vertices. The vertex connectivity, denoted by κ(G), of a non trivial
graph G is the largest k such that G is k-connected. Convention: κ(K1 ) = 0.
Example 7.5.5. 1. Each connected graph of order more than one is 1-connected.
2. A 2-connected graph is also a 1-connected graph.
7.5. CONNECTIVITY 219

3. For a disconnected graph, κ(G) = 0 and for n > 1, κ(Kn ) = n − 1.

4. The graph G in Figure 7.14 is 2-connected but not 3-connected. Thus, κ(G) = 2.

Figure 7.14: graph with vertex connectivity 2

5. The Petersen graph is 3-connected.

Definition 7.5.6. A graph G is called l-edge connected if |G| > 1 and G − F is connected
for every F ⊆ E(G) with |F | < l. The greatest integer l such that G is l-edge connected is the
edge connectivity of G, denoted λ(G). Convention: λ(K1 ) = 0.
Example 7.5.7. 1. Note that λ(Pn ) = 1, λ(Cn ) = 2 and λ(Kn ) = n − 1, whenever n > 1.
2. Let T be a tree on n ≥ 2 vertices. Then, λ(T ) = 1.
3. For the graph G in Figure 7.14, λ(G) = 3.
4. For the Petersen graph G, λ(G) = 3.

Exercise 7.5.8. Let |G| > 1. Show that κ(G) = |G| − 1 if and only if G = Kn . Can we say
T

the same for λ(G)?

Ans: If deletion of any |G| − 2 vertices keeps the graph connected, it means, the remaining two
DR

vertices are adjacent. So, G is a complete graph on |G| vertices, i.e., G = K|G| . Conversely, if
G = Kn , then κ(G) = n − 1, by definition.
Suppose that λ(G) = |G| − 1. This implies that δ(G) = |G| − 1. Hence, G = K|G| is complete.
Conversely, λ(Kn ) = n − 1.

Theorem 7.5.9. [H. Whitney, 1932] For any graph G, κ(G) ≤ λ(G) ≤ δ(G).

Proof. If G is disconnected or |G| = 1, then we have nothing to prove. So, let G be connected
graph and |G| ≥ 2. Then, there is a vertex v with d(v) = δ(G). If we delete all edges incident
on v, then the graph is disconnected. Thus, δ(G) ≥ λ(G).
Suppose that λ(G) = 1 and G − uv is disconnected with components Cu and Cv . If |Cu | =
|Cv | = 1, then G = K2 and κ(G) = 1. If |Cu | > 1, then we delete u to see that κ(G) = 1.
If λ(G) = k ≥ 2, then there is a set of edges, say e1 , . . . , ek , whose removal disconnects G.
Notice that G − {e1 , . . . , ek−1 } is a connected graph with a bridge, say ek = uv. For each of
e1 , . . . , ek−1 select an end vertex other than u or v. Deletion of these vertices from G results
in a graph H with uv as a bridge of a connected component. Note that κ(H) ≤ 1. Hence,
κ(G) ≤ λ(G).

Exercise 7.5.10. Give a lower bound on the number of edges of a graph G on n vertices with
vertex connectivity κ(G) = k.

Ans: As δ(G) ≥ k, we have kGk ≥ |G|k/2. When does equality happen?

220 CHAPTER 7. GRAPHS

Theorem 7.5.11. [Chartrand and Harary, 1968] For all integers a, b, c such that 0 < a ≤ b ≤ c,
there exists a graph with κ(G) = a, λ(G) = b and δ(G) = c.

Proof. Omitted, as it is out of the scope of this book.

Theorem 7.5.12. [Mader, 1972] Every graph G of average degree at least 4k has a k-connected
subgraph.

Proof. For k = 1, the assertion is trivial. So, let k ≥ 2. Note that

n = |G| ≥ ∆(G) ≥ 4k ≥ 2k − 1 and (7.1)

1
m = kGk ≥ (average degree × n) ≥ 2kn ≥ (2k − 3)(n − k + 1) + 1. (7.2)
2
We shall use induction to show that if G satisfies Equations (7.1) and (7.2), then G has a k-
connected subgraph. If n = 2k−1, then m ≥ (2k−3)(n−k+1)+1 = (n−2) (n+1) 2 +1 =
n(n−1)
2 . So,
n(n−1)
G is a graph on n vertices with at least 2 edges and hence G = Kn . Thus, Kk+1 ⊆ Kn = G.
Assume n ≥ 2k. If v is a vertex with d(v) ≤ 2k − 3, then we apply induction hypothesis to
G − v to get the result. So, let d(v) ≥ 2k − 2, for each vertex v. If G is k-connected then,
we have nothing to prove. Assume, if possible that G is not k-connected. Then, G = G1 ∪ G2
with |G1 ∩ G2 | < k and |G1 |, |G2 | < n. Thus, both G1 − V (G2 ) and G2 − V (G1 ) have at least
one vertex and there is no edge between them as G is not k-connected. As the degree of these
vertices is at least 2k − 2, we have |G1 |, |G2 | ≥ 2k − 1. Further,
T
AF

|G1 | + |G2 | = |G1 ∪ G2 | + |G1 ∩ G2 | ≤ n + (k − 1) = n + k − 1. (7.3)

If G1 or G2 satisfies Equation (7.2), using induction hypothesis, the result follows. Otherwise,
kGi k ≤ (2k − 3)(|Gi | − k + 1), for i = 1, 2 and hence, using Equation (7.3)

m = kGk ≤ kG1 k + kG2 k ≤ (2k − 3)(|G1 | + |G2 | − 2k + 2) ≤ (2k − 3)(n − k + 1),

a contradiction to Equation (7.2) and hence the required result follows.

Theorem 7.5.13. [Menger] A graph is k-edge-connected if and only if there are k edge disjoint
paths between each pairs of vertices. A graph is k-connected if and only if there are k internally
vertex disjoint paths between each pairs of vertices.

Proof. Omitted.

7.6 Eulerian Graphs

Definition 7.6.1. Let G be a graph. Then, G is said to have an Eulerian tour if there is a
closed walk, say [v0 , v1 , . . . , vk , v0 ], such that each edge of the graph appears exactly once in the
walk. The graph G is said to be Eulerian if it has an Eulerian tour.

Note that by definition, a disconnected graph is not Eulerian. In this section, the graphs can
have loops and multiple edges. The graphs that have a closed walk traversing each edge exactly
once have been named “Eulerian graphs” due to the solution of the famous Königsberg bridge
7.6. EULERIAN GRAPHS 221

problem by Euler in 1736. The problem is as follows: The city Königsberg (the present day
Kaliningrad) is divided into 4 land masses by the river Pregel. These land masses are joined by
7 bridges (see Figure 7.15). The question required one to answer “is there a way to start from
a land mass that passes through all the seven bridges in Figure 7.15 and return back to the
starting land mass”? Euler, rephrased the problem along the following lines: Let the four land
masses be denoted by the vertices A, B, C and D of a graph and let the 7 bridges correspond to
7 edges of the graph. Then, he asked “does this graph have a closed walk that traverses each
edge exactly once”? He gave a necessary and sufficient condition for a graph to have such a
closed walk and thus giving a negative answer to Königsberg bridge problem.
One can also relate the above problem to the problem of “starting from a certain point, draw a
given figure with pencil such that neither the pencil is lifted from the paper nor a line is repeated
such that the drawing ends at the initial point”.

B D

A
T

Figure 7.15: Königsberg bridge problem

AF
DR

Theorem 7.6.2. [Euler, 1736] A connected graph G is Eulerian if and only if d(v) is even, for
each v ∈ V (G).

Proof. Let G have an Eulerian tour, say W = [v0 , v1 , . . . , vk , v0 ]. Observe that whenever we
arrive at a vertex v using an edge, say e, in W then we leave that vertex using an edge, say e0 in
W with e 6= e0 . As each edge appears exactly once in W and each edge is traversed, d(v) = 2r,
if v 6= v0 and v appears r times in the tour. Also, d(v0 ) = 2(r − 1), if v0 appears r times in the
tour. Hence, d(v) is always even.
Conversely, let G be a connected graph with each vertex having even degree. Let W =
v0 v1 · · · vk be a longest walk in G without repeating any edge in it. As vk has an even degree
it follows that vk = v0 , otherwise W can be extended. If W is not an Eulerian tour then there
exists an edge, say e0 = vi w, with w 6= vi−1 , vi+1 . In this case, W 0 = wvi · · · vk (= v0 )v1 · · · vi−1 vi
is a longer walk compared to W , a contradiction. Thus, there is no edge lying outside W and
hence W is an Eulerian tour.

Proposition 7.6.3. Let G be a connected graph with exactly two vertices of odd degree. Then,
there is an Eulerian walk starting at one of those vertices and ending at the other.

Proof. Let x and y be the two vertices of odd degree and let v be a symbol such that v ∈ / V (G).
Then, the graph H with V (H) = V (G) ∪ {v} and E(H) = E(G) ∪ {xv, yv} has each vertex of
even degree and hence by Theorem 7.6.2, H is Eulerian. Let Γ = [v, v1 = x, . . . , vk = y, v] be an
Eulerian tour. Then, Γ − v is an Eulerian walk with the required properties.
222 CHAPTER 7. GRAPHS

Exercise 7.6.4. Let G be a connected Eulerian graph and e be any edge. Show that G − e is
connected.

Ans: Think of an Eulerian tour.

How to find an Eulerian tour (algorithm)?

Start from a vertex v0 , move via edge that has not been taken and go on deleting them.
Do not take an edge whose deletion creates a non trivial component not containing v0 .

Exercise 7.6.5. Find Eulerian tours for the following graphs.

11 13
16 15 14 13
12 7 5
9 10 11 12
10 8 6
8 7 6 5
3 1 9

2 4 1 2 3 4

Theorem 7.6.6. [Finding Eulerian tour] The previous algorithm correctly gives an Eulerian
T

tour whenever, the given graph is Eulerian.

Proof. Let the algorithm start at a vertex, say v0 . Now, assume that we are at u with H as the
DR

current graph and C as the only non trivial component of H. Thus, dH (u) > 0. Assume that
the deletion of the edge uv creates a non trivial component not containing v0 . Let Cu and Cv
be the components of C − uv, containing u and v, respectively.
We first claim that u 6= v0 . In fact, if u = v0 , then H must have all vertices of even degree
and dH (v0 ) ≥ 2. So, C is Eulerian. Hence, C − uv cannot be disconnected, a contradiction to
C − uv having two components Cu and Cv . Thus, u 6= v0 . Moreover, note that the only vertices
of odd degree in C is u and v0 .
Now, we claim that Cu is a non trivial component. Suppose Cu is trivial. Then, v0 ∈ Cv , a
contradiction to the assumption that the deletion of the edge uv creates a nontrivial component
not containing v0 . So, Cu is non trivial.
Finally, we claim that v0 ∈ Cv . If possible, let v0 ∈ Cu . Then, the only vertices in C − uv of
odd degree are v ∈ Cv and v0 ∈ Cu . Hence, C − uv + v0 v is a connected graph with each vertex
of even degree. So, by Theorem 7.6.2, the graph C − uv + v0 v is Eulerian. But, this cannot be
true as vv0 is a bridge. Thus, v0 ∈ Cv .
Hence, Cu is the newly created non trivial component not containing v0 . Also, each vertex of
Cu has even degree and hence by Theorem 7.6.2, Cu is Eulerian. This means, we can take an
edge e0 incident on u and complete an Eulerian tour in Cu . So, at u if we take the edge e0 in
place of the edge e, then we will not create a non trivial component not containing v0 .
Thus, at each stage of the algorithm either u = v0 or there is a path from u to v0 . Moreover,
this is the only non trivial connected component. When the algorithm ends, we must have
7.7. HAMILTONIAN GRAPHS 223

u = v0 . Because, as seen above, the condition u 6= v0 gives the existence of an edge that is
incident on u and that can be traversed (as dH (u) is odd). Hence, if u 6= v0 , the algorithm
cannot stop. Thus, when algorithm stops u = v0 and all components are trivial.

Exercise 7.6.7. Apply the algorithm to graphs of Exercise 7.6.5. Also, create connected graphs
such that each of its vertex has even degree and apply the above algorithm.
Exercise 7.6.8. 1. Give a necessary and sufficient condition on m and n so that Km,n is
Eulerian.
2. Each of the 8 persons in a room has to shake hands with every other person as per the
following rules:
(a) The handshakes should take place sequentially.
(b) Each handshake (except the first) should involve someone from the previous hand-
shake.
(c) No person should be involved in 3 consecutive handshakes.

Is there a way to sequence the handshakes so that these conditions are all met?
Ans: Suppose that ab is the first hand shake and bc is the second. Thus, the third hand
shake will be between c and someone other than b and so on. Thus, the sequence of hand
shakes represent an Eulerian walk. This is possible if and only if G is connected with at most
T

2 vertices of odd degree. But K8 has 8 vertices of odd degree. Thus, such a sequence is not
AF

possible.
DR

3. Let G be a connected graph. Then, G is an Eulerian graph if and only if the edge set of G
can be partitioned into cycles.

7.7 Hamiltonian Graphs

Definition 7.7.1. A cycle in G is said to be Hamiltonian if it contains all vertices of G. If G
has a Hamiltonian cycle, then G is called a Hamiltonian graph. Finding a nice characterization
of a Hamiltonian graph is an unsolved problem.
Example 7.7.2. 1. For each positive integer n ≥ 3, the cycle Cn is Hamiltonian.

The dodecahedron graph The Petersen graph

Figure 7.16: A Hamiltonian and a non-Hamiltonian graph

224 CHAPTER 7. GRAPHS

2. The graphs corresponding to all platonic solids are Hamiltonian.

3. The Petersen graph is a non-Hamiltonian Graph (the proof appears below).
Proposition 7.7.3. The Petersen graph is not Hamiltonian.
Proof. Suppose that the Petersen graph, say G, is Hamiltonian. So, G contains C10 =
[1, 2, 3, . . . , 10, 1] as a subgraph. As each vertex of G has degree 3, G = C10 + M , where M
is a set of 5 chords in which each vertex appears as an endpoint. Now, consider the vertices 1,
2 and 3.

3 2
4 1

5 10

6 9
7 8
Since, g(G) = 5, the vertex 1 can be adjacent to only one of the vertices 5, 6 or 7. Hence, if 1
is adjacent to 5, then the possible third vertex that is adjacent to 10 will create cycles of length
3 or 4. Similarly, if 1 is adjacent to 7, then there is no choice for the possible third vertex that
T

can be adjacent to 2. So, let 1 be adjacent to 6. Then, 2 must be adjacent to 8. In this case,
AF

note that there is no choice for the third vertex that can be adjacent to 3. Thus, the Petersen
DR

graph is non-Hamiltonian.
Theorem 7.7.4. Let G be a Hamiltonian graph. Then, for S ⊆ V (G) with S 6= ∅, the graph
G − S has at most |S| components.
Proof. Note that by removing k vertices from a cycle, one can create at most k connected
components. Hence, the required result follows.
Theorem 7.7.5. [Dirac, 1952] Let G be a graph with |G| = n ≥ 3 and d(v) ≥ n/2, for each
v ∈ V (G). Then, G is Hamiltonian.
Proof. Let is possible, G be disconnected. Then, G has a component, say H, with |V (H)| = k ≤
n/2. Hence, d(v) ≤ k − 1 < n/2, for each v ∈ V (H). A contradiction to d(v) ≥ n/2, for each
v ∈ V (G). Now, let P = [v1 , v2 , · · · , vk ] be a longest path in G. Since P is the longest path, all
neighbors of v1 and vk are in P and k ≤ n.
We claim that there exists an i such that v1 ∼ vi and vi−1 ∼ vk . Otherwise, for each vi ∼ v1 ,
we must have vi−1 vk . Then, |N (vk )| ≤ k − 1 − |N (v1 )|. Hence, |N (v1 )| + |N (vk )| ≤ k − 1 < n,
a contradiction to d(v) ≥ n/2, for each v ∈ V (G). So, the claim is valid and hence, we have a
cycle P̃ := v1 vi vi+1 · · · vk vi−1 · · · v1 of length k.
We now prove that P̃ gives a Hamiltonian cycle. Suppose not. Then, there exists v ∈ V (G)
such that v is outside P and v is adjacent to some vj . Now, use P̃ , v and vj to create a path
whose length is larger than the length of P . Hence, P cannot be the path of longest length, a
contradiction. Thus, the required result follows.
7.7. HAMILTONIAN GRAPHS 225

Theorem 7.7.6. [Ore, 1960] Let G be a graph on n ≥ 3 vertices such that d(u) + d(v) ≥ n, for
every pair of nonadjacent vertices u and v. Then, G is Hamiltonian.

Proof. Exercise.

Exercise 7.7.7. Let u and v be two vertices such that d(u) + d(v) ≥ |G|, whenever uv ∈
/ E(G).
Prove that G is Hamiltonian if and only if G + uv is Hamiltonian.

Ans: If G + uv has a Hamiltonian cycle not using uv, then G is Hamiltonian. Otherwise, apply
Ore’s theorem.

Definition 7.7.8. The closure of a graph G, denoted C(G), is obtained by repeatedly choosing
pairs of nonadjacent vertices u, v such that d(u) + d(v) ≥ n and adding edges between them.

Proposition 7.7.9. The closure of G is unique.

Proof. Let K be a closure obtained by adding edges e1 = u1 v1 , . . . , ek = uk vk sequentially and F

be a closure obtained by adding edges f1 = x1 y1 , . . . , fr = xr yr sequentially. Let ei be the first
edge in the e-sequence which does not appear in the f -sequence. Put H = G + e1 + · · · + ei−1 .
Then, ei = ui vi implies that ei ∈
/ E(H) and dH (ui )+dH (vi ) ≥ n. Also, H is a subgraph of F and
hence, dF (ui ) + dF (vi ) ≥ n. Moreover, ei = ui vi ∈
/ F as ei does not appear in the f -sequence.
Thus, F cannot be a closure and therefore the required result follows.

Exercise 7.7.10. Let G be a graph on n ≥ 3 vertices.

T
AF

1. If G has a cut vertex, then prove that C(G) 6= Kn .

2. Now prove a generalization of Dirac’s theorem: If the closure C(G) ∼
= Kn , then G is
DR

Hamiltonian.

Theorem 7.7.11. Let d1 ≤ · · · ≤ dn be the vertex degrees of G. Suppose that, for each k < n/2
with dk ≤ k, the condition dn−k ≥ n − k holds. Then, prove that G is Hamiltonian.

Proof. We show that under the above condition H = C(G) ∼ = Kn . On the contrary, assume that
there exists a pair of vertices u, v ∈ V (G) such that uv ∈ / E(H) and dH (u) + dH (v) ≤ n − 1.
Among all such pairs, choose a pair u, v ∈ V (G) such that uv ∈ / E(H) and dH (u) + dH (v) is
maximum. Assume that dH (v) ≥ dH (u) = k (say). As dH (u) + dH (v) ≤ n − 1, k < n/2.
Now, let Sv = {x ∈ V (H) | x 6= v, xv ∈ / E(H)} and Su = {w ∈ V (H) | w 6= u, wu ∈ / E(H)}.
Therefore, the assumption that dH (u) + dH (v) is the maximum among each pair of vertices u, v
with uv ∈/ E(H) and dH (u) + dH (v) ≤ n − 1 implies that |Sv | = n − 1 − dH (v) ≥ dH (u) = k
and dH (x) ≤ dH (u) = k, for each x ∈ Sv . So, there are at least k vertices in H (elements of Sv )
with degrees at most k.
Also, for any w ∈ Su , note that the choice of the pair u, v implies that dH (w) ≤ dH (v) ≤
n − 1 − dH (u) = n − 1 − k < n − k. As dH (u) = k, |Su | = n − 1 − k. Further, the condition
dH (u) + dH (v) ≤ n − 1, dH (v) ≥ dH (u) = k and u ∈ / Su implies that dH (u) ≤ n − 1 − dH (v) ≤
n − 1 − k < n − k. So, there are n − k vertices in H with degrees less than n − k.
Therefore, if d01 ≤ · · · ≤ d0n are the vertex degrees of H, then we observe that there exists a
k < n/2 for which d0k ≤ k and d0n−k < n − k. As k < n/2 and di ≤ d0i , we get a contradiction to
the given hypothesis.
226 CHAPTER 7. GRAPHS

Exercise 7.7.12. Complete an alternate proof of Theorem 7.7.11. Let R denote the property:
R : ‘If dk ≤ k then dn−k ≥ n − k, for each k < n/2’.
We know that G has this property.

1. Let e be an edge not in G. Show that G + e also has the property. What about the closure
H := C(G) of G?
Ans: Suppose that we have added an edge e. Let d1 ≤ d2 ≤ · · · ≤ dn be the old degree
sequence and d01 ≤ d02 ≤ · · · ≤ d0n be the new degree sequence.
Suppose d0k ≤ k, i.e., in G + e there are at least k vertices whose degrees are less than or
equal to k. So, in G there has to be at least k vertices with degrees ≤ k.
Thus, dk ≤ k and hence by assumption dn−k ≥ n − k. So, in G there are at least k + 1 =
n − (n − k) + 1 vertices with degrees ≥ n − k. Thus, in G + e there are at least k + 1 vertices
with degrees ≥ n − k. That is, d0n−k ≥ n − k. So, G + e has that property.
As C(G) is obtained by adding edges only. Hence, it will have the property.
2. Assume that max{d(u) + d(v) : u, v ∈ H are not adjacent} ≤ n − 2. Let e be an edge not
in H. Does H + e have property R? Is C(H + e) = H + e?
Ans: Since H has property R, by first part, H + e also has property R.
It is given that between any two nonadjacent vertices u, v ∈ V (H), d(u) + d(v) ≤ n − 2.
So in H + e, between any two nonadjacent vertices, we will have d(u) + d(v) ≤ n − 1. So
T
AF

C(H + e) = H + e.
3. In view of the previous observations assume that G is an edge maximal graph with property
DR

R which is not Hamiltonian. Do you have C(G) = G? Show that there are some k vertices
having degree at most k and some n − k vertices having degree less than n − k. Does that
contradict R?
Ans: Need to discuss Let G be an edge maximal graph with property R which
is not Hamiltonian. If C(G) 6= G then, there are nonadjacent vertices u, v ∈ V (G) with
d(u) + d(v) ≥ n. But, in that case G + uv also has property R. We claim that G + uv is not
Hamiltonian.
If possible, let there be a Hamiltonian cycle in G + uv. It must contain the edge uv as G was
not Hamiltonian. Let the cycle be [v1 = v, v2 , ...., vn−1 , vn = u, v]. Observe that the path
[v1 , ...., vn ] belongs to G. In G, d(v1 ) + d(vn ) ≥ n. If for some vk , we have v1 ∼ vk and
vn ∼ vk−1 , then we have a Hamiltonian cycle in G, namely, [v1 , ......vk−1 , vn , vn−1 , ...., vk , v1 ].
Thus, if v1 is adjacent to r vertices (locate them on that path), then vn will not be adjacent
to the vertices that are on the left of them. So, d(vn ) ≤ n − 1 − r. So d(v1 ) + d(vn ) ≤ n − 1,
a contradiction.
So, G could not be an edge maximal graph. Thus, we must have C(G) = G. Now proceed as
in the proof of Theorem 7.7.11.

Definition 7.7.13. The line graph H of a graph G is a graph with V (H) = E(G) and
e1 , e2 ∈ V (H) are adjacent in H if e1 and e2 share a common vertex/endpoint.
7.7. HAMILTONIAN GRAPHS 227

Example 7.7.14. Verify the following:

1. Line graph of C5 is C5 .
2. Line graph of P5 is P4 .
3. Line graph of any graph G contains a complete subgraph of size ∆(G).
Exercise 7.7.15. 1. Let G be a connected Eulerian graph. Then, show that the line graph
of G is Hamiltonian. Is the converse true?
Ans: Converse is not true, consider K4 − 12.
2. What can you say about the clique number of a line graph?
Ans: ∆(G).

Theorem 7.7.16. A connected graph G is isomorphic to it’s line graph if and only if G = Cn ,
for some n ≥ 3.

Proof. If G is isomorphic to its line graph, then |G| = kGk. Thus, G is a unicyclic graph.
Let [v1 , v2 , . . . , vk , vk+1 = v1 ] form the cycle in G. Then, the line graph of G contains a cycle
P = [v1 v2 , v2 v3 , . . . , vk v1 ]. We now claim that dG (vi ) = 2.
Suppose not and let dG (v1 ) ≥ 3. So, there exists a vertex u ∈ / {v2 , . . . , vk } such that u ∼ v1 .
In that case, the line graph of G contains the triangle T = [v1 v2 , v1 vk , v1 u] and P 6= T . Thus,
the line graph is not unicyclic, a contradiction.

Exercise 7.7.17. Consider the graphs shown below.

T
AF

1. Determine the closure of G.

Ans: The closure of G is the complete graph.
DR

2. Show that H is not Hamiltonian.

G H

Ans: In H, let u1 , u2 and u3 be the vertices of degree 4. Let u0i be that vertex on the outer
circle which is not adjacent to ui , for i = 1, 2, 3. Notice that H − {u1 , u2 , u3 , u01 , u02 , u03 } is
disconnected with 7 components. Thus, it cannot be Hamiltonian.
3. Give a necessary and sufficient condition on m, n ∈ N so that Km,n is Hamiltonian.
Ans: Since Km,n is bipartite, if we have a Hamiltonian cycle, its length is even and at least
4. Thus, half of its vertices will be in one part and the other half will be in the other part. So,
m = n. Conversely, if m = n ≥ 2, then taking the parts {1, 3, . . . , 2n − 1} and {2, 4, . . . , 2n}
we see that [1, 2, . . . , 2n − 1, 2n, 1] is a Hamiltonian cycle.
228 CHAPTER 7. GRAPHS

4. Show that any graph G with |G| ≥ 3 and kGk ≥ C(n − 1, 2) + 2 is Hamiltonian.
Ans: We claim that the degree sequence of G satisfies ‘for each k < n/2 with dk ≤ k we
have dn−k ≥ n − k’. In that case G is Hamiltonian by Theorem 7.7.11.
Suppose that our claim does not hold. Thus, there is a k ≤ (n − 1)/2 such that dk ≤ k and
dn−k < n − k ≤ n − k − 1. Thus, dk+1 ≤ · · · ≤ dn−k ≤ n − k − 1 and hence the total sum of
the degrees is at most

k 2 + (n − 2k)(n − k − 1) + k(n − 1) = n2 + 3k 2 − 2nk − n + k.

Note that the sum of degrees is at least 2 (C(n − 1, 2) + 2) = n2 − 3n + 6. We now show that
n2 + 3k 2 − 2nk − n + k − n2 − 3n + 6 is negative and hence obtain a contradiction. As

k ≤ (n − 1)/2, we have

n2 + 3k 2 − 2nk − n + k − (n2 − 3n + 6) = 3k 2 − 2n(k − 1) + k − 6

≤ 3k 2 − (4k + 2)(k − 1) + k − 6 = − k 2 − 3k + 4 ≤ 0.

5. Show that for any n ≥ 3 there is a graph H with kGk = C(n − 1, 2) + 1 that is not Hamil-
tonian. But, prove that all such graphs H admit a Hamiltonian path (a path containing
all vertices of H).
Ans: Consider Kn−1 with a new pendant vertex added at some vertex. This graph is not
Hamiltonian as it has a pendant vertex.
T
AF

Any graph H with |H| ≥ 3 and kHk = C(n − 1, 2) + 1 will have a Hamiltonian walk as H + e
DR

is Hamiltonian, where e is a new edge.

7.8 Bipartite Graphs

Definition 7.8.1. A graph is said to be 2-colorable if it’s vertices can be colored with two
colors in a way that adjacent vertices get different colors.

Lemma 7.8.2. Let P and Q be two v-w-paths in G such that length of P is odd and length of
Q is even. Then, G contains an odd cycle.

Proof. If P, Q have no inner vertex (a vertex other than v, w) in common then P ∪ Q is an odd
cycle in G.
So, suppose P, Q have an inner vertex in common. Let x be the first common inner vertex
when we walk from v to w. Then, one of P (v, x), P (x, w) has odd length and the other is even.
Let P (v, x) be odd. If length of Q(v, x) is even then P (v, x) ∪ P (x, v) is an odd cycle in G. If
length of Q(v, x) is odd then the length of Q(x, w) is also odd and hence we can consider the
x-w-paths P (x, w) and Q(x, w) and proceed as above to get the required result.

Theorem 7.8.3. Let G be a connected graph with at least two vertices. Then, the following
statements are equivalent.
1. G is 2-colorable.
2. G is bipartite.
7.8. BIPARTITE GRAPHS 229

3. G does not have an odd cycle.

Proof. Part 1 ⇒ Part 2. Let G be 2-colorable. Let V1 be the set of red vertices and V2 be the
set of blue vertices. Clearly, G is bipartite with partition V1 , V2 .
Part 2 ⇒ Part 1. Color the vertices in V1 with red color and that of V2 with blue color to get
the required 2 colorability of G.
Part 2 ⇒ Part 3. Let G be bipartite with partition V1 , V2 . Let v0 ∈ V1 and suppose Γ =
v0 v1 v2 · · · vk = v0 is a cycle. It follows that v1 , v3 , v5 · · · ∈ V2 . Since, vk ∈ V1 , we see that k is
even. Thus, Γ has an even length.
Part 3 ⇒ Part 2. Suppose that G does not have an odd cycle. Pick any vertex v. Define
V1 = {w | there is a walk of even length from v to w} and V2 = {w | there is a walk of odd length
from v to w}. Clearly, v ∈ V1 . Also, G does not have an odd cycle implies that V1 ∩ V2 = ∅ (use
Lemma 7.8.2. As G is connected each w is either in V1 or in V2 .
Let x ∈ V1 . Then, there is an even path P (v, x) from v to x. If xy ∈ E(G), then we have
a v-y-walk of odd length. Deleting all cycles from this walk, we have an odd v-y-path. Thus,
y ∈ V2 . Similarly, if x ∈ V2 and xy ∈ E, then y ∈ V1 . Thus, G is bipartite with parts V1 , V2 .
Exercise 7.8.4. 1. There are 15 women and some men in a room. Each man shook hands
with exactly 6 women and each woman shook hands with exactly 8 men. How many men
are there in the room?
Ans: Think of women as the set V1 and men as the set V2 . Then, total number of handshakes
T
AF

between men and women (each vertex in V1 has degree 8) is 15 × 8 = 120. Since each men
shook hand with exactly 6 women (each vertex in V2 has degree 6) , the number of men is 20.
DR

2. How do you test whether a graph is bipartite or not?

Ans: First verify whether the graph is connected or not. If connected, proceed. Else, do the
following for each connected component. Start from a vertex v. For 1 ≤ i ≤ n, let Di (v) be
the set of vertices which are at a distance i from v.
Then, G is not bipartite if and only if G has an odd cycle. Or equivalently, two vertices in
some Di (v) are adjacent.
!
n/2
S n/2
S
Or, put V1 = {v} ∪ D2i (v) and V2 = D2i−1 (v). If V1 ∩ V2 = ∅ then G is bipartite
i=1 i=1
else, it is not bipartite.
3. Prove that every tree is a bipartite graph.
Ans: A tree has no cycle and hence no odd cycle.
4. Prove that the Petersen graph is not bipartite.
Ans: It has a cycle of length 5, an odd number.
5. Let G and H be two bipartite graphs. Prove that G × H is also a bipartite graph.
Ans: As As G and H are bipartite, let V1 , V2 and V3 , V4 be the bipartition of V (G) and
V (H), respectively. Then, (V1 × V3 ) ∪ (V2 × V4 ) and (V1 × V4 ) ∪ (V2 × V3 ) is a bipartition of
V (G × H).
230 CHAPTER 7. GRAPHS

7.9 Matching in Graphs

Definition 7.9.1. A matching in a graph G is an independent set of edges. A maximum
matching is a matching with maximum number of edges. A vertex v is saturated by a
matching M if there is an edge e ∈ M incident on v. A matching is a perfect matching if
every vertex is saturated.
Example 7.9.2. 1. In Figure 7.17, M1 = {u1 u2 } is a matching. So, is M2 = {e}, where e is
any edge. The set M3 = {u3 u2 , u4 u7 } is also a matching. The set M4 = {u1 u2 , u4 u5 , u6 u7 }
is also a matching and it is maximum (why?). Can you give another maximum matching?

u2 u7 u6

u3 u4 u5
Figure 7.17: A graph

2. Any non trivial graph G has a maximum matching.

3. Vertices that are saturated for M3 are {u2 , u3 , u4 , u7 }.
T

4. Any graph with a perfect matching must have even order as each edge saturates two
AF

vertices. The Figure 7.17 cannot have a perfect matching.

Definition 7.9.3. Let M be a matching in G. A path P is called M -alternating if its edges

are alternately from M and from G − M . An M -alternating path with two unmatched vertices
as end points (of the alternating path) is called M -augmenting. Convention: Each path of
length 1 in M is M -alternating.

Example 7.9.4. Consider Figure 7.17 and Example 7.9.2.

1. The path [u1 , u2 ] is M1 -alternating. The only path of length 2 which is M1 -alternating is
[u1 , u2 , u3 ].
2. The path [u1 , u2 , u4 , u7 ] is not M3 -alternating. But, [u2 , u3 , u4 , u7 ] is M3 -alternating.
3. The path P = [u1 , u2 , u3 , u4 , u7 , u6 ] is M3 -alternating and M3 -augmenting. This gives us
a way to get a larger (in size) matching M5 using M3 : throw away the even edges of P
from M3 and add the odd edges; i.e., M5 = M3 − {u2 u3 , u4 u7 } + {u1 u2 , u3 u4 , u7 u6 }.

Theorem 7.9.5. [Berge, 1957] A matching M is maximum if and only if there is no M -

augmenting path in G.

Proof. Let M = {u1 v1 , . . . , uk vk } be a maximum matching. If there is an M -augmenting path

P , then (P \ M ) ∪ (M \ P ) is a larger matching, a contradiction. Conversely, suppose that M is
not maximum. Let M ∗ be a maximum matching. Consider the graph H = (V, M ∪ M ∗ ). Note
that dH (v) ≤ 2, for each vertex in H. Thus, H is a collection of isolated vertices, paths and
cycles. Since a cycle contains equal number of edges of M and M ∗ , there is a path P which
7.9. MATCHING IN GRAPHS 231

contains more number of edges of M ∗ than that of M . Then, P is an M -augmenting path. A

contradiction.

Exercise 7.9.6. How do we find a maximum matching in a graph G.

Example 7.9.7. Can we find a matching that saturates all vertices in the graph given below?

Ans: No. Let X be the given graph and take S = {1, 2, 3}. If there is a matching that
saturates S then |N (S)| ≥ |S|. But this is not the case with this graph.
T

Question: What if |N (S)| were at least |S|, for each S ⊆ X?

Theorem 7.9.8. [Hall, 1935] Let G = (X ∪Y, E) be a bipartite graph. Then, there is a matching
DR

that saturates all vertices in X if and only if for all S ⊆ X, |N (S)| ≥ |S|.

Proof. If there is such a matching, then obviously |S| ≤ |N (S)|, for each subset S of X.
Conversely, suppose that |N (S)| ≥ |S|, for each S ⊆ X. Let if possible, M ∗ be a maximum
matching that does not saturate x ∈ X.
/ M ∗ . Since M ∗ cannot be extended, y
As |N ({x})| ≥ |{x}|, there is a y ∈ Y such that xy ∈
must have been matched to some x1 ∈ X.
Now consider N ({x, x1 }). It has a vertex y1 which is adjacent to either x or x1 or both by an
edge not in M ∗ . Again the condition that M ∗ cannot be extended implies that y1 must have
been matched to some x2 ∈ X. Continuing as above, we see that this process never stops and
thus, G has infinitely many vertices, which is not true. Hence, M ∗ saturates each x ∈ X.

Corollary 7.9.9. Let G be a k-regular (k ≥ 1) bipartite graph. Then, G has a perfect matching.

Proof. Let X and Y be the two parts. Since G is k-regular |X| = |Y |. Let S ⊆ X and E be the
P
set of edges with an end vertex in S. Then k|S| = |E| ≤ d(v) = k|N (S)|. Hence, we see
v∈N (S)
that for each S ⊆ X, |S| ≤ |N (S)| and thus, by Hall’s theorem the required result follows.

Definition 7.9.10. Let G be a graph. Then, S ⊆ V (G) is called a covering of G if each

edge has at least one end vertex in S. A minimum covering of G is a covering of G that has
minimum cardinality.
232 CHAPTER 7. GRAPHS

Exercise 7.9.11. 1. Show that for any graph G the size of a minimum covering is n−α(G).

Ans: If M is a minimum covering, then G − M has no edges, so n − |M | ≤ α(G). If S is a

maximum independent set, then V (G) − S is a covering, so n − α(G) ≥ |M |.

2. Characterize G in terms of it’s girth if the size of a minimum covering is |G| − 2.

Ans: Girth of G is at least 4.

Proposition 7.9.12. Let G be a graph. If M is a matching and K is a covering of G, then

|M | ≤ |K|. If |M | = |K|, then M is a maximum matching and K is a minimum covering.

Proof. By definition, the proof of the first statement is trivial. To prove the second statement,
suppose that |M | = |K| and M is not a maximum matching. Let M ∗ be a matching of G with
|M ∗ | ≥ |M |. Then, using the first statement, we have |K| ≥ |M ∗ |. Hence, |K| ≥ |M ∗ | > |M | =
|K|. Thus, M is maximum. As each covering must have at least |M | elements, we see that K
is a minimum covering.

Exercise 7.9.13. Let G = Kn , n ≥ 3. Then, determine

1. the cardinality of a maximum matching?

n
Ans: b c
2
T

2. the cardinality of a minimum covering?

Ans: 1
DR

Is the converse of Proposition 7.9.12 necessarily true? Can you guess the class of graphs for
which the converse of Proposition 7.9.12 is true?
Ans: See the next theorem,

Theorem 7.9.14. [Konig, 1931] Let M be a maximum matching in a bipartite graph G and
let K be a minimum covering. Then, |M | = |K|.

Proof. Let V = X ∪ Y be the bipartition of V and let M be a maximum matching. Let U be

the vertices in X that are not saturated by M and let Z be the set of vertices reachable from U
by an M -alternating path.
Put S = Z ∩ X, T = Z ∩ Y and K = T ∪ (X \ S). Then, U ⊆ Z ⊆ X ∪ Y and every element of
X \ S is saturated. Also, every vertex in T is saturated by M (as M is a maximum matching)
and N (S) = T (else there will be M -augmenting path starting from u ∈ U ). Further, a vertex
v ∈ X \ S is matched to some vertex y ∈ / T . Thus, |K| = |T ∪ (X \ S)| ≤ |M |. If K is not a
covering, then there is an edge xy ∈ G with x ∈ S and y ∈ / T , a contradiction to N (S) = T .
Thus, K is a covering and hence, using |K| ≤ |M | and Proposition 7.9.12, we get |K| = |M |.
Furthermore, by Proposition 7.9.12, we also see that K is a minimum covering.

Alternate proof: Let (L, R) (L for left and R for right) be the bipartition of V and let M
be a maximum matching. Let U be the set of unmatched vertices on the left.
7.10. RAMSEY NUMBERS 233

UL′ UR′
U

Let U 0 be the set of vertices reachable from U by alternating paths (with respect to M ). Then
U 0 has two parts : one one the left, say UL0 and the other on the right, say UR0 . Note that
the vertices of U are reachable from themselves. Hence, we have U ⊆ UL0 . We have a few
observations.
a) If v ∈ L is a left vertex not in UL0 , then it is not in U , and so it must be matched to some
right vertex, say w. Can w ∈ UR0 ? No. Because, if w ∈ UR0 , then we have an alternating path
from u to w and as [w, u] is a matching edge, we see that v is reachable from u by an alternating
path. Then v should have been in UL0 , a contradiction. Thus every vertex from L\UL0 is matched
to a vertex in R \ UR0 .
b) Is every vertex in UR0 matched (saturated)? Yes. To see it, suppose that w ∈ UR0 is not
matched. As w ∈ UR0 , it must be reachable from a vertex u ∈ U via an alternating path. But,
this alternating path is an augmenting path. This means M is not a maximum matching, a
T
AF

contradiction.
c) The above two points imply that |M | = |L \ UL0 | + |UR0 |.
DR

d) Is there any edge from a vertex in UL0 to a vertex in R \ UR0 ? No. To see this note that,
each vertex in UL0 \ U is reached from some vertex of U via an alternating path and the last edge
of this path must be a matching edge. Thus, each vertex in UL0 \ U is matched to some vertex
in UR0 . This means, if there an edge from a vertex in UL0 to a vertex in w ∈ R \ UR0 , it must be
a nonmatching edge. But then, this makes w reachable from U via an alternating path. So w
should have been in UR0 , a contradiction.
e) The previous point means that (L \ UL0 ) ∪ UR0 is covering. This is a minimum covering, as
any covering must contain at least |M | many vertices by Proposition 7.9.12.

Exercise 7.9.15. How many perfect matchings are there in a labeled K2n ?
(2n)!
Ans: This is the number of partitions of {1, 2, . . . , 2n} into n subsets of size two: n! 2n .

7.10 Ramsey Numbers

Recall that in any group of 6 or more persons either we see 3 mutual friends or we see 3 mutual
strangers. Expressed using graphs it reads ‘let G = (V, E) be a graph with |V | ≥ 6. Then, either
K3 ⊆ G or K 3 ⊆ G.’

Definition 7.10.1. The Ramsey number r(m, n) is the smallest natural number k such that
any graph G on k vertices either has a Km or a K n as it’s subgraph.

1
234 CHAPTER 7. GRAPHS

Example 7.10.2. As C5 does not have K3 or K 3 as it’s subgraph, r(3, 3) > 5. But, using
the first paragraph of this section, we get r(3, 3) ≤ 6 and hence, r(3, 3) = 6. It is known that
r(3, 4) = 9 (see the text by Harary for a table).

Proposition 7.10.3. Let G be a graph on 9 vertices. Then, either K4 ⊆ G or K 3 ⊆ G.

Proof. Assume that |V | = 9. Then, we need to consider three cases.

Case I. There is a vertex a with d(v) ≤ 4. Then, |N (a)0 | = |V \ N (a)| ≥ 4. If all vertices in
N (a)0 are pairwise adjacent, then K4 ⊆ G. Otherwise, there are two nonadjacent vertices, say
b, c ∈ N (a)0 . In that case a, b, c induces the graph K 3 .
Case II. There is a vertex a with d(a) ≥ 6. If hN (a)i has a K 3 , we are done. Otherwise,
r(3, 3) = 6 implies that hN (a)i has a K3 with vertices, say, b, c, d. In that case a, b, c, d induces
the graph K4 .
P
Case III. Each vertex has degree 5. This case is not possible as d(v) should be even.

Exercise 7.10.4. Can you draw a graph on 8 vertices

1. which does not have K3 , K 4 in it?
Ans: Think of C8 with all main diagonals.
2. which does not have K4 , K 3 in it?
Ans: Take complement of the graph in the previous exercise.
T

3. Consider the graph C8 = [1, 2, . . . , 8, 1] with 10 extra edges 13, 14, 17, 26, 27, 35, 36, 48, 57, 58.
AF

Does this graph has a K4 or the complement of C3 ?

Ans: It is visible that this graph does not have a triangle, that is, a K3 . To understand better,
consider the complement of the given graph.

7 4

3
2 5 1
8 6

Suppose that we could select 4 mutually non-adjacent vertices in it. Then at most two of them
can be from the outer circle [8, 3, 7, 4, 6, 8]. It follows that, we must have selected at least two
from {1, 2, 5}. As 5 is adjacent to both 1, 2, it follows that we must have selected 1 and 2.
But, then we could not have selected their neighbors 4, 5, 6, 8. So, we must have selected 3, 7.
But they are adjacent. This is a contradiction..

Theorem 7.10.5. [Erdos & Szekeres, 1935] Let m, n ∈ N. Then,

r(m, n) ≤ r(m − 1, n) + r(m, n − 1).

Proof. Let p = r(m − 1, n) and q = r(m, n − 1). Now, take any graph G on p + q vertices and
take a vertex a. If d(a) ≥ p, then hN (a)i has either a subgraph Km−1 (and Km−1 together
with a gives Km ) or a subgraph Kn . Otherwise, |N (a)0 | ≥ q. In this case, hN (a)0 i has either a
subgraph Km or a subgraph K n−1 (K n−1 together with a gives K n ).
7.11. DEGREE SEQUENCE 235

7.11 Degree Sequence

Definition 7.11.1. The degree sequence of a graph of order n is the tuple (d1 , . . . , dn ) where
d1 ≤ · · · ≤ dn . A nondecreasing sequence d = (d1 , . . . , dn ) of nonnegative integers is graphic if
there is a graph whose degree sequence is d.

Exercise 7.11.2. Show that (1, 1, 3, 3) is not graphic.

Ans: Let the vertices be {a, b, c, d}. If deg(a) = deg(b) = 3 then a ∼ b, c, d and b ∼ a, c, d.
Thus, deg(c) = deg(d) = 2.

Theorem 7.11.3. Fix n ≥ 1 and the natural numbers d1 ≤ · · · ≤ dn . Then, d = (d1 , . . . , dn ) is

P
the degree sequence of a tree on n vertices if and only if di = 2n − 2.
P
Proof. If d = (d1 , . . . , dn ) is the degree sequence of a tree on n vertices then di = 2|E(T )| =
2(n − 1) = 2n − 2.
P
Conversely, let d1 ≤ · · · ≤ dn be a sequence of natural numbers with di = 2n − 2. We
use induction to show that d = (d1 , . . . , dn ) is the degree sequence of a tree on n vertices. For
n = 1, 2, the result is trivial. Let the result be true for all n < k and let d1 ≤ · · · ≤ dk , k > 2, be
P P
natural numbers with di = 2k − 2. Since, di = 2k − 2, we must have d1 = 1 and dk > 1.
Then, we note that d2 = d2 , · · · , dk−1 = dk−1 and d0k = dk − 1 are natural numbers such that
0 0
P 0
di = 2(k − 1) − 2. Hence, by induction hypothesis, there is a tree T 0 on vertices 2, · · · , k − 1, k
with degrees d0i ’s. Now, introduce a new vertex 1 and add the edge {1, k} to get a tree T that
T

has the required degree sequence.

Theorem 7.11.4. [Havel-Hakimi, 1962] The degree sequence d = (d1 , . . . , dn ) is graphic if and
DR

only if the sequence d1 , d2 , . . . , dn−dn −1 , dn−dn − 1, . . . , dn−1 − 1 is graphic.

Proof. If the later sequence is graphic then we introduce a new vertex and make it adjacent to
the vertices whose degrees are dn−dn − 1, . . . , dn−1 − 1. Hence, the sequence d = (d1 , . . . , dn ) is
graphic.
Now, assume that d is graphic and G is a graph with degree sequence d. Let dn = k and let
NG (n) = {i1 , i2 , . . . , ik } with di1 ≤ di2 ≤ · · · ≤ dik . If di1 ≥ dv for all v ∈ V (G) \ NG (n) then
{di1 , di2 , . . . , dik } = {dn−dn , dn−dn +1 , . . . , dn−1 } and hence G − n is the required graph.
If di1 < dv0 for some v0 ∈ V (G) \ NG (n) then, we construct another graph, say G0 , such that
G and G0 have the same degree sequence but
X X
dv ≥ du . (7.4)
v∈NG0 (n) u∈NG (n)

As, v0 6∼ n, the vertex v0 has a neighbor v 6= i1 with v 6∼ i1 . Now, consider the graph
G0 = G − {v0 , v} + {n, v0 } + {i1 , v} − {i1 , n}. Then, G0 also has d as it’s degree sequence with
NG0 (n) = {v0 , i2 , . . . , ik }. Thus, we see that Equation (7.4) holds. This process will end after
a finite number of steps by producing a graph in which the vertex n has degree dn and has
neighbors with degrees dn−dn , dn−dn +1 , . . . , dn−1 and hence the required result follows.
Exercise 7.11.5. 1. How many different degree sequences are possible on a graph with 5
vertices? List all the degree sequences and draw a graph for each one. (Include connected
and disconnected graphs.)
236 CHAPTER 7. GRAPHS

2. Which of the sequences below are graphic? Draw the graph or supply an argument.
(a) (2, 2, 3, 4, 4, 5)
(b) (1, 2, 2, 3, 3, 4)
(c) (22 , 36 , 42 ) = (2, 2, 3, 3, 3, 3, 3, 3, 4, 4)

3. If two graphs have the same degree sequence, are they necessarily isomorphic?
Ans: No. Consider C6 and disjoint union of two C3 s.
4. If two graphs are isomorphic, is it necessary that they have the same degree sequence?
Ans: Yes. Let G and H be two isomorphic graphs with vertex sets {v1 , . . . , vn } and
{u1 , . . . , un }, respectively. Let f be an isomorphism such that f (vi ) = ui . Thus, the degree
of vi in G is the same as the degree of ui in H. So, the multiset {d(vi ) : i = 1, . . . , n} is the
same as the multiset {d(ui ) : i = 1, . . . n}. As the degree sequences are obtained by sorting
these sets, we see that these graphs have the same degree sequence.

7.12 Planar Graphs

Definition 7.12.1. A graph is said to be embedded on a surface S when it is drawn on S so
that no two edges intersect. A graph is said to be planar if it can be embedded on the plane.
A plane graph is a graph which can be embedded on the plane.
T
AF
DR

K5 -Non-planar K3,3 -Non-planar K4 K4 - Planar embedding

Figure 7.18: Planar and non-planar graphs

Example 7.12.2. 1. A tree is embeddable on a plane and when it is embedded we have only
one face, the exterior face.
2. Any cycle Cn , n ≥ 3 is planar and any plane representation of Cn has two faces.
3. The planar embedding of K4 is given in Figure 7.18.
4. Draw a planar embedding of K2,3 .
5. Draw a planar embedding of the three dimensional cube.
6. Draw a planar embedding of K5 − e, where e is any edge.
7. Draw a planar embedding of K3,3 − e, where e is any edge.

Definition 7.12.3. Consider a planar embedding of a graph G. The regions on the plane
defined by this embedding are called faces/regions of G. The unbounded face/region is called
the exterior face (see Figure 7.19).
7.12. PLANAR GRAPHS 237

Example 7.12.4. Consider the following planar embedding of the graphs X1 and X2 .

9 f1 9 f1
14 13
f2 f2 11
8 11 8
10 f3 12 10 f3 12
15 f4
2 f4
1 2 3 4 5 1 3 4 5
f6 f5
7 6 7 6
Planar Graph X1 Planar Graph X2

Figure 7.19: Planar graphs with labeled faces to understand the Euler’s theorem

1. The faces of the planar graph X1 and their corresponding edges are listed below.

Face Corresponding Edges

f1 {9, 8}, {8, 9}, {8, 2}, {2, 1}, {1, 2}, {2, 7}, {7, 2}, {2, 3}, {3, 4}, {4, 6}, {6, 4}, {4, 5},
{5, 4}, {4, 12}, {12, 4}, {4, 11}, {11, 10}, {10, 13}, {13, 14}, {14, 10}, {10, 8}, {8, 9}
f2 {10, 13}, {13, 14}, {14, 10}
f3 {4, 11}, {11, 10}, {10, 4}
T

f4 {2, 3}, {3, 4}, {4, 10}, {10, 8}, {8, 2}, {2, 15}, {15, 2}
AF

2. Determine the faces of the planar graph X2 and their corresponding edges.
DR

From the table, we observe that each edge of X1 appears in two faces. This can be easily
observed for the faces that don’t have pendant vertices (see the faces f2 and f3 ). In faces f1
and f4 , there are a few edges which are incident with a pendant vertex. Observe that the edges
that are incident with a pendant vertex, e.g., the edges {2, 15}, {8, 9} and {1, 2} etc., appear
twice when traversing a particular face. This observation leads to the proof of Euler’s theorem
for planar graphs which is the next result.

Theorem 7.12.5. [Euler formula] Let G be a connected plane graph with f as the number of
faces. Then,
|G| − kGk + f = 2. (7.5)

Proof. We use induction on f . Let f = 1. Then, G cannot have a subgraph isomorphic to a

cycle. For if, G has a subgraph isomorphic to a cycle then in any planar embedding of G, f ≥ 2.
Therefore, G is a tree and hence |G| − kGk + f = n − (n − 1) + 1 = 2.
So, assume that Equation (7.5) is true for all plane connected graphs having 2 ≤ f < n and
let G be a connected planar graph with f = n. Now, choose an edge that is not a cut-edge, say
e. Then, G − e is still a connected graph. Also, the edge e is incident with two separate faces
and hence it’s removal will combine the two faces and thus G − e has only n − 1 faces. Thus,

|G| − kGk + f = |G − e| − (kG − ek + 1) + n = |G − e| − kG − ek + (n − 1) = 2

using the induction hypothesis. Hence, the required result follows.

238 CHAPTER 7. GRAPHS

Lemma 7.12.6. Let G be a plane bridgeless graph with kGk ≥ 2. Then, 2kGk ≥ 3f . Further,
if G has no cycle of length 3 then, 2kGk ≥ 4f .

Proof. For each edge put two dots on either side of the edge. The total number of dots is 2kGk.
If G has a cycle then each face has at least three edges. So, the total number of dots is at least
3f . Further, if G does not have a cycle of length 3, then 2kGk ≥ 4f .

Theorem 7.12.7. The complete graph K5 and the complete bipartite graph K3,3 are not planar.

Proof. If K5 is planar, then consider a plane representation of it. By Equation (7.5), f = 7.

But, by Lemma 7.12.6, one has 20 = 2kGk ≥ 3f = 21, a contradiction.
If K3,3 is planar, then consider a plane representation of it. Note that it does not have a C3 .
Also, by Euler’s formula, f = 5. Hence, by Lemma 7.12.6, one has 18 = 2kGk ≥ 4f = 20, a
contradiction.

Definition 7.12.8. Let G be a graph. Then, a subdivision of an edge uv in G is obtained by

replacing the edge by two edges uw and wv, where w is a new vertex. Two graphs are said to
be homeomorphic if they can be obtained from the same graph by a sequence of subdivisions.

For example, for each m, n ∈ N, the paths Pn and Pm are homeomorphic. Similarly, all the
cyclic graphs are homeomorphic to the cycle C3 if our study is over simple graphs. In general,
one can say that all cyclic graphs are homeomorphic to the graph G = (V, E), where V = {v}
T

and E = {e, e} (i.e., a graph having exactly one vertex and a loop). Also, note that if two graphs
AF

are isomorphic then they are also homeomorphic. Figure 7.21 gives examples of homeomorphic
DR

graphs that are different from a path or a cycle.

Figure 7.20: Homeomorphic graphs

Theorem 7.12.9. [Kuratowski, 1930] A graph is planar if and only if it has no subgraph
homeomorphic K5 or K3,3 .

Proof. Omitted.
We have the following observations that directly follow from Kuratowski theorem.

Remark 7.12.10. 1. Among all simple connected non-planar graphs

(a) the complete graph K5 has minimum number of vertices.

(b) the complete bipartite graph K3,3 has minimum number of edges.

2. If Y is a non-planar subgraph of a graph X then X is also non-planar.

7.12. PLANAR GRAPHS 239

Definition 7.12.11. Let G be a graph. Define a relation on the edges of G by e1 ∼ e2 if either

e1 = e2 or there is a cycle containing both these edges. Note that this is an equivalence relation.
Let Ei be the equivalence class containing the edge ei . Also, let Vi denote the endpoints of the
edges in Ei . Then, the induced subgraphs hVi i are called the blocks of G.

Proposition 7.12.12. A graph G is planar if and only if each of its blocks are planar.

Proof. Omitted.

Definition 7.12.13. A graph is called maximal planar if it is planar and addition of any
more edges results in a non-planar graph. A maximal plane graph is necessarily connected.

Proposition 7.12.14. If G is a maximal planar graph with |G| ≥ 3 vertices, then every face is
a triangle and kGk = 3|G| − 6.

Proof. Suppose there is a face, say f , described by the cycle [u1 , . . . , uk , u1 ], k ≥ 4. Then, we
can take a curve joining the vertices u1 and u3 lying totally inside the region f , so that G + u1 u3
is planar. This contradicts the fact that G is maximal planar. Thus, each face is a triangle. It
follows that 2kGk = 3f . As |G| − kGk + f = 2, we have 2kGk = 3f = 3(2 − |G| + kGk) or
kGk = 3|G| − 6.
Exercise 7.12.15. 1. Suppose that G is a plane graph such that each face is a 4-cycle. What
is the number of edges in G?
Ans: 2|G| − 4
T
AF

2. Show that the Petersen graph has a subgraph homeomorphic to K3,3 .

Ans:

Figure 7.21: Homeomorphic graphs

3. Show that a plane graph on ≥ 3 vertices can have at most 2|G| − 5 bounded faces.
Ans: The graphs has a maximum number of faces when it is maximal planar. By Euler’s
formula |G| − (3|G| − 6) + f = 2. Thus, f = 2|G| − 4. That is the number of bounded faces
is 2n − 5.
4. Let G be a plane graph with f faces and k components. Prove that |G| − kGk + f = k + 1
(use induction).
5. If G is a plane graph without 3-cycles, then show that δ(G) ≤ 3.
Ans: As G doesn’t have a 3 cycle, by Theorem 7.12.6 kGk ≥ 2f . So, by Euler’s formula
kGk = |G| + f − 2 ≤ |G| − 2 + kGk
P
2 . Thus, kGk ≤ 2|G| − 4. Thus, nδ(G) ≤ d(v) =
2kGk ≤ 4|G| − 8. Hence, by PHP, there is a vertex of degree ≤ 3.
240 CHAPTER 7. GRAPHS

6. Is it necessary that a plane graph G should contain a vertex of degree less than 5?
Ans: No.

7. Show that any plane graph on ≥ 4 vertices has a vertex of degree at most five.
P
Ans: Using Proposition 7.12.14, nδ(G) ≤ d(v) = 2kGk ≤ 6|G| − 12.
8. Show that any plane graph on ≥ 4 vertices has at least four vertices of degree at most five.
P
Ans: Note that using Proposition 7.12.14, d(v) = 2kGk ≤ 6|G| − 12. Let d1 ≤ d2 ≤ · · ·
be the degrees of the vertices. We need to show that d1 ≤ · · · ≤ d4 ≤ 5.
P
Let if possible d4 ≥ 6. Then 6|G| − 12 ≥ d(v) ≥ d1 + d2 + d3 + 6(|G| − 3) and hence
d2 + d3 + d3 ≤ 18 − 12 = 6. Thus, d1 ≤ 2. If d1 = 2, then deleting that vertex and using
Proposition 7.12.14, we have kGk − 2 ≤ 3(|G| − 1) − 6, so that kGk ≤ 3|G| − 7. Thus,
T
AF

P
d(v) ≤ 6|G| − 14. Repeating the argument, we have d1 + d2 + d3 ≤ 4 so that d1 ≤ 1, a
contradiction. Similarly, d1 6= 1, 0.
DR

9. Produce a planar embedding of the graph G that appears in Figure 7.22.

7 8

6 1

5 2

4 3

Figure 7.22: A graph on 8 vertices

7.13 Vertex Coloring

Definition 7.13.1. A graph G is said to be k-colorable if the vertices can be assigned k colors
in such a way that adjacent vertices get different colors. The chromatic number of G, denoted
χ(G), is the minimum k such that G is k-colorable.

Exercise 7.13.2. Every connected bipartite graph on ≥ 2 vertices has chromatic number 2.
7.14. REPRESENTING GRAPHS WITH MATRICES 241

Theorem 7.13.3. For every graph G, χ(G) ≤ ∆(G) + 1.

Proof. If |G| = 1, the statement is trivial. Assume that the result is true for |G| = n and let G
be a graph on n + 1 vertices. Let H = G − 1. As H is (∆(G) + 1)-colorable and d(1) ≤ ∆(G),
the vertex 1 can be given a color other than its neighbors.

Theorem 7.13.4. [Brooks, 1941] Every non complete graph which is not an odd cycle has
χ(G) ≤ ∆(G).

Theorem 7.13.5. [5-color Theorem] Every Planar graph is 5-colorable.

Proof. Let G be a minimal planar graph on n ≥ 6 vertices and m edges, such that G is not
5-colorable. Then, by Proposition 7.12.14, m ≤ 3n − 6. So, nδ(G) ≤ 2m ≤ 6n − 12 and hence,
δ(G) ≤ 2m/n ≤ 5. Let v be a vertex of degree 5. Note that by the minimality of G, G − v is
5-colorable. If neighbors of v use at most 4 colors, then v can be colored with the 5-th color
to get a 5-coloring of G. Else, take a planar embedding in which the neighbors v1 , . . . , v5 of v
appear in clockwise order.
Let H = G[Vi ∪ Vj ] be the graph spanned by the vertices colored i or j. If vi and vj are in
different connected components of H, then we can swap colors i and j in a component that
contains vi , so that the vertices v1 , . . . , v5 use only 4 colors. Thus, as above, in this case the
graph G is 5-colorable. Otherwise, there is a 1, 3-colored path between v1 and v3 and similarly,
a 2, 4-colored path between v2 and v4 . But this is not possible as the graph G is planar. Hence,
T

every planar graph is 5-colorable.

AF
DR

7.14 Representing graphs with Matrices

Definition 7.14.1. Let G = (V, E) be a simple (undirected) graph on vertices 1, . . . , n. Then,
the adjacency matrix A(G) of G (or simply A) is described by
(
1 if {i, j} ∈ E,
aij =
0 otherwise.

Let H be the graph obtained by relabeling the vertices of G. Then, note that A(H) =
S −1 A(G)S, for some permutation matrix S (recall that for a permutation matrix S t = S −1 ).
Hence, we talk of the adjacency matrix of a graph and do not worry about the labeling of the
vertices of G.

Example 7.14.2. The adjacency matrices of the 4-cycle C4 and the path P4 on 4 vertices are
given below.    
0 1 0 1 0 1 0 0
   
 1 0 1 0   1 0 1 0 
A(C4 ) = 
 0
 , A(P4 ) =  .
 1 0 1 


 0 1 0 1 

1 0 1 0 0 0 1 0
Exercise 7.14.3. 1. A graph "G is not connected
# if and only if there exists a permutation
A11 0
matrix P such that A(G) = , for some matrices A11 and A22 .
0 A22
242 CHAPTER 7. GRAPHS

2. Two graphs G and H are isomorphic if and only if A(G) = P t A(H)P , for some permuta-
tion matrix P .

Theorem 7.14.4. The (i, j) entry of B = A(G)k is the number of i-j-walks of length k.

Proof. Note that by the definition of matrix product

X
bij = aii1 ai1 i2 · · · aik−1 ik .
i1 ,...,ik−1

Thus, bij = r if and only if we have r sequences i1 , . . . , ik−1 with aii1 = · · · = aik−1 ik = 1. That
is, bij = r if and only if we have r walks of length k between i and j.
n−1
Theorem 7.14.5. Let G be a graph of order n. Then, G is connected if and only if I +A(G)
is entrywise positive.

n−1
Proof. Put B = I + A and let G be connected. If P is an i-j-path of length n − 1, then Bij ≥
n−1
Aij ≥ 1. If P = [i, i1 , . . . , ik = j] is an i-j-path of length k < n−1, then bii . . . bii bii1 . . . bik−1 j =
n−1
1, where bii is used n − 1 − k times. Thus, Bij > 0.
n−1
Conversely, let Bij > 0. Then, the corresponding summand bii1 . . . bin−1 j is positive. By
throwing out entries of the form bii , for 1 ≤ i ≤ n, from this expression, we have an expression
which corresponds to an i-j-walk of length at most n − 1. As B n−1 is entrywise positive, it
T

follows that G is connected.

Exercise 7.14.6. Let G be a simple, undirected graph with adjacency matrix A.

1. Then the eigenvalues of A are all real.

2. The eigenvectors can be chosen to form an orthonormal basis of Rn .
3. If A has a rational eigenvalue then it has to be an integer.
4. If G be the complete graph Kn then A = J − I, where J is the matrix with each entry 1.
Further, in this case n − 1 is an eigenvalues with multiplicity 1 and −1 as an eigenvalue
repeated n − 1 times.
5. Let G be the complement graph of G. Then, A(G) = J − I − A.
6. If G is k-regular then k is an eigenvalue of A with the vector of all 1’s as an eigenvector.
Further,
(a) n − k − 1 is an eigenvalue of G.
(b) if λ is an eigenvalue of A then −1 − λ is an eigenvalue of A(G).

If G is bipartite
7. " # then there exists a permutation matrix P such that B = P t AP =
0 B1
. Further, prove that λ is an eigenvalue of A if and only if −λ is an eigenvalue
B1t 0
of A.
" # " #
X X
Ans: is an eigenvector for λ if and only if is an eigenvector for −λ.
Y −Y
7.14. REPRESENTING GRAPHS WITH MATRICES 243

Definition 7.14.7. Let G be a graph with V (G) = {1, 2, . . . , n} and E(G) = {e1 , e2 , . . . , em }.
Let us arbitrarily give an orientation to each edge of G. For this fixed orientation, the vertex-
edge incidence matrix or in short, incidence matrix, Q(G) = [qij ] of G is a n × m matrix
whose (i, ej )-entry is described by

 1 if edge ej originates at i,

qij = −1 if edge ej terminates at i,

0 if edge ej is not incident with i.


Example 7.14.8. Consider the graph given below. It has V (G) = {1, 2, 3, 4, 5} and E(G) =
{e1 , e2 , . . . , e7 }. Thus, its incidence matrix
 
1 0 0 0 0 1 1
−1 −1 0 0 1 0 0
 
Q=  0 −1 1 0 0 0 0.
0 0 −1 −1 −1 0 −1
0 0 0 1 0 −1 0

e4
5 > 4

e6
>

>
e7
1 e5 e3
>
>

T
AF
>

e1
DR

<
2 e2 3

Exercise 7.14.9. Let G be a graph on n vertices and m edges.

1. Prove that Qt Q = diag(d1 , d2 , . . . , dn )−A, where diag(d1 , d2 , . . . , dn ) is the diagonal matrix
of di ’s the degrees of different vertices.
2. Prove that QQt = 2I − A(L(G)), where A(L(G)) is the adjacency matrix of the line graph,
L(G) of G.
3. Let e be a vector of all 1’s. Then et Q = 0t .
4. If G is connected then rank(Q) = n − 1.
5. Prove that determinant of any square submatrix of Q lies in {−1, 0, 1} (Q is unimodular).

7.14.1 More Exercises

Exercise 7.14.10. 1. Can there be a graph in which the size of a minimum covering is |G|?
Ans: No.
2. Characterize G if the size of a minimum covering is |G| − 1.
Ans: Note that Kn has the property. If uv ∈
/ G, then V − u − v is a covering.
3. What relationship is there between the size of a minimum covering and α(G)?
Ans: The size of a minimum covering
1 is at most n − α(G).
244 CHAPTER 7. GRAPHS

4. Is it necessary that a plane graph G should contain a vertex of degree at most 5?

P
Ans: Yes. Any plane graph with n ≥ 3 vertices has at most 3n − 6 edges. Thus, d(v) ≤
6n − 12 < 6n. By PHP we get the conclusion.
5. Is K5 − e planar, where e is any edge?
Ans: Yes.
6. Is K3,3 − e planar, where e is any edge?
Ans: Yes.
7. Is it true that any group of 7 persons there are 3 mutual friends or 4 mutual strangers?
Ans: No. Consider C7 .
8. Prove/disprove: A two colorable graph is necessarily planar.
9. Draw the tree on the vertex set {1, 2, . . . , 12} whose Prüfer code is 9954449795.
10. How many chordal graphs are there on the vertex set {1, 2, 3, 4}?
Ans: 61: 6 edges can be put in 26 ways. There are 3 four-cycles.
11. Count with diameter: how many nonisomorphic trees are there of order 7?
Ans: 11.
12. List the automorphisms of the following graph.
T
AF

4
DR

5 3

6 2

Ans: I, (2, 6)(3, 5), (1, 4)(2, 3)(5, 6), (1, 4)(2, 5)(3, 6)
Bibliography

[1] G. Agnarson and R. Greenlaw, Graph Theory: Modelling, Applications and Algorithm,
Pearson Education.

[2] R. B. Bapat, Graphs and Matrices, Hindustan Book Agency, New Delhi, 2010.

[3] J. Cofman, “Catalan Numbers for the Classroom?”, Elem. Math., 52 (1997), 108 - 117.

[4] D. M. Cvetkovic, Michael Doob and Horst Sachs, Spectra of Graphs: theory and applications,
Academic Press, New York, 1980.

[5] D. I. A. Cohen, Basic Techniques of Combinatorial Theory, John Wiley and Sons, New
York, 1978.

[6] William Dunham, Euler: The Master of Us All, Published and Distributed by The Math-
T
AF

ematical Association of America, 1999.

[7] F. Harary, Graph Theory, Addison-Wesley Publishing Company, 1969.

[8] Victor J Katz, A history of mathematics, an intro, Harper Collins College Publishers, New
York, 1993.

[9] G. E. Martin, Counting: The Art of Enumerative Combinatorics, Undergraduate Texts in

Mathematics, Springer, 2001.

[10] R. Merris, Combinatorics, 2th edition, Wiley-Interscience, 2003.

[11] J. Riordan, Introduction to Combinatorial Analysis, John Wiley and Sons, New York, 1958.

[12] R. P. Stanley, Enumerative Combinatorics, vol. 2, Cambridge University Press, 1999.

[13] H. S. Wilf, Generatingfunctionology, Academic Press, 1990.

245
Index

C(G): Closure of G, 225 Absorption, 90

P (n; n1 , . . . , nk ), 101 Atom, 91
∆(G): Maximum degree of G, 201 Free, 90
α(G): Independence number of G, 198 Homomorphism, 92
cf[xn , f ] : Coefficient of xn in f , 146 Idempotence, 90
δ(G): Minimum degree of G, 201 Identity elements, 89
diam(G): Diameter of G, 205 Induced partial order, 91
κ(G): Vertex connectivity of G, 218 Inverse, 89
hU i: Induced subgraph on U , 201 Isomorphism, 92
λ(G): Edge connectivity of G, 219 Principle of duality, 90
ω(G): Clique number of G, 206
Cardinal numbers
ε(G): Edge density of G, 207
Product, 81
{−1, 0, 1} vertex-edge incidence matrix, 243
Sum, 81
{f1 , . . . , fn } ⇒ g, 185
T

Cartesian Product, 10
AF

g(G): Girth of G, 205

Catalan number (Cn ), 127
n-set, 99
DR

Cauchy product, 146

r-combination, 101
Chain, 68
r-permutation, 99
Height, 68
r-sequence, 98
Chinese remainder theorem, 58
Absolute value in Z, 64 Circuit in a graph, 204
Algebraic expansion, 106 Circular permutation, 109
Anti-chain Clique in a graph, 206
Width, 68 Coin problem, 155
Argument, 175 Combination
Atomic formula, 176 C(n, r), 101
Atomic variable, 176 connected permutation, 157
Axiom of Choice, 73 Contradiction and tautology, 179
Equivalent Axioms, 74 Countably finite, 42
Countably infinite, 42
Bézout’s identity, 52
Counting
Bell Numbers, 165
Addition rule, 97
Bell numbers, 124
Multiplication rule, 97
Bijective function, 16
Cycle in a graph, 204
Bipartite graph, 199
Chord, 206
Blocks of a graph, 239
Boolean algebra, 89 Degree sequence, 235

246
INDEX 247

Graphic, 235 p implies q, 177

Derangement, 141 x-bound part, 191
Difference equation, 158 Adequate, 181
k-th difference, 158 Assignment, 176
First difference, 158 Atomic, 191
Disconnect graph, 206 bound, 191
Distributive lattice, 82 Conclusion, 185
Division algorithm, 51 Conjunction, 176
Durfee square, 156 Conjunctive normal form (CNF), 183
Connectives, 178
Empty Set, 6
Contradiction, 179
Equivalence relation, 19
Contrapositive, 178
Equivalent Sets, 16
Converse, 178
Euclid’s Algorithm, 52
Disjunction, 176
Euclid’s lemma, 54
Disjunctive normal form (DNF), 183
Euler’s totient function (ϕ(n)), 141
Dual, 184
Eulerian graph, 220
Equivalent, 179
Eulerian tour, 220
free, 191
Factorial Functionally complete, 181
Falling, 99 Hypothesis, 185
T

Interpretation, 191
AF

Falling (xn ), 125

Rising (xn ), 125 Literal, 183
DR

Family of finite character, 73 Logical conclusion, 185

Family of sets, 24 Negation, 176
Intersection, 24 Order of operations, 185
Union, 24 Polish notation, 185
Ferrer’s diagram, 152 Premise, 185
Conjugate, 152 Principal connective, 184
Self conjugate, 152 Quantifier, 190
Fibonacci sequence, 158 Satisfiable, 185
Finite set, 42 Substitution instance, 181
Cardinality, 41 tautology, 179
Forest, 210 Truth function, 179
Formal power series Well formed, 178
Cauchy product, 146 Frobenius number, 155
differentiation, 149 Function
Equality, 146 Bijective, 16
integration, 149 Domain, 13
Reciprocal, 148 Identity, 14
Sum, 146 Image, 12
Formal power series (P(x)), 146 Injective, 15
Formula Multiplicative, 144
248 INDEX

One-one, 15 Closure (C(G)), 225

Onto, 16 Coloring, 240
Partial, 12 Complement, 202
Pre-image, 12 Complete (Kn ), 199
Range, 13 Complete bipartite (Kr,s ), 199
Restriction, 15 Component, 206
Surjective, 16 Connected, 206
Total, 13 Connected component, 206
Zero, 14 Covering, 231
Fundamental theorem of arithmetic, 55 Cubic, 201
Cut edge, 211
Generating function
Cut vertex, 210
Bell numbers, 165
Cycle (Cn ), 199
Binomial coefficients, 164
Degree (d(v), dG (v)), 198
Catalan numbers, 162
Degree sequence, 235
Stirling numbers (S(n, k)), 164
Diameter (diam(G)), 205
Generating functions
Disconnected, 206
Exponential (egf), 147
Disjoint union, 202
Ordinary (ogf), 147
Distance, 205
Graph, 197
Edge connectivity (λ(G)), 219
T
2-colorable, 228
Edge deleted, 202
AF

M -Alternating path, 230

M -Augmenting path, 230 Edge density (ε(G)), 207
DR

k-colorable, 240 Edge set (E, E(G)), 197

k-factor, 201 Embedding, 236
Acyclic, 206 End vertex, 198
Addition of edge, 202 Eulerian, 220
Adjacency matrix, 241 Forest, 210
Adjacent vertices, 198 Girth (g(G)), 205
Automorphism, 209 Hamiltonian, 223
Automorphism group, 209 Homeomorphic, 238
Bipartite, 199 Incident edge, 198
Blocks, 239 Independence number (α(G)), 198
Bridge, 211 Independent set, 198
Cartesian product, 203 Induced subgraph (hU i), 201
Center, 205 Intersection, 202
Chord, 206 Invariant, 209
Chordal, 206 Isolated vertices, 198
Chromatic number (χ(G)), 240 Isomorphism, 207
Clique, 206 Join, 203
Clique number (ω(G)), 206 Length of path, 204
Closed path, 204 Length of walk, 204
Closed trail, 204 Line graph, 226
INDEX 249

Loop, 198 Infinite Set, 42

Matching, 230 Initial segment, 72
Maximal, 206 Injective function, 15
Maximal planar, 239 Integers
Maximum degree (∆(G)), 201 Co-prime, 52
Maximum matching, 230 Composite, 54
Minimal, 206 Divisibility, 52
Minimum covering, 231 Divisor, 52
Minimum degree (δ(G)), 201 Greatest common divisor (gcd), 52
Neighbor (N (v), NG (v)), 198 Highest common factor, 52
Path (Pn ), 199 Least common multiple (lcm), 55
Pendant, 198 Modular arithmetic, 55
Perfect matching, 230 Prime, 54
Petersen, 200 Relatively prime, 52
Planar, 236 Unity, 54
Radius, 205 Inverse Relation, 12
Regular, 201 Isomorphic graphs, 207
Self-complimentary, 208
Join of two graphs, 203
Separating set, 218
Simple, 198 Lattice, 82
T

Spanning subgraph, 201 n-tuples of 0 and 1, 84

Subdivision, 238 Bounded, 86

Subgraph, 201 Complement, 86

Trail, 204 Complete, 85
Tree, 210 Direct product, 84
Trivial, 198 Distributive, 82
Union, 202 Homomorphism, 85
Vertex connectivity (κ(G)), 218 Isomorphism, 85
Vertex deleted, 202 Lattice path, 126
Vertex set (V, V (G)), 197 Lemma
Walk, 204 Hand shaking, 199
Graphic sequence, 235 LHRRCC, 159
Line graph, 226
Hamel basis, 74 Linear congruence, 56
Hamiltonian graph, 223 Linear Diophantine equation, 54
Hand shaking lemma, 199 Linear recurrence relation, 158
Hasse diagram, 69 Homogeneous, 158
Directed, 69 Nonhomogeneous, 159
Hypergraph, 198 Linearly ordered set, 68
LNHRRCC, 158
Identity function, 14
Induced partial order, 74 Matching
Inductive Set, 28 Saturated vertex, 230
250 INDEX

Mathematical induction, 34 Addition in Q, 65

Modulus in Z, 64 Addition in Z, 60
Money changing problem, 155 Construction of Q, 64
Multigraph, 198 Construction of Z, 60
Multiplicative function, 144 Division in Q, 65
Multiset, 100 Multiplication in Q, 65
Multiplication in Z, 61
Natural number
Non-negative elements in Z, 63
Squarefree, 94
Order in Q, 65
Natural Numbers
Order in Z, 63
Addition, 28
Permutation, 99
Multiplication, 29
P (n, r), 99
Newton’s identity, 104
Circular, 109
Null Set, 6
Petersen graph, 200
Number of r-permutations, 99
Number of circular permutations, 109 Pigeonhole principle (PHP), 133
Planar graph, 236
One-one correspondence, 16 Edges, 236
One-one function, 15 Exterior face, 236
Onto function, 16 Faces, 236
Orbit, 110 Maximal, 239
T

Orbit size, 110 Regions, 236

Order Plane graph, 236

Complete, 68 Poset
Dictionary, 68 Anti-chain, 68
Lexicographic, 68 Chain, 68
Linear, 68 Greatest lower bound (glb), 70
Total, 68 Height, 68
Ordering in N, 32 Least upper bound (lub), 70
Partial function, 12 Lower bound, 70
Domain, 13 Maximal element, 70
Range, 13 Maximum element, 70
Partial order, 67 Minimal element, 70
Comparable, 67 Minimum element, 70
Induced, 74 Upper bound, 70
Partially ordered set, 68 Width, 68
Partition of n (πn ), 124 Positive elements in Z, 63
Partition of n into k parts (πn (k)), 124 Power set, 10
Partition of a Set, 21 Uncountable, 49
Path in a graph, 204 Prüfer code, 214
End vertices, 204 Predicate, 190
Internal vertices, 204 Principle of mathematical induction, 34
Peanos Axioms, 27 Principle of transfinite induction, 72
INDEX 251

Product of sets, 72 Infinite, 42

Propositional function, 190 Intersection, 8
Pseudograph, 197 Null, 6
Partition, 21
Quantified formula
Power Set, 10
Equivalent, 193
Product, 72
Valid, 193
Proper subset, 7
Quantifier, 190
Relation, 11
Scope, 191
Subset, 7
Ramsey number (r(m, n)), 233 Symmetric Difference, 9
Recurrence relation, 158 Uncountable, 44
Characteristic equation, 159 Union, 8
General solution-Distinct roots, 159 Simple graph, 198
General solution-Multiple roots, 160 Stirling numbers
Initial condition, 158 First kind (s(n, k)), 125
Solution, 158 Second kind (S(n, r)), 121
Recursive Theorem, 35 Stirling’s Identity, 165
Relation, 11 Strictly ordered set, 68
Anti-symmetric, 19 Surjective function, 16
Equivalence, 19
T
Total function, 13
Inverse, 12
AF

Trail in a graph, 204

Reflexive, 19
Tree, 210
DR

Symmetric, 19
Prüfer code, 214
Transitive, 19
Trivial graph, 198
Restricted function, 15
Truth function, 179
Rotation, 110
Truth table, 177
Schröder-Bernstein Theorem, 47 Truth value, 176
Scope of quatifier, 191
Uncountable set, 44
Sequence, 44
Universe of discourse (UD), 190
Set
Cartesian Product, 10 Vertex
Complement, 9 Adjacent, 198
Composition of Relations, 17
Difference, 9 Walk in a graph, 204
Disjoint, 8 Well order, 72
Empty, 6 Well ordering of N, 33
Enumeration, 44 Well ordering principle, 34
Equality, 7 Word expansion, 106
Family, 24
Zero function, 14
Finite, 42
Identity Relation, 14
Inductive, 28

Lecture 3 Hypothesis Space & Inductive Bias
No ratings yet
Lecture 3 Hypothesis Space & Inductive Bias
29 pages
Number Systems A Path Into Rigorous Mathematics 1nbsped 0367180650 9780367180652
100% (1)
Number Systems A Path Into Rigorous Mathematics 1nbsped 0367180650 9780367180652
317 pages
Discrete Mathematics - Balakrishnan and Viswanathan
50% (2)
Discrete Mathematics - Balakrishnan and Viswanathan
492 pages
Discrete Mathematics
No ratings yet
Discrete Mathematics
142 pages
Open Logic - Set Theory
No ratings yet
Open Logic - Set Theory
257 pages
Discrete Mathematics Discrete Mathematics
No ratings yet
Discrete Mathematics Discrete Mathematics
90 pages
MAT1362-Mathematical Reasoning and Proofs
No ratings yet
MAT1362-Mathematical Reasoning and Proofs
111 pages
Pdfslide Net Solucionario Set Theory
No ratings yet
Pdfslide Net Solucionario Set Theory
89 pages
MA320 - Discrete Mathematics
No ratings yet
MA320 - Discrete Mathematics
73 pages
List of Drawing Instruments Equipments and Materials
No ratings yet
List of Drawing Instruments Equipments and Materials
16 pages
Mathematical Reasoning Proofs 1727754610
No ratings yet
Mathematical Reasoning Proofs 1727754610
116 pages
COS1501 Notes Completed
No ratings yet
COS1501 Notes Completed
132 pages
Set Theory An Open Introdution
No ratings yet
Set Theory An Open Introdution
262 pages
Discrete Structures Lecture Notes
No ratings yet
Discrete Structures Lecture Notes
89 pages
Soln Dis July
No ratings yet
Soln Dis July
318 pages
Beauty in Mathematics
No ratings yet
Beauty in Mathematics
464 pages
Notes For Descrete Math
No ratings yet
Notes For Descrete Math
133 pages
Mathematical Reasoning and Proofs
No ratings yet
Mathematical Reasoning and Proofs
113 pages
MATH 421 Practice
No ratings yet
MATH 421 Practice
86 pages
GIAM. A Gentle Introduction To The Art of Mathematics - Fields
No ratings yet
GIAM. A Gentle Introduction To The Art of Mathematics - Fields
436 pages
Set Theory Lecture Notes
100% (1)
Set Theory Lecture Notes
170 pages
Main
No ratings yet
Main
108 pages
II CSE Discrete Notes
No ratings yet
II CSE Discrete Notes
333 pages
A Gentle Introduction To The Art of Mathematics
100% (2)
A Gentle Introduction To The Art of Mathematics
464 pages
Set Theory Oxford
No ratings yet
Set Theory Oxford
47 pages
Cs103x Notes
No ratings yet
Cs103x Notes
89 pages
Combinatorics Notes
100% (1)
Combinatorics Notes
303 pages
ComboNoteswSolutions11 6 04
No ratings yet
ComboNoteswSolutions11 6 04
358 pages
B1.2 Set Theory: Martin Bays HT23 Oxford
No ratings yet
B1.2 Set Theory: Martin Bays HT23 Oxford
41 pages
Theory of Computation-Lecture Notes: Michael Levet August 27, 2019
No ratings yet
Theory of Computation-Lecture Notes: Michael Levet August 27, 2019
119 pages
TCS NQT 24th Oct 8 Am To 11 Am Slot Analysis
No ratings yet
TCS NQT 24th Oct 8 Am To 11 Am Slot Analysis
35 pages
Number Systems
No ratings yet
Number Systems
20 pages
Discrete Mathematics 2 离散数学
No ratings yet
Discrete Mathematics 2 离散数学
189 pages
Discrete Math - Olteanu
100% (1)
Discrete Math - Olteanu
135 pages
Pa - Unit - Iv
No ratings yet
Pa - Unit - Iv
45 pages
Discrete Mathematics: Haluk Bingol February 21, 2012
No ratings yet
Discrete Mathematics: Haluk Bingol February 21, 2012
157 pages
A Course in Discrete Structures PDF
No ratings yet
A Course in Discrete Structures PDF
153 pages
MTH 202
No ratings yet
MTH 202
263 pages
Cyberstalking and Cyberbullying: Effects and Prevention Measures
No ratings yet
Cyberstalking and Cyberbullying: Effects and Prevention Measures
8 pages
HDFS Internals
No ratings yet
HDFS Internals
30 pages
Discrete Mathematics: Thomas Goller January 2013
No ratings yet
Discrete Mathematics: Thomas Goller January 2013
87 pages
Application Form For Job
No ratings yet
Application Form For Job
3 pages
Assessing Maximum DG Penetration Levels in A Real Distribution Feeder by Using OpenDSS
No ratings yet
Assessing Maximum DG Penetration Levels in A Real Distribution Feeder by Using OpenDSS
6 pages
MTH 202
No ratings yet
MTH 202
215 pages
Alistair Savage - Mathematical Reasoning & Proofs (Lecture Notes) (2017)
No ratings yet
Alistair Savage - Mathematical Reasoning & Proofs (Lecture Notes) (2017)
93 pages
Draft: Lecture Notes On Discrete Mathematics
No ratings yet
Draft: Lecture Notes On Discrete Mathematics
209 pages
Imperva - SecureD Data Protection v1.5 HSL v1.2
No ratings yet
Imperva - SecureD Data Protection v1.5 HSL v1.2
32 pages
Lecture Notes On Discrete Mathematics
No ratings yet
Lecture Notes On Discrete Mathematics
190 pages
TTP-245p 247 User Manual E
No ratings yet
TTP-245p 247 User Manual E
50 pages
MTH 202
No ratings yet
MTH 202
209 pages
Vector and Bitmap Images
No ratings yet
Vector and Bitmap Images
3 pages
HR
No ratings yet
HR
7 pages
Mathematical Logic
No ratings yet
Mathematical Logic
182 pages
Swann Catalog 2015
0% (1)
Swann Catalog 2015
20 pages
Disc Math
No ratings yet
Disc Math
153 pages
Leica Aibot: Line 1 Line 2 (Optional)
No ratings yet
Leica Aibot: Line 1 Line 2 (Optional)
2 pages
Solucionario Set Theory
No ratings yet
Solucionario Set Theory
89 pages
Course Outline MIT
No ratings yet
Course Outline MIT
3 pages
Hemochron Elite - Itc Usa
No ratings yet
Hemochron Elite - Itc Usa
4 pages
D79232GC10 44001 Us
No ratings yet
D79232GC10 44001 Us
5 pages
G812 3
No ratings yet
G812 3
9 pages
DESIGN For ROBUSTNESS
No ratings yet
DESIGN For ROBUSTNESS
15 pages
Theory of Computation Class Notes: Based On The Books by Sudkamp and by Hopcroft, Motwani and Ullman
No ratings yet
Theory of Computation Class Notes: Based On The Books by Sudkamp and by Hopcroft, Motwani and Ullman
29 pages
University Institute of Computing: Big Data Analytics 22CAH-782
No ratings yet
University Institute of Computing: Big Data Analytics 22CAH-782
27 pages
Homework Set No. 5, Numerical Computation: 1. Bisection Method
No ratings yet
Homework Set No. 5, Numerical Computation: 1. Bisection Method
4 pages
Flutter User Interface Using Scaffolds
No ratings yet
Flutter User Interface Using Scaffolds
36 pages
Codeverse Documentation
No ratings yet
Codeverse Documentation
60 pages
E SBC l1 GLP External
No ratings yet
E SBC l1 GLP External
91 pages
t1
No ratings yet
t1
4 pages
eSEC01 NetSec
No ratings yet
eSEC01 NetSec
24 pages
BAE5 - Tutorial 2 - 2023-1
No ratings yet
BAE5 - Tutorial 2 - 2023-1
2 pages
Computer Image Corporation Brochure
No ratings yet
Computer Image Corporation Brochure
8 pages
Data Anonymization - SAP
No ratings yet
Data Anonymization - SAP
4 pages
DSO-DP6 Plug-In Card 100-00168
No ratings yet
DSO-DP6 Plug-In Card 100-00168
2 pages
Alterar Data e Hora
No ratings yet
Alterar Data e Hora
1 page