Introduction and Logic Notes - 2
Introduction and Logic Notes - 2
One of the hurdles for a student encountering a rigorous calculus course for the rst time, is what level of detail is expected in a proof. If every statement is completely justied, the proof that 1 + 1 = 2 takes over 200 pages of Principia Mathematics (Russell & Whitehead). So obviously, some things must be taken for granted. Just how much? Knowing the answer requires some sophistication...for instructor and student to agree on what is to be assumed and what is to be justied. Often, precisely what a proof is is stated explicitly. For the record
Denition 1.1 A proof is a nite sequence of statements, each of which is an axiom, one of the hypotheses of the theorem, or follows from the preceding statements of the proof by elementary rules of inference, and the last statement of the proof is the conclusion of the theorem.
At the end of this chapter is an example of detailed mathematical reasoning, beyond what we shall require in this text, but detailed enough to indicate to the reader what truths rest upon what assumptions. 1
1.2
1.2.1
We shall often need to prove sentences of the form p = q where p and q are propositions. A proposition is a statement, like 2 is an integer or 4 is a prime. Of course, the rst of these propositions is true, and the second is false. The meaning of Equation 1.1 is taken as if p is true, then q is also true. We shall understand that this sentence is to be regarded as true if p and q are both true, or if p is false, regardless of the truth value of q . This has the same logical value as if q is false, then p also must be false. (See Exercise 1.) That is, q = p, where q stands for not q , the proposition which is true when q is false, and which is false when q is true. To say two sentences (involving propositions p, q, . . .) have the same logical value (or are equivalent) is to say that they are simultaneously either both true or both false, regardless of the truth value of p, q, . . .. The form in Equation 1.2 is called the contrapositive form of the sentence p = q . You should take a few minutes to convince yourself that the two forms have the same logical value. In many places in the book, we shall prove p = q by proving q = p. In summary, then, p = q if and only if q = p. We represent p and q by p q, (1.2) (1.1)
1.2. PROPOSITIONAL LOGIC AND THE PREDICATE CALCULUS which will be true if and only if both p and q are true. We represent p or q by p q, which will be true if and only if (1) p is true or (2) q is true or (3) both p and q are true.
In the exercises, you will be asked to verify that p = q is logically equivalent to the expression p q. Exercise 1 a. Make a truth table for p = q , by enumerating each of the four cases
(true,true), (true, false), (false, true), and (false, false) of values for (p, q ). Then decide for each of the cases whether p = q is true or false, and enter it in the second column in the table. p q p = q q = p p q
T T T F F F T F
b. Using the same truth table as in Part (a), decide for each of the four cases whether q = p is true or false. Enter your values in the third column of the table. Compare with p = q. Recall that two expressions are equivalent if they have the same truth value for all possible truth values of the variables (in this case p and q .) c. Using the same truth table as in Part (a), decide for each of the four cases whether p q is true or false. Compare with p = q. Another form of proof we shall employ is proof by contradiction: If the assumption of the proposition p leads to a contradiction (e.g. 0 = 1), then we may conclude that p is false, i.e. that p is true. Example 1 The familiar proof, due to Euclid, that there are innitely many primes, proceeds as follows: Suppose there were only nitely many primes, and we list ALL of them: p1 , p 2 , . . . , p k
CHAPTER 1. AXIOMS OF THE REAL NUMBER SYSTEM Now consider the integer n = 1 + p1 p2 . . . pk .
We claim that n is also prime, because for any i, 1 i k , if pi divides n, since pi divides p1 p2 . . . pk , it would divide their dierence, i.e. pi divides 1, impossible. Hence the assumption that pi divides n is false (since it led to contradiction), and hence pi does not divide n, for any i. Hence n is prime. But we listed ALL the primes above, as p1 , . . . pk , and n is not among them (since it is larger than each of them). This contradicts the assumption that we listed all the primes, and hence our assumption that there were only nitely many primes is false. Hence there are innitely many primes1 . In the above proof, note that we actually used the principle of proof by contradiction twice. The form of proof by contradiction written in propositional logic is the following: (p = FALSE) = p. If p = q and q = p then p and q are logically equivalent, and we write pq or p q.
1.2.2
Predicate Calculus
We shall often make and prove statements of the form for every x, P (x) which is written: xP (x) or there exists an x such that P (x) which is written: xP (x). Here we understand that P is a proposition (see above) with a variable x in it, which becomes either true or false when we substitute a number in for x. For example P (x) could be the statement
1
2000 years later, Euler gave a totally dierent proof. Eulers proof is much more sophisticated, and
was the rst proof which used Analysis to prove a result in number theory, and thus introduced the eld of Analytic Number Theory. See Chapter ** for his proof.
P (x) x is divisible by 2, a statement which is true when you substitute 4 for x, so P (4) is true, and P (122) as well. But P (3) is false. So, for the P (x) we just dened, the statement xP (x) is false, while the statement xP (x) is true. We understand the universe over which the quantiers range to be understood from the context. For our purposes, typically the universe will be the real numbers, or sometimes the natural numbers. Other universes are possible. Furthermore, the statement xP (x) will be regarded as true if there are NO xs in the universe, regardless of what x is! Thus, we are going to regard the sentence All unicorns have 5 feet. as true. But the statement xP (x) will be taken as false if there are no xs in the universe. Thus, the statement Some unicorns have 5 feet. will be taken as false. To say that it is false that for all x, P (x) is to say that there must be some x such that P (x) does not hold. The negation of xP (x)
6 is the sentence
xP (x), and the negation of xP (x) is the sentence xP (x). Quantiers are read left-to-right, like English, so we have the possibility of statements like xyP (x, y ). Supppose, for example, that P (x, y ) is the proposition x < y , where the variables x and y are understood to range over the real numbers. Then the statement xy (x < y ) is true, since for every real number x there is another real number (e.g. x + 1) which is greater than x. But the statement xy (x y ) is false, since it asserts there is a number (x) which is the smallest real number. Note also that as the quantiers are read left-to-right, in xy (x < y ), y depends upon x, but in xy (x y ), x does not depend upon y . Later on, we shall need to be familiar with reasoning such as, Above, we just concluded that in the domain of real numbers it is false that there is a number (x) which is less than every real number (y ). Hence xy (x y ), which is equivalent to xy (x y ), i.e., xy (x > y ),
which is the statement that for every real number (x) there is another real number y smaller than it (e.g. y = x 1.) Also when the existence of an x with a certain property is asserted, we shall frequently name one, say x0 , as in the following: ...f (x) is not continuous on all of [0, 1] and hence there exists an x [0, 1] such that f is not continuous at x. Let x0 be such an x. Exercise 2 Construct the negation of p = q. Exercise 3 Construct the negation of x[P (x) = Q(x)]. Exercise 4 Construct the negation of xy [P (x, y ) = Q(x, y )]. Exercise 5 Construct the negation of xy z [P (x, y, z ) = Q(x, y, z )]. Exercise 6 The denition of f is continuous at x = a is the following: for every > 0 there exists a > 0 such that for all x, if |x a| < then |f (x) f (a)| < . a. Write this as a logical expression in the Predicate Calculus, identifying the component pieces. (Assume that the universe of and is all positive numbers.)
1.3
1.3.1
The real numbers R = (, ) form a set which is also a eld, as follows: There are two binary operations on R, addition and multiplication, which satisfy a set of axioms which make the set R a commutative group under addition: (all quantiers in what follows are assumed to be over the universe of real numbers, R.) 1. For every x and y , x + y = y + x. (Commutativity)
CHAPTER 1. AXIOMS OF THE REAL NUMBER SYSTEM 2. For every x, y and z , x + (y + z ) = (x + y ) + z. (Associativity) 3. There exists an element x such that for every y , x + y = y . (We call this element the additive identity, and after proving that it is unique, we label it 0. See Theorem 1.1 for a proof of uniqueness.) Restated: x[0 + x = x]. (Identity) 4. For every x there exists a y such that x + y = 0. (Additive inverse). Note that from the order of the quantiers, y depends upon x. We usually denote this y by x. The non-zero elements of R form a commutative group under multiplication: 1. For every x and y x y = y x. (Commutativity) 2. For every x, y and z , x (y z ) = (x y ) z. (Associativity) 3. There exists an x such that for every y, x y = y . (We call this element the multiplicative identity, and after proving that it is unique, we label it 1. See Theorem 1.1 below for a proof of uniqueness.) Restated: x[1 x = x]. (Identity) 4. For every x there exists a y such that x y = 1. (Multiplicative inverse). Note that from the order of the quantiers, y depends upon x. We usually denote y by x1 , or by 1/x. To relate the additive group structure to the multiplicative structure, we require that multiplication distribute over addition: 5. For every x, y and z , x (y + z ) = (x y ) + (x z ). We need one additional axiom to ensure the eld contains at least two elements: 1. There exists at least one element a = 0. This guarantees us two elements, since by the axioms of the additive group, there exists
an additive identity (0.) We shall also omit the multiplication symbol, and write ab for a b when the context makes this clear.
1.3. PROPERTIES OF R, THE REAL NUMBERS: Theorem 1.1 In R the additive identity and multiplicative identity are unique.
Proof: Suppose that there are two elements, 0 and 0 which both satisfy the axiom of identity: For every x R, 0 + x = x and For every x R, 0 + x = x. Then 0 = 0 + 0 = 2 0 + 0 = 0, i.e. 0 = 0 . The proof that the multiplicative identity is unique is similar3 . Theorem 1.2 For every x R, x 0 = 0. Proof: x 0 = x(0 + 0) = x0 + x0 so 0 = x0 + (x0) = (x0 + x0) + (x0) = x0 + (x0 x0) = x0. (We are writing a b for a + (b), of course.) (1.3) (1.4)
Theorem 1.3 x = (1)x Proof: See Exercise 11. Exercise 7 Prove that if a + x = x for some x, then a = 0. Note that with this result we could conclude immediately from Equation 1.3 that x0 = 0. Exercise 8 Prove that if 1 and 1 are both multiplicative identities in R, then 1 = 1 . Exercise 9 Justify the two steps in equation 1.3 and the four steps in equation 1.4.
2 3
10
Exercise 10 Prove: (1)(1) = 1. Justify every step. Hint: what does (1) stand for? Exercise 11 Prove: x = (1)x. [Hint: show that (1)x is the additive inverse to x.] Exercise 12 Prove (x) = x. [Hint: what does (x) denote?]
1.3.2
The above axioms, A1-A4, and M1-M5, together with the requirement that there are at least two elements (i.e. that 0 = 1), make R a eld. R is actually a complete ordered eld. We will discuss completeness in Section 1.5. To obtain an ordered eld, we need to require an additional set of axioms, about the order relations among the elements, as follows: There is a binary relation4 , on R with the following properties: 1. For every x and y, x y and y x implies x = y . 2. For every x and y, either x y or y x. These rst two axioms together are called the Axiom of Trichotomy (because there are three possible relationships between x and y .) 3. For every x, y , and z, x y and y z implies x z . 4. For every x, y , and z, y z implies x + y x + z. 5. For every x, y , and z, y z and 0 x implies xy xz . We shall often used a strengthened version of Order Axiom 4, which we now prove. Theorem 1.4 For all x, y and z, if y < z then x + y < x + z.
4
The reader who wishes a precise denition of relation will nd one in Section 1.7
11
Proof: From Order Axiom 4, under the hypotheses of the theorem, x + y x + z. If x + y = x + z , by adding the additive inverse of x to both sides, using Associativity, Commutivity and Identity, we obtain5 y = z. But this contradicts the hypothesis of the theorem. Hence our supposition that x + y = x + z was false, and that together with x + y x + z implies that x + y < x + z. QED
The corresponding multiplicative theorem is the following Theorem 1.5 For all x, y and z, if y < z and x > 0 then xy < xz. Proof: The hypotheses of the theorem and Order Axiom 5 imply that xy xz. But if xy = xz , since x = 0, we can multiply both sides by 1/x and obtain y = z, a contradiction. Hence xy = xz , and the theorem is proved. Theorem 1.6 For all x, x < 0 x > 0. Proof: x<0 implies x + (x) < 0 + (x) implies 0 < x, by Additive Inverse, Theorem 1.4, and Additive Identity.
5
12
CHAPTER 1. AXIOMS OF THE REAL NUMBER SYSTEM Conversely, if 0 < x, then 0 + x < x + x
which implies x < 0, again by Theorem 1.4, Additive Identity, and Additive Inverse. Exercise 13 Prove that under the asssumption that a eld has at least two elements, then 0 = 1. Exercise 14 Verify that the set {0} satises all the axioms for an ordered eld, except for 0 = 1. Note: just exactly what plays the role of 1, the multiplicative identity? Exercise 15 Prove: 0 < 1. [In light of the preceding exercise, note that you MUST use 0 = 1. Why?] Exercise 16 Prove: a < b b < a. [Hint: Make use of Theorem 1.4.] Exercise 17 Prove: if a > 0 and b > 0 then a < b a2 < b2 . Exercise 18 Prove that for any x, x2 0. Exercise 19 Prove that a eld has no zero-divisors, i.e. if ab = 0 then either a = 0 or b = 0.
1.3.3
Denition 1.2 If a and b are real numbers, and a < b, we write [a, b] to denote the set of xs for which a x b. We write (a, b) to denote the set of xs for which a < x < b. Intervals of the form [a, b] are called closed intervals and intervals of the form (a, b) are called open intervals for reasons which will become clear when we study the topology of the real numbers in Chapter 9. It is also possible to have half-closed, half-open intervals. Note that closed intervals contain their endpoints, while open intervals do not. We shall also write (a, ) to denote the set of xs for which a < x and similarly (, a] to denote the set of xs for which x a.
1.3. PROPERTIES OF R, THE REAL NUMBERS: Denition 1.3 By the absolute value of x, denoted |x|, we mean |x| =
13
x, x,
if x 0. if x < 0.
Theorem 1.7 The following are properties of absolute value: 1. For all x, if x = 0 then |x| > 0. If x = 0 then |x| = 0. 2. |xy | = |x||y | for all real numbers x and y . 3. |x| x |x|, for all x. Proof: (1) By cases: if x > 0, then |x| = x > 0. If x < 0, |x| = x > 0. From the denition, |0| = 0. (2) also proved by cases. See Exercise 23. (3) By cases: if x > 0, |x| = x < 0 < x = |x|. If x < 0, |x| = x < 0 < x = |x|. If x = 0, |x| = x = |x|. In all three cases, then, |x| x |x|. The following will be needed literally dozens of times throughout the book. Needless to say, it is very important! Theorem 1.8 For all real numbers x and M , |x| M M x M. Proof: Suppose |x| M . Then by Exercise 16 above, M |x|, and by Theorem 1.7, M |x| x |x| M. Conversely, suppose M x M. Then M x, which implies that x M . But x M by assumption. Then since |x| = x or |x| = x, and in either case, x M , it follows6 that |x| M . Because the theorem above is so important, we give an example, and then have several homework problems which involve it.
6
The interested reader may wish to note that the form of this reasoning, so called on the horns of a
dilemma, is: [p r, q r, p q ] r.
14
Example 2 Find the set of y s for which |y L| . This is used heavily in our study of limits, in Chapter 2. By the theorem above, yL , which is equivalent to L y L+ . So, the set of y s is the interval [L , L+ ]. This reasoning will be used frequently throughout the book. A similar theorem holds for strict inequalities: Theorem 1.9 For all real numbers x and M , |x| < M M < x < M. The proof is similar to the proof above.
Exercise 20 Prove Theorem 1.9. Exercise 21 Find the set of xs for which |x 3| 4. Exercise 22 Find the set of y s for which |y L| < . Exercise 23 Prove |xy | = |x||y |, for all x, y R. Theorem 1.10 (Triangle Inequality) For all real numbers x, y , |x + y | |x| + |y |. Proof: [We call this triangle inequality because of the geometric interpretation when x and y are vectors in R2 . This will have nothing to do with our proof, however.] |x| x |x|
1.4. SUPREMA AND INFIMA: SUPS AND INFS and |y | y |y |. Adding these two double inequalities yields (|x| + |y |) = |x| |y | x + y |x| + |y | But then |x + y | |x| + |y | QED from Theorem 1.8
15
1.4
Denition 1.4 A set is non-empty if it possesses at least one element. The empty set, denoted , is the set consisting of no elements. More on this in Section 1.7 Denition 1.5 A non-empty set E is said to be bounded above if there exists an M such that for every x E, x M. Denition 1.6 A non-empty set E is said to be bounded below if there exists an N such that for every x E, x N. Denition 1.7 A non-empty set E is said to be bounded if there exists M, N such that for every x E, N x M. It is obvious from the denitions that a set E is bounded if and only if E is bounded above and E is bounded below. Example 3 [0, 1] is bounded above, by 10, for example. [0, 1] is also bounded above by 1. It is bounded below by 10. Therefore, [0, 1] is bounded as well. Example 4 [0, ) is bounded below (by 0), but not bounded above. For suppose [0, ) were bounded above by M , say. Then M + 1 [0, ) implies M + 1 M, a contradiction.7
7
To what, exactly?
16
Example 5 (, ) is neither bounded below nor bounded above. Example 6 {q Q : q 2 < 2} is bounded, as it is contained in the interval [100, 100]. Denition 1.8 M is an upper bound for the non-empty set E if x M for all x E. Denition 1.9 L is the least upper bound for the non-empty set E if 1. L is an upper bound for E . 2. If M is any upper bound for E , L M . That is, of all the upper bounds for E , L is the smallest. We denote L is the least upper bound for E by L = l.u.b. E or L = sup E, where sup is an abbreviation for the latin: supremum.
Example 7 10 is an upper bound for {3, 6, 9}. Example 8 23 is also an upper bound for {3, 6, 9}. Example 9 sup{3, 6, 9} = 9. In the above examples, things are simple, because for a nite set E , sup E = max E.
Example 10 10 is an upper bound for (0, 1). Example 11 23 is also an upper bound for (0, 1).
1.4. SUPREMA AND INFIMA: SUPS AND INFS Example 12 1 is the least upper bound for (0, 1). That is, sup(0, 1) = 1.
17
This last example is the rst non-obvious one. Certainly, 1 is an upper bound for (0, 1), since (0, 1) = {x : 0 < x < 1}.8 So property (1) of the denition of least upper bound is satised. To show property (2), we must show that if M is any upper bound for (0, 1), then 1 M . We claim that no number less than 1 can be an upper bound for (0, 1). Clearly if x 0 then x cannot be an upper bound for (0, 1). If 0 < r < 1 then 0 < r <
1+r 2
< 1, which
implies that r is not an upper bound for (0, 1). (Precisely why?) Since every number less than 1 is not an upper bound for (0, 1), then if M is an upper bound for (0, 1), then M 1. This establishes property (2). It follows that 1 is the least upper bound for (0, 1), i.e., sup(0, 1) = 1. More Examples Example 13 sup(0, ) does not exist, because (0, ) is not bounded above. Example 14 sup{0, 1/2, 2/3, 3/4, . . .} = 1. The proof for this will have to wait until after we introduce the Axiom of Completeness. Denition 1.10 M is an lower bound for the non-empty set E if x M for all x E. Denition 1.11 G is the greatest lower bound for the non-empty set E if 1. G is a lower bound for E . 2. If M is any lower bound for E , G M , that is, of all the lower bounds for E , G is the largest. We denote G is the greatest lower bound for E by G = g.l.b. E
8
18 or
Example 15 1 is an lower bound for {3, 6, 9}. Example 16 -23 is also an lower bound for {3, 6, 9}. Example 17 inf {3, 6, 9} = 3. In the above examples, things are simple, again because for a nite set E , inf E = min E. Theorem 1.11 (Approximation Property for Suprema) Let E be a non-empty set bounded above, and suppose there exists s = sup E . Then for every > 0 there is an element of E in the interval (s , s]. Proof: If s E , the theorem is trivially true. If s E , suppose to the contrary that in the interval (s , s) there is no element of E . Since s E , s bound for E , contradicting the hypothesis that s was the least. Corollary 1.12 (Approximation Property for Inma) Let E be a non-empty set bounded below, and suppose there exists t = inf E . Then for every the interval [t, t + ). Proof: See Exercise 31. > 0 there is an element of E in is therefore also an upper
Denition 1.12 Let E = . Then E = {e : e E }. Exercise 24 Determine sup{.3, .33, .333, .3333, . . .}. No need for a proof at this time. Exercise 25 A set E is bounded if and only if there exists an M so that |x| M for all x E.
19
Exercise 26 M is an upper bound for the set E if and only if M is a lower bound for the set E . Exercise 27 E is bounded above if and only if E is bounded below. Exercise 28 E is bounded if and only if E is bounded. Exercise 29 Is the empty set bounded? Proof? Exercise 30 sup E = inf {E }. This shows that any result we obtain about sups can be translanted to a result about infs, and conversely. Exercise 31 Prove the Approximation Property for Inma. Exercise 32 Prove: A B = inf B inf A sup A sup B , for A and B bounded, non-empty sets. Exercise 33 a. sup[a, b] = sup(a, b) = b.
b. inf[a, b] = inf(a, b) = a. Exercise 34 Let E be a non-empty set bounded above. Let F = {x : x is an upper bound for the set E }. Suppose inf F and sup E exist. Prove: inf F = sup E.
1.5
Axiom of Completeness
We have one nal axiom which we need to assume about R, which will distinguish R from Q, the ordered eld of rational numbers. This property is called completeness. We shall see that it is an essential property of the real number system, one which allows us to conclude that Cauchy sequences converge, that the Intermediate Value Theorem is true, the Extremal properties for continuous functions, etc., a great variety of properties and theorems all of which are consequences of the completeness of the real number system. Axiom of Completeness: Every non-empty set of real numbers E which is bounded above has a supremum.
20
The Axiom of Completeness cannot be proved, in the context of the real numbers. It must be assumed, which is what we do with all axioms. The Completeness Axiom is what distinguishes the real number eld from the eld of rational numbers, which is a subeld of the real numbers. The rational numbers presented some trouble to the ancient Greek mathematicians. They came to understand that there were lengths that could be constructed from rational numbers, but which themselves were not rational: They knew that the diagonal of a square of side length 1 was of a length which was not a rational number.9 The Greeks tried to operate within the eld of rational numbers, but were led outside it (as we saw, with the construction of 2.) The Axiom of Completeness is just what we need to complete the real numbers. There will be no operations which lead us outside the real numbers. We shall see, later, that sequences which ought to converge, do converge. This is a property which is not shared by the rational numbers, as we shall also see. We shall see some of the consequences of the Axiom of Completeness shortly, after we introduce the Natural Numbers. Theorem 1.13 (Completeness Axiom: Version II) Every non-empty set bounded below has an inmum. Proof: Let E be a non-empty set, bounded below, say by M . Then (see Exercise 30) E =
9
2 = p/q , for some integers p, q. After dividing out any common factors, we may assume that p and q contain no common factors. Then squaring both sides, p2 = 2 q 2 .
Then p2 is an even number (because it is a multiple of 2), which implies that p is an even number (since the square of an odd number is odd), so p = 2k for some integer k , so 2q 2 = p2 = (2k )2 = 4k 2 , and after dividing both sides by 2, q 2 = 2k 2 which implies that q must be even (reasoning as we did, above, with p). But that is a contradiction, because then p and q have a common factor, 2. So, there cannot be such a p and q , i.e. 2 is not rational.
21
{e : e E } is non-empty, and bounded above by M . By the Axiom of Completeness, E has a least upper bound, say L. It is routine to verify then that L is the greatest lower bound for E . Exercise 35 Give an alternate proof of the Completeness Axiom for infs, based on the following ideas: Let E be a non-empty set bounded below by L. a. Let F = {x R : x is a lower bound for E }. Prove sup F exists. Let f = sup F. b. Using the Approximation Property for sups, prove that if e E and e < f , then there exists f F, e < f , a contradiction. Conclude that f is a lower bound for E . c. Prove: for any > 0, f + is not a lower bound for E . d. Prove f = sup F is the greatest lower bound for E , that is, inf E = sup F.
1.5.1
Now we have enumerated the axioms of the real numbers: they form a Complete, Ordered Field. These properties characterize the real numbers, in that it can be proved (usually in a more advanced course on this subject) that any complete ordered eld is equivalent to the real numbers, in the sense that there will be a one-to-one correspondence between the elements of the real numbers and those of the other eld, and that this correspondence will preserve the properties of + and , as well as the order properties.
1.6
We start with the Peano axioms, from which all the properties of N can be derived. We are going to axiomatize the set of positive integers, which we also call the Natural Numbers. We denote this set by N . Let N be a non-empty set, with a 1-1 function10 s(n) (which we shall be thinking of as s(n) = n + 1 and called the successor of n) dened on it, with the following properties:
10
This is actually dened in the next section, as Denition 1.21. Since we need it here, you may take the
22
CHAPTER 1. AXIOMS OF THE REAL NUMBER SYSTEM 1) There is an element (which we call 1) in N which is not in the range of s(n). 2) For every subset M N , if M has the following two properties: (a) 1 M. (b) For every n N , n M s(n) M (a set M with this property (b) is called inductive.) then M = N .
Examples of Inductive Sets Example 18 M =N is an inductive set, because s(n) N for every n. Example 19 M = is an inductive, set, because no n is in M , so the hypothesis in (b) above is trivially satised. Example 20 M = {3, 4, 5, 6, . . .} is an inductive set. (2) above is usually called the Axiom of Induction, and we will make use of it thoughout the text. Here is an example of its use: Theorem 1.14 For any real number a 1, and all positive integers n: (1 + a)n 1 + na. Proof: Let a 1 be xed. Let M be the set of natural numbers for which the above formula holds, i.e. M = {n N : (1 + a)n 1 + na}. Note that since (1 + a)1 = 1 + a, 1 M.
1.6. PROPERTIES OF N , THE NATURAL NUMBERS: Now suppose n M (this is often called the induction hypothesis). Then (1 + a)n+1 = (1 + a)(1 + a)n (1 + a)(1 + na) by the Induction hypothesis = 1 + na + a + na2 1 + (n + 1)a since na2 is positive
23
i.e. n + 1 M . Then M is inductive. But then M satises properties 2(a) and 2(b), and therefore M = N . That is, the formula holds for all positive integers n.11 In the above proof, note what we did: to prove that a certain property holds for every n, we form a set (M ) of natural numbers for which the property holds, and then show two things: 1 M , and M is inductive. Then from Peano Axiom 2 it follows that M = N , i.e. that the property holds for all n. Theorem 1.15 (Corollary to the Axiom of Induction: Well- ordering Principle for N ) Every non-empty set E of natural numbers has a smallest element. Proof: If 1 E , then 1 is the smallest element of E . We proceed by assuming that 1 E , i.e., 1 E c .12 Now let S be the set dened by the following property: n S {1, 2, . . . , n} E c . Clearly, 1 S. Suppose S were inductive, i.e. suppose for every n, n S implies n + 1 S . Then by the Peano axioms, S = N , i.e. every n is in E c . But then E would be empty, a contradiction. So S cannot be inductive. That means that there is an n S , but n + 1 S . Then {1, 2, . . . , n} E c , but n + 1 E c . I.e., k E , for 1 k n, but n + 1 E . Then n + 1 is the smallest element of E . QED Exercise 36 Prove by induction, that 1 + 2 +...+ n =
11 12
n(n + 1) . 2
Where exactly did we use the hypothesis that a 1? E c , called E -complement, is the set of all elements (in the universe, in this case N ,) which are NOT
in the set E .
24
Exercise 37 Prove by induction, that 13 + 23 + . . . + n3 = (1 + 2 + . . . + n)2 . Hint: you will need to use the result from the preceding exercise. Exercise 38 The following exercise illustrates a method for showing that a property holds for all suciently large integers. a. Which natural numbers are in the set A = {k N inductive? Explain carefully. b. It is true, however, that A contains all natural numbers larger than a certain N0 . What is N0 ? c. Prove that A is almost inductive, i.e. that A has the property that (k N0 )[k A = k + 1 A]. [Hint: you will need to show that a certain other set is inductive to do this.] Exercise 39 Prove that k 2 > 2k + 1 if k 3. Exercise 40 Prove that I = {n : n2 + n + 1 is even} is an inductive set. Is it true that I = N ? Explain. Exercise 41 Prove the following Principle of Strong Induction: Suppose M is a subset of N , with the following properties: 1) 1 M. 2) For every n N , {1, 2, . . . , n} M = n + 1 M . Then M = N . (Hint: If M = N , then E = M c is not empty. Apply the Well-Ordering principle to E to get a contradiction to 2).) Theorem 1.16 N is not bounded above. (1.5) : k 2 < 2k }? Why is A not
1.6. PROPERTIES OF N , THE NATURAL NUMBERS: Proof: Suppose not13 . Then there exists an M such that if n N , then n M.
25
By the Completeness Axiom, since N is bounded above, s = sup N exists. Since s is the least upper bound for N , s 1/2 is not an upper bound for N . That is, there is an n N such that s 1 < n. But then s < n + 1, which contradicts the fact that s was an upper bound for N . QED. Theorem 1.17 (Archimedean Property for R) For any that 1/n < . Proof: > 0 implies 1/ > 0. Since N is not bounded above, there exists an n N such > 0 there exists n N such
that 1/ < n. Then 1/n < . Theorem 1.18 (Second Archimedean Property for R) For any real numbers a and b such that 0 < a < b there exists n N such that na > b. (Homely version: an arbitrarily large bathtub can be bailed out by an arbitarily small teaspooon.14 ) Proof: Let = a/b. By the rst Archimedean property, there is an n so that 1/n < a/b, i.e. na > b. QED Theorem 1.19 (The Rational Numbers are Dense in the Real Numbers) For any real numbers a, b, if a < b then there exists a q Q such that a < q < b.
13 14
What kind of proof, i.e. what form, do you expect this to be? Identify which of a, b, n is the teaspoon, and which is the bathtub. What role does n play?
26
CHAPTER 1. AXIOMS OF THE REAL NUMBER SYSTEM Proof: Case I: 0 a < b. Apply the Archimedean Property to = b a. Then there
exists an n N such that 0 < 1/n < b a. Then 1 < nb na, [At this point, the idea is that since nb and na are more than 1 apart, then there is an integer strictly between them. The remainder of the proof establishes this.] Since N is unbounded above, na is not an upper bound for N , and hence {m N : m > na} is not empty. It therefore has a least element by the well-ordering property, call it m0 . Then m0 > na. If m0 nb, then m0 1 nb 1 > na, contradicting the fact that m0 was the least such. Hence na < m0 < nb. Then a < m0 /n < b and m0 /n is our desired rational number. Case II: a < 0 < b. Then 0 is the desired rational. Case III: a < b 0. Then consider 0 b < a, apply Case I, nd the rational number q such that b < q < a. Then q is rational, and a < q < b. QED Examples Using the Axiom of Completeness Example 21 Consider the set E = {q Q : q 2 < 2}. This set is bounded above, because 1.5 is an upper bound for E . (proof?) The set is not empty (why?). Then by the Axiom of Completeness, sup E exists. Actually, we will show that sup E = 2. To see this we need to show two things:
27
To establish (1), note15 that 0 < q 2 < 2 = 0 < q 2 < 2, so that q E = q < 2. To establish (2), let M < 2. Then by the density of the rational numbers, there is a q Q so that M <q< or q2 < 2 which implies q E. But then M cannot be an upper bound for E since M < q . This establishes (2).16 Example 22 Let E = { 1 , 2 , 3 , . . .}. Then 2 3 4 sup E = 1. To see this, we reason as we did above: Clearly 1 is an upper bound for E . If r < 1, by the Archimedean property there exists an n N such that 1/n < 1 r. Then r < 1 1/n = n1 E, n
2,
i.e. r is not an upper bound for E . Therefore, 1 is the least upper bound for E , i.e. sup E = 1. Exercise 42 Prove the density of the irrationals: For all real numbers a, b where a < b, there exists an irrational c such that a < c < b. (Hint: the product of a rational and an irrational is irrational).
15 16
is equivalent to q = p.
28
Exercise 43 Prove: if E is bounded and non-empty, and s = sup E E , then for every > 0, there exist x, y E such that 0 < |x y | < . Exercise 44 Give an example of a bounded, non-empty set E such that the property in Problem 43 above does not hold.
1.7
Denition 1.13 A set is a collection of elements, usually here, real numbers, or natural numbers. We occasionally use the set-builder notation, where a set is dened by some property P (x),
A = {x R : x2 1} is way of dening the set A as the collection of all real numbers whose square is less than or equal to 1, i.e. A = [1, 1]. Why would we use the set-builder notation when we have a simpler way of dening the set A? It is because the property P (x) which characterizes the elements of A may not be so simple, as, for example, A = {x R : x3 + x2 3x 1}. We say A is a subset of B , and write AB if every element of A is also an element of B , i.e. x[x A = x B ].
29
A = B if and only if A B and B A, that is, two sets are equal if and only if they have precisely the same elements: x[x A x B ]. will be the symbol for the empty set, the set which consists of no elements.17 Denition 1.14 The cartesian product of two sets A and B , denoted A B , is the set of all ordered pairs of the form (a, b), where a is an element of A, and B is an element of B: A B = {(a, b) : a A, b B }. Denition 1.15 A relation R on a set A is a subset of A A. Example 23 The relation L = < on R is the subset of R R: L = {(x, y ) : x < y }, which is the set in the cartesian plane of all points which lie above the principal diagonal: y = x. Thus, for example, (1, 3) L and (2, 5) L but (6, 4) L. Denition 1.16 An equivalence relation R on a set A is a relation R on a set A with three properties: a. For every a A, aRa. [Reexivity] b. For every a, b A, aRb = bRa. [Symmetry] c. For every a, b, and c A, aRb and bRc imply aRc. [Transitivity] Exercise 45 Prove that the relation aRb if and only if b a is a multiple of 9 is an equivalence relation on N .
17
The careful reader will have noted that by use of the word the when attached to empty set, we are
presuming that there are not possibly two dierent empty sets. This follows from the property above that two sets are the same (equal) if they have exactly the same elements. Thus, two empty sets are the same, because they have exactly the same elements.
30
Denition 1.17 A function f : A B is a subset of A B with the following properties: a. For every a A there exists a b B such that (a, b) f. b. If (a, b1 ) f and (a, b2 ) f then b1 = b2 . (This expresses the single-valuedness of the function.) We shall also use the word mapping interchangeably for the name function, especially as in f is a mapping from A to B . Denition 1.18 The range of f is the collection of elements of B that are images of elements of A: range of f = {b B : (a)[(a, b) f ]}. Denition 1.19 The domain of f is the collection of elements of A. domain of f = {a A : (b)[(a, b) f ]}. Denition 1.20 A function f : A B is called onto or onto B (and denoted f : A B ) if every element of B is in the range of f : (b B )(a A)(a, b) f. Denition 1.21 A function f is called one-to-one, denoted 1 1, if f (x) = f (y ) = x = y, i.e. two dierent elements in the domain of f do not correspond to the same element in the range. Denition 1.22 Let f be a function, f : A B . The inverse of f, denoted f 1 , is the set f 1 = {(b, a) : (a, b) f }. Theorem 1.20 If f is a 1-1 function from A onto B , then the inverse of f is a 1-1 function from B onto A.
onto
1.8. CARDINALITY: COUNTABLE AND UNCOUNTABLE SETS Proof: See Exercise 48. Exercise 46 Let A = {a, b, c},
31
function f : A B . Can you construct a function from A into B which is 1-1? Why (or why not)? Exercise 47 The function f (x) = x2 does not have an inverse as yet, as we have dened it, because x2 is not one-to-one, at least on the domain which is understood, namely (, ). However, if we restrict the domain of f to [0, ), it is easy to see that in this case x2 is a one-to-one function, so an inverse is possible to dene. The inverse of x2 : f 1 = {(x2 , x) : x [0, )} = {(y, y ) : y [0, )} from which it is seen that the inverse to x2 is the function function on [0, ). Exercise 48 Prove Theorem 1.20. Exercise 49 Dene the relation between sets: A B if and only if there exists a function f : A B which is 1-1 and onto. Then prove that is an equivalence relation on the class of all sets. x. Verify that x is a 1-1
1.8
Denition 1.23 Two sets A and B are called cardinally equivalent if there exists a function f : A B which is both 1-1 and onto. Informally, we think of A and B being of the same size when they are cardinally equivalent.18
18
Cardinal equivalence is an equivalence relation on the class of all sets. The Fundamental Theorem of
Equivalence Relations states that an equivalence relation on a set A partitions A into disjoint subsets called equivalence classes. Bertrand Russell dened cardinal number as the equivalence class of all sets which are equivalent under this equivalence relation. The cardinal number 3 was then the equivalence class consisting of all sets with exactly three elements. The equivalence class 3 then embodied the property of three-ness.
32
Denition 1.24 A set A is called nite if there exists an n N and a 1-1 function f whose domain is A and whose range is contained in the set {1, 2, . . . , n}, for some n N . Note: this allows the empty set to be nite. (Equivalently: the nite sets are the ones which are either empty, or cardinally equivalent to the initial segments {1, 2, . . . , n} of the natural numbers.) Denition 1.25 A set A which is not nite is called innite. There are two dierent kinds of innite sets: Denition 1.26 An innite set A which is cardinally equivalent to N is called countably innite or denumerable. A set which is either denumerable or nite will be called countable. Denition 1.27 An innite set which is not countably innite is called uncountable. Theorem 1.21 (Cantors Diagonalization Theorem) The real numbers in (0, 1) are uncountable. Proof: Let A = (0, 1). Suppose instead that A were countable. Then there would exist a 1-1 function g : N A. We list the range of g , as follows: g (1) = 0.a11 a12 a13 . . . g (2) = 0.a21 a22 a23 . . . g (3) = 0.a31 a32 a33 . . . ... where the as are the decimal expansions of the elements of A which are so enumerated by the function g . We now construct a number b (0, 1) such that b range(g ), a contradiction. Dene a sequence {b1 , b2 , . . .} by
onto
bk =
5 if akk = 6 6 if akk = 6
1.8. CARDINALITY: COUNTABLE AND UNCOUNTABLE SETS Note that bk = akk , for each k = 1, 2, . . . . Now consider the number b = 0.b1 b2 b3 . . . = b2 b1 b3 + 2 + 3 + ... . 10 10 10
33
Clearly, 0.5 < b < 0.7, so b (0, 1). But b was constructed so that b disagrees with each element in the range of g : If b = g (k ) for some k , g (k ) = 0.ak1 ak2 ak3 . . . then bk = akk , so b = g (k ) after all! This contradicts the assumption that A is a countably innite set, hence A is uncountable. The Cantor Diagonal Argument has been applied in areas of mathematics as diverse from Real Analysis as to prove that The Halting Problem is recursively unsolvable19 , and to prove Godels Incompleteness Theorem, that no system of logic suciently powerful to express arithmetic of natural numbers (e.g. what it is to be a prime number, multiplication, etc.) can express in an algorithmic way all the provable sentences (i.e. the Theorems) and only the provable sentences. (More precisely, the set of provable sentences in formal logic is a recursively enumerable, not recursive20 .) set.) Theorem 1.22 A subset of a countably innite set is either countably innite or nite. Proof: Suppose A B , B countably innite, and A is not nite. We prove A is countably innite by induction: Since B was assumed countably innite, there exists a 1-1 function g : N B . Let bk = g (k ), k = 1, 2, . . . and we construct the mapping: N A by choosing in order, k = 1, 2, . . . , those bk which are in A as well. Here are the details: Consider the set E = {k N : bk A}
19
onto
There is no computer program that can decide, for every computer program, and each possible input to
the program, whether or not that program with that input will ever terminate.
20
Cf, Martin Davis, Computability and Unsolvability, McGraw-Hill, New York, 1958.
34
which is a non-empty set of natural numbers (since A is assumed not nite.) By the WellOrdering Principle, the set has a least element, call it j1 . Now suppose that j1 < j2 < . . . < jn have been chosen so that bjk A, k = 1, 2, . . . , n and for any m, 1 m jn , if m {j1 , j2 , . . . , jn }, then bm A. Now consider {k > jn : bk A} which is non-empty, because A was innite, and which also has a least element by the Well-Ordering Principle. Call it jn+1 . Then bjn+1 A. Thus we have constructed a sequence j1 < j2 < . . . such that bjk A for every k , and bm A if m {j1 , j2 , . . .} . Let f : N A be dened by f (n) = bjn . It follows21 that f is 1-1 and onto A, proving that A is a countable set. Corollary 1.23 : The real numbers, R, are an uncountable set. Proof: (0, 1) R. Theorem 1.24 The union of countably innitely many countably innite sets is countably innite. (Colloquially: the denumerable union of denumerable sets is denumerable.) Proof: Suppose A1 , A2 , . . . are the countably innite sets. In light of the previous theorem, we may assume that all the Ai s are disjoint, for if not, extending them to be disjoint sets only increases the number of elements in their union.22
21 22
Provide the details. See Exercise 54. This extension could be accomplished, for example, by replacing the set Ai by the new set Ai = Ai i = {(a, i) : a Ai }.
35
is countably innite, it suces to nd an enumerating function f : N B which is 1-1 and onto. This will be accomplished if we nd a way of listing all the elements of all the Ai s. What we need is a function which provides a mapping from N to N N which is itself 1-1 and onto. Once we have this, the remainder of the construction is quite easy. For the moment, however, let us enumerate each of the Ai , as follows: A1 = {a11 , a12 , a13 , . . .} A2 = {a21 , a22 , a23 , . . .} A3 = {a31 , a32 , a33 , . . .} ... To be more specic, since each Ak was assumed to be countably innite, there exists a gk : N Ak which is 1-1 and onto. Then let ak,j = gk (j ), j = 1, 2, . . . . Lemma 1.25 : There is a function h : N N N which is 1-1 and onto. Proof: Let m be any positive integer. Then m can be written as a power of 2 times an odd integer, and in only one way: m = 2r1 (2s 1), where r 1 and s 1. Now dene h(m) = (r, s), and observe that h1 (r, s) = 2r1 (2s 1) is a 1-1 onto function: N N N . It follows that h is the desired mapping. To complete the proof of the theorem, dene f : N B as follows: f (m) = gr (s) where h(m) = (r, s). Since gr (s) = gr (s ) if and only if r = r and s = s it follows that f is 1-1 because h is.
Then Ai Aj = if i = j since (a, i) = (a, j ) if i = j.
36
CHAPTER 1. AXIOMS OF THE REAL NUMBER SYSTEM Now, let x Ai . Then x Ak for some k . Then x = gk (l) for some l, and hence
x = f (2k1(2l 1)), which shows that f is onto. Corollary 1.26 The nite union of countably innite sets is countably innite. Proof: This follows from Theorem 1.22, when we recognize that the nite union of countably innite sets can be identied with a subset of the countably innite union of countably innite sets. Corollary 1.27 The countably innite or nite union of countably innite or nite sets is countably innite or nite. Proof: Suppose the sets are A1 , A2 , . . . . If the number of Ai is nite, say i = 1, 2, . . . , n, then let Ak = {(1, k )} for k > n. In light of Theorem 1.22 we may now assume that the number of Ak is countably innite. If any Ak is a nite set, say Ak = {ak1 , . . . , akn }, then extend Ak to an innite set Ak = {ak1 , . . . , akn , (k, n + 1), (k, n + 2), . . .} and again in light of Theorem 1.22 we may assume that Ak is a countably innite set. Since the countably innite union of countably innite sets is countably innite, the result follows. Corollary 1.28 The set Z of all integers, positive or negative, or 0, is a countably innite set. Proof: Z can be identied as the union of three sets, a copy of the positive integers, a copy of the negative integers, and the set consisting of 0. It follows from the previous corollary that Z is countably innite. Corollary 1.29 The set Q of all rational numbers is countably innite. Proof: The set Q can be written as the union of three sets, the positive rationals, the negative rationals, and the set consisting of 0. Q+ , the positive rationals, can be identied as a subset of the set of all ordered pairs of natural numbers: (m, n). Identify (m, n) with the rational number m/n, and observe that (2m, 2n) is also identied with the same rational
37
number. Thus, Q+ can be identied with a subset of N N , and hence by the previous remarks in the previous corollaries, is a countably innite set. The rest of the proof follows as in the case of Z . Exercise 50 Without using Corollary 1.28, prove directly from the denition that Z is a countable set. Exercise 51 Construct a function f : R R\23 {0} which is 1-1 and onto. Exercise 52 Construct a function f : R R\N which is 1-1 and onto. Exercise 53 Construct a function f : (0, 1) R which is 1-1 and onto. Exercise 54 Complete the proof of Theorem 1.22 by performing the following steps: a. Prove that jk = j (k ), k = 1, 2, . . . is a 1-1 function. b. Conclude that f is therefore 1-1. c. Prove that f is onto A.
23