An Introduction To The Theory of Surreal Numbers
An Introduction To The Theory of Surreal Numbers
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
If; w
in
1534.
since 1584.
CONTENTS
Page
Preface
Acknowledgements
Chapter 1
Introduction
Chapter 2
A.
B.
C.
Chapter 3
A.
B.
C.
D.
13
13
17
21
24
Chapter 4
A.
B.
C.
D.
27
27
28
32
41
Chapter 5
A.
B.
C.
D.
E.
Normal Form
Combinatorial Lemma on Semigroups
The a) Map
Normal Form
Application to Real Closure
Sign Sequence
52
52
54
58
73
76
Chapter 6
95
Chapter 7
3
3
4
8
104
Chapter 8
A.
B.
Number Theory
Basic Results
Partial Results and Unsolved Problems
page
111
111
114
Chapter 9
A.
B.
C.
D.
E.
121
121
124
129
135
138
Chapter 10 Exponentiation
A.
General Theory
B.
Specialization to Purely Infinite Numbers
C.
Reduction to the Function g
D.
Properties of g and Explicit Results
143
143
156
167
175
References
191
Index
192
PREFACE
INTRODUCTION
DEFINITION
If a and
In fact, this is
If F and
Case 2. G
a e F
5
3 < 0,
one of these sets might contain the empty sequence as its only element.]
Since F1 < G 1 , it follows that a(0) < b(0) for all a e F 1 , b e G 1 .
Now suppose d e F and d e G. This means that neither F1
1
nor G contains the empty sequence, i.e. a(0) and b(0) are never
undefined. Since a(0) < b(0), we obtain: a(0) = - and b(0) = +. It
is then clear that d works.
Since F and G are disjoint, d belongs to at most one of
F and G. Suppose that d e G. A similar argument will apply if
d e F. Then every a in F1 satisfies a(0) = -. Let F" be the set
of tails of F1 with respect to this -. (Such an iterated tail is,
clearly, the tail with respect to the sequence (d-.) Apply case 2 to
F" and <j> to obtain d 1 . Then the juxtaposition c = d-d1 works. We
already know that c satisfies the required inequality with respect to
those elements that do not begin with d. Since no b e G1 has
b(0) = -, this takes care of all of G. The choice of d1 takes care
of all elements in F beginning with d (the next term of which is
necessarily - ) . On the other hand, any element e satisfying F < e < G
must begin with d. Since d e G, the next term must be -. By choice
of d1, it must be an initial segment of the tail of e with respect to
d-, i.e. e must begin with d-d1.
This completes the proof.
Definition.
F < c < G.
F|G
is the unique
ORDER PROPERTIES
Theorem 2.2.
If G = <j> then
F|G
If F = <f> then
F|G
(F|G) _< the least a such that Va[a e F u G => (a) < a ] .
This is trivial because of the lexicographical order, since
otherwise the initial segment b of F|G of length a would also
satisfy F < b < G contradicting the minimality of F|G.
Note that < cannot be replaced by _< . For example, if
F = {(+)} and G = {(++)}, then F|G = (++-). The result also follows
from the construction in the proof of theorem 2.1. In fact, the
construction gives the more detailed information that every proper
initial segment of F|G is an initial segment of an element of F U G .
(An initial segment b of a is proper if b * a ) .
Theorem 2.3 has a kind of converse.
9
F|G
Proof. Let F = {b: Jt(b) < a and b < a} and G = {b: i(b) < a and
b > a}. Then F < a < G and every element of length less than a is,
by definition, in F or G so that a satisfies the minimum length
condition. Note that the argument is valid even if a is the empty
sequence.
The last result is a step in the way of showing the
connection between what is done here and the spirit of [1], since the
result says that every element can be expressed in terms of elements of
smaller length, thus every element can be obtained inductively by the
methods of [1]. The next theorem shows that the ordering in [1] is
equivalent to the one used here.
Theorem 2.5. Suppose
and F < d.
F|G = c and
F'|G' = d. Then
c _< d iff
c < G1
Proof. We know that F < c < G and F1 < d < G 1 . Suppose c _< d; then
c _< d < G' and F < c _< d. For the converse, assume c < G1 and F < d.
We show that d < c leads to a contradiction. This assumption yields
F < d < c < G. Hence c is an initial segment of d. Also
F1 < d < c < G1 so d is an initial segment of c. Hence c = d which
contradicts d < c.
This last result is of minor interest for our purpose. Its
main interest is that together with theorems 2.1 and 2.4 it shows that we
are dealing with essentially the same objects as in [1] although here
they are concretely defined. Since the present work is self-contained
this is not of urgent importance, although it is worthy of noting.
Of fundamental importance here will be what I call the
"cofinality theorems." They are analogous to classical results such as:
In the e,6 definition of a limit, it suffices to consider rational e;
and in the definition of a direct limit of objects with respect to a
directed set, a cofinal subset gives rise to an isomorphic object.
Definition. (F',G') is cofinal in
(VaeF)(3beF')(b>a) A (VaeG)(3beG'
(F,G) if
10
that
implies
(F'.G1)
are
Theorem 2.6 (the cofinality theorem). Suppose F|G = a, F1 < a < G1, and
(F'.G1 ) is cofinal in (F,G); then F'|G' = a.
Proof. Suppose (b) < (a) and F1 < b < G 1. It follows immediately
from cofinality that F < b < G, contradicting the minimality of (a).
Hence a = F'|G\
Theorem 2.7 (cofinality theorem b). Suppose (F,G) and (F',G')
are mutually cofinal in each other. Then F|G = F'|G'.
Note that it is enough to assume that F|G has meaning since
F < G => F1 < G 1 .
Proof. {x:F<x<G} = {x:F'<x<G'}. Hence the element of minimal length is
the same.
Although the two above theorems are closely related they are
not quite the same. Theorem 2.6 will be especially useful in the sequel.
I emphasize that in spite of the simplicity of the proof it is more
convenient to quote the term "cofinality1' than to repeat the trivial
argument every time it is used. Also it is convenient often with abuse
of notation to say that H1 is cofinal in H. However, this is
unambiguous only if it is clearly understood whether H and H1 appear
on the left or right, i.e. we must consider whether we are comparing
(H,G) with (H'.G1 ) or (F,H) with (F'.H 1 ). This is usually clear
from the context.
Cofinality will be used to sharpen theorem 2.4 to obtain the
canonical representation of a as F|G. Of course, the representation
in theorem 2.4 itself may be regarded as the "canonical" representation.
The choice is simply a matter of taste.
Theorem 2.8. Let a be a surreal number.
11
12
13
3
A ADDITION
We define addition by induction on the natural sum of the
lengths of the addends.
a, then
is a typical element of
G.
If
a = F|G
is a typical element of
is
F
We are now
Definition,
F i r s t , since the
a'+b
in the d e f i n i t i o n .
Secondly, no further d e f i n i t i o n
f.
F < G.
F|G = u
G used in the d e f i n i t i o n of
if
if
F < G
a+b do
F and
F is always
u is never
obtained as a value.
[1] is followed somewhat closely in building up the algebraic
14
d i f f e r e n t foundations.
1.
Now
(+) = {0}|(|>.
( )
by
Note that
(+)
G is
Similarly
Then
0+1 = 1.
"2".
1 , 2, etc.
definitions.
Theorem 3 . 1 .
a+b
b>c=>b>a+c
and
u) and furthermore
b>c=>b+a>c+a.
Remark 1. Although the first part is what is most urgent, we need the
second part to carry through the induction.
Remark 2. As a matter of style, one can prove commutativity first (which
is trivial) and then simplify the statement of the above theorem and its
proof.
a+b exists as a
(a) + (b)
is less than
In other
(a,b) of surreal
F < G. Since
that
< a + b"
and
a + b
< a" + b
and
a + b1 < a + b".
Similarly
1
By d e f i n i t i o n
of
15
< a" + b.
b > d > c.
Hence
a+b
is defined.
and
be the common i n i t i a l
Hence
is an i n i t i a l
a 1 + b < a 1 + b11
Also
U a ) + lib)
segment of
a + b > a
+ d>a
nor
is an i n i t i a l
_< a
and
c.
+ c
Now assume
and b + a > d
a > b
and
segment of
<_ a.
Let
b > c.
Hence
+ a > c + a .
c > d =>
a + c > b + d.
Theorem 3 . 2 .
{g+b, a+k}
Suppose
where
d e f i n i t i o n of
of
Remark.
and
a = F|G
and
b = H|K; then
f e F, g e G, h e H, k e K.
a+b
I.e.
a + b = {f+b,a+h}|
although the
representation
Proof.
Let
a = A'|A\
a = F|G, b = H|K.
b = B'|B".
By the inverse c o f i n a l i t y theorem (theorem 2 . 9 ) , F
cofinal in
a + b
f + b < a + b.
is cofinal in
Also suppose
The betweenness
Consider
is
( 3 f e F ) ( f _> a ) .
as in the d e f i n i t i o n .
By theorem 3 . 1 ,
f + b^ a
Since
1
+ b.
F
A
Hence the
a+b.
In p a r t i c u l a r ,
16
11
(a+b) .
a+b1
and similarly
We thus
obtain
(a+b) + c = {(a'+b)+c, (a+b')+c, (a+b)+c'}|{(a"+b)+c, (a+b")+c, (a+b)+c"}.
A similar result is obtained for
a + (b+c).
Associativity follows by
induction.
3) The i d e n t i t y :
Q',0",
Then
0 = <|>|<j>. We
There are no
so this simplifies to
and
a"
{a'+0}|{a"+0}
We thus get
We use induction.
F|G
0.
Let
which is
-a
be obtained from
be typical elements of
F and
b is an i n i t i a l segment of
an i n i t i a l segment of
-c
and
representation of
may be expressed as
a + (-a) = {a'+(-a),
a by
{a'}|{a"}
a + 0 = a.
G respectively.
c
then
-b
is
-a"|-a'.
a+(-a"}|{a"+(-a), a+(-a')}.
a.
Therefore
Since
is clear from the lexicographical order that -a" < -a < -a 1 . Using
induction and the fact that addition preserves order, we obtain
a1 + (-a) < a1 + (-a1) = 0. a + (-a") < a" + (-a11) = 0.
a" + (-a) > a" + (-a11) = 0, a + (-a1) > a1 + (-a1) = 0. Hence in the
representation of a + (-a), as H|K, H < 0 < K. Since 0 vacuously
satisfies any minimality property, a + (-a) = H|K = 0.
17
h e u r i s t i c a l l y natural.
B MULTIPLICATION
The d e f i n i t i o n of m u l t i p l i c a t i o n is more complicated than
that of a d d i t i o n ; in f a c t , i t took some time before the standard
d e f i n i t i o n for m u l t i p l i c a t i o n was discovered.
Definition,
a,b,a',b'
are ordinary
inequalities i f either
a1
i s replaced by
or
b'
replaced by
b".
ab
is always defined.
Furthermore
a > b and
factors.
P(a,b,c,d)
proper i n i t i a l segments of
that
a0
i s of the form a 1
and
or
ac - be > ad - bd as
ab + ab - abo
b
a".
where
respectively as
a,b
f(a,bo).
are
Note
a0 a
i s equivalent to
ac - ad > be - bd.
consider
b o and
f(ao,b2<>) - f t a o . b ^ ) .
be i n i t i a l segments of
b and
This is
and
b2
> b ,
expression is p o s i t i v e , i . e .
increasing function of
If
P(a,a,b ,b o ) , so
if
a<> < a.
If
f(a,bo)
is an
b ^ > b 2 ,
18
19
by the above.
used instead of
and
(ac)o + be
we use
By
ac + be + ac + be 0 - ac - be 0
A similar r e s u l t is obtained i f
a + b
is
+ b.
by theorem 3 . 2 .
(a+b)o
For
20
(aOc+acO-aOcO) + be.
ac+bc
is
ac + (bc)o
(a+b)c = ac+bc.
ab - (ab+abo-aObo)
as
(a-ao)(b-t>o).
Associative law.
We use induction on
(ab)c
A typical
By
Of
(ab)c
by putting a superscript on a t
there
I.e.
a(bc)
upper or lower, and which addends in a term have a minus is the same as
before.
ao(boc)
- ao(bco) + a o ( b o c o ) .
The i d e n t i t y .
F i r s t note that
d i s t r i b u t i v e law.
Since
a*0 = 0.
(Note that
21
DIVISION
We w i l l define a reciprocal for a l l
a > 0.
As usual,
Let
a = A'|A"
be the canonical
representation.
where
= { 0 aT * } H Ta I }
since
a > 0.)
a" e G and
a* e F - { 0 } .
We try
(Note that
Although
0 e A1
has some
22
23
F|G has
Let F|G = c.
Finally we compute
axc + acx- a c
1
0 e A
where
0c + a0 - 0*0 = 0
Suppose
is a lower element.
product reduce to ac
c
a x = c = 0.
and ac
e G.
However, we know that
c e F > ac
acx
< 1. Hence i f
is a lower element
Now suppose
in
FtjG,
ac
c x e G - a c x > 1 and
is an upper element
ax * 0.
c1oa1
Then
a^ + ac^ a c
> 1 and i f
and
is defined, is contained
(a-a 1 )c + a ^ = 1. Now
ac
acx < 1.
ac
iff
respectively i f f
c a
and
are on
e G iff
c a
> c.
l^l
s a t i s f i e s the equation
o
the inequality
c1 a1 > c
is equivalent to
cla1 * c
ac
1.
">"
axc + a c ^
a^.
e F(JG, i t follows
may be taken to be
"<"
in the proof.)
We have shown that in the expression for
the betweenness property but
1 = (+)
ac, 1
Hence
property.
Therefore
ac = 1.
Although we don't
satisfies
ba
varies
implies that
b and
24
D. SQUARE ROOT
Frankly, i t is not of extreme importance to obtain the
existence of square roots at this time, since that can be obtained from
the theory of i n f i n i t e series which w i l l be developed l a t e r .
However, in
view of the elegance of the theory, i t is worth seeing how square roots
can be obtained d i r e c t l y without further machinery.
In the bottom of
a l l i n i t i a l segments of
square roots.
elements in
Let
A'UA"
a = A'|A"
Let
0, then
b e A'UA11
then
Then a l l
A'UA".
We
f(b) = / b .
(In analogy with this case it was possible to use the concept of free
semigroup to deal with division, but we preferred to be more concrete.
Here we are stuck with this formalism since we are dealing with nonassociative juxtaposition.) By induction (Yx)[f(x)>0]. Furthermore
f(x) > 0 unless x = 0.
F and G are now defined inductively as follows.
If
f(bc) e G
then
25
F and G
Theorem 3.8.
f(bc) e F. Since
Proof.
f(b) e G.
x e F =* x 2 < a
and
In order to
(y2-a)(xrx2)
.
x ^ y - x 2 y = (x + y ) ( x + y ) Hence, for fixed
iff y 2 > a iff y e G
function of x
.
.
.
y, xy is an increasing
using induction.
Also
y eG
have previously, i.e. preserving sides is equivalent to being an increasing function if one variable is fixed.
x 1 o x 2 e G. First, if * 1 x 2 e F, let
Now assume
x = max(x 1 > x 2 ).
x ,x e G we take
V OV
a+x
2X
x 1 o x 2 _> xx.
x e G, then
fortiori
>
is symmetric in
< a
and
Therefore
x e F,
x2 < a
then
(a-x 2 )
x * a. Hence
(^i)
^ 2x J
and i f
> 0.
a2+2ax
*+xl* > a.
4x2
xy e F.
x
and
y) suppose
x e F
and
(because
y e G.
Then
> a.
Assume f i r s t that
xy = a.
Then
xv =
xy
and
( x oy) 2 =
Now assume
xy
Thus
Similarly, if
4xva
. Clearly x 4= y since
(x+y) 2
Therefore (xy) 2 < a as desired.
x+y
x+y
x 2 < a < y 2 ; hence
4xy
x <
or x > .
26
If x < , we
x < ~,
we have
then y >
a 2
(x-) < a.
a+CiCo
c < c1oc2 = c
+Q
iff c x c + c 2 c - c ^ < a.
c c is
0 which is
elements
for c 2 .
in the
/a"1" /a"1" = a 1 .
27
INTEGERS
The main task of this chapter is to show that the surreal
(The d i s t i n c t i o n
is the sequence
n.
(+).
n times
(1+1+1 )
Theorem 4 . 1 .
is
n times
(+++
).
Proof.
(+)
(+++
is c l e a r l y
)|<j> = {n-l}|<j>
{0}|<|>.
by the inductive
Hence by d e f i n i t i o n
which is
(++++...imeS)
28
senses, once for addition, and once as one of the symbols used in the
ordinal sequences which we consider.
algebra where, for example, in
meanings.
+(a+b)
have different
+ and
-n
is
n times
).
Since
that
proper subset of the rationals, they are dense in the reals. Thus they
can be used just as well as the rationals as building blocks later in
developing the reals.
Lemma 4.2.
Proof.
Let
addition.
a < c < b.
property.
If
{a}|{b} = c.
Then
then {a}|{b}
Hence
Now
a+b
a + c < a + b < b + c
{2a}|{2b} = a+b.
2c = c+c = {a+c}|{b+c}
2a < a + c and
2c = a+b;
{2a}|{2b} = a+b
by definition of
by cofinality.
First,
therefore
{a}|{b} = c =
that
So
a+b
The above is the key lemma for dealing with dyadic fractions.
29
Theorem 4 . 2 .
fractions.
d(m) * d ( 0 ) .
Define
b(i)
as follows.
b(i) = 1
b(i) = -1
b(D =
if
i < m and
if
i < m and
1f
2i-m+i
d(i) = + .
d(i) = - .
i i l m and
d(i)= + .
and
m+n-1
Then
d =
\
1=0
b(i).
j.|.2|.
The case
n = 0,
30
Here we have
followed by a minus.
with
n = 1
d(m) = - .
defined.
m pluses
is not
m> 1 [d(m)*d(0)L
Now
2m-l = {2m-2}|{2m}.
obtain
{m}|{m-l} = m - j
m pluses.
m-1
consists of
as an i n i t i a l
a = m-1
and
b = m to
d e f i n i t i o n t h a t {m}|{m-l} = d.
of
By c o f i n a l i t y we
segment.
(m-1)
m-1
and
pluses and m
m must by the
d = m- j ,
i.e.
n <_ r
and l e t
n.
n = r+1.
s.
Let
1
d -
and
d1
c+1.
Hence
{^-} | f | ^ }
be the i n i t i a l
segment of
since
has length
s i m i l a r argument applies i f
= - + d
of length
m+n = m+r+1.
d = d'-.)
positive
t i v e integers
d = d'+ or
= " ^ 7 for a l l
r _> 1 , i . e .
m+r.
Assume
that
n = 1
d'
Then
d = d'+.
(A
has been
begins with
followed by a minus.
Now l e t
d = FIG
are f i n i t e , so by c o f i n a l i t y
l a r g e s t element in
d = d'+, clearly
and
x = d1.
a
y
{x}|{y}
We cannot be as e x p l i c i t with
where
G.
F
x
and G
is the
Since
y , since in
31
G.
m pluses; hence
= x =
c.
and
y.
y m.
Hence
y =
d = { x } | {y} = +
then we
which is
Let
greater than
m+r
c a r d i n a l i t y of
every element of
is
H
1 + 2 + 2 ...2r-!.
is of the form
2^-1
e H
unless
= m.
- > = d1.
Since
that
Now
r
2
Since
fact,
r = d
2
$y-
m-1
and
GCHUW.
1
< d
and
H.
i[
In p a r t i c u l a r ,
1 < m+r.
Now
r
2
segment of d
and
which is a contradiction.
is a lower bound to
segment of
d.
G.
In
Otherwise, by consider-
we obtain an element of
Since
earlier,
m.
and by the
is a c t u a l l y an i n i t i a l
-
2T
and
m is in
In e i t h e r case
d = d' +
m-1
s t r i c t l y between
- > d.
The
G,
-
> d, i t follows that
2r
i.e.
y = -~-
As we said
H e u r i s t i c a l l y speaking, we always
2g-.
2g- < 2y
Since
2 < 2g < 3,
F i n a l l y , +++--+
with
Now
+++--
+++-.
is
we begin with
For example,
This is
2j
2j
so we need a
m+n.
Consider
b+1
32
is odd.
This is, of course, what we expect from the beginning since the numbers
are sequences and not equivalence classes of sequences. Anyway,
whatever appearance there may be of choice, it is clearly deceptive.
The whole idea of a shift from ordinary counting to a binary
decimal computation at the first change in sign may seem unnatural at
first. However, such phenomena seem inevitable in a sufficiently rich
system.
It is an amusing exercise in arithmetic to add numbers in
this form. Carrying exists as usual but since we deal with pluses and
minuses and do not have zeros, an adjustment is necessary if we would
otherwise obtain 0 in a place. Specifically, 0+ must be replaced by
+-. Also, one must be aware of the dividing line where the shift from
ordinary counting to the binary decimal computation occurs.
Finally, a suitable succinct way of expressing the result of
arithmetical operations on dyadic fractions given in the above form may
be useful in studying certain problems in the theory of surreal numbers.
Although the dyadic fractions look like a drop in the ocean of surreal
numbers, they form an important building block. As we shall see, for
example in chapter seven, it is possible to ask questions which are
non-trivial even for a set such as the dyadic fractions.
C
REAL NUMBERS
33
f i n i t e length or is of length
which is either of
GO and s a t i s f i e s
(VnoJGniMHnzHCni^noDACna^noMCaCni) = + ] A [ a ( n 2 ) = - ] } .
In other words, the d e f i n i t i o n simply requires that the terms
of the sequence
a(n)
constant signs.
where one might rule out an eventual sequence of nines to ensure that
each number has a unique representation. In our case a sequence consisting eventually of pluses w i l l be a surreal number outside the set of
reals.
To show that the set of real numbers forms a f i e l d ,
suffices to check the closure properties.
it
However, i t is convenient to
w which are real and those which are not can be expressed in
a=F|G
plus, then
If
is the canonical
F and
G are non-
FUG
dyadic f r a c t i o n s .
Lemma 4.3.
that
Let
F and
F < G, F
G has no minimum.
Then
F|G
is a
real number.
Proof.
F|G
By theorem 2.3
(a) = u) and
sign is
+.
&(a) _< w.
a
Let F|G = a.
-.)
The p o s s i b i l i t y
34
a > c > b
Also
2m
a < d < b -
2m
2m
Thus we have t h e
desired contradiction.
Lemma 4.4. Let a = F|G. Suppose that (VxeF)(3a positive dyadic
fraction r)(3y)(y>x+rAyeF) and (VxeG)(3a positive dyadic fraction r)
(3y)(y_<x-rAyeG).
Also let F1 < a < G'. Suppose that (V positive dyadic r)
Then
a = F'|G'.
F 1 . The case
such that r
r choose
we have
Since w e F1
of the sets to
Lemma 4.5. There are an infinite number of dyadic fractions between any
two distinct real numbers a and b.
Proof. It clearly suffices to obtain one dyadic fraction. If neither a
nor b is dyadic then the common segment works. More generally, if
35
36
{a+b11}
0 bx< b}
and
~~
in that case.)
such that
b 2 -b x _< r
and a 2 ~ a i r #
r
^
we can choose
e
usua
a ,a ,b ,b
that the limit of a product is the product of the limits shows that the
other conditions of lemma 4.4 are satisfied.
37
such that
both
F and
G are non-empty.
number of length
as
break down.)
Hence by lemma 4.4
F1
addition,
G1
and
ab = F ' | G ' .
ab
is r e a l .
Note that during the proofs of closure we obtained nice
representations of sums and products of reals with the help of lemma 4 . 4 .
Since such i n t u i t i v e - l o o k i n g representations f a i l in general, i t is
i n t e r e s t i n g that they work in the special case of real numbers.
Reciprocals.
reciprocals.
larger than
G = (d: d
0.
Let
Hence
0 e F.
Clearly F < G, F
fraction satisfying
da < 1 .
has no maximum.
Then
1 - da
1 - da > 0.
a <
Suppose
| i< 1 " d a -
Thus
is a dyadic
iSPn
m so that
2m
Hence
and
and G
2ma > 1 .
so that
is a real number
is a dyadic f r a c t i o n A da < 1}
i s a dyadic f r a c t i o n A d a > 1 ) } .
F = {d: d
( d +
n
2 ,
iSTn)a<1-
is bounded
n
i.e. a < 2 .
Similarly
has no minimum.
By lemma 4.3 b = F|G is a real number. By closure of
multiplication ba is a real number. Now let n be arbitrary. We
would like to show that there exists an element c in G such that
ca < 1 + Choose m such that a < 2m.
- zx\
Hence
2n
2 n *~
~"
~~
2n
c1 e F
such that
38
|ba1| < r for any positive dyadic fraction r. It follows from lemma
4.5 that ba = 1. This completes the proof.
Note that reciprocals of dyadic fractions are not necessarily
dyadic fractions. Thus a naive attempt consisting of expressing a as
F|G and using something like -j?\j to obtain the reciprocal would cause
dfficulties. Our proof circumvents that problem.
Now we know that the real numbers form a field containing
the dyadic fractions. They therefore contain all the rational numbers.
Finally, we would like to prove the l.u.b. property, i.e.
that every bounded non-empty set of reals has an l.u.b. within the set
of reals. It's important to note that in our development the real
numbers form a proper subset of the system we are studying so that care
is needed in stating the l.u.b. property. In fact, as a special case of
theorem 2.1 we see that any set which has no maximum has no_ l.u.b. in the
class of surreal numbers.
Incidentally, it is immediate from lemma 4.5 that every real
number is the l.u.b. of all dyadic fractions below it but this is not
what we really need. Since the "ordinary" real numbers can be
characterized as an ordered field with the l.u.b. property, the latter
is all that remains to be proved.
Suppose H is a non-empty set of real numbers bounded above.
Let G be the set of upper bounds which are dyadic fractions and let F
be the set of all other dyadic fractions. F and G are clearly nonempty. By lemma 4.5, F has no maximum. If G has a minimum b, then
b is already an l.u.b. to H and we are done. (Again, lemma 4.5 is
used since it guarantees that a least upper bound among the set of dyadic
fractions is automatically a least upper bound in the set of all real
numbers. So suppose G has no minimum. Then by lemma 4.3 r = F|G is
a real number. We now show that l.u.b. H = r. First, r is an upper
bound to H. Otherwise, (3aeH)(r<a). Let d be a dyadic fraction
satisfying r < d < a. (We are getting our money's worth out of lemma
4.5!) Since d < a e H, d e F. However r < d, contradicting the fact
that F < r. Finally, suppose s < r and s is an upper bound to H.
Let s < d < r for some dyadic d. Then d is also an upper bound to
H. Hence d e G. But d < r. This contradicts the fact that r < G.
We have finally shown the following.
39
Theorem 4.3. The real numbers form an ordered field with the l.u.b.
property (i.e they are essentially the same as the "reals" defined in
more traditional ways.)
We close this section with several remarks. Let a be a
real number which is not a dyadic fraction, thus a has length u>. If
a = F|G is the canonical representation, we know that F < a < G and
that the elements of F form an increasing sequence and that the
elements of G form a decreasing sequence. Since F has no maximum and
G no minimum, both sequences are infinite. It is clear from this and
from lemma 4.6 that if a n is the initial segment of A of length n
then lim a n = a. Of course, in the definition of a limit it makes no
n+
difference whether e is taken to be real, rational, or dyadic; however,
we certainly cannot use general surreal numbers.
Finally, we compare our definition with the one used in [1]
which makes no reference to sequences. It is convenient for motivation
to consider a third definition which is, roughly speaking, intermediate
in spirit between the two definitions.
Theorem 4.4. The following three conditions are equivalent.
(a) a is a real number (by our definition);
(b) For some integer n, -n < x < n and a has no initial segment
a a such that |a-aa| is infinitesimal;
(c) For some integer n, -n < x < n; and
a = {a-1, a - j, a - j - ' H ( a + l , a + j9 a + |} (the definition in [1]).
Remarks. An element a is infinitesimal if it is nonzero but for every
positive rational r, |a| < r. It is clear, for example, from what we
already know that the element a of length u which begins with + and
then after that consists only of -'s is an infinitesimal. The heuristic idea in definition (c) is that of the possibility of writing a in
the form F|G without forcing either F or G to be "too close" to a.
Of course, by the cofinality theorem, if a = F|G then one still gets a
if F and G are both enlarged to include elements closer to a. The
challenge lies in the opposite direction. How far away can F and G
be from a and still have a = F|G? As a rough rule of thumb, the
larger the length of a, the closer F and G must be to a.
40
Proof,
this is
Not (a)
even if
(a) => (b). Since all initial segments are dyadic fractions,
clear.
= not (b). (This contra-positive notation seems natural here
it may be unusual.)
Suppose a is not real. Then (a) _> u>. If for all n
less than u>, a has a fixed sign then the condition -n < a < n fails,
as is clear from the ordering, so that case is clear. Now let a w be
the initial segment of a of length a>. Assume first that a^ consists
eventually of pluses only. (A similar argument will apply if o> consists
eventually of minuses only. ) Then we can use an argument which is essentially the same as the one used in the proof of lemma 4.3. We let a n
be the initial segment of a^ obtained by stopping just before the last
minus. (We already ruled out the case where a w contains no minuses.)
Then a n > a. For all positive rational r we can find m sufficiently
high such that the initial segment of a m of a of length m satisfies
a m < a and a n - a m < r. As in the proof referred to above, we use a
computation of the form - + -^- + r+j + ... Hence a n - a < r
for all r. Since n is fixed, we have |an-a| is infinitesimal.
Note that this part of the argument is independent of whether a is the
same as a^ or not.
Now assume that a^ does not eventually have constant sign.
Since a is not real, a * a w ., i.e. a w is a proper initial segment of
a. Let r be an arbitrary positive rational. Suppose a w = F|G. Then
by lemma 4.6 there exist b e F, c e G such that c-b _< r. Now
b < a^ < c. It is also clear that b < a < c by the lexicographical
ordering. (In fact, b and c occur in the canonical representation of
a as well as a^.) Hence la^-a) < r. Therefore la^-al is
infinitesimal.
(b) =* (c). Express a canonically as F|G. Then the conclusion is
immediate by the cofinality theorem. Given a1 e F, then a - a'
is not infinitesimal. Choose n so that a - a1 > - Then a -- > a 1 .
A similar argument applies to a1 e G.
(c) = (b). Let a = {a-1, a - |, a - |,... }|{a+l, a + |, a +j,...} and
let a = F|G canonically. Then by the inverse cofinality theorem the set
{a-1, a - j9 a --=-...} is cofinal in F. Let a1 e F. Then for some
n, a " 2 . a ' 1"-e- a"a> 2.* Therefore
A similar argument applies to G.
a -a1
is not infinitesimal.
41
ORDINALS
42
i and all
k
r.
also be w r i t t e n in the form
I a> i with r. > r. . by breaking up a l l
1
r.
Ta) Tn. for which n. > 1 . The sum is also a natural sum and therefore
r
s.
by theorem 4.5 a surreal sum. Suppose a = GO 1 and 3 = o> i . By
r.. s.
the d i s t r i b u t i v e law for surreal numbers we obtain 01x3 = J w TXID J ,
I f at least one of
a and
3 is not a power
of
i,j
a>, then we can use the
r. s.
ax3 = I a) l0o) J = a3
I.j
by the distributive law for natural multiplication over natural addition.
Now suppose that both a and b are powers of u>. Let
r
s
a = w and b = oj . Then by definition
ax3 = {((/x6) + (Y><W S ) - (6xY)}|(j) where 6 < OJS and y < </. Note that
unlike in the case of addition, we are stuck with the surreal operation
- which does not correspond to an operation on ordinals.) First,
since there are no upper elements, by theorem 2.2 ctx3 is an ordinal.
43
r*
(u> x6)
r
which by the inductive hypothesis and theorem 4.5 is (w Q6) +
s
r
v^s
Since 6 < w
and y < w . This is clearly less than u>
.
rS&
a)
satisfies the betweenness property in the definition of
Hence
o3.
Hence
Ji(axg) a / ^
and since
1
Now if
< r
and
have
this is
</ ^ n .
ax$ is an ordinal,
n
ax$ w
Similarly if
*
s
+ (yxw )
s
(>Qw ).
< s
we have
ax$ > J^
Furthermore
n.
We have
already noted during our proof of theorem 4.5 that seq(r'@s, rs') = rs.
By the basic properties of the expansion of ordinals in powers of
seq((/ S n, a)r+S|n) = < / + s .
follows that
ax3 = J5
Hence
o>, it
So finally
= o@e.
In view of theorems 4.5 and 4.6 we no longer need symbols
such as
and
and
x.
places as well.
w-1
and
ju
have meaning.
Incidentally, our
w-1
has nothing to
This is
w-1
a)+(-l).
is
{n}|<j>
where
- 1 = 4>| -CO>.
44
{n};
hence we have
{n}|{oj}
w+1
which consists of
pluses
followed by a minus.
We make several remarks before continuing with other
examples.
F|G.
but r e f e r r i n g only to
and
themselves.
This helps to
where
m and
)
{n + y}|{oj
-}
length
w+1
for a l l
satisfy-
r e f e r r e d to above.
{n}|{oj}
and
a = F|G
G
where
is the set of a l l
If
we may
dyadic
a.
is a dyadic
i s a dyadic f r a c t i o n .
I.e.
if
45
As obvious as these above remarks may be, they are worthwhile to state and get out of the way once and for a l l , since i t would be
clumsy to make them in the middle of a proof.
i.e.
using con-
stating
the obvious j u s t i f i c a t i o n .
Now consider
w-2.
This is
w+2
integer
n.
consisting of
u)-(m+l)
for a positive
m is fixed although
varies.)
(To avoid
This is
sequence of length
oo+m+l
consisting of
pluses followed by
m+1
minuses.
I t is natural now to ask what
minuses represents.
infinite, i.e.
cal order.
co-w
still
the e f f e c t of the f i r s t
co pluses.
shortly.
The computation for
as
co.
For example
w-m
(co +6u)) - 3
followed by 3 minuses.
is the sequence of
w2 + 6u>
pluses
(oo+6) - 1
is simply
w+6
w+5.
{3: 3<a}.
For
{3-1:3<a}
For a l i m i t ordinal
by replacing i t by
is a
46
r.
r.
where
w+r
F|G
{Go+F,n+r}|{oj+G}.
is the
Since
0 e F,
{GO+F}|{GO+G}.
r.
Incidentally,
is negative as long
is empty.
The l a t t e r case
GO+F
GO is replaced by other
l i m i t ordinals.
We now consider
|w.
This is
({0}|{l})x({n}|<j>).
By the
= { r O l I w --gn} = {n}|{u)-n}.
GO minuses.
GO. 2
Using the e a r l i e r
beginning with GO
cofinality.
Hence
GO pluses followed by
w minuses.
Since
be the
a = {n}|{Go-n}.
Let
Then
Since
c o f i n a l i t y condition is s a t i s f i e d ; hence
n,
GO < Go-n+a.
GO = {n}|<|>
u> by
and
n+a ^ n, the
2a = {n+a} |{u)-n+a} = w.
The second method requires a good guess at the answer and being
Go+r.
be handled similarly to
u>-l
ya
and
-^o+r
a.
r
can
yu+r
Such simple
47
i s positive i n f i n i t e )
cofinality.
M
and integers
n,
a = A1 |A"
consider
ll
b = Bl|B".
and
Then
ab
a'^+ab'-aV}.
is
{a'b+ab'-aV,
I t is often convenient to
a'b + ( a - a ' ) b 1
and
or
a"b - (a"-a)b"
ab" - a ' ( b " - b ) .
-rw.
-r = (+-+) = { ^ I d }
by our work on
Hence
| u = ( { | } | { l } ) x ( { n } | c j ) ) = {u + ( | - | ) n } | { l . a ) - ( l - f ) n } .
the above remarks.
be w r i t t e n as
j n + -2(w-n),
magnitude more c l e a r l y .
ju = {-^o+n}|{u)-n}.
In f a c t , i t is immediate by c o f i n a l i t y
u).3
consisting of
that
-^to+n we
pluses
pluses.
times.
in
r
wrn
r.
I t turns out
u) -a)
consists of
u>2-a).
co2-n
pluses followed by
u)2-u)
so we obtain
w minuses.
The key
0)
with
48
Our primary
Let
e be the
In f a c t , i t is
co;
Heuristically,
w.
canonical representation
ew
= (0|^}) (W|<J0
of
Now
0|{}.
1 = {0}|<j>.
Hence
is
{em}|{em + p j -pjiri}.
c o f i n a l i t y theorem.
Regardless of
is
The
m, em >^ 0.
Hence
1 as required.
Note that in spite of the existence of i n f i n i t e s i m a l s there
played a r o l e .
Let us now consider
First, let
r + e where
be a dyadic f r a c t i o n .
and
Then
is a real number.
for
Then
r + e = { s } | { t } + { 0 } | { ~ } = { r + 0 , s + e } | { r + J , t + e } = { r } | { r + ^ } = {r}|G
where
r.
I t is immediate
or
t.
our discussion of the real numbers, since we see from the above that a l l
sequences whose terms are eventually minus are obtained this way.
a l l sequences of length
dyadic
r, or of the form
Thus
r +
w
for
49
be non-dyadic.
Let
r = R'|R".
r.
I t makes no d i f f e r e n c e . )
I t is immediate from
counter-example to naive j u x t a p o s i t i o n .
is worth only
e!
is dyadic or not.
This is
{ 0 } | { ^ } + { 0 } | { ^ } = {0+e}|{|+e} = {e}\{h-e}
Hence
2e
= {e}|{-J}
by mutual
followed by a plus.
2u>.
Note
e.
Consider
followed by a
Next, we compute
value only
cofinality.
Then
r"+e} = { r } | { r + | } = r|G
|e.
=
This i s ( { 0 } | { l } ) x ( { 0 } \ { } )
{ 0 }
{ e }
Th1s
1s
the
se( uence
which is
of
length
w+1
note the contrast between t h i s case and that of -*& where u> minus
1
signs took care of the y .
Next we consider re f o r a t y p i c a l positive real number
which is not an integer.
M
R'|R x{0}|{J} = { e r \ e r
c o f i n a l i t y t h i s s i m p l i f i e s to
pattern by i n d u c t i o n .
er',er"}.
+-J(r-r')}.
By mutual
followed by the
To
In that case
{er'}|{}.
R"
e 2 = ?
In
{!})
= {0> +
. 1_ } ( { }
rw
{ O } ! ^ }^.
It
50
n
b 2 = {bn, + bn o - n,n o ,
1
mi
} I {nb + ~ } '
m2
^1^2
nb + S - ^ >
2m 2m
A
2m
51
52
NORMAL FORM
biJ
n.
> bi
n+i - Jn
Since aiJ
for all
n+i
+ bi
Vi
n
n
Then ai
> ai
and bi
> bi
J
J
J
J
n+i n
n+i ~ n
< aiJ + bi , we obtain that
n.
NORMAL FORM
53
a-j
= ai
and bi
= bi , contradicting the hypothesis that the
J
J
J
n+i
n
n+i
V
representations are distinct.
Case 2. Suppose that we have only sums of k terms for fixed k. Then
the proof is essentially the same, but the notation is minutely more
complicated. If s n has the form a n i + an2 sink "then we simply
iterate the process of taking subsequences for all fixed j _< k.
Case 3. If only a finite number of k's occur, then we can reduce to
case 2 since at least one of these k's must appear infinitely often.
Case 4. Suppose an infinite number of k's occur, i.e. the value of kis unbounded. Then we can choose a subsequence {sn 1 } of {Sn> as
follows: S J 1 = aii + a-f2 ... + ain- where ni is a strictly increasing
function of i. In particular, nj >^ i. Also, we express the sums in nonincreasing order, i.e. a-ji >^ ai2 2. 1. ain- We now choose a subsequence of {s n '} such that a-ji is monotonic increasing, a subsequence
such that ai2 is monotonic increasing, etc. By the usual diagonal
method we obtain a subsequence s n " such that a-jj for fixed j is
monotonic increasing as a function of i for i >_ j. [Since ni >_ i,
a-f j is defined for i _> j.] First we show that necessarily ni > i
for all i. Suppose ni = i. Then consider
si" = aii + ai2 ... aii
and
Since ai+i^j >_ ai 5 j for all j _< i, and si+i" contains an extra
term a-j+i^-f+i which has no analogue in si", it follows that
s
n."
= a
Since a n .j >^ aij for j _< i, since s n " contains more terms than si"
and since s n " is monotonic decreasing, there must exist k such that
aj k > a ^ .
Hence
54
By
NORMAL FORM
Suppose
x ~ a ~ y.
property
and
Let
z - a.
55
l(z)
< (x)
By the convexity
Remark. The same argument shows that the element of minimal length is an
initial segment of every other element equivalent to a.
Similarly to the above one can define additive orders of
magnitude using addition rather than multiplication.
Notation,
Definition.
k11
k '
u> = {0,ro>
}|{sw
where
b1
and
and
b"
b.
By c o n f i n a l i t y we can, of course, l i m i t
s to dyadic f r a c t i o n s with numerator
Theorem 5 . 2 .
to integers and
1.
0.
Furthermore,
56
, i
we have
,n
u>
<< o>
p o s i t i v e reals
defined.
Since
and
0
s, rw
< sw
Hence, for a l l
Also, 0 < sw
Hence
to
is
or
Suppose
c
and
is the common i n i t i a l
a>
If
u)C.
a> ,
i.e.
segment.
Otherwise we have
Corollary.
b < c
= {u)X: xeF}
(/
if
for
i.e.
and s i m i l a r l y for
b = F|G
G.)
b < c - w < w
~.
Proof.
since
i s an i n i t i a l
and
segment of
are a r b i t r a r y positive r e a l s .
x, so
i(u
) <_
Hence
i{x).
Let
a = A'|A"
every element in
Let
F = {y:
x < a < z
A'UA"
F < G, F|G
0 e A .
u> + a 1
has meaning.
i t follows that
x > y.
We use
y e FUG.
Now suppose
x e F, y e G and
x
v
x > y + w
Then
(3xeA') U~J)}
FOG t ft and l e t
Since
u> .
a".
~ x e A1
and
u>y ~ z e A".
a ~ </.
FOG = <j>.
Then
We claim that
wX ~ a 1 e A'
and
F < G.
For suppose
wy ~ a" e A".
Let
w .
Suppose
z = F|G.
Then
Hence
Since
is a complete set of
NORMAL FORM
57
three cases.
ra) _>. a
Case 1 .
fr
1
~~~~~""
""'~~~'
1
a1
e A'
a e
A' s a t i s f y
hence a
some
positive real
~ oo .
Then
ru> _< a
satisfy
a" ~ < / .
but
Let
a' ~ u ~ ru> ;
and some
Then
x e F.
XX
a _< a <_ rw
a" e A"
but
x e G.
Let
a ~ a> .
Case 3.
rJ
and some
X
Case 2.
hence
< a
< sa) .
particular,
e A
(3xeG)(a"~a) X ).
a = A'|A M
Now l e t
r, ru
- {0}.
1
>_ a .
Then
S i m i l a r l y for
a" e A"
In
we have
Since
{0,ru) }|{SUJ } .
Hence
a = {0,rw }|{su) } = w .
has minimal
since, as we have
(a)
w = 1 , (b) w w = w
By d e f i n i t i o n
usual.
Let
and
our u>
oo0 = {0}|<|> = 1 .
a = A'lA"
b = B'|B".
we obtain that
S i m i l a r l y for m u l t i p l i c a t i o n
we obtain that
ab
a'b
b'a
a'b
b'a
a'b1
a"b
b" a
a) a) = {O,roj (o 9r^
a) ,rw co + r.oo a> - rr.u
a) ,sa) w + s.co a)
a" b1' T a" b
b" a
a1 b
b" a
a 1 b11
a" b
-SS.u)
0)
J-|iSo)
+ rui
a)
so)
a) ,S.o)
a)
0) ,rco
co
+ S.U)
rs.o)
a)
, So)
a)
}.
+ a
u>
58
and
u> w
wau) .
Since
u)C
b > c o>
the other
tude.
First
Next,
Next, soua
+ s^
- ss 1 o a
II , r II
rwa
+ s^
- rs^
s2u>
s2 < s
u>
the element
and
are
Then
u>
Let
F(c)
a = {a'}|<|>.
instead of
For the
a>
for
The l a s t equality
This completes
the proof.
Corollary.
u> o>
=1.
remark applies to
sw .
NORMAL FORM
I W ">r-f where
i
NORMAL FORM
59
for a l l
i.
a and r-j
Case 1 .
is a non-limit ordinal.
Case 2.
is a l i m i t ordinal.
form
i s a real number d i s t i n c t
This i s done i n d u c t i v e l y on a.
Let a = 3 + 1 .
We obtain
Then
I OJ ir^
i<a
in the
F|G.
A typical element of
such that
s-j = r-j
for
i < 3, and
S3 = r$ - e
real.
S i m i l a r l y a typical element of
where
e is positive r e a l .
I w ir-j
s-j = r-j
as an a l t e r n a t i v e for
0
i
If
for
w
l
1#
oa 1# Sj,
i<3
S3 = r3+e where
notation
Sj.)
<|>|<j> = 0.
ordinary f i n i t e sum.
For
For
i n f i n i t e , a proof that
F < G is needed in
In f a c t , we shall show
a's
and
r's
x =
Let
I a) "ir-i
i<a
and
y =
y < min(a,3)
then
If
Y = 3, then
x > y
x > y
iff
iff
rY > 0.
a Y > bY
If
I a> i r - j .
i<a
][ u> "isj.
i<3
If
First
or
Y = a
(aY,rY) * (bY,sY).
a Y = bY
then
and
x > y
rY > s Y .
iff
sy < 0.
Note that this is consistent with the situation for the
normal form for o r d i n a l s .
Theorem 5.5.
The expression
decreasing sequences
(a-j)
Furthermore for a l l
3 < a,
60
a.
b.
To begin with, let x = J u irj and y = I w 1 s^ and
i <a
i <a
suppose x > y in the lexicographical order. We must show that x > y
as surreal numbers. If (Vi<B)C(a-f,r-f) = (bj,si)] then either a3 > bg
or a3 = b3 and r3 > S3. In either case u> ^ 3 > u> &S3 by elementary
reasoning with orders of magnitude. Since addition preserves order, it
follows that x > y.
Next assume (a Y ,r Y ) * (b Y ,s Y ) for some y < 3 but
(a6>r,s) = (b6,ss) for 6 < y. Then either a Y > b Y or a Y = b Y and
r Y > s Y . In either case u> Y r Y - u Y s Y _> Y t for some positive real t.
Hence
I u> 'r-,- -
3y
by
= o) T r Y - a)
Ts
I I a) *rj
i
I
i l
ay
si
y ^ a) T t .
I <*> ^ j and
i<3
property by the inductive hypothesis. Hence
1#
rj| a) Y
and
less than
t.)
1s
nave
tne
Again, since
1#
bo
b
a
a> p m Y co Y,
.
(We can
we obtain
x-y =
1#
Xw
i<3
\ I u isj
i
Therefore
use any
a)
However
1#
a.
ae
x and y
have a
x gains in the
Yth
stage is necessarily
NORMAL FORM
not present.
61
Then we use
bY
u> '
| I co i r j - I to ""r-f | .
i <a
i <y
| w 3 r 3 + ( I to 1r-f -
i<3
I a) ">r-f) | .
instead.)
property.
Let
y < a
and
j < y.
This i s
If
y = 3
t h i s reduces t o
|to &ra|
i<y
a ;
co J .
Otherwise
|to ^r^|
is
to , and so is
a is a l i m i t ordinal.
By the inductive hypothesis the ordering for elements in
FUG
We must consider
| I co i r j - co " r-f | .
i<a
i<3
F in the representation of
J u Vj
i<a
F < G.
3 < a and
j < 3.
a.
of
Let
a.
is
a-;
This is
a.
aj
and
y =
J u isj
and suppose
x > y
i n the l e x i c o g r a p h i c a l
62
order.
i<a
(a$,r$) = (b,s,ss)
Suppose
a.
I a) "'r-j
i<a
Therefore
ay
I a) "> m-f |
a)
and
I I to ""s-j -
J ^ S J I
by
u
av
<u
i<y
i<a
i<y
~wa l. r
b.
atY f o r s o m e ~~
w l s
w
).
i " I
i 2.
p o s i t i v e real t .
i <a
i <a
I f only one of x and y has a representation of length
a,
Proof.
|x| ~ cu
such that
i s bounded above.
Let
r t 0.
r = .u.b. S.
e;
\ a) r 1 *.
i<a
ordinals.
hence
Since
Then
|x-w a r| wa.
> x and
Since
(r-e)co
|x| ~ ( /
<x
it
x * 0.
for some a.
su>a <_ x.
We know
We define a sequence
Suppose
A [ x - I a) 1 ^ 1
i<a
(a-j,r-j)
(a-j,r-j)
where
i s defined f o r a l l
runs through a l l
i < a.
the
Then
= a) <*ra.
NORMAL FORM
63
a
a
A[x- I u> ir-j] = u Prp.
i<3
a
a.
a.
a
a
o> a r a = Afx- X a) ""r-jl = Af (x- I u> 1 r 1 # ) - a) ^ 3 ] a> & by the inductive
i<a
i<3
d e f i n i t i o n of (agjrg). Hence a a < a$.
Now l e t a be a l i m i t ordinal and l e t 3 < a. We already
know that
a.
a < 3.
J o> ""r-f
i<a
F and
G used in the
a.
I a) ir-j. Hence the former is an initial segment of the latter and thus
i<3
it has smaller length. The uniqueness of representations as sums guarantees that the length is strictly smaller. This is enough to verify the
inequality since every strictly increasing function f from ordinals to
ordinals necessarily satisfies f(x) >_ x for all x.
If a is a limit ordinal we already know that
a.
a
|x - I a) ir-f I a> & for any 3 < a. This shows that x satisfies
i<a
F < x < G for the F and G used in the representation of J u ir-j.
i<a
a.
Hence a( I u> ">r-,-) < (x). This is true for all a. In view of the
i<a
earlier inequality this implies that i[x) is above every ordinal, which
is absurd. This contradiction completes the proof.
We have now established the normal form for surreal numbers.
The usual representation of ordinals in terms of powers of u> is a
special case of this, since finite sums correspond to ordinary addition
64
and this agrees with ordinal addition i f terms are arranged so that there
is no absorption.
Next we shall show the fundamental fact that the basic
operations can be performed on elements in the normal form analogously to
usual operations on polynomials.
abbreviated
We shall study
n(x)
in more detail
x,.
later.
We now need some lemmas which will help us deal with the
normal form.
Lemma 5.2.
where
is an arbitrary positive
real number.
We know that
a>a = {0,su>a }|{ta) a }
positive r e a l .
Hence
r = {r-e}|{r+e}
by definition where
u*r
r.
Also,
are arbitrary
Let
OJ
a)
a) ( r - e ^
a)
el
,
e.
Since
w (r+e^.
tractability.
a.
a.
a
a.
a
I w ir-j = { I u Vi+w a ( r a - e ) } | { I w Vi+ca a ( r a + e ) } .
i
i
i
addend.
and
i
NORMAL FORM
65
lexicographical order. Since the argument is similar for the upper terms
terms, the result follows in this case.
In general, every ordinal a has the form a'+n where a'
is a limit ordinal. We can now use induction on n. For convenience of
notation we can assume the result for
I
is that the typical lower terms which are discarded because of cofinality
a.
a
now have the form
J u V j - w a e by the inductive hypothesis; the
i<a
upper terms are similar.
It is convenient to extend the definition of I u ir-f to
i<a
the case where r-j may take the value 0. In fact, we use exactly the
same definition, but we of course no longer have unique representation.
Lemma 5.4. Let r-j be a sequence of length a, and let {nj} for i < 3
be the subsequence of i's such that r-\ * 0. Furthermore, suppose
t>i = a n 1.
Proof.
and
si = r n ..
!
Then
a.
b.
i u ] H = I ul s 1 i<a
i<3
Essentially we
We do
I a) "ir-,*, the
r$ = 0, since i t
i<a
leads to elements in
FUG
such as
J m V j + ew P. However, by the
i<3
cofinality theorem we s t i l l obtain the same element. The trickiest case
which occurs is the following:
integers
n.
i
side is defined in terms of an
an ordinary sum.
F and
a> 1#rj = J to i r j .
The left-hand
i
G whereas the right-hand side is
u> ir-j
66
a.
Y
2. w l r i " w e W 1 ' t n y < &+a)> a n d there is a similar expression for a
i<T
typical upper term. By lemma 5.3 the right-hand side is
is
{ J w
i<3
1#
Proof.
We use induction on
3.
If
I o>
j<3
ra+j.
If
is a l i m i t
If
a+J
I u>
r a + j = F|G
j<3
c o f i n a l i t y the right-hand side may be expressed as
then by
{ I a) * r j + F } | { I to fr-f+G}.
(We invoke the lexicographical order and
i <a
i <a
reason as in the proof of lemma 5 . 3 . Thus a typical lower term has the
form
I u) ir-j + ( I u
i<
j<y
a+Jr
a+j
- ew a + Y )
for
inductive hypothesis is
i
NORMAL FORM
67
a.
Proof.
a.
a.
Theorem 5.7.
If
As usual we
In fact,
a.
a
a
a.
I a) Mr-j+sj) + a) &r$ + ca &S3 = I w "(r-f+s-j) using the ordinary
i<3
i<a
distributive law and the definition of summation.
Now suppose a is a limit ordinal. One typical lower
element of the sum is ( J u ir-j - u &e) + ( I w isj)
i<3
i<a
= ( J u i'rj - to ^ e ) + ( J u ">s-f +
I a) is-,*)
i<3
i<3
3<i<a
a.
(
I w
3<i<a
1s
by lemma 5 . 5 .
a
i
I
u> 3+1+1S3+1+-J.)
i<a-(3+l)
By the
I u> i(r-j+s-|) - (w e e ) +
I a> ""SJ. By
i<3
3<i<a
the lexicographical order this is mutually cofinal with
I a) Mr-j+s-j) - a) Be#
i<3
I to "r-j + ( J u
i
1#
s-f - w &e)9
and
a.
I u> Mr-i+s-j)
by d e f i n i t i o n , or by lemma 5 . 4 , i f
r-j+si = 0
i
We now turn to multiplication, and prove the remarkable fact
b
a.
b+a.
o> [ I 00 " r- "| = J u
">r-j.
i <a
i <a
68
a.
a.
Since
II
oo >> oo
b
_> u [ d " - d ' ] .
Hence
Similarly
h"
h'
h
oo t ( d - d ' ) + oo s(d"-d) _> oo [ d " - d ' ] ; hence
oobd"-o)b s(d"-d) oo b d' + o)b t ( d - d ' ) .
Therefore by c o f i n a l i t y
wbd
can
d/
be
Let
dx'
and
K'
s(d-d').
Similarly
K'
Hence
b+a.
b+a
{ I oo ir-j+oo
B e } , which by d e f i n i t i o n i s
b+a.
I oo i r - j .
NORMAL FORM
69
xy
aj+bj
aj+bj
occurs only
is well-ordered.
Hence
avoid trouble.)
I t is well-known and easy to verify that with respect to
formal polynomial multiplication and addition one gets a ring.
In fact,
Theorem 5.8
Proof.
i . e . the product
a = Y+1.
a or
e is a non-limit ordinal.
8 = Y+1.
b
Then
a
+ u
KB
a-y
KY
i<8
1<Y
i<6
addend to obtain
a +b
i
n aY+b.
f r
lr s
[ i ID i r-jsi + ]. w
Y i ) T ^ e argument is now completed by
i<Y
i<3
j<6
theorem 5.7 which tells us that formal addition works.
Now suppose that a and 3 are both limit ordinals. We
y = J i c V j , z = \ u> 1#SJ and
i<a
i<6
are lower or upper elements in the representation of y and z
yz
yz
70
+ y z - y z .
+ y xz - y xz
= (yxz) - (y-y ) x ( z - z )
by ordinary algebra.
(Recall that we already know that surreal addition agrees with formal
addition.)
if
and
yz
Now
y-y
u> Y e x + cx
for
some
av
p o s i t i v e real
6
a) e
+ c
|c |
e1
and some
for some
a)
|c 1 1
6 < 3, some
(y-y)x(z-z0)
Therefore
i f and only
co
Similarly
positive real e
y < a, some
6e
e
1
z-z
and some |c |
+
S*
wnere
we have a plus in f r o n t of
u>
yz = (yxz-w
< r
u> r.
+ c
series for
yxz.
v a l i d even i f
form
aY+a<5
yxz
ay
< to r
$e).
| c 1 1 , | c 2 | u>
+ c .)
yxz.
Now
e)|(yxz+w
By
aY+a
u> 5 .
yxz
has the
possibi 1 i t y of c a n c e l l a t i o n ) .
Hence by c o f i n a l i t y
yz, i.e.
yxz
yxz = yz.
This
completes the
proof.
Remark.
Note
s a t i s f i e d , we don't need i t .
NORMAL FORM
71
Theorems 5.7 and 5.8 give us a powerful tool for dealing with
surreal numbers. In fact, for many purposes we can simply work with
these generalized power series and ignore what surreal numbers are in the
first place. This is an example of the whole spirit of abstraction in
mathematics. However, there are limits to what can be accomplished by
general power series methods since the surreal numbers are somewhat
special. Note, for example, that the class of exponents is precisely the
class of all surreal numbers, which in itself is unusual.
Let us see what the power series methods accomplish. First,
we have an alternative way of dealing with inverses and square roots
which is much easier than the direct method used in chapter three. Let
us consider, for example, the inverse. The essential idea is as follows
follows.
Let
Theorem 5 . 9 .
positive integer
Proof.
72
n.
J. u
Let
form
GO r o [ l +
GO / n
a.l r
&e
surreal number.
I u> i s j " | .
0<i<a
Now GO r 0
has an
r0
is positive.
Hence
form
by determining
a = 3 = GO and the
terms
aj's
inductively.
and x j ' s
(For
are simply
might be d i f f e r e n t from
GO
where
For
a.)
(1+ I GO i y j ) n
Suppose that
all
XJ and y j
z _> XJ f o r some
agrees with
1 +
I GO i r j
for
i < Y , but
(1+ I GO i y j )
* 1 + I GO i r j . (Recall that in our generalized power
i <Y
i <<*
series the exponents are decreasing.) Then we claim that there exists
x
z >_ x.
aj
Furthermore, i f a l l
f o r a l l terms
GO
where
with i n t e g r a l c o e f f i c i e n t s , then so i s
and that
aj
x.
(The f a c t that
xj
is
ordinary power s e r i e s . )
In f a c t , l e t
X
(1+ I GO i y j ) n and 1+ I GO i r j
d i f f e r . Then x < XJ
i <Y
1<a
i , and the respective coefficients, s and t , satisfy s * t .
ficients of
for a l l
in
z >^ Xj
for some
sible disagreement is
s * t
G
O
i.
The earliest term for which there is posand, in fact, its coefficient is
satisfying
s+ny = t .
and
s+ny.
Since
(Uniqueness
y
the claim is
NORMAL FORM
73
with
r-f
real and
d-j
Let
infinitesimal.
i=l
n
Po
and Q o . Then
x + I hjx "
il
74
r-1
and s-1 respectively for i > 0, but ( J u " P-f)( I u " Q-f) * f(x).
i<3
i<3
We shall find b$ such that a$ < a-f for all i < 3 and
polynomials P3 and Q3 of degrees at most r-1 and s-1 respectively
b
b
b.
b
( I a) iPj+w 3p B )( I a) TQi+o) % )
i<3
i<3
exponents y such that y _> b^.
so that
Let
of
all
agrees with
bg < bf for
I w ip-j+o) ^G and
I w iQ-f+w ^H
1 <3
i <3
where G and H are polynomials to be determined l a t e r . Then
b
b
b.
b
( I a) Pi+w G G ) ( I w TQ^+U e H) agrees with f ( x ) for a l l terms up to
i
i
NORMAL FORM
75
n c
n
i
x . Let c = max .
1=2
stitution
x = yoo0.
f i c i e n t of
c
(yw C ) n
(u>
yn +
r-j)o)
n
+ Y a-jfyu)0)11""1 = 0,
i=2
I ajaT^y 11 "" 1 = 0.
i=2
.
By choice of
The coefc
we have
i
<_ c with equality for at least one i, i.e. ci - ic 0. Thus all
coefficients begin with terms with non-positive exponents and at least
one term begins with exponent 0.
If an odd degree polynomial is factored into irreducible
factors at least one of its factors must have odd degree. Hence to prove
the theorem it is enough to show that an irreducible polynomial of odd
degree must have degree one. If we apply the above construction to an
irreducible polynomial the polynomial remains irreducible. Hence by the
76
contrapositive of lemma 5.7 the real part of the polynomial does not have
two relatively prime factors, i.e. has the form (x-a) or (x +bx+c) .
Since the degree is odd the latter possibility is ruled out. Hence the
real part of the polynomial has the form (x-a) n . Since the coefficient
of x11"1 is 0, it follows that a = 0. Therefore the real part of the
polynomial has the form x . This contradicts the fact that at least one
term besides x11 begins with exponent 0.
Since the construction leads to a contradiction, the polynomial itself must be x11. (We are not talking about the real part.)
Since the polynomial is irreducible n must be 1.
The last part of the theorem follows from the same proof.
For this purpose we restrict ourselves to surreal numbers whose exponents
are of the form referred to in the statement of the theorem.
E SIGN SEQUENCE
Our aim in this section is to obtain a formula which
expresses the sign sequence for I u> ">r-f in terms of the sign sequences
for a-j and r-j.
It is natural to look first for the sign sequence for w .
However, in order to carry through an induction we need to know the sign
sequence for certain special finite sums along the way. Thus caution is
required with the induction in order to avoid circular reasoning.
Specifically, we deal with finite sums of the form
w 1#r-f,
i<n
where aj+i is an initial segment of aj for all i and rj is either
an integer or a dyadic fraction with numerator 1. (It is understood
that the aj's are strictly decreasing, since we are working with normal
forms.)
We first need some lemmas which are roughly variations on
lemmas 5.2 and 5.3. In proving these lemmas we used the fact that
r = {r-e}|{r+e}, which in the case where r is dyadic involves throwing
out information. (This representation is cofinal but not mutually
cofinal with the standard representation of r.) We now see what happens
if we do not throw out information. In order to cut down on duplication
later we shall deal with a general dyadic although for our immediate
purpose we need only integers and dyadic fractions with numerator one.
NORMAL FORM
77
we obtained
integer.
{r-l}|<j>
i n the form { s } | { t }
if
r = with
if
i s a positive
n > 0 we have
Lemma 5 . 8 . ( a ) I f
r = {r'}|{r"}
is non-empty, then
a 1 as usual
i s an a r b i t r a r y positive i n t e g e r .
A = {a)V}|{u)V}.
b) If n is a positive integer greater than one then
a
a1
a"
a) n = {u> ( n - 1 ) + a ) m}|{a)
empty, where
m i s an a r b i t r a r y positive i n t e g e r ,
o>an
o> r
Again
is
u = {0,sw
a"
} | { t w } . Hence
u> r i s
Mr-r1)}.
)(rll-r)9
among the upper and lower elements and replace terms such as
a
a'
a
a'
w r 1 + (soo )(r-ri)
by
w r 1 + w m.
a1
i s non-
desired form,
(b)
that now we need the t h i r d term of the upper elements since the other
a"
a
terms are not present. Since u> >> o> , by c o f i n a l i t y this term may be
a"
replaced by u> e.
Note that negative integers can be handled by sign r e v e r s a l .
Also, r
= 0
formulas s i m p l i f y .
element is
u> n.
I f the set of
Otherwise
a's
A convenient
n
a-,* can be expressed as
il
{a1+a2...+a .'...+an>|{ax+a2...+ak"...an>
78
In fact,
1 k n.
where
1 j <_ n and
1#
I w 1#
rj referred
i<n
I t is understood that r-\ * 0
form
i.
a.
2. w 1r1- be a surreal number where for all i , a . , , is
n l
i<n
an i n i t i a l segment of aj less than a-j and r-,- is a dyadic fraction.
a.
Then
I w ir-j can be expressed in the form F|G where a typical element
i<n
x in G is obtained as follows:
Lemma 5.9.
(a)
If
Let
"
be the minimum
,.
Then
I to ir-j + a) n - 1 r
" - a) T"1 m where a
' is a typical
n i
n i
1<n-l
"
"
lower element in the canonical representation of a ,
and m is an
x =
( I f there is no a
'
omitted.)
(b)
If
i , then let
rj
Then
a ;
x = I oo r1- + u r j - u>
and
(c)
If
is
m where
rj
a ;'
rj
rj,
aj
then
r-j
representation of
a0
and
numerator one.
(d)
If
then
r-j
G is empty.
A typical element of
Proof.
and a 0
is an ordinal
is clearly an ordinal.)
is obtained similarly.
Specifically, in the
NORMAL FORM
79
consider
G.
F can be handled s i m i l a r l y .
In p a r t (a) the s i t u a t i o n i s analogous to that of lemma 5 . 3 ,
x.
If
I ou i r i
i<n
i s replaced by
a "
a"
replaced by
introduced.
u k term.
a
Now i f
k = j,
For
term is the f i r s t
is replaced by
is an i n i t i a l segment of
i n i t i a l segment of
a
< a
we a
aj
ak".
k > j.
Since
a^
a^11 > a j .
Furthermore
Recall that
is an
Since
of i n i t i a l segments.
au(a) = - .
aj(a) = - .
Now suppose
apply transitivity.
have of
a^
so is
n.)
Hence
Suppose
Since
a^"
a^
has length
a.
Since
is an i n i t i a l segment of
a-j
Thus the terms obtained by altering u> r j
a,
the terms obtained by altering u> krk for
a^" > a j .
"
2, a) 'r-j +o) u r j - a)
i<j
we need a l t e r is the
j'
m + I w "'r-j,
j<i<n
j ^ 1 term.
80
a.
a -i
a -j'
a
k " > ao Hence the above expression i s mutually cofinal with co k 6.
a "
When k = 0 t h i s is simply co 6. F i n a l l y , since every element of the
ii
form
a|<
k = 0,
a0
a0".)
co .
Let
of length
aa
a,
and l e t
a+
a.
Theorem 5.11.
with a plus.
u>a i s as follows.
a we have a s t r i n g of
co n, where
a * i
co
We begin
pluses i f
than
co n minuses.
co
and
since
is
NORMAL FORM
81
a.
I w 1 r 1 # , where
r-j
i s e i t h e r an integer or a
i , a-j+i
i s an i n i t i a l
segment of
from
a-j
by ignoring a l l minuses.
Before proving the theorem we i l l u s t r a t e with several
examples.
First let
f i r s t term in
a = (+-).
a, which is
Then
+, gives rise to
u>
- , gives rise to
= u>
2
gether we have
minuses.
minuses.
a =j
this is
Then the
1+1
So a l t o -
(In juxtapos-
Incidentally,
compute
w2
w3
pluses.
By rule (b)
b 1
u) ^j and a) ^ j . )
Finally we have
was +00 3.
rather than
However,
g(x)
obtained from
x = I <*> * r-f
82
a.
x = I GO ""r-j,
g is one-one. Suppose
1<m
b.
1
y
I " S j , and g(x) = g(y). Assume first that a 0 = b 0 . Then g(x)
i<n
and g(y) have the same tail after discarding the initial segments
a
b
corresponding to <*> o = w o. Now a + ,a +, ..., a + is a decreasing
=
a.
J ID i n-j.
i<m
Thus the length of the tail determines a-j+ and n-,*
uniquely. Furthermore, a-j+ determines a-j uniquely. This is because
a-j is obtained from ao by stopping at a plus. Finally, the signs of
the various strings determine whether r-j is an integer or a dyadic with
numerator 1 and the value of n-j determines r-j. Thus x = y.
Now we rule out the possibility that a 0 t b 0 . If neither
a 0 nor b 0 is an initial segment of the other, then clearly a discrepancy between g(x) and g(y) arises at the point where a 0 and b 0
differ. Suppose without loss of generality that a 0 is an initial
segment of b 0 , and consider the tail following the sequence for GO .
a. +
The length of the tail of g(x) has the form
J u 1 n-j which is less
than
a ++i
u)
.
i<m
The t a i l of
identical signs.
g(x)
So certainly
I t is clear that
form
a + +i
to
g(x) * g(y).
g(x)
has the
x = GO .
An element of
g(x).
is at the beginning so
F = {0}.
a at some place
the i n i t i a l segment of
of length
a.
Otherwise a plus in
a where
a.
a(a) = +.
There are
GO
pluses where
c < to
selves to values of
.
c
Let
b be
pluses in
is
b with
GO n
n.
GO (n+1).
But
NORMAL FORM
83
a where
one.
jO,u) nf Nco
IJf - co = x.
2"
Next l e t x = co n, where n is a positive integer larger than
Since
g(x)
is obtained from
g(co )
by c o f i n a l i t y .
f i n a l i t y we assume that
a+
c
and
m is a positive integer.
a+,
so
less than
Now l e t
runs
runs through
w
But
m#
a'
this reduces to
g(x)
In
co (n-1).)
g(x) = co n = x.
x = u>a(-).
g(u ) = co .
co () - co m.
a'
l+
a,
Now as
coa (n-2) +
elements as
Thus we add
= {0}|()
g(x).
Let
g(x) = F|G
Suppose f i r s t that
i > j .
r-j
in
g(x)
G1
cofinal
which is
t n
term with a
g(co r j ) , where
84
statement of lemma 5.9(b). This follows by the earlier part of the proof
dealing with monomials. By the inductive hypothesis this is
a.
ai
a i'
j. u) irj + a) J r j " - w J m.
If r-j is a positive integer for all i, then g(x) is
a
obtained from g(o> o) by adding on pluses only. Hence g(x) and
a
a "
g(w o) have the same upper elements, namely u o e,
A similar argument applies to the lower elements. In all
cases the upper and lower elements obtained for g(x) are just what we
need by lemma 5.9 to deduce that g(x) = x. (For convenience we unified
various cases. For example, in lemma 5.9 part (a) may be regarded as a
special case of part (b) and part (d) of part (c). There is a pedagogical advantage in separating cases at the beginning for the sake of
concreteness, but at a later stage it is repetitious and tedious.)
We are now ready to determine the sign sequence of a general
sum
I u) i*rj. First, we define what we mean by a reduced sequence a$'
of
a$, where
3 < a.
a$ of
a3
if
II
if
a$(6) = -
is obtained from
a$ by
ae:
an i n i t i a l segment, and r
has
followed by a minus as
discarded.
a^ 0 = (++-),
NORMAL FORM
Theorem 5 . 1 2 .
(a)
85
o b t a i n e d by j u x t a p o s i n g the sequence f o r
from
o>ar
u
f o r p o s i t i v e real
is
r,
times.
(b)
If
(c)
I w irj
i<a
is obtained by juxtaposition of
0
a.
o
u> " rT- where a-j
is the
Consider
u {^-).
Here
We already saw
Since -x = ( +-)
2
a> pluses
a>22 minuses.
followed by
Proof.
L
I
u>2 + w8.
We f i r s t consider the case where the surreal number has the form
uniquely.
r-\
Recall also that lemmas 5.8 and 5.9 deal with general dyadic
coefficients.
theorem 5.11.
The subcase where x = war with r a positive dyadic
f r a c t i o n but neither an integer nor a dyadic fraction with numerator one
i s similar to the cases
proof of theorem 5.11.
x = wan
and
In f a c t , l e t
x = ^("fj")
r = {r'}|{r }.
Note that
r'
is
the i n i t i a l segment of
and
r"
86
By hypothesis
begins with a plus following which there is at least one plus and one
minus.
g(x)
we obtain co-
"r".
obtained by adding on
m minuses to
g(w r")
element is
g(x) = w r = x.
r-j
x = co r
g(w2)
For example,
by the formula.
where
is not dyadic.
g(oj/7)
can be extended to the present case by noting that even i f the length of
a + +i
the t a i l of g(x) is u)
the signs are necessarily not a l l alike
so
g(x) * g(y).
5.2
oj r
r = R1|R"
is one-one.)
may be expressed as
aj R'|to R"
since
By lemma
is not dyadic.
(Note
g(x).
Since
Uar").
Hence
so f a r .
and s t i l l have
g(w r ' )
to be an i n i t i a l segment
n(x)
which a p r i o r i seems
n(x)
NORMAL FORM
87
(a)
(b)
(c)
for
(a)
(b)
i _> a, a-j+i
dyadic.
I.e.
I u> ir-j
i <a+n
is an initial segment of a-,- and r-j is
a.
where
is a f i n i t e sum
a.
rn(x) _< rn( I a> ">r-j) + 1.
i<a
a.
Let x have the normal form
I co ir-j and express x as
i<a+n
I u> "r-j + y
i<a
Note that
a.
I a) Tr-j + y. Suppose that y is a surreal number which satisfies the
i<a
hypothesis of lemma 5.9. Then x can be expressed in the form F|G
If y
88
then
x
(3)
If y
a a " exists or if y
satisfies case (d) (in which case there certainly does not exist an
a a ")
then
a
I a) ir-j
as given by the d e f i n i t i o n i f a
by lemma 5.3 i f
A typical element of
Proof of lemma.
where
z"
is a l i m i t ordinal and
is a non-limit ordinal.
F is obtained s i m i l a r l y .
Typical elements
x"
are
and
z"+y
terms of the former type are clearly cofinal by the lexicographical order
thus proving the lemma in these cases.
a "
I OJ 1 r-f + w a e.
Since
a-j's are decreasing and since the a a " are initial segments
a a " or has a a M
followed by a
NORMAL FORM
89
aa"
sponding
aj
t h e r e i s no a j
satisfies
j < aa", j
satisfying
aj = aa".
aj < aa".
Then the c o r r e -
Since there is no
aj
a,
satisfying
i.e
a = j+1.
This
aa"
ra-i.
aa.
rn(x)
is a non-limit ordinal.
where f(Vi^ot)(a-f+i
is an i n i t i a l segment of
aj
Let
and
rn(x).
I OJ n r-j
i <a+n
is dyadic) or
x =
r-j
and
r-j
is non-dyadic)l.
Then
a.
a.
00
we may assume the result for J u V j . We now use induction on Jig(y ))
i
a o
where y=
I
toa i r-j. The argument is similar to the one used in
y= a<i<a+n
I
i
the proof of theorem 5.11; however, there is a complication because of
the need to consider reduced sequences.
F and
SA = SF|SG.
g as a function of
I a i"rj.
of
aa
A = F|G
is one-one.
aj
for
(Yi<a)(aj>a a )
so that the
*aU) * a(J)
but
aa(i) = ba(i)
a a * ba
for
i < j .
aa.
Hence
but
a a = b a .
Without loss of
(33<a)(Vk<j)[a B (k) = a o ( k ) ] .
ba < bg = a$,
Assume
ba(j)
By the
is necessarily minus
90
is the juxtaposi-
juxtaposition.
Suppose
is negative.
by
aa.
is positive.
Suppose f i r s t that
This corresponds to
g(y )
as in lemma 5.10(1).
( I
y = { y ' 0 } | { y " 0 }
O H g ]
g(y).
We can obtain
g(x)
Now i f
by considering
y = {y'}|{y"}
and furthermore
i<ct
i<a
i<a
by t r i v i a l reasoning with juxtaposition.
By the inductive hypothesis this is
a.
a.
a.
{ 1 ^ i + y'}|{ I " l r i + y"} which is I u ^ + y as desired by
i <a
i <a
i <a
lemma 5.10(1). It is worth remarking that juxtaposition works trivially
in the argument here since the minuses ignored in y depend only on the
nature of a-,- for i < a so they are the same for all y 1 and y".
Now suppose all minuses in g(y) are contributed by a a ,
i.e. by minuses in a a which are not ignored. Such a minus corresponds
to an a a " . Assume first that case 2 of lemma 5.10 holds. Then by
cofinality we may limit ourselves to such a a " and the juxtaposition
argument is identical to that of the earlier case.
The most subtle case occurs when case 2 is not satisfied. By
the remarks following the proof of lemma 5.10 this can happen only when
a has the form j+1 and the minus corresponding to a a " = aj is the
last minus in a a . Since we are dealing with a minus in a which is not
ignored, r a -i is dyadic. (This case is the most delicate with regard to
the issue of ignoring of minuses.) We obtain a subset cofinal in the
upper elements of the canonical representation of g(x) by taking
+
a
sequences of the form g( I <*> i>j) followed by </a pluses and
a) a n minuses for some integer n. This follows because first all
minuses preceding the one corresponding to a a " are ignored in y .
NORMAL FORM
91
g(y)
begins
a) a
minuses contributed by the minus corresponding to a a . Now
.,
a.
a.
a ;
a.
dyadic
to
[g(x)]n
upper element
i n the canonical r e p r e s e n t a t i o n o f g ( x )
a.
g( OJ 1 r 1 - ) , i . e i s an upper element in the
i<a
corresponds to a minus in
{ I w
i <y
l r
i "
w Ye
lll
I u
i <y
i r
+ w Ye
"}
Now g( J u V j ) = I OJ ir-j
i <a
i <a
where
Y<a
and e
such that
I u i r f + w e = g[ I OJ V J + OJ Y e ]
i<Y
i<Y
hypothesis.
[g(x)]" >
(g(x)) M
a.
9( 1
i
w 1 r
i).
a.
Y
gf I OJ V-j + OJ e]
i s greater than
= g(x).
g| ), u i r j + OJ e] = ]i u i r j + OJ e > I OJ Tr-j
i^Y
i^Y
i<a
Thus by the lexicographical order the only d i f f i c u l t y
92
II w V j + wYYee. In
i<Y
a.
aY
order for this to occur rY is necessarily dyadic.
I u> " r-f + 03 'e is
i
the sequence
I u> 1#rj followed by u>Y
pluses and w Y n minuses
i<y
a.
for some integer n. Therefore the contribution of
I u V-j to
a.
a) V j consists only of pluses. Necessarily y < i + r-j is a
i<a
positive integer and y _< i - ai+i is an initial segment of a-,*. The
latter follows since a-j+i < a-j as, otherwise, a-j+i would have a minus
not occurring in aj and, a fortiori, not in any aj for j < i, and
a.
would thus contribute a minus to J u V j . It follows that a = y+n
i<a
a.
_ a. +
for some integer n. So I w "*r^ contributes
I w " r-j pluses to
i
y<i<a
I to V j where y <_ i - a-|+ > a . j + i + . Furthermore g(y) contains
i<a
only pluses. Since r a - i i s dyadic a l l minuses in a a are contained in
arises when
I a> iir j
i<a
is
i an iinii tii all segment off
aa-i;
f i n a l l y shows that
consists of
the sequence
I u> V j
i<Y
g(x)
(In general, i f
rn&(x)
Now suppose
rnl(x)
rn(x)
is a l i m i t ordinal.
is
Then
u>a + m, then
are a l l f i n i t e . )
NORMAL FORM
93
Since
z=
g(z)
segment.
Hence
Now l e t
has the form
z < y
g( I o> 1 # rj) as
i<3
followed by a minus as an
which contains
y
by the lexicographical
order.
be an a r b i t r a r y upper element of
I u> "ir-j +
l<3
3e
x.
Then
is a
g(x)
is cofinal in
a.
i.
However, since
a-j+i
con-
J u ir-j
Y<i<ct
to
g( J u ir-j)
i<a
for which u Y r Y
is an i n i t i a l
segment of
is a nona-,*
for
contradiction.
I t follows that
since otherwise a l l
g(x)
cannot be an i n i t i a l
contributions of
u> ir-j
for
segment of
3 < i
to
g(x)
y
would
and
g(x)
differ.
Suppose
(g(x))(j)
= +.
[g(x)](j)
for which
imply t h a t
false.
6 > y
y < g( I u ir-j) = ( J w V j )
where 6 = max(3+1,Y) which is
i<6
i<<5
Hence ( g ( x ) ) ( j ) = - .
By what was said e a r l i e r there exists
such that
a) 6 r $
of
g(x).
g(z)
g(x).
contains
This
g( J u i'rj)
94
95
6
a(ab)
(a)
3
(b)#
[We
are
using
ordinary ordinal
exponentiation.]
Proof. Using induction as in the proof of theorem 6.1 we must consider
elements of the form ab + ab - ab.
96
U(~)| XoU(a)|
[i.e. unless
97
Lemma 6 . 1 .
Proof.
If
|&Ua)| = |i(a)|.
(a) (u )
a, l e t
by pluses.
be obtained from a
co
hence
certainly
that
98
operations
different.]
S i n c e t h e c o n s t r u c t i o n o f t h e s i g n sequence f o r
c o u n t i n g t h e number o f p l u s e s a t v a r i o u s s t a g e s , c l e a r l y
|*(a)b)| = |it(b)| = | U a ) | .
However,
the f a c t t h a t
Lemma 6.2.
Proof.
I u)
\i{^
Since
a+1
r)\
Hence
\i{>/)\
i{u
involves
) <_ (co ) .
<_ | * ( a ) | .
(Recall
= <A)
= \i[u> )|
i s an i n i t i a l
segment of
u> r
i f a * 0.
i t follows
that
u> r
i s obtained by f o l l o w i n g
(w r) U ( o ) ) ] [ u ) ] .
a
has i n f i n i t e
Lemma 6.3.
If
Therefore
length.
w r
by ignoring minuses.
a
| i ( w r ) | X0\i{^)\.
Hence
If
a * 0
In general the d i f f i c u l t y
segment of
then
then
I f OJ r
so the r e s u l t is
a> s t r i n g s
of
by at most
Note f i r s t
immediate.
ID
followed by
w+1
pluses and
followed by
u>.2
minuses.
-uT^uT2
-w" 1 + uT 2 .
u> minuses,
Thus
uT2
w"
consists of a plus
cannot be a subsequence of
If a
99
a
wrrt
as well as
w r.
d
a) .
is
has
signs.
pluses and
ooa
signs
contributed by a non-dyadic
contributing
Recall that i f
to
wr
since
has
a
Let a = I u> <*ra. Then
100
Remark. It turns out that this lemma is not needed for the proof of the
theorem. However, I feel that it is of interest in itself and fits in
naturally in our present discussion of lengths. A similar remark applies
to lemma 6.5. What is really needed is lemma 6.5a which strengthens both
the hypothesis and conclusion of lemma 6.5. Lemma 6.5 is of interest
because of the surprising fact that the bound is sharp in spite of the
lack of any hypothesis on the length of 3.
Proof. If (a a ) is finite for all a then there can be at most w
terms so we need only consider the case where at least one a a has
infinite length.
Let d = |i.u.b. (a a )|. Then all a a terminate at d or
earlier. By a simple cardinality argument there are at most 2^ such
sequences. This is not good enough for our purpose, and in fact we shall
show that this can be strengthened using the fact that the a's are
well-ordered.
In fact, assume that there exists a well-ordered decreasing
sequence of surreal numbers (a a ) containing d + > d members where d +
is the successor of d. We show that this leads to a contradiction.
First consider the sequence a a (0). Since this can take on
only the values plus, minus, and 0 by the lexicographical order there
exists an a a such that a a (0) is fixed for a > a 0 . [E.g. if a a (0)
is minus for any a it must be minus for all larger a.] If the fixed
value is 0 we have an immediate contradiction since this says in
particular that a a = a a +x = 0. (Recall that 0 stands for
"undefined.")
We now construct a monotonic increasing sequence (aj)
defined for all i _< fc.u.b. (a a ) satisfying a a (i) = aai.(i) for
all a > aj.
Suppose otj is defined for i _< j. Since aj
is monotonic increasing we know that a a (i) = a a .{i) for all a > aj and all
i _< j. Consider the subsequence a a (j+l) where a > aj. By the lexicographical order this is monotonic decreasing. Hence the same argument
as in the case a a (0) shows that there exists an aj+ x such that
a(j+l) = a a # (j+1) for all a 2 . a - + i Note "tnat the subsequence
101
expressed as
Lemma 6.5.
| 3 | _< . u . b .
|(aa)|
a
If a = \ o> a r a
a<3
then
Xo.
Lemma 6 . 5 a .
then
If a =
|t(a)| < d.
a
I co r a a n d i . u . b . ( | a ( B ) | , U ( a a ) | , X ) < d
a<6
102
Proof. This is the same as the last part of the proof of lemma 6.5.
Note that the bound here is slightly sharper than the one in lemma 6.5.
This bound we actually have to "pay for" although the early bound came
free thanks to lemma 6.4.
Lemma 6.6. If a x ,a 2 ... a n are arbitrary surreal numbers and
i> r 2 r n are rational then U(Ir-|aj)| _< |max (aj)|y 0 .
Proof. We know that for any real r i(r) _< u so |(r-j) X o for all
i. The lemma then follows from theorems 6.1 and 6.2. (Of course, all
that is used is a weakened form of these theorems which refer only to
cardinalities of lengths.)
We now have all we need to prove theorem 6.4. This may seem
strange since none of the lemmas have anything to do with polynomials of
odd degree! In fact, the proof of the theorem will not make direct
reference to such polynomials. The aspect of theorem 5.10 which is
crucial is that the exponents are rational linear combinations of the
given exponents, so that the same proof works for reciprocals.
Proof of Theorem 6.4. This follows easily from the lemmas by a kind of
back and forth argument.
Let a be an ordinal of infinite cardinal d which is an
upper bound to the lengths of all the coefficients. By lemmas 6.3 and
6.1 (used in that order) a is also an upper bound to the lengths of the
exponents occurring in the normal forms of the coefficients. Since
n(a) <_ (a) for any surreal number, a is also an upper bound to the
normal lengths of all the coefficients. By lemma 6.6 d is an upper
bound to the cardinalities of the lengths of the exponents occurring in
the normal form of the constructed root, d is also an upper bound to
the cardinality of the normal length of the constructed root. (By
elementary cardinal arithmetic, the cardinality of the set of all finite
rational linear combinations of an infinite set S has the same
cardinality as S.) By lemma 6.5a the cardinality of the length of the
constructed root is bounded above by d.
Remark. A proof using lemma 6.5 is also possible although our approach
seems simpler. For such a proof we need a strengthened form of lemma 6.6
103
which makes fuller use of theorems 6.1 and 6.2. This will enable us to
obtain an ordinal upper bound of cardinality d to the lengths of the
exponents occurring in the normal form of the constructed root. In fact
the least e number larger than a works.
Finally, as a culmination of the results of this chapter we
have shown that the subset of surreal numbers a such that |(a)| d
for any fixed infinite cardinal d is a real closed field. Since all
operations concerned depend on only finitely many elements the condition
(a) _< d may be replaced by (a) < d. [The latter formulation gives
more fields.] These are all "honest" fields since their carriers are
sets.
Not all subfields have the above form. In fact, the two most
well-known fields found in nature, the rationals and reals, both consist
of all surreals of finite length together with some but not all surreals
of length w.
The field of all surreals of countable length should be a
worthwhile object for further study.
104
7
This chapter i s s t i l l
a t a pioneering l e v e l .
I have a strong
A sequence
<a
>
and
sequences
and
<
if
is a s h u f f l e of the sequences
there e x i s t s t r i c t l y
JiJ2*##J6>
sucn
increasing
t n a t
l a r g e r than
j's
i s one of the
i's
or
not
A subshuffle
For o r d i n a l s i t i s known t h a t
a + 3, where
r e f e r s to
and
8.
shuffles
However, the s h u f f l e
the
3)
u> from
gives
w2+8
For example, i f
ct+3
and
3+a
o)+u)+5+3
a = OJ+5
give
w2+3
3 = w+3
u2+5
( i . e . we f i r s t take the
and
and
then the
respectively.
co from
then
followed by the r e s t of
a+3.
Theorem 7 . 1 .
of
and
and
b,
a+b
i s a subshuffle
b.
Proof,
(a)
finite,
using i n d u c t i o n .
&(a)
and
Jt(b)
are
105
106
107
108
as in the previous case we can now use the u minuses which a> contributes to a) s. (Since s is negative these w minuses are present
by sign reversal because of the u> pluses in to .)
(d) We now prove the result in general. If we allow the use of zero
a.
a.
coefficients, we may assume that x = I w "*r-j, y = ^ w 1s1- and
i <a
i <a
x+y = I co i(r-j+s-j). Now if no minuses were ignored in the sign
sequence formulas the result would be trivial. Juxtaposition of all the
subshuffles as we run through all i would lead to a subshuffle of x
and y. Furthermore, minuses "missing" in x+y cause no trouble; in
fact, this makes things easier. The difficulties occur only when minuses
in x or y are "missing." As in the proof of part (c) we must show
that other sources for these minuses exist so that x+y can still be
obtained as a subshuffle. (It is ironic that the main cases of difficulty occur when r-j+si = 0 for some i! Since the corresponding term
in x+y vanishes, the subshuffle requirement appears to be vacuous.
However, the issue of ignored minuses arises in later terms when the
above equality holds.)
We shall obtain x+y as a subshuffle of x and y by
a.
induction on a. We assume that I u> Mr-j+s-j) has been obtained
as a subshuffle of
a.
I w ir-j and
a.
I u> ">Sj and
show that
I w ">r-f and
I GO
1#
SJ
109
P -1
and
p-1
are non-dyadic
p-1
and
r.
p-1
+ s. .
P- I
is dyadic (in
p-1
the normal form of x+y. In any case the inductively defined subshuffle
makes use of only a finite number of pluses and minuses in the
contributions of r 0 . and s o
to x and y respectively. (If one
p-i
p-i
p-i
110
Ill
8
NUMBER THEORY
A BASIC RESULTS
This chapter overlaps chapter five in [1] to some extent. We
study a subring of the class of surreal numbers for which there are
results analogous to those in number theory. By the results of chapter
six the theory is essentially unchanged if one restricts oneself to
certain suitable subsets of the class of surreal numbers so that we deal
with "honest" rings.
Definition. A surreal number a is an integer if the exponents in the
normal form of a are all non-negative, and if a zero exponent occurs
then the real coefficient is an ordinary integer.
For example,
this) whereas
M2-^-!
is not an integer.
(3)
have opposite s i g n s .
a = {a-l}|{a+l}.
such t h a t
a(a)
and
112
Proof (1) =*> (2). This is immediate from the sign sequence formula. If
b > 0, to begins with an infinite number of pluses, and in <o r all
signs are repeated infinitely often. Fortunately, the ignoring of
minuses in sums does not cause any extra complications. For b = 0 we
know that an integer corresponds to a finite sequence of signs which are
all alike.
Not (1) = not (2). For b = 0, we know that if r is not an integer,
then r = (o r has length at most w and there is a change in sign
somewhere, i.e. we have not (2). If b < 0, then u> begins with a plus
followed by a minus. This almost proves not (2). What remains is to
consider the possibility that the minus may be ignored. Let b be the
first negative exponent occurring in the normal form of a. If the first
minus sign in b is ignored then there is a previous term with exponent
zero and non-dyadic coefficient so again we have not (2).
(1) =o (3). First, assume that all exponents are positive. Then the
result follows immediately by cofinality, where we use the definition if
there is no last term in the normal form and lemma 5.3 if there is a last
term. If a zero exponent does occur then it necessarily occurs in the
last term with an integer coefficient. The result then follows by
cofinality from lemma 5.10. (Actually, since 0 has no proper initial
segments we need only a simpler version of the lemma.)
Not (1) => not (3). Let the normal form consist of terms with negative
exponent and express a as b+c where c is the tail in the normal
form consisting of all terms with negative exponent. Then by the
lexicographical order a-1 < b < a+1. Clearly i[a) > lib) by the sign
sequence formula (in fact, b is a proper initial segment of a ) , so
a * {a-1}|{a+1}.
Now let us consider the other possibility, i.e a has the
,. b.
form
I a) Trj + r a where b-j > 0 and r a is not an integer. Then
i<a
if r a is replaced by either the largest integer less than r a or the
smallest integer greater than r a we obtain a number c satisfying
a-1 < c < a+1. One of these necessarily has smaller length than a, so
again a * {a-l}|{a+l}.
Each of the equivalent definitions has its own intrinsic
interest. (2) requires nothing more than the definition of surreal
numbers and thus can even be used in the beginning of chapter two. (3)
NUMBER THEORY
113
Theorem 8.3.
Proof.
a) c
-c
co a
a.
Let a = i w V j .
i<a
Suppose c = <j>|{O,a-j}. Then c < 0 and (V i) (c < a-j). Then
is an integer.
is also an integer.
Of course
a =
uTca
co"c
C.
Let
114
a divides
GO, i.e.
a
is an integer.
NUMBER THEORY
115
a in the form
1
begins with
w Or o [l+ w "is-j] it is
"b 1
a>
(as in dealings
a
r0
with ordinary power series). Therefore the normal form of 7- begins
a
with
a)
.
r0
that
b0 2. 1
contain any
Since
we
^ve
is an integer
b0 = 1.
b0 < 1.
further terms so =
where
r0
can't
is the reciprocal of an
a VQ
a has the form ~ for some integer
integer, i.e.
n. But then a
divides
. b.
Let a = 2.w l r i be an integer. Suppose that there exists
i
such that
Proof.
b 0 >> bj and
bj * 0. Then
116
is composite.
Let
a of the form
I u> irj
where
i<ct
bj is finite for all i. It is well known and easy to prove that every
finite number a can be expressed uniquely in the form r+e where r
is real and e is infinitesimal. (In the language of nonstandard
analysis r is called the standard part of a.) We now group together
all terms which have the same standard part. Specifically, corresponding
to every real number r we obtain an expression of the form
I w isj
i
NUMBER THEORY
117
Is wvT+w+l
a prime?
/2 but in
fact will use only the fact that /2 is irrational. Moreover, the proof
would probably be extendable to more general polynomials. The following
unsolved problem is found in [1].
l
1 i
Open Problem.
Is
OJ+W
+w
u> +1 a prime?
(Do not let an earlier remark confuse you. We had stated that if all the
exponents in a polynomial are rational multiples of a fixed number then
the polynomial is composite. The discussion there concerned itself with
polynomials of finite length.)
It is of some interest to classify the factors of u>. Among
the obvious factors are finite integers and numbers of the form w
where r < 1. It is clear from the proof of theorem 8.3 that any number
such as
I w ir-j where
i
a0
is infinitesimal (e.g., uP + 1) is a
118
factor. With caution one can take certain products of these such as
1
j_
(o)"2"-7)(u)s+l). O f course one cannot take all possible products; e.g.,
l-wi)(OJ^+1)
I
(u)
kind.
aQ-a!
Although we have no
theorems to the effect that an answer to the latter question will imply
an answer to any of our questions on primes, I believe that the
techniques of the proof will supply insights to study the classification
of primes.
Furthermore, the above question leads to algebraic questions
of interest which are independent of the theory of surreal numbers. I
believe that generalized power series in which exponents may be
irrational and lengths may be larger than
their own sake.
valuation theory, but as a whole such objects have been neglected in the
1iterature.
We close this chapter by proving a partial result concerning
possible factorizations of GO.
Theorem 8.7.
b.
d = I to is-j
i<oo
numbers.
and
cd where
a.
c = I to ir-j
i<oo
are integers, and all the a's and b's are real
Remark.
proof.
Also we are tacitly assuming that the r's and s's are distinct
from
Proof.
Suppose
a.
b.
w = ( I w 1r-,-)( I to 1 s 1 -).
i <to
By suitable normalization
i <to
t h i s may be e x p r e s s e d i n t h e f o r m
a.
b
1 = (1+ a i r - j ) ( i + u i S j )
i <w
o f course, not n e c e s s a r i l y
the same
a's, b's,
r's
and
with,
<OJ
s's.
NUMBER THEORY
replace
co
by
x .
Also,
exponents are p o s i t i v e ; ,
bounded above by
1.
where
(bj)
(a-,-)
length
and
119
since
and
are
integers
hence i n the n o r m a l i z a t i o n
Thus we f i n a l l y
have
are s t r i c t l y
w) sequences of r e a l
all
the
original
the exponents
1 = (l+x i r - j X l + ^ x
is-,-)
i n c r e a s i n g bounded o r d i n a r y
numbers.
We would l i k e
(i.e.
to show t h a t
are
( I t i s , of course,
of
this
and
be bounded.)
Let
write r
sup(aj) = l
m
a +m
= (x -*+Ix i
and
sup(bj) = m.
-ViKl+IxbiSi).
If
Then
I u irj
and
I u> ">s-f
with
( C J ) and
(dj)
sup I
r 0 = s 0 = 1.
6 = min[cj-Cj-1,(ci-dj:dj<c-|)].
product.
For convenience
both increasing
such that a l l
We have
[Since
CJ. Let
sup(cj) = sup(dj),
(dj:dj<cj)
6 > 0.
j .
m.
Hence
j > i.
is increasing.
l _> ( S l+ d m ,
i.e.
occur is f i n i t e .
Furthermore, i f
i.e.
CJ <^ A - 6 .
Hence
dm _> CJ .
6' = m i n [ c j + 1 - c j , ( d j - c j : d j > c j ) ] .
Thus i f we vary
Similarly, i f
n the set of
Hence
m's
which
dm > CJ we obtain
such that
dj
(i,n)
for a l l
d i s t i n c t from
Therefore
and any
(j,m)
j's
Hence
l _> 6' + c j ,
which occur is f i n i t e .
120
pairs (j,m) such that Cj + d m = c-j + d n for any n _> n 0 . Hence there
exists an n x such that for n j> n x the only possible pair (j,m) which
satisfies Cj + d m = c-j + d n must have d m = Cj and hence Cj = d n .
Since i was arbitrary to begin with and also since the same argument
can be applied to dj this shows that CJ = d-} for all i. Furthermore, because of cancellation we must have r-,*sn = -r n sj for n_> n l t
If we begin with i = 0, since r 0 = s 0 = 1 this tells us
that for n 2. "i> r n = ~ s n Now let i = ni Tnen f r m 2. n 2 we nave
r s
m n = " r n % - Since r n = - s n * 0 this gives r m = s m .
Hence if n i ^ m a x ( n 1 , n 2 ) then r m = s m and r m = - s m ; i.e.
s m = 0 which is a contradiction.
Unfortunately, the proof breaks down for lengths other than
u). The limited interest of the theorem is due to the fact that it is
the only result we have which gives some restriction to factorizations,
thus giving us minimal hope for the existence of non-trivial primes.
All our other results give circumstantial evidence against the existence
of such primes.
121
9
""V
122
on1
number. We abbreviate
n
e as e.
Then
0)
e1
= {a) ( l ) } | { o ) n ( e - l ) >
Which
i s
( u ) , ^ , ^
,"'l||e-l,ai
-1
, u)W
,}.
Note that the sequence which comprises the upper set is decreasing
although this might run counter to intuition. This is so because
a)c
<< w e = e whereas e-1 ~ e. Moreover, once we know that
OJ
< e-1 the decreasing nature of the sequence follows by induction.
e.
positive integers
Proof.
n.
Since
Now
e. , + 1
u) b
e. i
u) b =
~ e, , + 1 .
b
Hence
to
e
k
b
'+1
> e, , + 1 .
b
123
Similarly uT b "
^b" = e ,, ~ e ,,-l, hence GO b"
implies that w U b ' + 1 ) is an increasing function of n
a) (e ,,-1) is a decreasing function of n. Hence if m
n b
arbitrary positive integers we have
u ( +1) < (
( +1)
1
1
Sit1Ce
<o (1) 1s
n
certainly an increasing function of n we also have w (1) < w (e. ,,-1).
We now have exactly what we need to conclude that s is defined.
The fact that w (1) < e is immediate since w (1) occurs
among the lower elements. It is also immediate from the definition that
e < ^ ( e ,,-1) < e.,,-1 < e. . Similarly, e., < e. . The usual argument
using common initial segments shows that e. is strictly increasing.
Since the set {w n (1)}, {w n (e,b ,+1)} contains no maximum and
the set {OJ (e, ,,-1)} contains no minimum we know that
n b
mV
Vn V
by cofinality.
Corollary 9.1.
e, .
b = F|G will
Proof. This follows from the usual argument using the inverse cofinality
theorem and the cofinality theorem since e. is an increasing function.
We generalize the usual definition of epsilon numbers to
surreal numbers by defining an epsilon number to be any surreal number
such that a) = a. We thus have a class of epsilon numbers parametrized
by the surreal numbers.
Theorem 9.2. Any epsilon number between
initial segment.
Proof.
Suppose
e. , and
e. has
e.
as an
powers of
Similarly
But
But
0) (1 )
e = a) > a) n
that
= a) + 1 ( 1 ) .
a)
a) (e)
(e) =
= e
e
since
124
by
by induction.
induction.
Hence
Hence
e.
e.
ii s
an i n i t i a l
s ar
segment of
s a t i s f i e s the required i n e q u a l i t i e s .
Corollary 9 . 2 .
If
initial
segment of
Proof.
Let
is an i n i t i a l
segment of
then
is an
e .
b = B'|B"
lexicographical order
Hence
e. , < e
By the
< e.,,
so that the
Remark.
Note that the proof of theorem 9.2 makes no use of the f a c t that
the representation of
Theorem 9 . 3 .
Proof.
is canonical.
Let
e = F|G
i s of the form
a = C|D.
Now
e.
G.
e~ < e < e~
Since
and
for some
e.
Also l e t
and
the set of
by choice of
b.
D.
C < D.
Hence, by
theorem 9.2
e
is an i n i t i a l segment of e. On the other hand any
a
epsilon number of the form e
which is a proper i n i t i a l segment of e
must be contained in the set
erUen,
u
e
so i t cannot equal
e .
Hence
epsilon numbers.
Note. Induction is not used in the above proof. We do not need the fact
that every epsilon number in F U G has the form e a for some a.
B HIGHER ORDER FIXED POINTS
We now prove a general fixed point theorem.
Theorem 9.4. Let f be a function from surreal numbers to surreal
numbers satisfying the following:
125
126
[Condition (c) i s a p p l i -
Similarly
d D 2g(b ) < d.
Now i f
i t is a power of
a = f(a)
f(ja)
a
w.
f(2a).
a < 2a.
Therefore
f(a) = a.
C < range
is a fixed point of
Hence
Hence
then
f ( x ) < x.
an increasing function of
function of
if
function of
x,
then
is positive since
2a ~ a < f(2a).
x e DUygCB")
Therefore
Similarly,
x e CU2g(B')
then
f ( x ) > x and i f
x CU2g(B')
x D yg(B").
Since
fn(x)
is
and a decreasing
f (x)
is an increasing
we obtain
f(b)
<
VDu|g(B")].
This
is defined.
I t is immediate that
using common i n i t i a l segments.
g(b)
the lower terms have no maximum and the upper terms no minimum.
Hence
which is
g(b)
cofinality.
g(a)
point of
is clearly a power of
is a f o r t i o r i in the range of
So i f
Now suppose
x
G has no maximum.
g(G).
is a fixed point of
since a fixed
We claim that
then
y G such that
f g(y) > f [ 2 g ( x ) L
g(y) > f [2g(x)3.
Applying
Since
f [2g(G)]
is
f.
b = G|H
fn[2g(G)]}|{fn(D),fn[|g(H)]}.
w for a l l
g(y)
u>,
On the
where
x G.
y > x.
Then
g(y)
g(x),
A similar argument
g(b) = { f
127
Therefore by the c o f i n a l i t y
(C),g(G)}|{f
(D),g(H)}.
f (C)
Thus
and
theorem we
satisfies
f (D).
(Inciden-
n
tally, note that it is not necessary for n to run through the set of
all positive integers. By cofinality any subset containing arbitrarily
large integers will work as well.)
To complete the proof of the theorem we still must show that
every fixed point of f is in the image of g. We first show that the
analogue of theorem 9.2 is valid. In fact, let x be a fixed point
between g(B') and g(B"). Then c < x. Since g(B') < x it follows
that g(B') x, hence 2g(B') < x. Therefore f n [2g(B')] < f p (x) = x
and similarly f (C) < x. Since the same reasoning applies to the upper
elements, it follows from the definition that g(b) is an initial segment
of x. Also the analogue of corollary 9.2 as well as the remark
following the corollary remain valid. We now verify the analogue of
theorem 9.3 which is what we need. As in the earlier argument we let x
be a fixed point and let x = F|G be the canonical representation. Let
A be the set of all a such that g(a) e F and B the set of all b
such that g(b) e G. Then A < B. Let c = A|B. Then g(c) is an
initial segment of x. However, g(c) ^ g(A)Ug(B), hence g(c) f F(jG.
Thus g(c) is not a proper initial segment of x so g(c) = x.
Theorem 9.4 allows us to construct higher order fixed points
by induction. We begin with any f satisfying the hypothesis, e.g. w .
Suppose we call the fixed point function f . By induction there exist
fixed point functions {f } for all positive integers n where f +1
is obtained from f by the construction in the proof of the theorem.
We now indicate how the sequence can be extended to functions with
transfinite indices. For this purpose we extend theorem 9.4 to certain
ordinal sequences of functions.
Theorem 9.4a. Let f 0 be a function satisfying the hypotheses of
theorem 9.4. Then there exist functions f a for every ordinal a
satisfying the hypotheses and such that for a > 0, f a is onto the set
of all common fixed points of fg for 3 < a and satisfies condition
(c) with respect to the sets g(C) and g(D) where g runs through all
finite compositions of f$ for $ < a.
128
31m M * )
for all
sitions
> f
3i n h < x )
if
m > n
f 3 (x) < x
3. These give us the basic inequalities among all the compog for a fixed x.
Now if x e C or x has the form 2y where y is a common
fixed point of all f 3 then f 3 (x) > x for all 3 by our earlier
proof. Similarly if x e D or x has the form ^y where y is a
common fixed point then f 3 (x) < x for all 3. The above inequalities
now allow us to imitate the proof of theorem 9.4.
Incidentally, we must be careful in our reasoning with
inequalities. For example, assuming f 3 (x) > x for all 3, we easily
obtain f 3 f T (x) > f 3 (x). However, we do not obtain f 3 f T (x) > f Y (x). In
fact, if 3 < y we have equality.
We now define f a (b) inductively on b as follows.
fa(b) = {(gCCj^CZfafB'JlHlgfDl^E^-falB 11 )]}. In the case where
a = 6+1 the above inequalities show that we obtain cofinal subsets if we
129
consider only g's of the form f n . Thus the formula for f a (b) is
consistent with the one obtained in theorem 9.4 starting with the
function f$. For general a the proof proceeds as before. The first
problem that arises is that we need a g bearing the same relation to an
arbitrary pair (g x ,g 2 ) which f m + n bears to the pair (f m fn) ln the
proof of theorem 9.4. If gl = f 0 m hx and g 2 = f$2n h 2 , then
9
f m "'/o
is
iiiaxvpiP2'
130
03
(2)
a)
a
= aa
for a l l
such that
aa * 0
and
OJ = a 0 .
c a +a a +i
Note:
b a = ba
for a l l
a.
a0.
Since
<D
= aa.
w _> x
for ordinals
131
An ordinal
ca
(a a +l)w = a a .
replacing
(If
immediate evaluation of
e-i
132
constant from the beginning. For the sequence of a a 's we can use the
same argument we used for ordinals using the fact that the earlier a a 's
are constant. The same argument works for the b a 's. To see this note
that the above argument for ordinals works for any function f which
satisfies f(x) ^> x for all x (the argument makes no use of any
special properties of u> ) so that it applies in particular to
functions such as u
x.
We next obtain a formula for the sign sequence of e a . The
work is facilitated by the next lemma.
Lemma 9.1. Let f and g be strictly increasing maps from the surreal
numbers onto the same class S which preserve the initial segment
property. Then f = g.
Remark. A function f preserves the initial segment property if for all
a and b such that a is an initial segment of b, f(a) is an initial
segment of f(b). For example, corollary 9.2 says that the function e x
has this property.
Proof. As usual we use induction. Let b = B'|B" and suppose
f(x) = g(x) for all x e B'LJB11. Now g(b) e S. Therefore g(b) = f(c)
for some c. g(B') < g(b) < g(B"). Therefore f(B') < f(c) < f(B").
Since f is an increasing function B1 < c < B". Therefore b is an
initial segment of c thus f(b) is an initial segment of f(c) = g(b).
Similarly g(b) is an initial segment of f(b). Therefore f(b) = g(b).
Since e. is an increasing function which preserves the
initial segment property by corollary 9.2 it suffices because of theorem
9.5 to find a sign sequence formula using "good judgment."
Theorem 9.6.
e.
atn
Proof. Let f(a) be the function given by the above sign sequence rule.
It is immediate from theorem 9.5 that the above is an epsilon number for
all a. (Note that in general a>ea) = {if) = e w .) It is also clear
133
a a = d a - l.u.b. dg
where
is obtained as follows:
ath
block
of pluses.
(2)
ba
a"th
It is clear that
segment property.
(e^J^x = c where
block of minuses.
f
Hence
f maps larger
f(a) = e a .
Suppose that
and
powers of
0.
Let
and
is given
as follows:
(1)
The
a^n
string of pluses of
(2)
The
atn
string of minuses of
f(a)
f(a)
has length
has length
Proof.
Then
ba
g(d a ).
h(d a )b a
f(a) = a
where
iff
is a multiple of
In fact, it is
134
easier in the sense that we are given a "handicap," i.e. the analogue of
some of the results obtained on the way in the above proof are contained
here in the hypothesis.
f(a) = a if and only if the following equations are
satisfied:
(1) g(d a ) = a a .
(2) [h(d a )]b a = b a .
As before, equation (1) can be expressed as a conjunction of
two equations g(a a ) = a a and c a + a a = a a . The first condition says
that a a is a fixed point. This implies in particular that a a is a
power of a). As before, with the help of the first condition the second
condition can be replaced by a a > l.u.b. a$. (In the earlier proof we
3<a
did not need the full multiplicative absorbing property. It was enough
to absorb 2 multiplicatively.)
The equation [h(d a )]b a = b a is equivalent to the condition
[h(d a )] divides b a by the theory of ordinals. This completes the
proof.
Remarks. We do not require that the range of h consists only of powers
of OJ although this does occur in the main examples. Strictly speaking,
continuity of g is not used in the proof. Continuity guarantees fixed
points for g, hence fixed points for f.
Other variations of the theorem are possible and in fact one
will be considered later. At this time we are concerned specifically
with higher order epsilon numbers. We now state an analogue of theorem
9.6.
Theorem 9.8. Let f satisfy the hypotheses of theorems 9.4 and 9.7.
Then the formula for the sign sequence of the fixed point function f1
given by theorem 9.4 is as follows.
The a t n block of pluses in f'(a) has length g'(d a )
1
where g is the fixed point function of g in the theory of ordinals
and the a t n block of minuses has length [hg'(d a )] b a .
The proof is identical to that of the proof of theorem 9.6.
Note now that the conclusion says that f satisfies the hypothesis of
theorem 9.7 to make an induction possible. We let g' play the role of
(hg')w
and
the role of
h.
135
satisfies
the hypothesis of theorem 9.8 so we can use the theorem to determine the
sign sequence of
en
for f i n i t e
n.
corresponds to
fixed points of
{ei,[(ee j"]^}.
Then
e0
corresponds to
by d e f i n i t i o n , this simplifies to
ex
[e,e W ] so
consists of
[ e ^ U j ) " * 1.
u>
for a l l
necessarily a multiple of
8 < a
8 < a where
is a l i m i t ordinal is
01
w , hence a multiple of
is necessarily a multiple of
x* w
for a l l
oTx = x.
In f a c t , we l e t
a = 0 and
o
a , = u>
n+i
Then heuristically
such that
eo
"X
=y
and w
< a2
) is a strictly decreasing
given by
136
{a2r)}|{a2n+1}.
~"X
Proof.
Since
the equation
is a decreasing function of
= x
2n+3
"
a2
2n)
f(a
<
2n)
decreasing sequence.
g
is a s t r i c t l y
ao
< ao , .
2n
2n+i
a
2m -
is
x, hence
= a
2 n +
< a
2m+2n
f(x) = w
is a
is a strictly increasing
t h e r e f o r e
i s
s t r 1 c t 1 y
2n+1
Again, &l
i s a power of
co so
< a .
Since
I t follows t h a t for a r b i t r a r y
J
it is immediate that
Now
certainly a power of
> 0 = a . So by induction
a
From ao 1O > ao
we obtain
2n+2
2n
f(a
2m+2n+i -
ao
2m
and
ao ,
2n+i
we have
2n+r
2n}'{a2n+i}
-a = { - a 2
has
mean1n
+1>U"a2
}.
L e t
9-
{a
2n}'{a2n+i}*
Then
= (0,f(a2n+1)}|{f(a2n)}
g(a)
i s between
"convergence."
a
and
f(a)
and s t i l l
such
theorem 9.4.
that
a = G|H
-a = -H|-G
with
s a t i s f i e s the hypothesis of
hence
= {0,a>~ } | { u T } .
Finally,
Now suppose
no minimum.
Then
Therefore
c = {0}
D = {1}.
Recall t h a t for the e a r l i e r f i x e d points
f(a) = [C,f(G)]|[f(H)]
137
general form was used in the statement of theorem 9.4 with the present
example in mind.
At any rate, we now know that we have a fixed point function
1
g for g. Note that since g(x) = ff(x), any pair of the form (x,y)
where x is a fixed point of g and y = f(x) satisfies u> x = y
y
and
00=
x.
If
t h a t the i n d i c e s of
Theorem 9 . 1 0 .
Proof.
g'(-b)
i s a f i x e d p o i n t of
x
and
f(x)
so i s
f(x).
We now show
= f[g'(b)].
f(2x) f(x) so f(2x) < ~f(x). Also |f(x) < f(x); therefore
138
(Note the
reversal of sides since f(x) = u" x .) Since g(x) = ff(x) and f(0) = 1
the right-hand side is
{g n (O),fg n [|g'(b")]H{g n (l),fg n [2g'(b l )3}. Also -b = {-b"}|{-b-'} hence
g'(-b) = {g n (o),g n C2g 1 (-b")]}|{g n (i),g n [|g'(-b 1 )]}
= {gn(0),gn[2fg1(b")J}|{gn(l),gn[-|fg'(b1)]}
by the inductive hypothesis.
The inequalities we obtained earlier are just what are needed
to check mutual cofinality. First we have 2f(x) < f(jx) < ff[2f(x)]
for any fixed point, in particular for x of the form g'(b"). Since
g is an increasing function this gives
g n [2f(x)] < 9 n f(|x) < g n + 1 [2f(x)]. Similarly from the
inequality
g'(-b) = f[g'(b)].
fixed points.
In fact,
a =0,
ax = 1, a 2 =
and
a 3 = oo . Thus
139
by
minuses.
pluses,
consists of a plus
a^ consists of a 3
followed
a x = (+), a 2
an
with
c pluses
minuses, etc.
Theorem 9.11.
a
P
in
with
n _< 4
Hence
= u>
is obtained from
= GO n"1 by adding a
Because of the
minus in the exponent the signs are opposite to those in the last string
of a . We now separate the even and odd case to determine the length of
n
n-i
i=o di
the e x t r a s t r i n g .
d
= (c w)c .
a), i t
is clear that a l l
powers of
increasing.
fore
to.
cp = ^
and
d's
and
i s also a power of
w i t h the exception of
to
are
is
strictly
if
n _> 1 .
Thered
n x
c = to ~
n
and
140
= c 2.
There are now two ways to conclude, each of which is immediate. First {a 2 } and {a 2 + 1 > are cofinal sets in the canonical
representation of the proposed sign sequence. Thus the sequence is
{a 2 }|{a2
} = a. (It is possibly easier to go back to the definition
of F|G.) Second, one can ignore the individual a and note that the
above computation shows directly that the proposed sequence satisfies
-x
= x.
It may seem at first that this method can produce other
""X
141
( 3<a
c a = (w
Similarly
and
>
Jc
and. d.a
a
= ca
3<a
)da\
a" 2. w
il w
Similarly
Furthermore
Hence
c a = c a " >_ w
da
d a = d a " _> d a ' >_ a>
hence
thus
da
ca
is an epsilon number.
142
143
10 EXPONENTIATION
A GENERAL THEORY
As was mentioned in the introductory chapter, Kruskal
discovered that a theory of exponentiation for the surreal numbers is
possible. Taking advantage of his hints I discovered that an elegant
natural theory does exist, i.e. exp x can be defined in a uniform way
for all surreal numbers x and it has the properties that are expected
of an exponential function. Note that the function w x is not suitable
as an exponential function even though the theory in chapter five makes
this notation convenient. For example, it is certainly not onto since no
two numbers in the range have the same order of magnitude. (The word
"exponent" used in the past is a convenient abuse of language.)
Although we begin with a unified definition of exp x the
subject breaks up naturally into three cases.
(a) x is real,
(b) x is infinitesimal,
(c) x is purely infinite (i.e. all "exponents" in the normal form
of x are positive).
The unified form is somewhat complicated to deal with, whereas
the theory simplifies in each of the above cases for different reasons.
(Note that any surreal number is uniquely a sum of three numbers each of
which satisfies one of these cases.) Case (c) is the only one which is
worthy of a substantial discussion. In case (a) it suffices to show that
the unified definition is consistent with the usual one and in case (b)
that the unified definition is consistent with the result by formal
expansion in the spirit of chapter five.
Although the notation 'u)x" will continue to be used with
its original meaning there should be no danger of confusion since the
144
{l+u> + JJ } and
EXPONENTIATION
145
possible to split the subject into the three parts according to the
different cases. On the other hand, although the theory can be developed
by considering each case separately from the beginning, we prefer not to
do so since the subject then takes on an ad hoc form with definitions
that should really be theorems. In fact, the whole beauty of the subject
of surreal numbers from the very beginning lies in the possibility of
making uniform definitions which reduce to what is desired in various
special cases.
As in the past we use the canonical representation of an
element x to define exp x inductively as follows:
X
n
LX
xj2n+1
' r2n+i
>
Necessarily
X"f"V
= e . Since
146
X V
= e e , it is a
Since
for
and
[ x ] 2 n + 1 < ex,
clearly
[-x]2n+1[x]2n+1 < 1
x > 0.
In both inequalities
2n+2
terms occurring in the l e f t and not on the r i g h t , and a l l such terms have
positive coefficients.
Also, [-e]
+It
^ 2 +1
<
"^ ^ o r
e >
^'
~^1S
follows from the fact that on the left-hand side a l l terms involving
beyond
e = 1 for
c o e f f i c i e n t of
i 2n+l
is negative.
Furthermore
s1nce
2n+i
is positive so is [-(r+e)]
for
n 2. n
so is [-(r+e)]
2n+i
. Similarly, if [-r]
is increasing
fore
r2n+i
[-(r+e)]
2n+i
be an no for which
"
EXPONENTIATION
147
for a l l
and y
x,
Since
[ x ] 2 n + 1 , y] 2 n + 1 >
[x]
and
[x+yl2m+1
is a s t r i c t l y increasing function of
i f necessary).
r+e
larger
then follow
[ - ( r + e ) ] 2 n + 1 [ r + e ] 2 n + 1 < 1.
for negative x.
+1
For the sake of symmetry between the cases for positive and negative x
and for the sake of uniformity, a l l of our results relating
and
[x+y]
[x] , [y]
numbers, i . e .
given in
n.
Proof.
v r
is defined for a l l
tive integers
exp x
n.
Also
x.
and ex
exp y [ x - y ]
Furthermore, i f
< exp x
x <y
for a l l posi-
exp x > 0.
Ayr.
\/
LX - x j n LX - x J 2 n + 1
that the lower terms are below the upper terms, so that exp x
The inequalities involving
is defined.
m = n.
Choose
so that
[x] n
i.e.
[x"-x']
< exp x".
>_ [ x " - x ] [ x - x 1 ] .
Therefore
< T^Tr~-4-.
L X XJ
Since
m
is an increasing function of n, the result for general m and n
148
In fact,
, < Lx "-xJ P *
. 13^x7"
max(m,n)
n
Exactly the same argument shows that
exp x"[x-x"L m+1 < r ? x p n x ' (In fact, we deliberately ignored the
Lx ~xj
fact that
m and
< r ?xpx
Lx
[x-x1]
[x'-x]
We need
"XJ2m+i
where
x ' and
' J
x ' are two arbitrary lower elements not necessarily equal. It turns
out that we can use a device which is similar to what we used for
different subscripts m and n which will also be important later; in
fact, it is the same cofinality idea which we used, for example, in our
development of multiplication in chapter three.
Specifically, we desire to prove the following: Let
x
i'
< X
choose
2"
Then
Vm
n so that
^n!exP
X X
1 'C
Similarly, we
exp x 2 '
exp x x '
have Vm 3n [r , , < , , ,
1. Hence, if x ' = max(x \ x ' ) ,
Lx 2 -xj - L X X " x J 2 m + 1
3
1 2
then for a sufficiently high p we have
exp x 3 '
exp x 2 '
exp x '[x-x ' ] m <_ exp x '[x-x '] < r .
<_ r . -i
LX
3 "XJ2p+i
LX
2 "XJ2n+i
1"t
and
Vm 3n r
exp x
""x1ll]2
+1
ex
EXPONENTIATION
so that we obtain
149
<
exp x 2 "
H
x 2 ' in
exp x.
Since
certainly
exp x > 0.
The inequality
exp x [ y - x ]
< exp y
or
x < z <y
where
for
exp y [ x - y ]
and
y.
Now
Hence
,, < exp x.
Corollary 1 0 . 1 .
x <y
Vn 3 p [ z - x ] p [ y - z ] p > [ y - x ] R .
exp y > exp z [ y - z ]
i s an i n i t i a l
Proof.
is
c o f i n a l i t y r e s u l t s ; e.g. i f
Let
exp x
> exp x [ y - x ] .
Similarly
For
exponential f u n c t i o n .
Theorem 10.2.
Proof.
exp r = e
Note f i r s t that
for real
r.
Hence we have
, , -^r^
1n
2n+i
e .
{e - e } | { e +e}
positive real.
We
use
Recall that
e , like
where
For example,
e
e
is
[r-r1]
< e
= er.
If
r > 0
then
r1,
" rJ 2n+i
n>0
+~ L " rJ 2n+i
150
as a possible value of
r".
e .
Similarly, if
proof.
Remark.
upper terms in
exp r
0.
contribute to
needed to compute
1
use of
= 0
exp x.
For infinitesimal
this case the formal sequence
of chapter five.
spirit for real
exp(oo2).]
1+x + |y + ...
In
ex,
this time
exp x = e x
r+e
where
for infinitesimal
r
is real and
x.
More generally, if
is infinitesimal, then
exp(r+e) = e e .
Proof.
the form
r+e
where
is real and
is infinitesimal.
[r+e]
e * 0.
Since we
r+6
exp x
where
(Note that
p
upper
segments of the above form; e.g., 1+
r+e
might have no
as upper segments, so that this does not follow immediately from our
earlier cofinality results.)
segment of
r+e
is an initial
is positive or negative,
EXPONENTIATION
151
have real part larger than e . On the other hand, the contributions of
r
r
r such as e [e] all have real part e . Since the same reasoning
applies to upper terms this proves the desired cofinality.
Since all the upper and lower terms remaining in the reprer6
sentation of exp x now have essentially the form e e x[6 ] it remains
to study expressions such as e l[& ] n .
a^ = min(ai .)
J
then
b 2.
na
a.
Now y = I u> V-f
i<3
j=1
'
is an i n i t i a l segment
152
a.
I m Tr-j and hence participates in obtaining upper and lower terms
i<a
to exp x. Let z = x-y = I w "irj.
i>3
Suppose first that z > 0. Then exp y[z] n is a lower term
in the representation of exp x. Now
of
(exp y)e z - exp y[z] n = (exp y ) | " ^ + ^ , +"!. The first term in exp y
has exponent 0. Hence the first exponent in the latter expression is
(n+Da^ which is smaller than na$. (As a technicality, in case all
a-j . are zero, we can choose 3 to be 1 so that a^ < 0.) Hence
j
(exp y)e and exp y[z] n agree in all terms up to and including the
terms with exponent na$, a fortiori the terms with exponent b.
For z < 0 the argument is identical except for the fact
that we consider only odd n.
b.
Also note that in general if two expressions I w is-j agree
i<B
in all terms up to and including exponent c, then so do their reciprocals
since it is clear that by formally taking inverses terms with given
exponents cannot give rise to terms with earlier exponents. Therefore
the above argument applies equally well to expressions such as
~
and LZJ
fSU.
n
If y = r+6 then by the inductive hypothesis
exp y = (exp r)e . Hence
7
V~r
J
V~Y*~^'Z.
J
EXPONENTIATION
153
exp n(l+x) = 1+x. This identity is valid for formal series, hence it is
applicable to infinitesimal surreal numbers, i.e. it is possible to
define a log function in the natural way. For any infinitesimal
exp n(l+e) = 1+e. This shows in particular that the function x maps
the infinitesimals onto the class of all surreal numbers of the form 1+e
where e is infinitesimal. Combining this with the behaviour of exp
for ordinary real numbers, we see that exp maps the finite numbers into
the class of all positive finite non-infinitesimal numbers.
This essentially completes the theory for the finite case.
There were no surprises. On the contrary, the fact that the unified
definition agrees with what is expected gives some philosophical
justification for using it for the infinite case where we lack an a
priori alternative. This is the realm where exotic results occur and, as
mentioned earlier, the case of main interest.
From now on we would like to limit our study to the case
where x is purely infinite, i.e. where all exponents in the normal form
of x are positive. However, before doing so we need a "piecing
together" result.
Theorem 10.4. If x = y+z where y
finite, then exp x = exp y exp z.
Proof. We again use induction.
z is
I u> irj,
i<a
then y necessarily has the form
I u ir-j and z the form I u> irj.
i<3
i>0
Hence y is an initial segment of x. We use an argument similar to the
one used in the proof of theorem 10.3 to show that in the representation
of exp x we can restrict ourselves to initial segments of the form
y+u where u is finite.
Regardless of whether z is positive or negative, terms containing exp y contribute to both upper and lower terms in exp(y+z).
Now suppose x-x
is positive infinite. Consider a term exp x [x-x ] .
Note that [x-x ] x [x-x ] by elementary reasoning with orders of
magnitude. Therefore exp x > exp x 1 [x-x 1 ] +1 exp x ^ x - x ^ . A
similar inequality applies if x-x
is negative infinite. On the other
hand, y contributes terms such as exp y[z] or exp y[z]
if z
is negative. Now [z] p _> 1 if z is positive. If z is negative we
154
EXPONENTIATION
F i r s t , since
and
155
0
i s a lower element of
exp y
then
exp y F
exp y G appear among the terms in the formula for the product
exp y exp z.
For convenience, l e t
used in the d e f i n i t i o n .
segment
of
Since
satisfies
H|K
y
be the representation of
|y-z|
is i n f i n i t e .
exp y [ y - z ] .
exp z [ y - z ]
exp y
< exp y.
If
z <y
we have a
As we already noted,
Similarly i f
z >y
we have
yj
and k e K ^ exp y k.
- [z-u]
nor -r-T
rT
is infinitesimal. Also,
L
J
2n+i
2n+3
exp(z-u) - [z-u] n _> [ z - u ] n + 1 - [z-u] n hence exp[z-u] - [z-u] n is
not infinitesimal.
Similarly
r
L
-i
2n+i
exp[z-u]
is not infinitesimal.
Since all the expressions considered are f i n i t e , we obtain by multiplication with exp u that
exp u ~ exp u[z-u] +1 - exp u[z-u] ~ exp u[exp(z-u)] - exp u[z-u] ~
exp u
]
[u z]
" 2n + i
expu
[
exp u exp(z-u) =
expu
course
, _ v = exp z.
z-u ~ oj ,
z-u.
Then
If
z-u
i.e.
[z-u]
Suppose
~ u>
By formal multiplicaion
" T^
156
^ -
exp z.
EXPONENTIATION
157
}.
\X ~X/
158
exp y ( x - y )
n+1
1 11
< exp y ( x - y )
n+1
A similar argument
We state the f i n a l s i m p l i f i c a t i o n as
theorem 10.5.
Theorem 10.5.
For purely i n f i n i t e x
is,
x1
x.
As an example consider
i n f i n i t e lower segment.
exp x
exp u>.
Therefore
of course, d i f f e r e n t from
Since w
to
as defined
exp (x in w)
A similar r e s u l t applies i f
n.
is expressed in a form
y e FUG=^|x-y|
of an element in
an element
y-z
is i n f i n i t e .
FUG
need not be in
FUG.
is f i n i t e . ]
s t a r t with using
F and
such that
exp x we can
F|G
Suppose
FUG
at
F for the
In future
x, y, and
w i l l refer to purely i n f i n i t e
Assume
and y
are p o s i t i v e ;
,
2n
n n
(x+y)X
> x y
x
n+i n+i v /
n
y
> (x+y)N .
("jxV"""
( )x y is one of the
satisfies
EXPONENTIATION
159
("jx 1 ^"" 1 x n + 1 y n + 1 . Although the latter inequality may be strengthened there is not much point to i t for our purpose. ( I t is much like
showing that one can use -JTT instead of TT in analysis!)
Recall that by theorem 10.1, i f x < y, then
exp x[y-x] n < exp y. Hence exp x(y-x) < exp x[y-x] n + 1 < exp y. Thus
we may replace the term [y-x]
by (y-x) n in the inequality stated as
part of the above theorem. Since y-x is infinite, this shows
immediately that x < y + exp x exp y. As we recall, the function OJX
has this property, but there is a crucial distinction, namely for </
this applies to the whole domain, but to exp x this applies only to the
subclass of purely infinite numbers. In fact, i f y-x is finite then
x
y
e ~ e . At any rate, because of the above property the proof of the
addition formula for exp x resembles the proof of the corresponding
formula for
u .
Theorem 10.6.
and s i m i l a r l y for
y.
(x"-x) n
hypothesis, a typical lower term simplifies to exp x1 exp y(x-x')
exp X
and
160
e xp x.
(y"-y) m (x"-x) n
(y"-y)
Similarly
exp y.
(x"-x) r
(y"-y) m (x"-x) n
Therefore the above expression is negative, so that for cofinality it
suffices to take the term zero.
Finally we must consider a term such as
exp x"
exp x exp y'(y-y') m + exp y exp x n - exp y'(y-y')
Now
M
n
(x -x)
(x"-x) n
exp y exp x
Therefore certainly
exp y,
l)m
(x"-x) n
(x n -x) n
exp x exp y'fy-y1)111 + exp y
(xH-x)n
L > \ exp y
^
(x M -x) n ~~ *
(x"-x)
(xM-x) n+i
y. This
Proof. This follows immediately from theorem 10.6 with the help of
theorem 10.4 and the corresponding result for finite x and y.
EXPONENTIATION
161
GO.
Proof. We already know that every lower term such as exp x'(x-x)
satisfies exp x'(x-x') exp x and similarly every upper term such
satisfies exp x exD
^ x" Hence every number y of the
(x"-x) n
same order of magnitude as x also satisfies
exp x'(x-x') n < y < e x P x Therefore x is an initial segment of y.
(x"-x) n
So by theorem 5.3 x is a power of w.
The above result is especially striking if one recalls that
purely infinite elements are canonical elements in an additive order of
magnitude. The theorem says that such elements are mapped into canonical
elements in a multiplicative order of magnitude. This is further
heuristic evidence that exp behaves the way an exponential should.
We now define in x. This definition will be made only for
x of the form co .
b"-b
Definition.
n(w ) = {in{m
)+n, n(w
) - OJ
}|{n(w
)-n,
b-b1
n(w
) + a)
where
n(a) ).
b contribute to upper
This is essential.
For
example, l e t
b = 1.
Then
in x
Since
exp w = w
with
exp x.
i_
l
in to = {n}|{u> } = w .
With no
this
On the
This
i_
i
= {O}|{-}.
w^ = {0,n}|{u) n }.
Then
Hence by r e s t r i c t i n g ourselves to
7F
exp (to") = {(exp O)a) u) }H ex P (a) 1 ) }.
{ n *]m
i n f i n i t e parts
uo
to .
_L
< (u)n)m = /
i -
infinite i f
(in f a c t - ).
n
H
> a)"
> a).
Now
a) = {n}|t|).
Since
to03 < oi
and
= (to )
(J -^)"
exp(gj n )
11 A 1
Also
-to J
n
_
simply
162
= to .
So
is
s a t i s f i e d and
exp(co ) = to.
We prefer to
know t h a t
in x
Theorem 10.8.
is
We
defined!
in x
is defined.
Furthermore, i f
a > b then
a-b
&n(to ) - n(to )
a l l positive integers n.
In particular, i f
a > 0
then
for
in{u )
is
that the lower terms are really less than the upper terms. Since
h"
h1
h1
h"
n(to ) - Jtn(to ) is i n f i n i t e , then certainly n(to )+n < n(io )-m.
b-b1
Since
to
is i n f i n i t e then certainly
an(u>
b-b'
) + to m
However, such an inequality is not adequate for our purpose since we must
consider two arbitrary i n i t i a l segments of
(The use of the same
incomplete p r o o f . )
b not necessarily a l i k e .
b-d
b ' , which is a natural mistake, leads to an
So we must show t h a t
m
0)
EXPONENTIATION
where
and
163
segments of
b.
This is
c-d
if
d 2. c.
If
c > d
we have
trivial
b-d
< to
-n.
Hence
b-d
&n(ioc)+n < ir\{ud)
+ to
segments
and
c-b
d
of
n(toC) - a>
As before this is
c-d
trivial
if
c < d.
If
c > d
we have
c-b
< u>
-n
"~
n(<oC) - u> m
hence
< n(to ) - n .
b"-b
bb
n(to
in
b"-b
Now to 2 n
n.
(o n
(w
) - n(w
n(co
b"-b b-b' 1
= [(o
b"-b
<i[u>
bb-b'
n(<o
< n(<o
) - to
b"-b'
ii
) << w
+ to
Since
to
by the arithmetico-
b -b
r = max(m,n)
/ b \
n (to
we have
/bv
)
in (to
to
b"-b
n
)
b"-b
_< to
+ ui
then
/ b \
in (to
Hence
b"-b
n
so
b-b'
+ to
b-b'
m
^ (0
Hence
is a decreasing function of
for
for a l l
b-b'
+ a) n
) + to
b-b'
m
/ b \
to
^ &n(to
+ to
b-c
*,n(to b )-n(to a ) < a)
and
c-a
+ to
b-c
.
Now
to
Then
b-a
to
c-a
and
to
b-a
to
Therefore
c-a b-a
+ to n < w n .
b-a
infinite numbers.
Theorem 10.8.
Proof.
Let
n(co )
)+n
be the representation of
We claim that
we get
) - oo
terms.
given by the
c e F =*> (3deF)(d>x+l)
d = n(oj
) + (n+1)
If
and i f
and
c has the form
has the form
b"-b
second case
b.
Jin(u) )
b"-b
n(u>
is the inverse
c e G =* (3deG)(d<c-l).
n(u)
Jln(x)
exp
F|G
definition.
164
is i n f i n i t e . )
By induction
d = n(u>
) - oo n
(In the
r.
n(oo )+r
Hence
n(a> )
n(a> )
satisfies
is an i n i t i a l segment
We are now ready for the theorem which completes the basic
theory.
Theorem 10.9.
Proof.
exp
nn(oo ) = to .
,
n(oo ) = F|G
G = {iin(u>
b-b
n
h1
h"
)-n, n(to
c e F = n(to )-c
and
)+n,
Jtn(oob ) - to
"~b
n
and
) + to
c e F = (3deF)(d>c+l)
F = {n(oo b
where
}.
c e G => (ideG)(d<c-l).
is infinite
and
d e G=^>d-iin(to )
Thus
by our earlier remarks we may use the infinite parts of the elements in
F
and
G to compute
positive power of
exp n(to ).
.i
n(oo
and
n(to
) - to
, and s i m i l a r l y those
EXPONENTIATION
165
b-b1
in
n(aj
and &n(u>
0
Therefore the
exp[fcn(o) ) ]
have
expUn(u>b ) ] U n ( a j b ) - n U b ) ] n and
the form
bM-b
exp[<w(u
) + w
)-u
b"-b
K
l{nn(u) )-[itn(co
K"
)-u> n
^ " ^ " ^
and
xp[tn(^)t. "
u>
which
{0,A}|{a)b"s}.
LM
LII
)]
J2
b\
* bA1n
>
2
b -b
b
w
(i
b"-b
)" \
iim
bM-b
- [*n(u)b ) - a) n
expU
)
b"-b
11
)m
s1nCe
b"-b
,n+m
""
b.
^=
^ b -
= l b
Similarly
b-b'
]
explwi(co ':
to
b<
b-b
b-t
[Un(to b Vto
b-b1
tob'(o)
b-b
la)
*n(.b)] m
'
(0
^t n ( t o b ' ] l+co
b
b-b'
=
(0
0)
This
b-b'
to
- ^r l ( o ) D ]
n ~\
explco
b-b1
r n >
verifies
166
k im
o) r
expUn(to
)]Un((o -*,n(to
)] .
So we may take
is
The f i r s t factor is
n(to )
<o .
Hence ,n(to ) - n(w )
L I
larger than
o> r
to .
L II
since
i s , of course, f i n i t e .
to .
i$
Similarly
o> s is
Here we consider
(Again
1nfinl-tes1mal#
Un(co D )-*n(coD ) ] n
h"
we may use
complete.
n = 1.)
w i'bj with
i<a
a
o> ( b o +
b0 > 0
c.
"'b)
l
This is in turn the class of all positive surreals. Thus we have
Corollary 10.3. exp x is onto the class of all positive numbers.
We now know that exp x behaves as it should with respect to
the main properties expected of an exponential function. Thus we have
good philosophical grounds in designating our function as "the
EXPONENTIATION
exponential f u n c t i o n . "
167
x =
non-trivial,
i.e.
i s real or i n f i n i t e s i m a l .
Proof.
If
a > 0
exp(w a )
then
We do not use i n d u c t i o n ! !
as {0,a>a r}|{u)
exp(w ).
<*> .
So
b
ww .
a > 0
results.
i s understood.
infinite
Hence in
all
s}
all
Thus i f
is
a"
{0,(exp Q)(a) ) ,[exp(a) 'r)](u) -a) r) H{ /, a, ^ s)
a \}.
n We now use mutual
(a) S-U) )
a n
a>
exp ( aj
exp((oa)
simplifies
to
to
possible value of a 1
a
"!-) =
^
exp(a)
,,s) <
( a s ~\
a)
exp (to
is a
> to ".
-
s).
Thus
Hence t h e u p p e r
So
t e r m s may be
limited
exp(u) ) s i m p l i f i e s f u r t h e r
to
}|{exp(o)
na
s )
,,
na
exp[o) j)
r)]o)
Since in general
n+1
exp(a)
Since
na
168
Let
exp (to
s)}.
) = a>
and
exp(u)
) = w
Then
exp((o )
can be expressed as
b'r
na , , b"s
the above.
b'r
/n
{0,<O
na.., b'V
0)
{b'r+na}
} |{0)
b = {b'r+na}|{b s}.
,n
b'r+na,,,
} = {0,0)
exp x
and
to
b"s,
}|{U)
M
theorem f o r both
We may j u s t as well w r i t e t h i s as
specifically
}.
where
since we began with something which was defined and used mutual
c o f i n a l i t y at every step, so we already know that to
Consider the representation
b'r+na < c < b"s
i s a power of
hence
w.
Let
b = {0,b'r+na}|{b s}.
< to
If
b is an i n i t i a l segment of
b = toC. Then
b~ c
c.
then
Therefore
a = ww .
ind a as follows:
ind a = b 0 .
If
g(a).
For this
EXPONENTIATION
surreal
such t h a t
169
a ~ u)X.
exp(uj ) = /
( x )
f ( 0 ) = 0.
for
x > 0
so t h a t
f ( a ) = { r f ( a ' ) + n a } | { s f (a 1 1 )}.
case
f(a')
= 0,
by c o f i n a l i t y .
0
f(a) = u>g(a).
x = 0
we set
notation
a1
is
in which
f ( a ) = { 0 , r f (a )+na} | { s f (a 11 )}
we may c e r t a i n l y w r i t e
Now among a l l
If
a1
we single out
and w r i t e t h i s as
f(a) = {0,na,rf(a')+na}|{sf(a11)}.
a1
j u s t as there may be no
a"
altogether.
We may now replace
x > 0.
i.e.
So we obtain
a ~ u)C.
Then
equicofinal with
ru
f(a)
Since
f ( a ) = w9
f(x)
by
f(a) = {0,na,ruj9(a
na
or
}
+na}| {su) 9 ( a
is equicofinal with
+ nw
which i s v a l i d for
nwC
}.
and
Let
rou9
which i s in turn e q u i c o f i n a l
is
with
g(a).
In f a c t ,
{c,g(a )}|{g(a")}
by c o f i n a l i t y .
simple-looking
gives r i s e to t r i c k y phenomena.
prevents
+na
={0)nuc,numax[9(a')'c]}|{soo9(a")}.
formula.
c = ind a,
In
x.
fact,
among the
g(x) as a theorem.
a
u> g ( a )
Theorem 10.11.
I f a i s p o s i t i v e exp(w ) has the form w
where
g(a) = { c , g ( a ' ) } | { g ( a " ) }
and c i s such t h a t a ~ u> . [ I t i s under-
stood t h a t
a1.]
L e t us a p p l y theorem 1 0 . 1 1 t o o b t a i n
evaluate
exp a>.
Since
w = w
c = 0.
I n t h i s case we r e g a r d
and
i s separated o u t .
we have
A
Hence
and
170
a = 1.
A"
as
exp(u) W ).
Since
F i r s t we
w = 1
<>
j since
g ( l ) = {0}|<}> = 1 .
we have
1 = {0}|<j>
Thus
g(l) = 1.
Therefore
exp u = w
=w
g().
Assume
Then
= q f { 0 } | { ( 4 : ) } l = lind ( I - ) } | { g ( i - n = {0} I { - U = 4 r r r .
fore
g(^) = g [ { 0 } | { - ~ } ] = { i n d ^ K g ^ ) }
= {-1>|{|^> = 0.
There-
Hence
o
expdo^) = OJW =w
power of
g.
For this
we need a lemma which states that the uniformity theorem is valid for the
function
n x.
we have no lemmas in this chapter so far and since we are thus already
top heavy with theorems we leave this as a lemma.
interested primarily in the function
Besides, we are
n x
exp x, e.g.
representation of
in the form
a = F|G
n x.
n(a) )
by using any
1
x
/b"
) + n , n(co
If
b"-b
n 1
) - oo
then
i r /
b " x
b-b1
n i
}|{n(a)
) - n , n(w )+io
x
b1
n(w )+n > n(w )+n and
}.
EXPONENTIATION
b-x
n ,
/ xx
icn v a) ; + 03
j + 03
y-b
n
/ Vv
b-b 1
n
^ ion 103
If
n(co
, b\
171
>
n(o3
)+n
and
b"-b
n
/ b"
03
Jtn(o3y)+n <_ Z n U
then
- 03
03 n
First
- 03 n
> 03
since
b-x
03 n
or 03 n
x-b'
03
> n(oj )
- n(o3
b"-b
Similarly
O 3
).
- o >
>
Jcnio3
Hence
> o 3
n(o3 ) + a>
b-b'
< nn(o3
) + 03
n(o3b ) - Jtn(o3 y ).
>
Hence
b"-b
n
- 03
For a l l
a,
a
>m(o3W )
is a power of
03.
the r e p r e s e n t a t i o n
{0,w
}|{u>
Hence
e
} o f 03 .
a"
a'
a
^ )
11
c 5\
03
j-n,
/ 0 \
x,n{03 j+03
an integer we have
n(o3W
03
We now simplify.
0, thus getting
)+n,
- U
a"
a
r
z' w
x,n[o3
v >
' j + 03
- co
a
-03
As a s t a r t we can get r i d of
Jin(03 ) , which
Also
b"-y
/ bMx
03
Theorem 10.12.
Proof.
y-b
y-b
n
/ Y\
is
has a
b-x
>inio3
03
x-b'
in x
i n f i n i t e so
rn(o3
a'
)
is
exp x
that i f
rJm(o3W
)+n.
Note
172
a" _ ^a
which occurs as an exponent.
a"
a" a
a
dearly - ^ < " *"" - Since i t n ^ s) < u n
this shows that all
the lower terms in the third group are negative and hence can be discarded because of the presence of n. So the set of lower terms have
a
been simplified to {n,rn(a)a) )}. Similarly for the upper terms we
may assume that s is dyadic and drop n so that the first group of
a1
terms simplify to sln^u* ). The second group of terms have the form
a
a a
a a
^n~
- 'r
'r -a
a) . Now consider -^
which occurs as an exponent. -p- >
a1
W r
Since n(a)
) > 0 we may discard the third troup of terms by
cofinality because of the second group. Thus the set of upper terms
a
a
a
II
GJ__
0)
0)
a
n(u)W ) as a surreal
Note that h(a) > 0 for all a. This shows that the range
of g consists of the class of all surreal numbers. Thus exp x
induces a map from the class of positive surreals onto the class of all
surreals! This sounds like the opposite of what an exponential function
does but recall that the map goes from exponents to iterated exponents so
that there is nothing unreasonable about that, although the existence of
such a g is a surprising phenomenon.
The uniformity theorem is valid for g and h. This is
clear since x _< y * ind x _< ind y. We are primarily interested in g
and it is thus convenient to know that we may use any representation of
b to compute g(b) from the formula given by theorem 10.11.
We next prove that exp x has a kind of generalized
linearity property. This will enable us to reduce the study of exp x
for purely infinite x entirely to the study of the function g.
EXPONENTIATION
Theorem 10.13.
exp
If
173
y = a>
then
i r-j.
i
Proof.
Since
and u>
r-j.
= w O3J the r e s u l t is
rj
r e s u l t for a r b i t r a r y sums.
war
Consider
expressed as
where
is real.
toar
Then
may be
r1
and
r"
u> r 1
and
0.
wV.
Since
Then
( A V r
= {0,03
na11 r a)
} | {{o3
o
03
0
3
r"
}|{o3
-nai
03
g(a)
naJU^
r"9(a)
Now
general.
tt9(a)
i.e.
> (J-.ja,
i.e.
g(a)
r
O3W
{or
/ v
o ) 9 ( a j > na
thus
in
r'}|{o3
hence
This proves
representation of
1 a
na}.
> O3na
Now 039
exp(o3 r ) .
r"}.
can be expressed as
Certainly
g(a)
u
g(a)
Y
= {0,
03
g(a)
ry ' " | r O 3
g(a)
'
{03
and
g(a)
u
v l
so
a.
For non-limit
174
exp x
ordinary a d d i t i o n .
and
03
where
y < a
and
is positive.
Hence
)(o3 V ' ) ~ n }
2,0) T r j
Y<i<a
e1
is such that
and s i m i l a r l y f o r
are i n f i n i t e s i m a l .
g ( a
where
. . a .
03=o)I+
Ie-e"I
g ( a
i .
e".
|e-e'|
and
Also
g(a
Y}
9(ai)
9(a Y )
Since the lower terms have no maximum and the upper terms no minimum we
have
03
= {0,03 } | {03 }
terms and
where
exp( I 03 "ir-j)
does give w
> na
Y > na
n. Since
Y
03
u
where
terms of
exp( I 03 1 r 1 -)
is
d *Y
exp( [ u ^ j - m
9( an -)
9(aY )
1 CJ rj-03
e + naY
ii d *\/
eju
y =
exp x.
EXPONENTIATION
175
then
g(a.)
g(a Y )
g(a Y )e
), a) ' r^-a)
e + a)
y
v
.
which i s less than
g(a.)
\ w ! rj
by
A similar
form
to
w.
>_ 1.
g.
exp x
{ind(a),g(a')}|{g(a")}
where
c such that
a ~ </
Thus, as has been promised e a r l i e r , the subject has in one respect become
greatly s i m p l i f i e d .
g behaves in t r i c k y i n t e r e s t i n g
that
g ( l ) = 1.
g for ordinals.
We know already
g(x) = x
At least in
e.
By d e f i n i t i o n
</ = e hence
ind e = e.
g(e) > e.
exp x.
F i r s t , we have
for positive
e
g.
much more e x p l i c i t ,
176
in some of the
sequence of
g(x)
though in a
g(a)
i s an o r d i n a l .
Theorem 10.14.
If
the i n e q u a l i t y
e _< a < e + w
i s an o r d i n a l , then
g(a) = a
unless
satisfies
e, in which case
g(a) = a+1.
Proof.
a"
If
i s an ordinal
i s empty.
Hence
g(a)
g(a)
F i r s t suppose t h a t
number.
Then
ind a < a.
{ i n d a, a'}|<j>.
Since
Now l e t
{ i n d a, g(a')}|<j>
this is
since
We now use i n d u c t i o n .
ind a < a
a
i s an o r d i n a l .
epsilon
g(a) =
by the c o f i n a l i t y
theorem.
Then g(a) =
g(a+n)
Hence
we obtain
Next we determine
have so f a r .
a _< e a
where
ea
i s the
is a
Now
In f a c t , g(a+w) = {ind(a+w),
g(x) = x
where
by c o f i n a l i t y .
g(a+w).
i n d u c t i v e l y on
g(a+n+l) = { i n d ( a + n + l ) , g(a+n)}|<|>
such t h a t
that
and
g(ea+o>) = { e a , e a + n + l } <
| J> = ea+u>.
then we
have
ea+x
If
ea+x < e a + 1
Then g ( e a + 1 ) = { i n d ( e a + 1 ) , ( e a + 1 ) ' } |
must consider
this
is
= ea+i lea
where
In f a c t , by c o f i n a l i t y
ea = {ea'}|<J>, hence
EXPONENTIATION
177
g(x) = x is broken at in
to
co
g ( 2 ~ V b ) = -b + 2~ n .
b.
178
179
So the above description of g(b) applies if the number of
minuses after e is less than eco. If there are exactly ew minuses
after e, then ind b is e followed by a minus. The set of lower
elements now have the form {e-l,x}. The upper elements all consist of
e followed by a plus and a sequence of minuses. It is clear by
cofinality that g(b) = e, i.e. g(we l) = e. We now have a setup for a
double induction.
We lead the reader by the hand for a short distance. If b
has ew+1 minuses after e then we may express g(b) as {e-l}|{e}
by cofinality. Hence g(b) consists of e followed by a minus and a
plus, e-1 remains as a lower element until we obtain e w 2 minuses;
hence as we noted earlier g(b) continues the same way as b. Hence if
b has ew+r minuses after e where r is finite, then g(b) is e
followed by a minus and plus followed by r-1 minuses. If r is
infinite then g(b) also has a tail of r minuses since the shortage
by one gets wiped out at ew + w.
When b has eu>-2 minuses after e, e-1 no longer occurs
as a lower element and we thus easily obtain g(b) = e-1. The situation
for eu>-2 + r is similar to what we had before, the only difference being
that g(b) has two minuses after e rather than just one. This
continues in a similar way up to any b which consists of e followed
by ewn + r minuses. g(b) consists of n minuses after e followed
by a plus and the contribution of r.
We now write 3 = ewa + r. The induction on a works the
same way for all non-limit ordinals. In fact, if 3 = ewa then g(b)
consists of e followed by a-1 minuses. If 3 = ewa + r for finite
r then g(b) consists of e followed by a minuses, a plus, and r-1
minuses. If 3 = ewa + r for infinite r then g(b) consists of e
followed by a minuses, a plus, and r minuses. A slight difference
occurs if a is a limit ordinal. The case 3 = ew 2 is typical. In
this case ind b which is e followed by u> minuses is cofinal in the
set of lower elements. For the upper elements we have the cofinal set
consisting of the elements of the form e followed by n minuses.
Hence g(b) consists of e followed by w minuses and a plus. If
3 = ew 2 + r then g(b) consists of e followed by GO minuses, a plus,
and r minuses. Note that for finite r this is different from the
case where a is a non-limit ordinal because of the start at r = 0. In
180
If b >_ 1 then
g(b) _> b.
EXPONENTIATION
Remarks.
181
It
g(b').
lower element of
b.
If
b1 = 0
then there
g ( l ) , which is legal,since
1 is a
The requirement
b.
We shall
b.
Theorem 10.17. If ea+cu <_ b _< a < ea+i for some epsilon number
some ordinal a then g(b) = b. The same conclusion applies if
1 _< b a < e 0 .
e a and
182
e a +w into the set of all b' so that the lower set {ind b,b'} has its
usual meaning.)
Now 3 is not an epsilon number since it is strictly between
e a and eOL+l. Hence ind 3 is an ordinal less than 3. Therefore
ind b _< ind 3 < b. Therefore {ind b, b'}|{b"} = b by cofinality. This
completes the proof.
The same argument applies to the second part. As we noted
earlier, as a technical point we need the element one as a lower element
so that we can reintroduce zero by cofinality. Formally speaking, if one
is picayune and use the notation B' for the set of all b 1 , the set of
lower elements should be expressed first as {ind b},(B'-{0}) which
simplifies to {ind b},{B'} by cofinality. We prefer a slight abuse of
notation for the sake of readability.
The above theorem shows that the equality g(b) = b is par
for the course, i.e in a rough sense "most" numbers are covered by the
theorem. We now consider numbers which are "close" to epsilon numbers.
Theorem 10.18. If e _< b _< e+n for some epsilon number e and integer
n then g(b) is obtained from b as follows. If the sign sequence of
b is e followed by the sequence S then g(b) is the juxtaposition
e followed by a plus and then S.
Note. By a careful look at the sign sequence formula one can show that
g(b) = b+1, as was stated as part of theorem 10.14 in the special case
where b is an ordinal. We prefer to deemphasize this point of view
since in general a simple operation such as inserting a plus into a
sequence can have a complicated effect on the normal form. This is
because the existence of the plus affects the contribution of the
subsequent signs. Although the general problem concerning the effect of
insertion of signs on the algebraic value of a surreal number is of
intrinsic interest, it is far removed from the theory of exponentiation.
For our purpose it suffices to attempt to express the function g in
what appears to be the most tractable way.
Proof. The result is known for ordinals. We may thus assume that for
some integer n>0 we have b = e a +n+c where - K c < 0 , i.e. b begins
with e+n pluses followed by a minus. Hence a cofinal lower set in the
EXPONENTIATION
183
canonical representation of
which begin w i t h
satisfy
a lower element of
e+n
then
g(b')
and
lexicographical
i t s e l f continue w i t h a minus.
g(b")
order
but
Also, since
Thus
e+n
b1
i s superfluous by c o f i n a l i t y .
also has t h i s
and
Since
g [ e + n - l ] = e+n
is
Hence
the elements of
g(b)
e.
By the
property.
Theorem 10.19.
ind b = e.
We now f i l l
Note.
g ( b ) , ind b = e
g(b) = { g ( b * ) } { g ( b " ) } .
e+n _< b
ind e = ind(e+w) = e
the form
e + ( n - l ) pluses.
and
slightly.
and a l l
integers
n,
g(b) = b.
10.18.
Proof.
Such a
begins with
e+co
followed by a minus.
ind b = e
i s superfluous.
and since
Hence
Assume f i r s t t h a t a f t e r the
g(e+n) = e+n+1
no f u r t h e r changes in s i g n .
e+w
g(b) = { g ( b ' ) } | { g ( b M ) }
is a
If
there
pluses then
An easy
which s a t i s f y
= b1,
so
= { b ' } | { b n } = b.
begins w i t h
by a minus, i . e .
b _< 3.
b < e
Namely, t h i s is the
$ < e
does
followed by a minus.
So using
satisfying
i n t o t h i s case, i . e .
e
b
followed
satisfy
We include
184
EXPONENTIATION
185
186
EXPONENTIATION
187
an extra plus.
This is similar to what happens in the simple case for
ordinals at the first epsilon number. In fact, for any x such that
c
Ji x JS d$ the same reasoning we used so far including the splitting up
into various cases can be applied to the tail of c in x in order to
determine the sign sequence of the rest of g(x). Again we are left with
only one case to consider further. Specifically, the case consists of an
element beginning with c followed by e pluses for some epsilon number
e greater than e and then followed by e^ft minuses for some limit
ordinal 3.
We now have a set up for a "grand" induction. At every stage
we are left with one case to consider further so it appears that we will
never be done. The pattern is as follows. We have strings of pluses and
minuses, the ith string of pluses has length e. where (e.) is a
strictly increasing sequence of epsilon numbers, and the ith string of
minuses has length e.uxx. where a. is a limit ordinal for all i.
What we do next is to consider a sequence b such as the one
described above where i runs through the set of all positive integers.
By the induction described above we know the value of g for every
proper initial segment of b. Now g(b) = {ind b, g(b')}|{g(b")}. By
cofinality we may limit ourselves to those initial segments b of b
which consist of full strings. Hence by the inductive hypothesis g(b)
has the following form: the ith string of pluses has length e., the
ith string of minuses has length a. and a plus is added at the end.
(This is valid whether the last string in b consists of pluses or
minuses.) Also, ind b has the same form but, of course, with no plus at
the end. By mutual cofinality we can ignore the plus at the end in
g(b). This is easy to see since the lower element obtained by stopping
after the ith string of minuses then followed by a plus is less than
the element obtained by stopping after the (i+l)st string of minuses,
and the upper element obtained by stopping after the ith string of
pluses is greater than the element obtained by stopping after the
(i+l)st string of minuses then followed by a plus.
Hence g(b) simplifies to {ind b, ind b 1 }|{Ind b"}. If we
apply the remark concerning restricting oneself to full strings by
cofinality to ind b, it follows that {ind b'}|{ind b'1} is precisely
ind b.
188
We have shown that g(b) has the form {ind b, F}|{G} where
F|G = ind b and F and G are initial segments of ind b. It follows
easily that g(b) is ind b followed by a plus. We see this directly
as follows. If we denote ind b followed by a plus by c then clearly
F < ind b < c. Since G is an initial segment of ind b and
ind b < G, c < G by the lexicographical order. Hence F < c < G. Since
ind b < c, we see that c satisfies the betweenness condition. On the
other hand, suppose F < x < G and ind b < x. Since F|G = ind b it
follows that ind b is an initial segment of x. Since ind b < x it
follows that x must begin with ind b followed by a plus, i.e. x has
c as an initial segment.
It is now easy to see that the situation after the co
strings of pluses and minuses behaves the same way as the situation after
e discussed earlier. Furthermore, the same argument used above for to
strings of pluses and minuses applies to a strings of pluses and
minuses where a is any limit ordinal.
Thus our discussion takes care of the value of g for an
arbitrary sequence of pluses and minuses. Although the final result
could be expressed as a formal theorem we feel that this would make the
pattern look more complicated than it is, i.e. we feel that it is most
lucid to express the procedure in the somewhat informal manner which we
used.
However, we shall end this book by a formal theorem which
b
describes precisely when g(b) = b, i.e. exp w b = ^ . This result
should be of interest since it is a natural culmination of earlier
theorems.
Theorem 10.20.
two forms.
e0
is the first
(2) b begins with at least e 0 pluses and the first string in the
sequence for b such that the initial segment of b which terminates at
the end of the string is not a generalized epsilon number is a string of
pluses. Furthermore, if a is the number of pluses then choose e to
be the largest number of pluses such that e < a and the sequence
EXPONENTIATION
189
n.
Now suppose that for no ordinal a < e 0 does b satisfy
b _< a. This is equivalent to the condition that b begins with at least
e 0 pluses.
Now assume that b is not a generalized epsilon number but
b consists of c followed by a string of minuses and c is a generalized epsilon number. Then ind b consists of c followed by a
string of minuses of length smaller than the corresponding last string in
c. Hence ind b > b. Since g(b) > ind b it follows that g(b) * b.
190
191
REFERENCES
[1]
[2]
192
INDEX
Abelian group
addition
, natural
17
13
41
betweenness property
15
canonical representation
cofinal
c o f i n a l i t y theorem
, inverse
commutative ring
convex
11
9, 10
10
12
19
54
Dedekind cut
diagonalization
dyadic f r a c t i o n
32
121
28a, 29, 34ff
element, lower
, upper
epsilon numbers
exp y F
exponentiation
17
17
121, 175
154ff
143, 145
FIG
field
fixed points
7, 10, 11
22, 39
124ff
groupoid
24
Hensel's lemma
73
, natural
non-standard analysis
normal form
, common
integer
length
logarithm
1
39, 48
3
7
111
3
161
48, 116
52 , 58, 62
order
order of magnitude
, higher
ordinals
3
54, 55
54
41
71
156
33
21, 37
52
41
104
76 , 85, 94
24
116
104
1, 3
3
60
59
tail
tai1 property
transfinite sequence
uniformity theorems
impartial games
infinitesimal
i n i t i a l segment
17
41
multiplication
15.
18
valuation theory
118
[x]n
n
144
55