Lect 1
Lect 1
Vectors Spaces
In this course we work with ‘scalars’ and ‘vectors’. ‘Scalars’ are going to be elements of a chosen
(associative) ring with (identity) K and ‘vectors’ are going to be in general non commutative
and will be equipped with a chosen involution but many significant results are best obtained in a
setting when further restrictions are imposed on K . We provided here a collection of definitions
and examples [of the kind of scalars that we are going to need or think the user may need a bit
1. Z, Q, R, C with the usual addition and multiplication are commutative rings. With the
so this is transitive.
n’. Write [a] := {bεZ/a ∼ b} and define Zn := {[a]/aεZ} ,[a] + [b] := [a + b], [a][b] := [ab].
(i) If a0 ε[a], b0 ε[b] then n divides a−a0 as well as b−b0 ; therefore n divides (a+b)−(a0 +b0 ).
Thus [a + a0 ] = [b + b0 ] if [a] = [a0 ] and [b] = [b0 ]. This means addition above is
[ab] = [a0 b0 ] if [a] = [a0 ] and [b] = [b0 ] which means multiplication is well defined.
[a] + [0] = [a] = [0] + [a]. Further −a = −np − a0 i.e. [a] + [−a] = [0] which means
[−a] = −[a]. Therefore, under addition as defined, Zn an abelian group [it is clear
that [a] + ([b] + [c]) = [a] + ([b + c]) = [a + (b + c)] = [(a + b) + c] = ([a + b]) + [c] =
(iii) If [a] = [1] then a = nq + 1 so that ba = bnq + b hence [ba] = [b] and ab = nqb + b
hence [ab] = [b]. Thus [a] = [a][1] = [1][a] for each [a]εZn . Clearly [a]([b][c]) =
[a][bc] = [a(bc)] = [(ab)c] = [ab][c] = ([a][b])[c] for all [a], [b], [c]εZn . Therefore under
(iv) We have [a]([b] + [c]) = [a][b + c] = [a(b + c)] = [ab + ac] = [ab] + [ac] = [a][b] + [a][c]
and ([a] + [b])[c] = [a + b][c] = [(a + b)c] = [ac + bc] = [ac] + [bc] = [a][c] + [b][c].Thus
(v) These calculations show that Zn is a ring under the defined operations.
(vi) Since [a][b] = [ab] = [ba] for each [a], [b]εZn , we see that Zn is a commutative ring.
and thus [a][r] = [ar] = [1]. Thus [a]εZn is invertible if f a is relatively prime to n
inverse of [a] is then given by [c] and we have [c] = [a]−1 iff there exists an integer d
such that ac − nd = 1
f
3. The ring Z occupies a very special position: Z −
→ K defined by f (n) := n1 = 1+. . .+1[1εK]
is the single-morphism for any ring K from Z. [Indeed, f (1) = 1K is must and so is
supplies a ring -morphism Z → K for any ring K; here we wrote the identity of K as 1K for
extra clarity because 1εZ is also there in the calculation but one usually writes 1K simply
as 1].
We say “Z is an initial object in the category of rings” but this language will not be used
in this course.
(i) The least integer k ≥ 0 such that f (k) = 0 is called the characteristic of K; clearly
f = {km|εZ}.
(d) K has characteristic 1 iff f (1) = 0 i.e. 1=0 in K. Since we have agreed that we
have at least two distinct scalars 0 6= 1, no ring of scalars for this course will
have characteristic 1 but it should be noted that not every textbook on linear
4. (i) In connection with 3(ii)(d) above, it is perhaps here we should note that some text-
books on linear algebra accept rings without identity can be enlarged to a ring with
identity so in the good old days one always did it. with modern development, there
are situations [like ‘K-theory’] in which rings, or rather algebras, without identity
(ii) On the other hand, many text books on linear algebra take only commutative rings
as scalars and most of them in fact take only fields. Some accept only division rings.
The basic trouble with these in as follows : Even if one works only with fields as
scalars, one has to works with field as scalars, one has to work with ‘block matrices’ in
into picture.
(iii) Working with only division rings or fields, one can avoid the term ‘module’ and use
only ‘vector spaces’. This is an economy which is only apparent : practically all the
work has to be done anyway. In particular, which one crucial fact : a linear operator
T
V −
→ V [where V is a vector space over field F ] is in fact just a module over the
polynomial ring F[θ], is never mentioned, all the computational work has still to be
done. So all one gains is that a concept has not been given its proper name.
(ii)If every module over K-modulo, K must be a division ring. Thus the best result
are certainly developed in the context of vector spaces [ which are just modules over
division rings ] and if one uses the very convenient characterization in terms of de-
vector spaces over fields”. However,it becomes an unnecessary journey into surprizes
later when one feels that many things do not work in more matrices are not invert-
true,and so on; but it does not mean that one should never move out of the world
of real numbers.
(v) Some textbooks deal with only complex numbers as scalars; this is to prepare the
student for “Applicable functional analysis ” and hence to many rich and exciting
subjects like quantum mechanics and Wavelets differential equations and so(classical)
computation, one frequently meets with Zn and since there is hardly any saving in
(vi) Clearly, the ‘choice of reasonable scalars’ is a subjective choice . We hope our option
a preferred conjugation’ prepares the users for a number of situations that may arise
(vii) In all this, let us note that in Zn , we have [n] = [0] so that, for instance, λεZ2 , µεZ2
λ+µ
dose not ensure that 2 εZ2 . Many results will need “let F be a field of characteristic
not equal to 2”. [ There are fields other than Z2 which have characteristic 2;]
5. Let us note that the collection of all vectors in R3 is a ring with cross product as multipli-
fails] and does not commutative, [ a × b = b × c fails ] and does not have an identity. User
for this course can treat it as an object of curiosity in the beginning but at the advanced
level of linear algebra, non associative multiplication enter into picture [when one deals
= −q1 + q0 e1 + q3 e2 − q2 e3 , while
e1 q = e1 q0 + e1 q1 e1 + e1 q2 e2 + e1 q3 e3
= q0 e1 + q1 e1 e1 + q2 e1 e2 + q3 e1 e3 [qi εR ⊆ cenH]
= −q1 + q0 e1 − q3 e2 + q2 e3
P3 P3 0
which provides q3 = 0 = q2 [ ∵ i=0 qi ei = i=0 qi ei iff qi = qi0 for each i]
λ = λ0 +e1 λ1 +e2 λ2 +e3 λ3 εH as the standard involution but λ∗ = λ0 +e1 λ1 −e2 λ2 −e3 λ3 is
also an involution and there could be others [such as ?]. We do not attempt an explanation
f
(i) (a) If Z −
→ Z is a ring morphism, we have f (1) = 1 and thus f (m) = f (m.1) = mf (1)
f
→ Q is a ring morphism, we have nf ( m
(c) If Q − m
n ) = f (n. n ) = f (m) = f (m.1) =
mf (1) = m = n. m m
n and hence f ( n ) =
m
n for any rεQ, r = m
n, m, nεZ, n > 0
f
(d) Suppose K and L are rings, K −
→ L a ring morphism. Write kerf := {λεK/f (λ) =
0} Then if λ ε kerf and a ε K, we have f (λa) = f (λ)f (a) = 0f (a) = 0 i.e. λaε
ker f ; similarly aλε kerf [ This makes kerf an ideal of K; for the definition
of ‘ideal’ see below] Now if K and L are division rings , 0 6= λ ε kerf , then
To sum up: If K and L are division rings (and in particular, if they are fields),
f
any ring morphism K −
→ L is necessarily injective.
f
(ii) If R −
→ R is a ring morphism, then by above it is injective and hence we have f (r) = r
√ √ √
for rεQ. If xεR, x > 0 then f (x) = f ( x x) = (f ( x))2 so that if a > b and thus
a − b > 0, we get f (a − b) = f (a) − f (b) > 0 i.e.f (a) > f (b). This means f must
be order-preserving. Now let xεR and choose rn , sn εQ such that rn < x < sn with
T∞
n=1 [rn , sn ] = {x}
None of the rings Z , Zn , Q , R admits any ring morphism from itself to itself other
than the identity. Since each of them is commutative, morphisms are the same as
anti-morphisms. Therefore
f
No nontrivial involution, isomorphisms are the same as antiisomorphism . If C −
→ C is
an isomorphism such that f (x) = x for xεR, then f (x + iy) = f (x) + f (i)f (y) with
(f (i))2 = f (i)f (i) = f (i2 ) = f (−1) = −1 so that f (i) = ±i.To sum up:
(iii) If d 6= 0 is an integer which is square free [ in the sense that its prime factorization
√ √ √ √
has no square ], we write Z[ d] := {a + b d|a, bεZ} and Q[ d] := {a + b d|a, bεQ}
Then under obvious multiplication and addition, they are commutative rings and
√ √ √
Q[ d] is actually a field ; both carry the nontrivial involution a + b d 7→ a − b d.
(iv) For a prime number p, Zp is a finite field with exactly p element ; this has been
introduced in 2(viii) page 2 above. We supply the following facts without proof; the
proof are not exactly obvious and are properly speaking part of an algebra course
(c) For the finite field with pr elements, the characteristic is p. Further, λ 7→ λp is
(e) Let us examine the field F with p2 elements for p = 2. Quite clearly, this cannot
be Z4 [in fact the field with pr elements, r > 1, will never be Zq , q = pr ] because
Zn is a field iff p is a prime [2(viii), page 2 above ]. verify that it carries addition
0 0 1 a b 0 0 0 0 0
1 1 0 b a and 1 0 1 a b
a a b 0 1 a o a b 1
b b a 1 0 b 0 b 1 a
Verify that this is same as the additive fragment of Z3 . There are no finite division
0 i i 0 0 1
KI =
=
=J
i 0 0 −i −1 0
i 0 0 i 0 −1
IK = = = −J
0 −i i 0 1 0
i 0 0 1 0 i
IJ =
=
=K
0 −i −1 0 i 0
0 1 i 0 0 −i
JI = = = −K
−1 0 0 −i −i 0
0 1 0 1 −1 0
J2 =
=
= −Id2
−1 0 −1 0 0 −1
0 i 0 i −1 0
K2 =
=
= Id2
i 0 i 0 0 −1
i 0 i 0 −1 0
I2 =
=
= Id2 If α = a + bi, β = c + di, a, b, c, dεR
0 −i 0 −i 0 −1
we can write a quaternion uniquely as q = aId2 +bI+cJ +dK and conversely a matrix of the
sort aId2 +bI +cJ +dK would stand for the quaternion This can be done by correspondence
table for quaternions corresponds to the matrix multiplication performed above; since the
addition obviously does the mapping f (e0 ) = Id2 , f (e1 ) = I, f (e2 ) = J, f (e3 ) = K estab-
α 0
introduced just now. So the two are the same. Further,f (α) =
= aId2 + bI
0 α
α 0
for α = a + biC embeds the field C into H via a 7→ a + bi = α =
0 α
a 0
Similarly, with aεR being identified to
we can show that the matrices a + bj =
0 a
a −b
form a field under multiplication and addition which is isomorphic to C via
b a
a −b
a + bi ↔
.
b a