Elementary Row Operations and Row-Echelon Matrices
Elementary Row Operations and Row-Echelon Matrices
2007/2/16
page 139
i i
For Problems 6–8, determine the coefficient matrix, A, the 12. x1 = −4x1 + 3x2 + 4t, x2 = 6x1 − 4x2 + t 2 .
right-hand-side vector, b, and the augmented matrix A# of
the given system. 13. x1 = t 2 x1 − tx2 , x2 = (− sin t)x1 + x2 .
x1 + 2x2 − 3x3 = 1,
14. x1 = e2t x2 , x2 + (sin t)x1 = 1.
6. 2x1 + 4x2 − 5x3 = 2,
7x1 + 2x2 − x3 = 3.
15. x1 = (− sin t)x2 + x3 + t, x2 = −et x1 + t 2 x3 + t 3 ,
x + y + z − w = 3, x3 = −tx1 + t 2 x2 + 1.
7.
2x + 4y − 3z + 7w = 2.
For Problems 16–17 verify that the given vector function x
x1 + 2x2 − x3 = 0, defines a solution to x = Ax + b for the given A and b.
8. 2x1 + 3x2 − 2x3 = 0, 4t
5x1 + 6x2 − 5x3 = 0. e 2 −1 0
16. x(t) = , A = , b(t) = .
−2e4t −2 3 0
For Problems 9–10, write the system of equations with the
given coefficient matrix and right-hand-side vector.
4e−2t + 2 sin t 1 −4
17. x(t) = , A = ,
1 −1 2 3 1 3e−2t − cos t −3 2
9. A = 1 1 −2 6 , b = −1 . −2(cos t + sin t)
b(t) = .
3 1 4 2 2 7 sin t + 2 cos t
In the next section we will develop methods for solving a system of linear equations.
These methods will consist of reducing a given system of equations to a new system that
has the same solution set as the given system but is easier to solve. In this section we
introduce the requisite mathematical results.
i i
i i
i i “main”
2007/2/16
page 140
i i
Solution: If we permute (i.e., interchange), say, Equations (2.4.1) and (2.4.2), the
resulting system is
2x1 − 5x2 + 3x3 = 6,
x1 + 2x2 + 4x3 = 2,
4x1 + 6x2 − 7x3 = 8,
which certainly has the same solution set as the original system. Returning to the original
system, if we multiply, say, Equation (2.4.2) by 5, we obtain the system
x1 + 2x2 + 4x3 = 2,
10x1 − 25x2 + 15x3 = 30,
4x1 + 6x2 − 7x3 = 8,
which again has the same solution set as the original system. Finally, if we add, say,
twice Equation (2.4.1) to Equation (2.4.3), we obtain the system
We can verify that, if (2.4.4)–(2.4.6) are satisfied, then so are (2.4.1)–(2.4.3), and
vice versa. It follows that the system of equations (2.4.4)–(2.4.6) has the same solution
set as the original system of equations (2.4.1)–(2.4.3).
More generally, similar reasoning can be used to show that the following three
operations can be performed on any m × n system of linear equations without altering
the solution set:
1. Permute equations.
2. Multiply an equation by a nonzero constant.
3. Add a multiple of one equation to another equation.
Since these operations involve changes only in the system coefficients and constants
(and not changes in the variables), they can be represented by the following operations
on the rows of the augmented matrix of the system:
1. Permute rows.
2. Multiply a row by a nonzero constant.
3. Add a multiple of one row to another row.
i i
i i
i i “main”
2007/2/16
page 141
i i
These three operations, called elementary row operations, will be a basic computational
tool throughout the text, even in cases when the matrix under consideration is not derived
from a system of linear equations. The following notation will be used to describe
elementary row operations performed on a matrix A.
Furthermore, the notation A ∼ B will mean that matrix B has been obtained from
matrix A by a sequence of elementary row operations. To reference a particular elemen-
tary row operation used in, say, the nth step of the sequence of elementary row operations,
n
we will write ∼ B.
Example 2.4.2 The one-step operations performed on the system in Example 2.4.1 can be described as
follows using elementary row operations on the augmented matrix of the system:
1 2 4 2 2 −5 3 6
2 −5 3 6 ∼ 1
1 2 4 2 1. P12 . Permute (2.4.1) and (2.4.2).
4 6 −7 8 4 6 −7 8
1 2 4 2 1 2 4 2
2 −5 3 6 ∼ 10 −25 15 30 1. M2 (5). Multiply (2.4.2) by 5.
1
4 6 −7 8 4 6 −7 8
1 2 4 2 1 2 4 2
2 −5 3 6 ∼ 2 −5 3 6
1
1. A13 (2). Add 2 times (2.4.1) to (2.4.3).
4 6 −7 8 6 10 1 12
It is important to realize that each elementary row operation is reversible; we can “un-
do” a given elementary row operation by another elementary row operation to bring the
modified linear system back into its original form. Specifically, in terms of the notation
introduced above, the reverse operations are determined as follows (ERO refers here to
“elementary row operation”):
We introduce a special term for matrices that are related via elementary row operations.
DEFINITION 2.4.3
Let A be an m × n matrix. Any matrix obtained from A by a finite sequence of
elementary row operations is said to be row-equivalent to A.
i i
i i
i i “main”
2007/2/16
page 142
i i
Thus, all of the matrices in the previous example are row-equivalent. Since elemen-
tary row operations do not alter the solution set of a linear system, we have the next
theorem.
Theorem 2.4.4 Systems of linear equations with row-equivalent augmented matrices have the same
solution sets.
Row-Echelon Matrices
Our methods for solving a system of linear equations will consist of using elementary
row operations to reduce the augmented matrix of the given system to a simple form.
But how simple a form should we aim for? In order to answer this question, consider the
system
x1 + x2 − x3 = 4, (2.4.7)
x2 − 3x3 = 5, (2.4.8)
x3 = 2. (2.4.9)
This system can be solved most easily as follows. From Equation (2.4.9), x3 = 2.
Substituting this value into Equation (2.4.8) and solving for x2 yields x2 = 5 + 6 = 11.
Finally, substituting for x3 and x2 into Equation (2.4.7) and solving for x1 , we obtain
x1 = −5. Thus, the solution to the given system of equations is (−5, 11, 2), a single
vector in R3 . This technique is called back substitution and could be used because the
given system has a simple form. The augmented matrix of the system is
1 1 −1 4
0 1 −3 5
0 0 1 2
We see that the submatrix consisting of the first three columns (which corresponds
to the matrix of coefficients) is an upper triangular matrix with the leftmost nonzero
entry in each row equal to 1. The back-substitution method will work on any system of
linear equations with an augmented matrix of this form. Unfortunately, not all systems
of equations have augmented matrices that can be reduced to such a form. However,
there is a simple type of matrix to which any matrix can be reduced by elementary row
operations, and which also represents a system of equations that can be solved (if it has
a solution) by back substitution. This is called a row-echelon matrix and is defined as
follows:
DEFINITION 2.4.5
An m × n matrix is called a row-echelon matrix if it satisfies the following three
conditions:
1. If there are any rows consisting entirely of zeros, they are grouped together at
the bottom of the matrix.
2. The first nonzero element in any nonzero row4 is a 1 (called a leading 1).
3. The leading 1 of any row below the first row is to the right of the leading 1 of
the row above it.
4
A nonzero row (nonzero column) is any row (column) that does not consist entirely of zeros.
i i
i i
i i “main”
2007/2/16
page 143
i i
whereas
1 0 0
1 0 −1 0
0 1 2 0 0
and 0 1 −1
0 1 −1
0 0 1
i i
i i
i i “main”
2007/2/16
page 144
i i
hand calculations, fewer algebraic errors result if we avoid the use of fractions. In this
case, we can obtain a leading 1 without the use of fractions by adding −1 times row 3
to row 2.
1 −1 2 1
3 0 1 −6 −4
∼ 0 2 1 5
Step 3 row operation: Add −1 times row 3 to row 2.
0 2 −3 1
Step 4: Use the leading 1 in the (2, 2) position to put zeros beneath it in column 2.
We now add appropriate multiples of row 2 to the rows beneath it. For row-echelon
form, we need not be concerned about the row above it, however.
1 −1 2 1
4 0 1 −6 −4 Add −2 times row 2 to row 3.
∼0 0 13 13
Step 4 row operations:
Add −2 times row 2 to row 4.
0 0 9 9
0 0 9 9
Step 6: Use the leading 1 in the (3, 3) position to put zeros beneath it in column 3.
The appropriate row operation is to add −9 times row 3 to row 4.
1 −1 2 1
6 0 1 −6 −4
∼ 0 0 1 1
0 0 0 0
This is a row-echelon matrix, hence the given matrix has been reduced to row-echelon
form. The specific operations used at each step are given next, using the notation intro-
duced previously in this section. In future examples, we will simply indicate briefly the
elementary row operation used at each step. The following shows this description for the
present example.
Remarks
1. Notice that in steps 2 and 4 of the preceding example we have performed multi-
ple elementary row operations of the type Aij (k) in a single step. With this one
exception, the reader is strongly advised not to combine multiple elementary row
operations into a single step, particularly when they are of different types. This is
a common source of calculation errors.
i i
i i
i i “main”
2007/2/16
page 145
i i
2. The reader may have noticed that the particular steps taken in the preceding exam-
ple are not uniquely determined. For instance, we could have achieved a leading 1
in the (1, 1) position in step 1 by multiplying the first row by 1/2, rather than per-
muting the first two rows. Therefore, we may have multiple strategies for reducing
a matrix to row-echelon form, and indeed, many possible row-echelon forms for
a given matrix A. In this particular case, we chose not to multiply the first row by
1/2 in order to avoid introducing fractions into the calculations.
The reader is urged to study the foregoing example very carefully, since it illustrates
the general procedure for reducing an m×n matrix to row-echelon form using elementary
row operations. This procedure will be used repeatedly throughout the text. The idea
behind reduction to row-echelon form is to start at the upper left-hand corner of the
matrix and proceed downward and to the right in the matrix. The following algorithm
formalizes the steps that reduce any m × n matrix to row-echelon form using a finite
number of elementary row operations and thereby provides a proof of Theorem 2.4.7.
An illustration of this algorithm is given in Figure 2.4.1.
ze
ze
* Pivot position * . *
ro
ro
ro
s
s
* * 0 *
Nonzero elements New submatrix
Submatrix Pivot column
i i
i i
i i “main”
2007/2/16
page 146
i i
1 0 −3 4 1 0 −3 4 0 −1 −2 3
6 6
Pivot column
Pivot column Pivot column
@R
@
1 1 −1 1 1 1 −1 1 1 1 −1 1
∼ 0 1 2 1 ∼ 0 1 2 1 ∼ 0 1 2 1
3 4 5
0 −1 −2 3 0 0 0 4j 0 0 0 1
6
Pivot position
This is a row-echelon matrix and hence we are done. The row operations used are
summarized here:
Theorem 2.4.10 Let A be an m × n matrix. All row-echelon matrices that are row-equivalent to A have
the same number of nonzero rows.
Theorem 2.4.10 associates a number with any m × n matrix A—namely, the number
of nonzero rows in any row-echelon form of A. As we will see in the next section, this
number is fundamental in determining the solution properties of linear systems, and
indeed it plays a central role in linear algebra in general. For this reason, we give it a
special name.
DEFINITION 2.4.11
The number of nonzero rows in any row-echelon form of a matrix A is called the
rank of A and is denoted rank(A).
3 1 4
Example 2.4.12 Determine rank(A) if A = 4 3 5.
2 −1 3
i i
i i
i i “main”
2007/2/16
page 147
i i
2 −1 3 2 −1 3 0 −5 1 0 −5 1 0 0 0
Since there are two nonzero rows in this row-echelon form of A, it follows from Defini-
tion 2.4.11 that rank(A) = 2.
In the preceding example, the original matrix A had three nonzero rows, whereas any
row-echelon form of A has only two nonzero rows. We can interpret this geometrically as
follows. The three row vectors of A can be considered as vectors in R3 with components
c1 a1 + c2 a2 + c3 a3 ,
and thus the rows of a row-echelon form of A are all of this form. We have been combining
the vectors linearly. The fact that we obtained a row of zeros in the row-echelon form
means that there are values of the constants c1 , c2 , c3 such that
c1 a1 + c2 a2 + c3 a3 = 0,
where 0 denotes the zero vector (0, 0, 0). Equivalently, one of the vectors can be written
in terms of the other two vectors, and therefore the three vectors lie in a plane. Reducing
the matrix to row-echelon form has uncovered this relationship among the three vectors.
We shall have much more to say about this in Chapter 4.
DEFINITION 2.4.13
An m × n matrix is called a reduced row-echelon matrix if it satisfies the following
conditions:
1. It is a row-echelon matrix.
2. Any column that contains a leading 1 has zeros everywhere else.
i i
i i
i i “main”
2007/2/16
page 148
i i
3 −1 22
Example 2.4.16 Determine the reduced row-echelon form of A = −1 5 2.
2 4 24
Solution: We apply the row-reduction algorithm, but put 0s above and below the
leading 1s. In so doing, it is immaterial whether we first reduce A to row-echelon form
and then arrange 0s above the leading 1s, or arrange 0s both above and below the leading
1s as we proceed from left to right.
3 −1 22 1 9 26 1 9 26 1 9 26 1 0 8
A = −1 5 2 ∼ −1 5 2 ∼ 0 14 28 ∼ 0 2 ∼ 0 1 2
1 2 3 4
1
2 4 24 2 4 24 0 −14 −28 0 −14 −28 0 0 0
1. A21 (2) 2. A12 (1), A13 (−2) 3. M2 (1/14) 4. A21 (−9), A23 (14)
i i
i i
i i “main”
2007/2/16
page 149
i i
i i
i i
i i “main”
2007/2/16
page 150
i i
Solution: We first use elementary row operations to reduce the augmented matrix of
the system to row-echelon form.
3 −2 2 9 1 −2 1 5 1 −2 1 5
1 −2 1 5 ∼ 1
3 −2 2 9 ∼ 0 4 −1 − 6
2
2 −1 −2 −1 2 −1 −2 −1 0 3 −4 −11
1 −2 1 5 1 −2 1 5 1 −2 1 5
∼ 0 1 3 5 ∼ 0 1 5 ∼ 0 1 3 5 .
3 4 5
3
0 3 −4 −11 0 0 −13 −26 0 0 12
i i
i i