0% found this document useful (0 votes)

83 views14 pages

1 - Summary of Vector Matrix Operations

Uploaded by

sayedrafat36

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

83 views14 pages

1 - Summary of Vector Matrix Operations

Uploaded by

sayedrafat36

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

10.1002/9780470890042.app1, Downloaded from https://onlinelibrary.wiley.com/doi/10.1002/9780470890042.app1 by Egyptian National Sti. Network (Enstinet), Wiley Online Library on [07/12/2022].

See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
SUMMARY OF VECTOR/MATRIX
OPERATIONS

This Appendix summarizes properties of vector and matrices, and vector/matrix

operations that are often used in estimation. Further information may be found in
most books on estimation or linear algebra; for example, Golub and Van Loan
(1996), DeRusso et al. (1965), and Stewart (1988).

A.1 DEFINITION

A.1.1 Vectors
A vector is a linear collection of elements. We use a lower case bold letter to
denote vectors, which by default are assumed to be column vectors. For
example,

⎡ a1 ⎤
a = ⎢a2 ⎥
⎢ ⎥
⎢⎣ a3 ⎥⎦

is a three-element column vector. A row vector is (obviously) defined with elements

in a row; for example,

a = [ a1 a2 a3 ].

A vector is called unit or normalized when the sum of elements squared is equal to
n
1: ∑a 2
i = 1 for an n-element vector a.
i =1

A.1.2 Matrices
A matrix is a two-dimensional collection of elements. We use bold upper case letters
to denote matrices. For example,

Advanced Kalman Filtering, Least-Squares and Modeling: A Practical Handbook, by Bruce P. Gibbs
Copyright © 2011 by John Wiley & Sons, Inc

555

bapp01.indd 555 12/8/2010 10:00:50 AM

10.1002/9780470890042.app1, Downloaded from https://onlinelibrary.wiley.com/doi/10.1002/9780470890042.app1 by Egyptian National Sti. Network (Enstinet), Wiley Online Library on [07/12/2022]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
556 ADVANCED KALMAN FILTERING, LEAST-SQUARES AND MODELING: A PRACTICAL HANDBOOK

⎡ A11 A12 A13 ⎤

A=⎢
⎣ A21 A22 A23 ⎥⎦
is a matrix with two rows and three columns, or a 2 × 3 matrix. Individual elements
are labeled with the first subscript indicating the row, and the second indicating the
column. An n-element column vector may be considered an n × 1 matrix, and an
n-element row vector may be considered a 1 × n matrix.

A.1.2.1 Symmetric Matrix A square matrix is symmetric if elements with inter-

changed subscripts are equal: Aji = Aij. For example,

⎡2 5 9⎤
A = ⎢ 5 1 3⎥
⎢ ⎥
⎢⎣ 9 3 4 ⎥⎦

is symmetric.

A.1.2.2 Hermitian Matrix A square complex matrix is Hermitian if elements

with interchanged subscripts are equal to the complex conjugate of each other:
Aji = Aij*.

A.1.2.3 Toeplitz Matrix A square matrix is Toeplitz if all elements along the
upper left to lower right diagonals are equal: Ai,j = Ai−1,j−1. For example,

⎡1 2 −1 4 ⎤
⎢3 1 2 −1⎥
A=⎢ ⎥
⎢5 3 1 2⎥
⎢ ⎥
⎣ −2 5 3 1⎦

is Toeplitz.

A.1.2.4 Identity Matrix A square matrix that is all zero except for ones along
the main diagonal is the identity matrix, denoted as I. For example,

⎡1 0 0 0⎤
⎢0 1 0 0⎥
⎢ ⎥
⎢0 0 1 0⎥
⎢ ⎥
⎣0 0 0 1⎦
is a 4 × 4 identity matrix. Often a subscript is added to indicate the dimension, as
In is an n × n identity matrix.

A.1.2.5 Triangular Matrix All elements of a lower triangular matrix above the
main diagonal are zero. All elements of an upper triangular matrix below the main
diagonal are zero. For example,

bapp01.indd 556 12/8/2010 10:00:50 AM

⎡1 7 4 −4 ⎤
⎢0 2 6 1⎥
⎢ ⎥
⎢0 0 3 9⎥
⎢ ⎥
⎣0 0 0 5⎦

is upper triangular.

A.2 ELEMENTARY VECTOR/MATRIX OPERATIONS

A.2.1 Transpose
The transpose of a matrix, denoted with superscript T, is formed by interchanging
row and column elements: B = AT is the transpose of A where Bji = Aij. For
example,

T ⎡ A11 A21 ⎤
⎡ A11 A12 A13 ⎤
B=⎢ = ⎢ A12 A22 ⎥ .
⎣ A21 A22 A23 ⎥⎦ ⎢ ⎥
⎢⎣ A13 A23 ⎥⎦

If the matrix is complex, the complex conjugate transpose is denoted as AH = (A*)T.

The matrix is Hermitian if A = AH.

A.2.2 Addition
Two or more vectors or matrices of the same dimensions may be added or sub-
tracted by adding/subtracting individual elements. For example, if

⎡ 1 2 3⎤ ⎡3 7 5 ⎤
A=⎢ ⎥ , B = ⎢ 2 1 −2 ⎥
⎣ 4 5 6 ⎦ ⎣ ⎦
then

⎡4 9 8 ⎤
C= A+B= ⎢ ⎥.
⎣6 6 4 ⎦
Matrix addition is commutative; that is, C = A + B = B + A.

A.2.3 Inner (Dot) Product of Vectors

The dot product or inner product of two vectors of equal size is the sum of the
products of corresponding elements. If vectors a and b both contain m elements,
the dot product is a scalar:
m
a, b = ai b = aT b = ∑ ai bi.
i =1

If a, b = 0 , the vectors are orthogonal.

bapp01.indd 557 12/8/2010 10:00:50 AM

10.1002/9780470890042.app1, Downloaded from https://onlinelibrary.wiley.com/doi/10.1002/9780470890042.app1 by Egyptian National Sti. Network (Enstinet), Wiley Online Library on [07/12/2022]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
558 ADVANCED KALMAN FILTERING, LEAST-SQUARES AND MODELING: A PRACTICAL HANDBOOK

A.2.4 Outer Product of Vectors

The outer product of two vectors (of possibly unequal sizes) is a matrix of products
of corresponding vector elements. If vectors a and b contain m- and n-elements,
respectively, then the outer product is an m × n matrix:
C = a ⊗ b = abT
or Cij = aibj .

A.2.5 Multiplication
Two matrices, where the column dimension of the first (m) is equal to the row
dimension of the second, may be multiplied by forming the dot product of the rows
m
of the first matrix and the columns of the second; that is, Cij = ∑ Aik Bkj . For
example, if k =1

⎡3 2 ⎤
⎡ 1 2 3⎤ ⎢ ⎥
A=⎢ ⎥ , B = ⎢0 1 ⎥
⎣4 5 6⎦
⎢⎣ 5 −2 ⎥⎦
then
⎡18 −2 ⎤
C = AB = ⎢ ⎥.
⎣42 1 ⎦
Matrix multiplication is not commutative; that is, C = AB ≠ BA. A matrix multiply-
ing or multiplied by the identity I is unchanged; that is, AI = IA = A.
The transpose of the product of two matrices is the reversed product of the
transposes: (AB)T = BTAT.
Vector-matrix multiplication is defined as for matrix-matrix multiplication. If
matrix A is m × n and vector x has m-elements, y = xTA or
m
y j = ∑ xi Aij for j = 1, 2, … , n
i =1

is an n-element row vector. If vector x has n elements, y = Ax is an m-element

column vector.

A.3 MATRIX FUNCTIONS

A.3.1 Matrix Inverse

A square matrix that multiplies another square matrix to produce the identity
matrix is called the inverse, and is denoted by a superscript −1; that is, if B = A−1,
then AB = BA = I. Just as scalar division by zero is not defined, a matrix is called
indeterminate if the inverse does not exist. The matrix inverse may be computed by
various methods. Two popular methods for general square matrices are Gauss-
Jordan elimination with pivoting, and LU decomposition followed by inversion of
the LU factors (see Press et al. 2007, chapter 2). Inversion based on cofactors and
determinants (explained below) is also used for small matrices. For symmetric
matrices, inversion based on Cholesky factorization is recommended.

bapp01.indd 558 12/8/2010 10:00:50 AM

The inverse of the product of two matrices is the reversed product of the inverses:
(AB)−1 = B−1A−1. Nonsquare matrices generally do not have an inverse, but left or
right inverses can be defined; for example, for m × n matrix A, ((ATA)−1AT)A = In,
so (ATA)−1AT is a left inverse provided that (ATA)−1 exists, and A(AT(AAT)−1) = Im
so AT(AAT)−1 is a right inverse provided that (AAT)−1 exists.
A square matrix is called orthogonal when ATA = AAT = I. Thus the transpose
is also the inverse: A−1 = AT. If rectangular matrix A is m × n, it is called column
orthogonal when ATA = I since the columns are orthonormal. This is only possible
when m ≥ n. If AAT = I for m ≤ n, matrix A is called row orthogonal because the
rows are orthonormal.
A square symmetric matrix must be positive definite for it to be invertible. A
symmetric positive definite matrix is a square symmetric matrix for which xTAx > 0
for all nonzero vectors x. A symmetric positive semi-definite or non-negative definite
matrix is one for which xTAx ≥ 0.

A.3.2 Partitioned Matrix Inversion

It is often helpful to compute the inverse of a matrix in partitions. For example,
consider the inverse of

⎡A B ⎤
⎢ C D⎥
⎣ ⎦
where the four bold letters indicate smaller matrices. We express the inverse as

⎡E F ⎤
⎢G H ⎥
⎣ ⎦
and write

⎡A B ⎤ ⎡ E F ⎤ ⎡ I 0 ⎤
⎢ C D ⎥ ⎢G H ⎥ = ⎢ 0 I ⎥
⎣ ⎦⎣ ⎦ ⎣ ⎦
or
AE + BG = I (A3-1)
AF + BH = 0 (A3-2)
CE + DG = 0 (A3-3)
CF + DH = I. (A3-4)
Using equations (A3-2), (A3-4), (A3-1), and (A3-3) in that order, we obtain:

F = − A −1BH
H = (D − CA −1B)−1
E = A −1 (I − BG) (intermediate) (A3-5)
.
G = −(D − CA −1B)−1 CA −1
= − HCA −1
E = A −1 + A −1BHCA −1

bapp01.indd 559 12/8/2010 10:00:50 AM

10.1002/9780470890042.app1, Downloaded from https://onlinelibrary.wiley.com/doi/10.1002/9780470890042.app1 by Egyptian National Sti. Network (Enstinet), Wiley Online Library on [07/12/2022]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
560 ADVANCED KALMAN FILTERING, LEAST-SQUARES AND MODELING: A PRACTICAL HANDBOOK

Alternately, using (A3-3), (A3-1), (A3-4), and (A3-2), we obtain

G = − D−1CE
E = (A − BD−1C)−1
H = D−1(I − CF)
. (A3-6)
F = −(A − BD−1C)−1 BD−1
= − EBD−1
H = D−1 + D−1CEBD−1

Thus the partitioned inverse can be written in two forms:

−1
⎡A B ⎤ ⎡A −1 + A −1B(D − CA −1B)−1 CA −1 − A −1B(D − CA −1B)−1 ⎤
⎢ C D⎥ = ⎢ ⎥ (A3-7)
⎣ ⎦ ⎣ −(D − CA −1B)−1 CA −1 (D − CA −1B)−1 ⎦

or
−1
⎡A B ⎤ ⎡ (A − BD−1C)−1 −(A − BD−1C)−1 BD−1 ⎤
⎢ C D⎥ = ⎢ − D−1C(A − BD−1C)−1 −1 −1 −1 −1 −1 ⎥
. (A3-8)
⎣ ⎦ ⎣ D + D C(A − BD C) BD ⎦

If the matrix to be inverted is symmetric:

−1
⎡A B⎤ ⎡A −1 + A −1B(D − BT A −1B)−1 BT A −1 − A −1B(D − BT A −1B)−1 ⎤
⎢BT = ⎢ ⎥ (A3-9)
⎣ D⎥⎦ ⎣ D − BT A −1B)−1 BT A −1
−(D (D − BT A −1B)−1 ⎦

or
−1
⎡A B⎤ ⎡ (A − BD−1BT )−1 −(A − BD−1BT )−1 BD−1 ⎤
⎢BT = ⎢ − D−1BT (A − BD−1BT )−1 −1 ⎥
⎣ D⎥⎦ ⎣
−1 −1 T −1 T −1
D + D B (A − BD B ) BD ⎦

(A3-10)
where A and D are also symmetric.

A.3.3 Matrix Inversion Identity

The two equivalent expressions for the partitioned inverse suggest a matrix equiva-
lency that is the link between batch least squares and recursive least squares (and
Kalman filtering). This formula and variations on it have been attributed to various
people (e.g., Woodbury, Ho, Sherman, Morrison), but the relationship has undoubt-
edly been discovered and rediscovered many times.
From the lower right corner of equations (A3-7) and (A3-8):

(D − CA −1B)−1 = D−1 + D−1C(A − BD−1C)−1 BD−1. (A3-11)

In the symmetric case when C = BT:

(D − BT A −1B)−1 = D−1 + D−1BT (A − BD−1BT )−1 BD−1, (A3-12)

bapp01.indd 560 12/8/2010 10:00:50 AM

or by changing the sign of A:

(D + BT A −1B)−1 = D−1 − D−1BT (A + BD−1BT )−1 BD−1. (A3-13)

Equation (A3-13) is the connection between the measurement update of Bayesian

least squares and the Kalman filter.
If D = I and C = BT:

(I − BT A −1B)−1 = I + BT (A − BBT )−1 B. (A3-14)

(I + BT A −1B)−1 = I − BT (A + BBT )−1 B. (A3-15)

If b is a row vector and a is scalar:

(D − bT b / a)−1 = D−1 + (D−1 bT )(bD−1 )/(a − bD−1 bT ) (A3-16)
or
(D + bT b / a)−1 = D−1 − (D−1 bT )(bD−1 )/(a + bD−1 bT ). (A3-17)

A.3.4 Determinant
The determinant of a square matrix is a measure of scale change when the matrix
is viewed as a linear transformation. When the determinant of a matrix is zero, the
matrix is indeterminate or singular, and cannot be inverted. The rank of matrix |A|
is the largest square array in A that has nonzero determinant.
The determinant of matrix A is denoted as det(A) or |A|. Laplace’s method for
computing determinants uses cofactors, where a cofactor of a given matrix element
ij is Cij = (−1)i+j|Mij| and Mij, called the minor of ij, is the matrix formed by deleting
the i row and j column of matrix A. For 2 × 2 matrix

⎡ A11 A12 ⎤
⎢A ,
⎣ 21 A22 ⎥⎦
the cofactors are
C11 = A22, C12 = − A21, C21 = − A12, C22 = A11.
The determinant is the sum of the products of matrix elements and cofactors for
any row or column. Thus
A = A11C11 + A12C12 = A11C11 + A21C21 = A21C21 + A22C22 = A12C12 + A22C22
.
= A11 A22 − A12 A21
For a 3 × 3 matrix A,

⎡ A11 A12 A13 ⎤

⎢A A22 A23 ⎥ ,
⎢ 21 ⎥
⎢⎣ A31 A32 A33 ⎦⎥

bapp01.indd 561 12/8/2010 10:00:50 AM

10.1002/9780470890042.app1, Downloaded from https://onlinelibrary.wiley.com/doi/10.1002/9780470890042.app1 by Egyptian National Sti. Network (Enstinet), Wiley Online Library on [07/12/2022]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
562 ADVANCED KALMAN FILTERING, LEAST-SQUARES AND MODELING: A PRACTICAL HANDBOOK

the cofactors are

C11 = A22 A33 − A23 A32, C21 = −( A12 A33 − A13 A32 ), C31 = A12 A23 − A13 A22 , …
so using the first column,
A = A11C11 + A21C21 + A31C31
.
= A11 ( A22 A33 − A23 A32 ) − A21 ( A12 A33 − A13 A32 ) + A31 ( A12 A23 − A13 A22 )
A matrix inverse can be computed from the determinant and cofactors as

⎡C11 C21 Cn1 ⎤

1 1 ⎢⎢C12 C22 Cn 2 ⎥
⎥,
A −1 = [C]T = (A3-18)
A A ⎢ ⎥
⎢ ⎥
⎣C1n C2 n Cnn ⎦

where [C] is the matrix of cofactors for A.

Computation of determinants using cofactors is cumbersome, and is seldom used
for dimensions greater than three. The determinant is more easily computed by
factoring A = LU using Crout reduction, where L is unit lower triangular and U is
upper triangular. Since the determinant of the product of matrices is equal to the
product of determinants,
AB = A B , (A3-19)
we have |A| = |L||U|, which is simply equal to the product of the diagonals of U
because |L| = 1. A number of other methods may also be used to compute deter-
minants. Often the determinant is a bi-product of matrix inversion algorithms.

A.3.5 Matrix Trace

The trace of a square matrix is the sum of the diagonal elements. This property is
often useful in least-squares or minimum variance estimation, as the sum of squared
elements in an n-vector can be written as
n

∑a 2
i = aT a = tr [aaT ]. (A3-20)
i =1

This rearrangement of vector order often allows solutions for minimum variance
problems: it has been used repeatedly in previous chapters.
Since the trace only involves diagonal elements, tr(AT) = tr(A). Also,

tr(A + B) = tr(A) + tr(B) (A3-21)

and for scalar c,

tr(cA) = c ⋅ tr(A). (A3-22)

Unlike the determinant, the trace of the matrix products is not the product of traces.
If A is an n × m matrix and B is an m × n matrix,

bapp01.indd 562 12/8/2010 10:00:51 AM

n m m n
tr(AB) = ∑ ∑ Aij Bji = ∑ ∑ Bji Aij
i =1 j =1 j =1 i =1 . (A3-23)
= tr(BA)

However, this commutative property only works for pairs of matrices, or inter-
change of “halves” of the matrix product:
tr(ABC) = tr(C(AB)) = tr(BCA)
≠ tr(ACB) . (A3-24)
≠ tr(CBA)
When the three individual matrices are square and symmetric, any permutation
works:

tr(ABC) = tr(CT BT AT ) = tr(CBA). (A3-25)

This permutation does not work with four or more symmetric matrices.
Permutation can be used to express the weighted quadratic form aT Wa as

aT Wa = aT WT / 2 W1 / 2 a = tr(W1 / 2 aaT WT / 2 ) = tr(WT / 2 W1 / 2 aaT )

, (A3-26)
= tr ( W(aaT ))

where a is a vector and matrix W = WT/2W1/2 is symmetric.

For a transformation of the form T −1AT (called a similarity transformation), the
trace is unchanged:

tr(T −1AT) = tr(TT −1A) = tr(A). (A3-27)

A.3.6 Derivatives of Matrix Functions

The derivative of matrix A with respect to scalar variable t is ∂A/∂t, which is a matrix
with the same dimensions as A. The derivative of matrix product BA with respect
to t can be written as

∂(BA) ∂(BA) ∂A
= , (A3-28)
∂t ∂A ∂t

where ∂(BA)/∂A is a four-dimensional variable:

∂(BA) ∂(BA) ∂(BA)
, , ,….
∂A11 ∂A12 ∂A13
Thus matrix derivatives for a function of A, C(A), can be written using ∂C/∂A if
the four dimensions are properly handled.
The derivative of the inverse of a matrix is obtained by differentiating AB = I,
which gives (dA)B + A(dB) = 0. Thus

d(A −1 ) = dB = − A −1 (dA)A −1

bapp01.indd 563 12/8/2010 10:00:51 AM

10.1002/9780470890042.app1, Downloaded from https://onlinelibrary.wiley.com/doi/10.1002/9780470890042.app1 by Egyptian National Sti. Network (Enstinet), Wiley Online Library on [07/12/2022]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
564 ADVANCED KALMAN FILTERING, LEAST-SQUARES AND MODELING: A PRACTICAL HANDBOOK

is two-dimensional, but has perturbations with respect to all elements of dA; that
is, each i, j element (dB)ij is a matrix of derivatives for all perturbations dA. Thus

∂(A −1 ) ∂A ⎞ −1
= − A −1⎛⎜
⎝ ∂t ⎟⎠
A . (A3-29)
∂t
The derivative of the trace of matrix products is computed by examining the deriva-
tive with respect to an individual element:
∂tr(AB) ∂ ⎛ m m ⎞
= ⎜⎝ ∑ ∑ Akl Blk ⎟⎠ = Aji .
∂Bij ∂Bij k =1 l =1
Thus

∂tr(AB)
= AT . (A3-30)
∂B
For products of three matrices,

∂tr(ABC) ∂tr(CAB)
= = (CA)T . (A3-31)
∂B ∂B
The derivative of the determinant is computed by rearranging equation (A3-18) as
A I = A [C ] .
T
(A3-32)
Thus for any diagonal element i = 1, 2,…, n,
n
A = ∑ AikCik .
k =1

The partial derivative of |A| with respect to any A element in row i is

∂A ∂ ⎛ n ⎞
= ⎜ ∑ AikCik ⎟⎠ = Cij ,
∂Aij ∂Aij ⎝ k =1
where ∂Cik/∂Aij = 0 from cofactor definitions. Hence

∂A
= [ C ] = A A −T . (A3-33)
∂A
By a similar development

∂A ∂A ⎞
= A tr ⎛⎜ A −1 ⎟. (A3-34)
∂t ⎝ ∂t ⎠

A.3.7 Norms
Norm of vectors or matrices is often useful when analyzing growth of numerical
errors. The Hölder p-norms for vectors are defined as
1/ p
⎛ n p⎞
x = ⎜ ∑ xi ⎟ . (A3-35)
p
⎝ i =1 ⎠

bapp01.indd 564 12/8/2010 10:00:51 AM

The most frequently used p-norms are

1/ 2
n
⎛ n ⎞
x 1 = ∑ xi , x = ⎜ ∑ xi2 ⎟ , x ∞ = max xi . (A3-36)
i =1
2
⎝ i =1 ⎠ i

The l2-norm 储x储2 is also called the Euclidian or root-sum-squared norm.

An l2-like norm can be defined for matrices by treating the matrix elements as a
vector. This leads to the Frobenius norm
1/ 2
⎛ m n ⎞
A F = ⎜ ∑ ∑ Aij2 ⎟ . (A3-37)
⎝ i =1 j =1 ⎠

Induced matrix norms measure the ability of matrix A to modify the magnitude of
a vector; that is,
⎛ Ax ⎞
A = max ⎜ .
x =1 ⎝ x ⎟⎠
The l1-norm is A 1 = max a: j 1
where a:j is the j-th column of A, and the l∞-norm
j
is A ∞ = max a i: 1
where ai: is the i-th row of A. It is more difficult to compute an
i
l2-norm based on this definition than a Frobenius norm. 储A储2 is equal to the square
root of the maximum eigenvalue of ATA—or equivalently the largest singular value
of A. These terms are defined in Section A.4.
Norms of matrix products obey inequality conditions:

Ax ≤ A x or AB ≤ A B . (A3-38)

The l2 and Frobenius matrix norms are unchanged by orthogonal transformations,

that is,
AB 2 = A 2 ⎫
⎬ if B B = I.
T
(A3-39)
BA 2 = A 2 ⎭

A.4 MATRIX TRANSFORMATIONS AND FACTORIZATION

A.4.1 LU Decomposition
LU decomposition has been mentioned previously. Crout reduction is used to factor
square matrix A = LU where L is unit lower triangular and U is upper triangular.
This is often used for matrix inversion or when repeatedly solving equations of the
form Ax = y for x. The equation Lz = y is first solved for z using forward substitu-
tion, and then Ux = z is solved for x using backward substitution.

A.4.2 Cholesky Factorization

Cholesky factorization is essentially LU decomposition for symmetric matrices.
Usually it implies A = LLT where L is lower triangular, but it is sometimes used to
indicate A = LTL. Cholesky factorization for symmetric matrices is much more
efficient and accurate than LU decomposition.

bapp01.indd 565 12/8/2010 10:00:51 AM

10.1002/9780470890042.app1, Downloaded from https://onlinelibrary.wiley.com/doi/10.1002/9780470890042.app1 by Egyptian National Sti. Network (Enstinet), Wiley Online Library on [07/12/2022]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
566 ADVANCED KALMAN FILTERING, LEAST-SQUARES AND MODELING: A PRACTICAL HANDBOOK

A.4.3 Similarity Transformation

A similarity transformation on square matrix A is one of the form B = T −1AT. Since
TB = AT, matrices A and B are called similar. Similarity transformations can be
used to reduce a given matrix to an equivalent canonical form; for example, eigen
decomposition and singular value decomposition discussed below.

A.4.4 Eigen Decomposition

The vectors xi for which Axi = λix i , where λi are scalars, are called the eigenvectors
of square matrix A. For an n × n matrix A, there will be n eigenvectors xi with cor-
responding eigenvalues λi. The λi may be complex (occurring in complex conjugate
pairs) even when A is real, and some eigenvalues may be repeated. The eigenvec-
tors are the directions that are invariant with pre-multiplication by A.
The eigenvector/eigenvalue relationship can be written in matrix form as

AM = ML (A4-1)
or

A = MLM −1 (A4-2)

where
⎡ λ1 0 0⎤
⎢0 λ2 0⎥
M = [ x1 x2 x n ], L = ⎢ ⎥.
⎢ ⎥
⎢ ⎥
⎣0 0 λn ⎦
Matrix M is called the modal matrix and λi are the eigenvalues. The eigenvalues are
the roots of the characteristic polynomial p(s) = |sI − A|, and they define the spectral
response of the linear system x (t ) = A x(t ) when x(t) is the system state vector (not
eigenvectors). Eigen decomposition is a similarity transformation, and thus

A = λ1 λ 2 … λ n . (A4-3)

Also

tr(A) = λ1 + λ 2 + … + λ n . (A4-4)

When real A is symmetric and nonsingular, the λi are all real, and the eigenvectors
are distinct and orthogonal. Thus M−1 = MT and A = MΛMT.
Eigenvectors and eigenvalues are computed in LAPACK using either general-
ized QR decomposition or a divide-and-conquer approach.

A.4.5 Singular Value Decomposition (SVD)

While eigen decomposition is used for square matrices, SVD is used for either
square or rectangular matrices. The SVD of m × n matrix A is

A = USVT (A4-5)

bapp01.indd 566 12/8/2010 10:00:51 AM

where U is an m × m orthogonal matrix (UUT = Im), V is an n × n orthogonal matrix

(VVT = In), and S is an m × n upper diagonal matrix of singular values. In least-
squares problems m > n is typical, so
⎡S1 0 0⎤
⎢0 S2 0 ⎥ ⎡ vT1 ⎤
⎢ ⎥
⎢ ⎥ ⎢ vT2 ⎥
A = [ u1 … u n u n +1 um ] ⎢ ⎥⎢ ⎥
⎢0 0 Sn ⎥ ⎢ ⎥
⎢ ⎢ ⎥
⎥ ⎣ vTn ⎦
⎢ ⎥
⎣0 0 0⎦
where ui are the left singular vectors and vi are the right singular vectors. The left
singular vectors un+1, un+2,…, um are the nullspace of A as they multiply zeroes in S.
The above SVD is called “full,” but some SVD utilities have the option to omit
computation of un+1, un+2,…, um. These are called “thin” SVDs.
When A is a real symmetric matrix formed as A = HT H (such as the information
matrix in least-squares estimation), the eigenvalues of A are the squares of the
singular values of H, and the eigenvectors are the right singular vectors vi . This is
seen from
A = HT H = (VST UT )(USVT ) = V(ST S)VT .
The accuracy of the SVD factorization is much greater than that for eigen decom-
position of A = HT H because the “squaring” operation doubles the condition
number (defined below). This doubles sensitivity to numerical round-off errors.

A.4.6 Pseudo-Inverse
When A is rectangular or singular, A does not have an inverse. However, Penrose
(1955) defined a pseudo-inverse A# uniquely determined by four properties:
AA # A = A
A # AA # = A #
. (A4-6)
(AA # )T = AA #
(A # A)T = A # A

This Moore-Penrose pseudo-inverse is sometimes used in least-squares

problems when there is insufficient measurement information to obtain a unique
solution. A pseudo-inverse for the least-squares problem based on measurement
equation y = Hx + r can be written using the SVD of H = USVT. Then the normal
equation least-squares solution xˆ = (HT H )−1 HT y is obtained using the pseudo-
inverse of H,
H # = (VST UT USVT )−1 VST UT
= V(ST S)−1 VT VST UT
,
= V(ST S)−1 ST UT
= V [S1# 0 ] UT

bapp01.indd 567 12/8/2010 10:00:51 AM

10.1002/9780470890042.app1, Downloaded from https://onlinelibrary.wiley.com/doi/10.1002/9780470890042.app1 by Egyptian National Sti. Network (Enstinet), Wiley Online Library on [07/12/2022]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
568 ADVANCED KALMAN FILTERING, LEAST-SQUARES AND MODELING: A PRACTICAL HANDBOOK

where S1# is the pseudo-inverse of the nonzero square portion of S. For singular
values that are exactly zero, they are replaced with zero in the same location when
forming S1# . Thus
xˆ = V [S1# 0 ] UT y.
This shows that a pseudo-inverse can be computed even when HT H is singular.
The pseudo-inverse provides the minimal norm solution for a rank-deficient
(rank < min(m,n)) H matrix.

A.4.7 Condition Number

The condition number of square matrix A is a measure of sensitivity of errors in
A−1 to perturbations in A. This is used to analyze growth of numerical errors. The
condition number is denoted as

κ p (A ) = A p A −1 p (A4-7)

when using the an lp induced matrix norm for A. Because it is generally not conve-
nient to compute condition numbers by inverting a matrix, they are most often
computed from the singular values of matrix A. Decomposing A = USVT where U
and V are orthogonal and Si are the singular values in S, then

max(Si )
κ 2 (A ) = i
. (A4-8)
min(Si )
i

bapp01.indd 568 12/8/2010 10:00:51 AM

Matrix Operations
No ratings yet
Matrix Operations
34 pages
(MADHU MANGAL PAUL) Numerical Analysis For Scienti
77% (13)
(MADHU MANGAL PAUL) Numerical Analysis For Scienti
666 pages
Numerical Modelling in Geo Engineering
No ratings yet
Numerical Modelling in Geo Engineering
82 pages
Matrices Algebra
No ratings yet
Matrices Algebra
10 pages
Notes 1 MSC Physics I Mathematical Physics
No ratings yet
Notes 1 MSC Physics I Mathematical Physics
17 pages
A Quasi-Gauss-Newton Method For Solving Non-Linear Algebraic Equations
No ratings yet
A Quasi-Gauss-Newton Method For Solving Non-Linear Algebraic Equations
11 pages
Linear Algebra and Optimization: BITS Pilani
No ratings yet
Linear Algebra and Optimization: BITS Pilani
11 pages
Continuous Time Stochastic Modelling
No ratings yet
Continuous Time Stochastic Modelling
36 pages
A 241247
No ratings yet
A 241247
140 pages
Linear Algebra Review and Reference
No ratings yet
Linear Algebra Review and Reference
26 pages
Userguide 5.6.0
No ratings yet
Userguide 5.6.0
126 pages
Sma 2201 Linear Algebra I Notes 4
No ratings yet
Sma 2201 Linear Algebra I Notes 4
25 pages
Matrix Decomposition and Applications
No ratings yet
Matrix Decomposition and Applications
184 pages
SMTA022 Study Guide For 2024
No ratings yet
SMTA022 Study Guide For 2024
63 pages
MATLAB Documentation
No ratings yet
MATLAB Documentation
79 pages
Linear Algebra Full (Ch1 - CH 4)
No ratings yet
Linear Algebra Full (Ch1 - CH 4)
55 pages
Review of Matrices-MAT 201E
No ratings yet
Review of Matrices-MAT 201E
8 pages
AVM 305 (Advance Algebra - AME) - Copy2
No ratings yet
AVM 305 (Advance Algebra - AME) - Copy2
69 pages
Design and Implementation of Proportional Integral Observer Based Linear Model Predictive Controller
No ratings yet
Design and Implementation of Proportional Integral Observer Based Linear Model Predictive Controller
8 pages
Lesson 4-Linear Systems
No ratings yet
Lesson 4-Linear Systems
34 pages
EME4403 Finite Element
No ratings yet
EME4403 Finite Element
39 pages
MA 214: Introduction To Numerical Analysis: Shripad M. Garge. IIT Bombay (Shripad@math - Iitb.ac - In)
No ratings yet
MA 214: Introduction To Numerical Analysis: Shripad M. Garge. IIT Bombay (Shripad@math - Iitb.ac - In)
71 pages
Direct Solution To Equations Using The LA Form Revised 1
No ratings yet
Direct Solution To Equations Using The LA Form Revised 1
27 pages
BaseR Cheat Sheet
No ratings yet
BaseR Cheat Sheet
21 pages
Note of Use of The Linear Solver in Aster
No ratings yet
Note of Use of The Linear Solver in Aster
28 pages
Notas Monitoria 1 - Matrix Algebra
No ratings yet
Notas Monitoria 1 - Matrix Algebra
27 pages
x6101 LU Factorization 2024: 1 Gaussian Elimination
No ratings yet
x6101 LU Factorization 2024: 1 Gaussian Elimination
10 pages
Chan Jeliazkov 2009
No ratings yet
Chan Jeliazkov 2009
20 pages
Solutions Manual Scientific Computing
0% (1)
Solutions Manual Scientific Computing
192 pages
MAtrices Review
No ratings yet
MAtrices Review
9 pages
Parallel-Vector Equation Solvers For Finite Element Engineering Applications
No ratings yet
Parallel-Vector Equation Solvers For Finite Element Engineering Applications
15 pages
Introduction To Numerical Analysis For Engineers: - Systems of Linear Equations Mathews
No ratings yet
Introduction To Numerical Analysis For Engineers: - Systems of Linear Equations Mathews
10 pages
Of Its History and Status: Sonar Beamforming - An Overview
No ratings yet
Of Its History and Status: Sonar Beamforming - An Overview
27 pages
Numerical Methods COMPLETE QUIZ PDF
No ratings yet
Numerical Methods COMPLETE QUIZ PDF
22 pages
A Comprehensive Analysis of The Performance of Gear Fault Detection Algorithms
No ratings yet
A Comprehensive Analysis of The Performance of Gear Fault Detection Algorithms
11 pages
Lecture 3
No ratings yet
Lecture 3
33 pages
Linear Algebra
100% (1)
Linear Algebra
36 pages
Linear Algebra
No ratings yet
Linear Algebra
44 pages
Seeger - Low Rank Updates For The Cholesky Decomposition - Cholupdate
No ratings yet
Seeger - Low Rank Updates For The Cholesky Decomposition - Cholupdate
7 pages
ECSE343MidtermW2023 Final
No ratings yet
ECSE343MidtermW2023 Final
8 pages
Lect 9
No ratings yet
Lect 9
16 pages
Mathematics I Lecture Notes 1
No ratings yet
Mathematics I Lecture Notes 1
11 pages
MAT 461/561: 3.3 Special Matrices: Announcements
No ratings yet
MAT 461/561: 3.3 Special Matrices: Announcements
6 pages
Midterm Solutions: SOLUTION. We Can Write F (U
100% (1)
Midterm Solutions: SOLUTION. We Can Write F (U
7 pages
The DCC Package: Riccardo (Jack) Lucchetti Giulio Palomba Luca Pedini
No ratings yet
The DCC Package: Riccardo (Jack) Lucchetti Giulio Palomba Luca Pedini
13 pages
Linear Algebra - Class Notes
No ratings yet
Linear Algebra - Class Notes
5 pages
Matrix Algebra: Harvey Mudd College Math Tutorial
No ratings yet
Matrix Algebra: Harvey Mudd College Math Tutorial
6 pages
Matrices: CS5691: PRML - Linear Algebra - Basics CS6015-LARP CS6464
No ratings yet
Matrices: CS5691: PRML - Linear Algebra - Basics CS6015-LARP CS6464
108 pages
斯坦福大学机器学习数学基础 1-8
No ratings yet
斯坦福大学机器学习数学基础 1-8
8 pages
Elementary Matrices
No ratings yet
Elementary Matrices
21 pages
2009 Bookmatter StructuralAnalysisWithTheFinit PDF
No ratings yet
2009 Bookmatter StructuralAnalysisWithTheFinit PDF
58 pages
I.Rajkumar: Introduction To Finite Elements of Analysis
No ratings yet
I.Rajkumar: Introduction To Finite Elements of Analysis
67 pages
Unit 1
No ratings yet
Unit 1
64 pages
M0 1 After Class
No ratings yet
M0 1 After Class
21 pages
Unit 1
No ratings yet
Unit 1
64 pages
Matrices
100% (1)
Matrices
41 pages
LA Lectures
No ratings yet
LA Lectures
148 pages
Introduction To Matrix Algebra I: 1 Definition of Matrices and Vectors
No ratings yet
Introduction To Matrix Algebra I: 1 Definition of Matrices and Vectors
15 pages
Linear Algebra Review and Reference: 1 Basic Concepts and Notation
No ratings yet
Linear Algebra Review and Reference: 1 Basic Concepts and Notation
20 pages
Matrix Algebra
No ratings yet
Matrix Algebra
21 pages
Matrix 123
No ratings yet
Matrix 123
6 pages
Matrix 01
No ratings yet
Matrix 01
14 pages
LAODE Introduction & Rank
100% (1)
LAODE Introduction & Rank
11 pages
GEM 802 Chapter 1
No ratings yet
GEM 802 Chapter 1
52 pages
Dorf App GDF Ge
No ratings yet
Dorf App GDF Ge
10 pages
Big File
No ratings yet
Big File
41 pages
Engineering Mathematics III
No ratings yet
Engineering Mathematics III
17 pages
Linear Algebra Ma106 Iitb
No ratings yet
Linear Algebra Ma106 Iitb
71 pages
Section 2 Algebra of Matrices Lecture
No ratings yet
Section 2 Algebra of Matrices Lecture
14 pages
Linear Algebra Review and Reference: Zico Kolter (Updated by Chuong Do and Tengyu Ma) June 20, 2020
No ratings yet
Linear Algebra Review and Reference: Zico Kolter (Updated by Chuong Do and Tengyu Ma) June 20, 2020
29 pages
Linear Algebra Review and Reference: Zico Kolter (Updated by Chuong Do and Tengyu Ma) April 3, 2019
No ratings yet
Linear Algebra Review and Reference: Zico Kolter (Updated by Chuong Do and Tengyu Ma) April 3, 2019
28 pages
1 Matrix
No ratings yet
1 Matrix
40 pages
Lecture Notes #1: Review of Matrix Algebra: 1 Vectors
No ratings yet
Lecture Notes #1: Review of Matrix Algebra: 1 Vectors
8 pages
LinearAlgebraPrimer Ver 2010
No ratings yet
LinearAlgebraPrimer Ver 2010
15 pages
Introduction To Matrix Algebra
No ratings yet
Introduction To Matrix Algebra
14 pages
1 Topics in Linear Algebra
No ratings yet
1 Topics in Linear Algebra
42 pages
Lecture Notes For Chapters 4 & 5: 1 Matrix Algebra
No ratings yet
Lecture Notes For Chapters 4 & 5: 1 Matrix Algebra
25 pages
Linear Algebra - Qs
100% (4)
Linear Algebra - Qs
6 pages
Linear Algebra Primer: Daniel S. Stutts, PH.D
No ratings yet
Linear Algebra Primer: Daniel S. Stutts, PH.D
14 pages
Introduction To Matrix Algebra
No ratings yet
Introduction To Matrix Algebra
7 pages
CME 434 Notes - Matrix Equations: 1.1 Introduction To Matrices
No ratings yet
CME 434 Notes - Matrix Equations: 1.1 Introduction To Matrices
19 pages
Introduction To Financial Econometrics Appendix Matrix Algebra Review
No ratings yet
Introduction To Financial Econometrics Appendix Matrix Algebra Review
8 pages
cs229 Linalg
No ratings yet
cs229 Linalg
26 pages
Kronecker Products and Matrix Calculus with Applications
From Everand
Kronecker Products and Matrix Calculus with Applications
Alexander Graham
No ratings yet
Worked Examples in Mechanical Vibrations using MATLAB
From Everand
Worked Examples in Mechanical Vibrations using MATLAB
Eric Okoth Ogur
No ratings yet
Advanced Calculus
From Everand
Advanced Calculus
H.K Nickerson
No ratings yet
Matrix Theory and Applications for Scientists and Engineers
From Everand
Matrix Theory and Applications for Scientists and Engineers
Alexander Graham
No ratings yet
Real Variables with Basic Metric Space Topology
From Everand
Real Variables with Basic Metric Space Topology
Robert B. Ash
5/5 (1)
Calculus I Essentials
From Everand
Calculus I Essentials
Editors of REA
1/5 (1)
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
From Everand
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
Peter Kattan
2.5/5 (2)

1 - Summary of Vector Matrix Operations

Uploaded by

1 - Summary of Vector Matrix Operations

Uploaded by

10.1002/9780470890042.app1, Downloaded from https://onlinelibrary.wiley.com/doi/10.1002/9780470890042.app1 by Egyptian National Sti. Network (Enstinet), Wiley Online Library on [07/12/2022].

This Appendix summarizes properties of vector and matrices, and vector/matrix

is a three-element column vector. A row vector is (obviously) defined with elements

bapp01.indd 555 12/8/2010 10:00:50 AM

⎡ A11 A12 A13 ⎤

A.1.2.1 Symmetric Matrix A square matrix is symmetric if elements with inter-

A.1.2.2 Hermitian Matrix A square complex matrix is Hermitian if elements

bapp01.indd 556 12/8/2010 10:00:50 AM

A.2 ELEMENTARY VECTOR/MATRIX OPERATIONS

If the matrix is complex, the complex conjugate transpose is denoted as AH = (A*)T.

A.2.3 Inner (Dot) Product of Vectors

If a, b = 0 , the vectors are orthogonal.

bapp01.indd 557 12/8/2010 10:00:50 AM

A.2.4 Outer Product of Vectors

is an n-element row vector. If vector x has n elements, y = Ax is an m-element

A.3 MATRIX FUNCTIONS

A.3.1 Matrix Inverse

bapp01.indd 558 12/8/2010 10:00:50 AM

A.3.2 Partitioned Matrix Inversion

bapp01.indd 559 12/8/2010 10:00:50 AM

Alternately, using (A3-3), (A3-1), (A3-4), and (A3-2), we obtain

Thus the partitioned inverse can be written in two forms:

If the matrix to be inverted is symmetric:

A.3.3 Matrix Inversion Identity

(D − CA −1B)−1 = D−1 + D−1C(A − BD−1C)−1 BD−1. (A3-11)

In the symmetric case when C = BT:

(D − BT A −1B)−1 = D−1 + D−1BT (A − BD−1BT )−1 BD−1, (A3-12)

bapp01.indd 560 12/8/2010 10:00:50 AM

or by changing the sign of A:

(D + BT A −1B)−1 = D−1 − D−1BT (A + BD−1BT )−1 BD−1. (A3-13)

Equation (A3-13) is the connection between the measurement update of Bayesian

(I − BT A −1B)−1 = I + BT (A − BBT )−1 B. (A3-14)

(I + BT A −1B)−1 = I − BT (A + BBT )−1 B. (A3-15)

If b is a row vector and a is scalar:

⎡ A11 A12 A13 ⎤

bapp01.indd 561 12/8/2010 10:00:50 AM

the cofactors are

⎡C11 C21 Cn1 ⎤

where [C] is the matrix of cofactors for A.

A.3.5 Matrix Trace

tr(A + B) = tr(A) + tr(B) (A3-21)

and for scalar c,

tr(cA) = c ⋅ tr(A). (A3-22)

bapp01.indd 562 12/8/2010 10:00:51 AM

tr(ABC) = tr(CT BT AT ) = tr(CBA). (A3-25)

aT Wa = aT WT / 2 W1 / 2 a = tr(W1 / 2 aaT WT / 2 ) = tr(WT / 2 W1 / 2 aaT )

where a is a vector and matrix W = WT/2W1/2 is symmetric.

tr(T −1AT) = tr(TT −1A) = tr(A). (A3-27)

A.3.6 Derivatives of Matrix Functions

where ∂(BA)/∂A is a four-dimensional variable:

bapp01.indd 563 12/8/2010 10:00:51 AM

The partial derivative of |A| with respect to any A element in row i is

bapp01.indd 564 12/8/2010 10:00:51 AM

The most frequently used p-norms are

The l2-norm 储x储2 is also called the Euclidian or root-sum-squared norm.

The l2 and Frobenius matrix norms are unchanged by orthogonal transformations,

A.4 MATRIX TRANSFORMATIONS AND FACTORIZATION

A.4.2 Cholesky Factorization

bapp01.indd 565 12/8/2010 10:00:51 AM

A.4.3 Similarity Transformation

A.4.4 Eigen Decomposition

A.4.5 Singular Value Decomposition (SVD)

bapp01.indd 566 12/8/2010 10:00:51 AM

where U is an m × m orthogonal matrix (UUT = Im), V is an n × n orthogonal matrix

This Moore-Penrose pseudo-inverse is sometimes used in least-squares

bapp01.indd 567 12/8/2010 10:00:51 AM

A.4.7 Condition Number

bapp01.indd 568 12/8/2010 10:00:51 AM

You might also like