Matrix Algebra: Theory, Computations and Applications in Statistics 3rd Edition James E. Gentle
Matrix Algebra: Theory, Computations and Applications in Statistics 3rd Edition James E. Gentle
com
https://textbookfull.com/product/matrix-algebra-theory-
computations-and-applications-in-statistics-3rd-edition-
james-e-gentle/
OR CLICK HERE
DOWLOAD EBOOK
https://textbookfull.com/product/linear-algebra-and-matrix-
computations-with-matlab-1st-edition-dingyu-xue/
textbookfull.com
https://textbookfull.com/product/matrix-differential-calculus-with-
applications-in-statistics-and-econometrics-3rd-edition-jan-r-magnus/
textbookfull.com
https://textbookfull.com/product/basic-matrix-theory-leonard-e-fuller/
textbookfull.com
https://textbookfull.com/product/biota-grow-2c-gather-2c-cook-loucas/
textbookfull.com
Computations and Combinatorics in Commutative Algebra EACA
School Valladolid 2013 1st Edition Anna M. Bigatti
https://textbookfull.com/product/computations-and-combinatorics-in-
commutative-algebra-eaca-school-valladolid-2013-1st-edition-anna-m-
bigatti/
textbookfull.com
https://textbookfull.com/product/inorganic-chemistry-3rd-edition-
james-e-house/
textbookfull.com
https://textbookfull.com/product/applied-linear-algebra-and-matrix-
analysis-second-edition-thomas-s-shores/
textbookfull.com
https://textbookfull.com/product/matrix-methods-applied-linear-
algebra-and-sabermetrics-4th-edition-richard-bronson/
textbookfull.com
Springer Texts in Statistics
James E. Gentle
Matrix
Algebra
Theory, Computations and Applications
in Statistics
Third Edition
Springer Texts in Statistics
Series Editors
G. Allen, Department of Statistics, Rice University, Houston, TX, USA
R. De Veaux, Department of Mathematics and Statistics, Williams College,
Williamstown, MA, USA
R. Nugent, Department of Statistics, Carnegie Mellon University, Pittsburgh,
PA, USA
Springer Texts in Statistics (STS) includes advanced textbooks from 3rd- to
4th-year undergraduate levels to 1st- to 2nd-year graduate levels. Exercise sets
should be included. The series editors are currently Genevera I. Allen, Richard
D. De Veaux, and Rebecca Nugent. Stephen Fienberg, George Casella, and
Ingram Olkin were editors of the series for many years.
James E. Gentle
Matrix Algebra
Theory, Computations and
Applications in Statistics
Third Edition
James E. Gentle
Fairfax, VA, USA
This Springer imprint is published by the registered company Springer Nature Switzerland
AG
The registered company address is: Gewerbestrasse 11, 6330 Cham, Switzerland
This book is different from the several other books on the general topic of
“matrix algebra and statistics” or “linear algebra and statistics” in its more
extensive coverage of the applications to statistical linear models (mostly in
Part II, especially Chap. 9) and the discussions of numerical computations
(mostly in Part III). This book also includes numerous examples of R in
matrix computations.
The lengths of the chapters vary; I emphasized unity of topics rather than
consistency of numbers of pages. Some topics receive attention at multiple
places in the book. The repetition is intentional because of different motiva-
tions or points of view. The book has extensive cross-references and a large
index to facilitate finding discussions in other parts of the book.
The book can serve different audiences. It can be used as a general reference
book for an applied mathematician, a self-learner, or someone who just needs
a refresher. The extensive index should be useful for such persons.
The book, especially Part I, the first seven chapters, can serve as a text for
a mathematically oriented course in linear algebra for advanced undergradu-
ates or graduates.
It could also serve as a textbook for a course in linear models. The primary
emphasis would be on Part II, with extensive cross-references to the relevant
theory in other chapters.
A course in statistical computing or numerical analysis could be based on
the nitty-gritty of computer arithmetic, the computational issues, and general
design of algorithms in Part III. The emphasis would be on numerical linear
algebra, and there would likely be extensive cross-references to the underlying
theory in other chapters.
There are separate sections in several chapters that discuss the R pro-
gramming system, starting from the basics and going through some of the
more advanced features of R. There are also many examples and numerous
exercises using R. This could serve as a quick course or a refresher course in
R. R is also used for illustration in various places in the text.
VIII Preface to Third Edition
As in the revisions for the second edition, in this third edition I have
corrected all known remaining typos and other errors; I have (it is hoped)
clarified certain passages; I have added some additional material; and I have
enhanced the Index. I have also added exercises in some of the chapters.
The overall organization of chapters has been preserved, but some sections
have been changed. The two chapters that have been changed most are the
original Chap. 4, which is now Chap. 7 and has more coverage of multivariate
probability distributions, and Chap. 9, with more material on linear models.
In this edition, I discuss the R software system much more frequently. It
is likely that most readers know R, but I give a brief introduction to R in
Chap. 1. The most commonly used objects in this book are of class matrix,
but I use data.frame for the linear models of Chap. 9. I do not use any of the
objects, functions, or operators in the set of Tidy packages, which are very
popular nowadays.
I require use of R in several exercises, and assume the reader/user has, or
will develop, at least a moderate level of competence in use of R. R, as any
software system, is learned through usage. Some of the exercises, especially in
Part III, also require competence in Fortran or C.
The notation and terms that I use are “standard”; that is, they are (I
think) among the most commonly used ones in discussions of matrices and
linear algebra, especially by statisticians. Before delving into the book, the
reader may want to take a quick look at Appendix A, and then refer to it
whenever it is necessary to refresh the recognition of a symbol or term.
In previous editions, I had included answers to selected exercises. In this
edition, I have moved the solutions online.
I thank the readers of the first and second editions who informed me
of errors, or who made suggestions for improvement. I particularly thank
Professor M. Yatov for extensive comments on notation and definitions, as
well as for noting several errata and gaps in logic. I also thank Shijia Jin
for many general comments and suggestions. Any remaining typos, omissions,
and so on are entirely my own responsibility.
I thank John Chambers, Robert Gentleman, and Ross Ihaka for their foun-
dational work on R. I thank the R Core Team and the many package devel-
opers and those who maintain the packages that make R more useful.
Again, I thank my wife, María, to whom this book is dedicated, for everything.
Preface to Third Edition IX
In this second edition, I have corrected all known typos and other errors; I
have (it is hoped) clarified certain passages; I have added some additional
material; and I have enhanced the Index.
I have added a few more comments about vectors and matrices with com-
plex elements, although, as before, unless stated otherwise, all vectors and
matrices in this book are assumed to have real elements. I have begun to
use “.det(A)” rather than “.|A|” to represent the determinant of A, except in
a few cases. I have also expressed some derivatives as the transposes of the
expressions I used formerly.
I have put more conscious emphasis on “user-friendliness" in this edition.
In a book, user-friendliness is primarily a function of references, both internal
and external, and of the index. As an old software designer, I’ve always thought
that user-friendliness is very important. To the extent that internal references
were present in the first edition, the positive feedback I received from users of
that edition about the friendliness of those internal references (“I liked the fact
that you said ‘equation (x.xx) on page yy’, instead of just ‘equation (x.xx)’.
”) encouraged me to try to make the internal references even more useful.
It’s only when you’re “eating your own dogfood”, that you become aware of
where details matter, and in using the first edition, I realized that the choice of
entries in the Index was suboptimal. I have spent significant time in organizing
it, and I hope that the user will find the Index to this edition to be very useful.
I think that it has been vastly improved over the index in the first edition.
The overall organization of chapters has been preserved, but some sec-
tions have been changed. The two chapters that have been changed most are
Chaps. 3 and 12. Chapter 3, on the basics of matrices, got about 30 pages
longer. It is by far the longest chapter in the book, but I just didn’t see any
reasonable way to break it up. In Chap. 12 of the first edition, “Software for
Numerical Linear Algebra”, I discussed four software systems or languages:
C/C++, Fortran, MATLAB, and R, and did not express any preference for
one over another. In this edition, although I occasionally mention various
languages and systems, I now limit most of my discussion to Fortran and R.
XII Preface to Second Edition
There are many reasons for my preference for these two systems. R is ori-
ented toward statistical applications. It is open source and freely distributed.
As for Fortran versus C/C++, Python, or other programming languages, I
agree with the statement by Hanson and Hopkins (2013, page ix), “. . . Fortran
is currently the best computer language for numerical software.” Many people,
however, still think of Fortran as the language their elders (or they themselves)
used in the 1970s. (On a personal note, Richard Hanson, who passed away
recently, was a member of my team that designed the IMSL C Libraries in
the mid1980s. Not only was C much cooler than Fortran at the time, but the
ANSI committee working on updating the Fortran language was so fractured
by competing interests, that approval of the revision was repeatedly delayed.
Many numerical analysts who were not concerned with coolness turned to C
because it provided dynamic storage allocation and it allowed flexible argu-
ment lists, and the Fortran constructs could not be agreed upon.)
Language preferences are personal, of course, and there is a strong “cool-
ness factor” in choice of a language. Python is currently one of the coolest
languages, but I personally don’t like the language for most of the stuff I do.
Although this book has separate parts on applications in statistics and
computational issues as before, statistical applications have informed the
choices I made throughout the book, and computational considerations have
given direction to most discussions.
I thank the readers of the first edition who informed me of errors. Two
people in particular made several meaningful comments and suggestions. Clark
Fitzgerald not only identified several typos, he made several broad suggestions
about organization and coverage that resulted in an improved text (I think).
Andreas Eckner found, in addition to typos, some gaps in my logic, and also
suggested better lines of reasoning at some places. (Although I don’t follow
an itemized “theorem-proof” format, I try to give reasons for any non-obvious
statements I make.) I thank Clark and Andreas especially for their comments.
Any remaining typos, omissions, gaps in logic, and so on are entirely my
responsibility.
Again, I thank my wife, María, to whom this book is dedicated, for everything.
I used TEX via LATEX 2ε to write the book. I did all of the typing, program-
ming, etc., myself, so all misteaks (mistakes!) are mine. I would appreciate
receiving suggestions for improvement and notification of errors. Notes on
this book, including errata, are available at
https://mason.gmu.edu/~jgentle/books/matbk/
are not covered in those chapters, the topics addressed are oriented more to-
ward computational algorithms. Chapter 4 covers methods for decomposing
matrices into useful factors.
Chapter 5 addresses applications of matrices in setting up and solving
linear systems, including overdetermined systems. We should not confuse sta-
tistical inference with fitting equations to data, although the latter task is a
component of the former activity. In Chap. 5, we address the more mechanical
aspects of the problem of fitting equations to data. Applications in statistical
data analysis are discussed in Chap. 9. In those applications, we need to make
statements (that is, assumptions) about relevant probability distributions.
Chapter 6 discusses methods for extracting eigenvalues and eigenvectors.
There are many important details of algorithms for eigenanalysis, but they are
beyond the scope of this book. As with other chapters in Part I, Chap. 6 makes
some reference to statistical applications, but it focuses on the mathematical
and mechanical aspects of the problem.
Although the first part is on “theory”, the presentation is informal; neither
definitions nor facts are highlighted by such words as “Definition”, “Theorem”,
“Lemma”, and so forth. It is assumed that the reader follows the natural
development. Most of the facts have simple proofs, and most proofs are given
naturally in the text. No “Proof” and “Q.E.D.” or “ ” appear to indicate
beginning and end; again, it is assumed that the reader is engaged in the
development. For example, on page 378:
If A is nonsingular and symmetric, then .A−1 is also symmetric because
−1 T
.(A ) = (AT )−1 = A−1 .
The first part of that sentence could have been stated as a theorem and
given a number, and the last part of the sentence could have been introduced
as the proof, with reference to some previous theorem that the inverse and
transposition operations can be interchanged. (This had already been shown
before page 378 — in an unnumbered theorem of course!)
None of the proofs are original (at least, I don’t think they are), but in most
cases I do not know the original source, or even the source where I first saw
them. I would guess that many go back to C. F. Gauss. Most, whether they
are as old as Gauss or not, have appeared somewhere in the work of C. R. Rao.
Some lengthier proofs are only given in outline, but references are given for
the details. Very useful sources of details of the proofs are Harville (1997),
especially for facts relating to applications in linear models, and Horn and
Johnson (1991) for more general topics, especially those relating to stochastic
matrices. The older books by Gantmacher (1959) provide extensive coverage
and often rather novel proofs. These two volumes have been brought back into
print by the American Mathematical Society.
I also sometimes make simple assumptions without stating them explicitly.
For example, I may write “for all i” when i is used as an index to a vector.
I hope it is clear that “for all i” means only “for i that correspond to indices
of the vector”. Also, my use of an expression generally implies existence. For
Preface to First Edition XV
point numbers. (Appendix A provides definitions for this and other notation
that I use.)
Chapter 10 also covers some of the fundamentals of algorithms, such as
iterations, recursion, and convergence. It also discusses software development.
Software issues are revisited in Chap. 12.
While Chap. 10 deals with general issues in numerical analysis, Chap. 11
addresses specific issues in numerical methods for computations in linear al-
gebra.
Chapter 12 provides a brief introduction to software available for com-
putations with linear systems. Some specific systems mentioned include the
IMSL™ libraries for Fortran and C, Octave or MATLAB® (or MATLAB®),
and R or S-PLUS® (or S-Plus®). All of these systems are easy to use, and
the best way to learn them is to begin using them for simple problems. I do
not use any particular software system in the book, but in some exercises, and
particularly in Part III, I do assume the ability to program in either Fortran or
C and the availability of either R or S-Plus, Octave or MATLAB, and Maple®
or Mathematica®. My own preferences for software systems are Fortran and
R, and occasionally these preferences manifest themselves in the text.
Appendix A collects the notation used in this book. It is generally “stan-
dard” notation, but one thing the reader must become accustomed to is the
lack of notational distinction between a vector and a scalar. All vectors are
“column” vectors, although I usually write them as horizontal lists of their
elements. (Whether vectors are “row” vectors or “column” vectors is generally
only relevant for how we write expressions involving vector/matrix multipli-
cation or partitions of matrices.)
I write algorithms in various ways, sometimes in a form that looks similar
to Fortran or C and sometimes as a list of numbered steps. I believe all of the
descriptions used are straightforward and unambiguous.
This book could serve as a basic reference either for courses in statistical
computing or for courses in linear models or multivariate analysis. When the
book is used as a reference, rather than looking for “Definition” or “Theo-
rem”, the user should look for items set off with bullets or look for numbered
equations, or else should use the Indexor Appendix A, beginning on page 653.
The prerequisites for this text are minimal. Obviously some background in
mathematics is necessary. Some background in statistics or data analysis and
some level of scientific computer literacy are also required. References to rather
advanced mathematical topics are made in a number of places in the text. To
some extent this is because many sections evolved from class notes that I
developed for various courses that I have taught. All of these courses were at
the graduate level in the computational and statistical sciences, but they have
had wide ranges in mathematical level. I have carefully reread the sections
that refer to groups, fields, measure theory, and so on, and am convinced that
if the reader does not know much about these topics, the material is still
understandable, but if the reader is familiar with these topics, the references
add to that reader’s appreciation of the material. In many places, I refer to
XVIII Preface to First Edition
Acknowledgments
I thank John Kimmel of Springer for his encouragement and advice on this
book and other books on which he has worked with me. I especially thank
Ken Berk for his extensive and insightful comments on a draft of this book.
I thank my student Li Li for reading through various drafts of some of the
chapters and pointing out typos or making helpful suggestions. I thank the
anonymous reviewers of this edition for their comments and suggestions. I also
thank the many readers of my previous book on numerical linear algebra who
informed me of errors and who otherwise provided comments or suggestions
for improving the exposition. Whatever strengths this book may have can be
attributed in large part to these people, named or otherwise. The weaknesses
can only be attributed to my own ignorance or hardheadedness.
I thank my wife, María, to whom this book is dedicated, for everything.
I used TEX via LATEX 2ε to write the book. I did all of the typing, program-
ming, etc., myself, so all misteaks are mine. I would appreciate receiving sug-
gestions for improvement and notification of errors.
1 Introduction. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.1 Vectors. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.2 Arrays. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.3 Matrices. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.4 Representation of Data. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
1.5 What You Compute and What You Don’t. . . . . . . . . . . . . . . . . . . 7
1.6 R. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
1.6.1 R Data Types. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
1.6.2 Program Control Statements. . . . . . . . . . . . . . . . . . . . . . . . . 9
1.6.3 Packages. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
1.6.4 User Functions and Operators. . . . . . . . . . . . . . . . . . . . . . . 11
1.6.5 Generating Artificial Data. . . . . . . . . . . . . . . . . . . . . . . . . . . 13
1.6.6 Graphics Functions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
1.6.7 Special Data in R. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.6.8 Determining Properties of a Computer System in R. . . . . 14
1.6.9 Documentation: Finding R Functions and Packages. . . . . 15
1.6.10 Documentation: R Functions, Packages, and Other
Objects. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
1.6.11 The Design of the R Programming Language. . . . . . . . . . 16
1.6.12 Why I Use R in This Book. . . . . . . . . . . . . . . . . . . . . . . . . . 16
XXII Contents
Our website is not just a platform for buying books, but a bridge
connecting readers to the timeless values of culture and wisdom. With
an elegant, user-friendly interface and an intelligent search system,
we are committed to providing a quick and convenient shopping
experience. Additionally, our special promotions and home delivery
services ensure that you save time and fully enjoy the joy of reading.
textbookfull.com