0% found this document useful (0 votes)

6 views

Chapter-5-slides

Uploaded by

levinali1225

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views

Chapter-5-slides

Uploaded by

levinali1225

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 73

STATS 20: Chapter 5 - Matrices

Thomas Maierhofer

Fall 2024

1 / 73
Learning Objectives

▶ Create matrices with matrix(), rbind(), and cbind().

▶ Understand how R stores and performs operations on matrices.
▶ Add row and column names to two-dimensional objects.
▶ Extract and assign values to two-dimensional objects.
▶ Differentiate between * and %*% for matrices.
▶ Compute the inverse of a square matrix.

2 / 73
Basic Definitions and Functions

3 / 73
Basic Definitions and Functions

Why Matrices in Statistics?

▶ Matrices play an important role in statistics, especially in multivariate analysis
(e.g., multiple regression, covariance matrices).
▶ We will not cover linear algebra in depth, we will introduce the matrix object in
R how to use it.

Why Matrices in R?
▶ Matrices offer a natural introduction to data frames, which are the most
commonly used objects in R for storing rectangular data.
▶ Much of the syntax and functions for matrices also apply to data frames, making
them essential for understanding data analysis in R.

4 / 73
The matrix() Function
▶ A matrix is a two-dimensional (rectangular) array of values.
▶ In R, every value in a matrix must be of the same type (integer, double,
character, or logical).
Creating a Matrix
▶ The matrix() function takes a vector of values (data) and arranges them into a
matrix.
▶ You specify the number of rows (nrow) and columns (ncol).

A <- matrix(1:6, nrow = 2, ncol = 3)

## [,1] [,2] [,3]

## [1,] 1 3 5
## [2,] 2 4 6
5 / 73
Filling Matrices with matrix()
▶ By default, the matrix() function fills matrices by column.
▶ For example, in matrix A, the first two elements fill the first column, the next two fill
the second column, and so on.
▶ To fill a matrix by row instead, use the byrow argument:
▶ The default is byrow = FALSE (fills by column).
▶ Set byrow = TRUE to fill the matrix row by row.

B <- matrix(1:9, nrow = 3, ncol = 3, byrow = TRUE)

## [,1] [,2] [,3]

## [1,] 1 2 3
## [2,] 4 5 6
## [3,] 7 8 9

6 / 73
Behavior of nrow and ncol arguments

▶ The default values for the matrix() function are nrow = 1 and ncol = 1.
▶ If both nrow and ncol are left blank, R will produce a matrix with a single
column, i.e., a column vector

# matrix with default ncol and nrow

matrix(1:5)

## [,1]
## [1,] 1
## [2,] 2
## [3,] 3
## [4,] 4
## [5,] 5

7 / 73
If only nrow or ncol is defined, the other value is automatically computed based
on the length of the data vector.

# specifying only nrow (compute ncol automatically)

matrix(1:6, nrow = 2)

## [,1] [,2] [,3]

## [1,] 1 3 5
## [2,] 2 4 6

# specifying only nrow (compute ncol automatically)

matrix(1:6, ncol = 2)

## [,1] [,2]
## [1,] 1 4
## [2,] 2 5
## [3,] 3 6

Key Takeaway: R will automatically compute missing dimensions based on input

8 / 73
Caution: Recycling in matrix()
Caution: If the length of the data vector is too short to fill the entire matrix, the
values will be recycled.
▶ R behaves similarly to vector recycling (as covered in Chapter 2):
▶ If the vector is recycled a whole number of times, R fills the matrix without a
warning.
▶ If the vector is not fully recycled, the matrix will still be filled, but R will throw a
warning.
▶ Complete Recycling (No Warning):

# Complete Recycling (No Warning)

# Recycles 1:4 twice, fills matrix without warning
matrix(1:4, nrow = 2, ncol = 6)

## [,1] [,2] [,3] [,4] [,5] [,6]

## [1,] 1 3 1 3 1 3
## [2,] 2 4 2 4 2 4
9 / 73
# Incomplete Recycling (Warning):
# Recycles 1:4 incompletely, throws a warning
matrix(1:4, nrow = 2, ncol = 5)

## Warning in matrix(1:4, nrow = 2, ncol = 5): data length [4] is not a

## sub-multiple or multiple of the number of columns [5]

## [,1] [,2] [,3] [,4] [,5]

## [1,] 1 3 1 3 1
## [2,] 2 4 2 4 2

Key Takeaway: Be cautious when the data vector is shorter than the matrix
dimensions—recycling can happen silently or with a warning.

10 / 73
dim() The Dimension of a Matrix
▶ The dimension of a matrix is defined by the number of rows and the number of
columns.
▶ This is often written as nrow × ncol (read as “nrow by ncol”).
▶ For example, matrix A with 2 rows and 3 columns is a 2 × 3 matrix.
▶ A matrix is called square if the number of rows equals the number of columns.
The dim() function returns a numeric vector specifying the dimension of a matrix.

A <- matrix(1:6, nrow = 2, ncol = 3)

B <- matrix(1:9, nrow = 3, ncol = 3, byrow = TRUE)
dim(A)

## [1] 2 3

dim(B)

## [1] 3 3
11 / 73
nrow() and ncol() The Number of Rows and Columns of a Matrix
▶ The functions nrow() and ncol() input a two-dimensional object and output the
number of rows or columns, respectively.
▶ These will produce the individual entries from the dim() function.

nrow(A) # The number of rows of A

## [1] 2

ncol(A) # The number of columns of A

## [1] 3

Question: How would you extract the number of rows from the output of dim(A)? In
other words, how would you write an nrow() function that relies on the dim()
function?
12 / 73
The cbind() and rbind() Functions
▶ The cbind() (column bind) and rbind() (row bind) functions allow you to
create matrices by binding columns or rows together.
▶ Each column (for cbind()) or row (for rbind()) is provided as a separate
unnamed argument.
# An alternative way to create the matrix A
cbind(c(1, 2), c(3, 4), c(5, 6))

## [,1] [,2] [,3]

## [1,] 1 3 5
## [2,] 2 4 6
# An alternative way to create the matrix B
rbind(c(1, 2, 3), c(4, 5, 6), c(7, 8, 9))

## [,1] [,2] [,3]

## [1,] 1 2 3
## [2,] 4 5 6
## [3,] 7 8 9
13 / 73
Conformable Matrices for cbind() and rbind()
▶ To successfully combine matrices with rbind() or cbind(), the matrices must be
conformable.
▶ This means the number of rows (for cbind()) or the number of columns (for
rbind()) must match.
▶ If the matrices are not conformable, an error will be produced.
# Bind the rows of A (2x3) and B (3x3) together (i.e., stack A on top of B)
rbind(A, B)

## [,1] [,2] [,3]

## [1,] 1 3 5
## [2,] 2 4 6
## [3,] 1 2 3
## [4,] 4 5 6
## [5,] 7 8 9
# This will give an error, because the number of rows are not conformable
cbind(A, B)

14 / 73
Recycling in rbind() and cbind()
▶ Side Note: The rbind() and cbind() functions recycle values when necessary.
▶ This is useful for adding columns or rows of repeated values.
Application to Statistics
▶ In linear regression, the observed values of the predictor variables are often
organized into a design matrix X .
▶ The design matrix usually includes a column of 1’s to account for the intercept
term in the model (see matrix formulation of linear regression in STATS 101A for
details).

cbind(1, B) # Append a column of 1's to matrix B

## [,1] [,2] [,3] [,4]

## [1,] 1 1 2 3
## [2,] 1 4 5 6
## [3,] 1 7 8 9
15 / 73
Matrices are Stored as Two-Dimensional Vectors

▶ Every value in a matrix must be of the same type because matrices are
internally stored as vectors in R.
▶ This can be verified using the mode() function on a matrix.

mode(A) # A is stored as a numeric vector

## [1] "numeric"

16 / 73
Matrices vs. Vectors

▶ Matrices are vectors with an additional dimension attribute (dim).

▶ Vectors have no dimension attribute, which means they are not simply
one-dimensional matrices.
▶ The attributes() function allows us to see the attributes of an R object.

attributes(A)

## $dim
## [1] 2 3

attributes(1:6)

## NULL

17 / 73
We could strip the A matrix of its dim attribute by assigning NULL to its
attributes(A) object. The matrix object will revert back to a vector.

attributes(A) <- NULL # Remove all of A's attributes

A # A is now a vector

## [1] 1 2 3 4 5 6

attributes(A)

## NULL

We can also give a vector the dim attribute in a similar way to convert a vector into a
matrix.

# Assign the dim attribute to A (with 2 rows and 3 columns)

attributes(A) <- list("dim" = c(2, 3)) # we'll talk about lists later
# A is now a matrix again
18 / 73
The attributes() and attr() Functions
▶ Note: The attributes() function accesses and can assign (or reassign) all
attributes of an object.
▶ When you use attributes() <-, any existing attributes will be overwritten with
only the specified attributes (e.g., dim).
▶ To directly access or assign one specific attribute, use the attr() function.

attr(A, "dim") # Shows the dim attribute of A

## [1] 2 3

attr(A, "dim") <- NULL # Removes the dim attribute from A

A # A is now a vector

## [1] 1 2 3 4 5 6

attr(A, "dim") # The dim attribute of A no longer exists, outputs NULL

19 / 73
Naming Rows and Columns of Two-Dimensional Objects

20 / 73
Naming One-Dimensional Objects (Vectors) in R
▶ A named vector in R is a regular vector where each element is associated with a
name.
▶ The names are stored as an attribute of the vector and can be used for more
intuitive indexing.
# Create a named vector
heights <- c("Leslie" = 62, "Ron" = 71, "April" = 66, "Tom" = 68)
heights

## Leslie Ron April Tom

## 62 71 66 68

You can access elements by name as well as by numeric index:

heights["April"]

## April
## 66
21 / 73
get and set vector names
You can also add names to an existing vector using the names() function:

weights <- c(115, 201, 119, 154)

names(weights) <- c("Leslie", "Ron", "April", "Tom")
weights

## Leslie Ron April Tom

## 115 201 119 154

You can also access the names using the names() function:

names(weights) # this returns a character vector

## [1] "Leslie" "Ron" "April" "Tom"

22 / 73
Naming Rows and Columns of Matrices in R
Suppose we have data on a few employees at the Pawnee Parks and Recreation
Department, shown in a data table below.

Name Height (inches) Weight (pounds) Income ($/month)

Leslie 62 115 4000
Ron 71 201 (Redacted)
April 66 119 2000

We can input the numeric data into a matrix in R.

parks_mat <- cbind(c(62, 71, 66), c(115, 201, 119), c(4000, NA, 2000))
parks_mat # Make sure the data was entered correctly

## [,1] [,2] [,3]

## [1,] 62 115 4000
## [2,] 71 201 NA
## [3,] 66 119 2000 23 / 73
Row and Column Names for Matrices in R
▶ row and column names are often needed to give meaning to the data in
two-dimensional objects.
▶ Use the rownames() and colnames() functions to access or set row and column
names.
▶ Access the current names (if any) by calling the functions on the object:

rownames(parks_mat) # none exist, output NULL

## NULL

colnames(parks_mat) # none exist, output NULL

## NULL

Heads up: Whenever you feel like your matrix really needs row and column names you
should probably use a data frame instead. More later.
24 / 73
Setting Row and Column Names

You can set names by creating a character vector of names and assigning it using the
<- operator.

rownames(parks_mat) <- c("Leslie", "Ron", "April")

colnames(parks_mat) <- c("Height", "Weight", "Income")
parks_mat

## Height Weight Income

## Leslie 62 115 4000
## Ron 71 201 NA
## April 66 119 2000

The names are now associated with the rows and columns of the matrix, providing
more context to the data.
25 / 73
Attributes and dimnames
▶ Technically, the rownames() and colnames() functions modify the attributes of
the object.
▶ Setting names will add a dimnames attribute to the object.

attributes(parks_mat)

## $dim
## [1] 3 3
##
## $dimnames
## $dimnames[[1]]
## [1] "Leslie" "Ron" "April"
##
## $dimnames[[2]]
## [1] "Height" "Weight" "Income"

The attributes(parks_mat) object is a list, and the dimnames attribute contained 26 / 73

Naming Matrices using dimnames()

Side Note: The dimnames() function can get and set both the row and column name
attributes at once. The assignment input using dimnames() needs to be a list with
two vector components.

dimnames(parks_mat) <- NULL # Remove the row and column names

parks_mat

## [,1] [,2] [,3]

## [1,] 62 115 4000
## [2,] 71 201 NA
## [3,] 66 119 2000

27 / 73
# Add the same names as before
dimnames(parks_mat) <- list(c("Leslie", "Ron", "April"),
c("Height", "Weight", "Income"))
parks_mat

## Height Weight Income

## Leslie 62 115 4000
## Ron 71 201 NA
## April 66 119 2000

28 / 73
Naming Matrices using matrix(„dimnames = list())

Side Note: There are a few other ways to add names to rows and columns. The
matrix() function has an optional dimnames argument that allows us to add names
directly when creating a matrix object. The syntax is the same as the dimnames()
function.

matrix(1:9, nrow = 3, ncol = 3,

dimnames = list(c("a", "b", "c"), c("A", "B", "C")))

## A B C
## a 1 4 7
## b 2 5 8
## c 3 6 9

29 / 73
Naming Matrices created using rbind() and cbind()
The rbind() and cbind() functions allow us to name, respectively, each row or
column by just typing the name of the row or column in quotation marks, as shown
below.

rbind("a" = 1:3, "b" = 4:6, "c" = 7:9) # name each row

## [,1] [,2] [,3]

## a 1 2 3
## b 4 5 6
## c 7 8 9

cbind("A" = 1:3, "B" = 4:6, "C" = 7:9) # name each column

## A B C
## [1,] 1 4 7
## [2,] 2 5 8
## [3,] 3 6 9 30 / 73
Extracting Data From Two-Dimensional Objects

31 / 73
Extracting Data From Two-Dimensional Objects
▶ Recall that square brackets are used to extract specific parts of data from
objects in R.
▶ Vectors are one-dimensional, so you can extract elements by providing a single
index inside square brackets.
▶ For two-dimensional objects, such as matrices or data frames:
▶ Use two index coordinates inside square brackets, separated by a comma.
▶ The general format is [i, j], where:
▶ i is the row index.
▶ j is the column index.
▶ This extracts the entry in the ith row and jth column, also called the ijth entry.

A[2, 3] # extract the entry in the second row third column

## [1] 6

32 / 73
Numeric Indices in Two-Dimensional Objects
▶ Leaving one value blank means extracting all the values in that dimension.
▶ Positive, negative, and fractional indices work the same way as they do for
vectors.
▶ Row and column indices are independent, allowing for mixed positive and
negative indices.
B # Notice the row and column indices in the output

## [,1] [,2] [,3]

## [1,] 1 2 3
## [2,] 4 5 6
## [3,] 7 8 9

B[2, 1] # Extract the (2,1) element

## [1] 4
33 / 73
B[2, ] # Extract the second row

## [1] 4 5 6

B[, 3] # Extract the third column

## [1] 3 6 9

B[, -2] # Remove the second column

## [,1] [,2]
## [1,] 1 3
## [2,] 4 6
## [3,] 7 9

34 / 73
B[-1, c(2, 3)] # Remove the first row and extract the second and third colu

## [,1] [,2]
## [1,] 5 6
## [2,] 8 9

Note: Notice that when the resulting output is one-dimensional (i.e., a single row or a
single column), the output object is a vector, not a one-dimensional matrix.

35 / 73
Caution: Single Indexing in Matrices
▶ Caution: Using a single index [] instead of an index pair [,] will not throw a
warning or error for matrix objects.
▶ Matrices are stored as one long column-major vector:
▶ Values are stored top-to-bottom down the first column, then the second column, and
so on.
▶ Using a single index [] will return the corresponding entries in the vector

B[8] # extracts the 8th value from the matrix as if it were a vector

## [1] 6

B[c(2, 3)] # extracts the second and third element as if it were a vector

## [1] 4 7

B[2, 3] # extracts the element in the second row and third column

36 / 73
Logical Indices
▶ Logical vectors can also be used to subset two-dimensional objects.
▶ The behavior of logical indices is similar to vectors, allowing us to extract rows
or columns that satisfy specific conditions.

# Which heights are above 65 inches?

tall_index <- parks_mat[, 1] > 65
# Extract only the rows/observations for people who are taller than 65 inch
parks_mat[tall_index, ]

## Height Weight Income

## Ron 71 201 NA
## April 66 119 2000

Question: How can we use the tall_index vector to extract only the
rows/observations for people in the data who are at most 65 inches tall?
37 / 73
Named (Character) Indices
If the rows or columns of a two-dimensional object are named, we can use the name as
an index.
▶ row names can only be used as a row index,
▶ column names can only be used as a column index.

parks_mat["Leslie", ] # Extract the row of data for Leslie

## Height Weight Income

## 62 115 4000

parks_mat[, "Income"] # Extract the column of data for Income

## Leslie Ron April

## 4000 NA 2000
38 / 73
parks_mat["Ron", "Height"] # Extract the height of Ron

## [1] 71

parks_mat[c("Leslie", "April"), "Weight"] # Extract the weight of Leslie an

## Leslie April
## 115 119

Note: Notice that we do not need to know the numeric index for the observations or
variables. Using names as indices can also increase the readability of your code.

39 / 73
Matrix Operations

40 / 73
Entrywise Arithmetic Operations
▶ Matrices are stored as vectors, so arithmetic operations (+, -, *, /, etc.) on
numeric matrices work just like on numeric vectors.
▶ Entrywise operations: Arithmetic operations are applied to each entry in the
matrix.
▶ Matrices must be conformable (same dimensions) for entrywise operations
between two matrices.
A + 10 # Add 10 to every entry in A

## [,1] [,2] [,3]

## [1,] 11 13 15
## [2,] 12 14 16
Aˆ2 # Square each entry in A

## [,1] [,2] [,3]

## [1,] 1 9 25
## [2,] 4 16 36
41 / 73
# Construct a matrix C with the same dimensions as A
C <- matrix(1:3, nrow = 2, ncol = 3)
C

## [,1] [,2] [,3]

## [1,] 1 3 2
## [2,] 2 1 3
A + C # Add A and C

## [,1] [,2] [,3]

## [1,] 2 6 7
## [2,] 4 5 9
AˆC # Exponentiate A by C

## [,1] [,2] [,3]

## [1,] 1 27 25
## [2,] 4 4 216
42 / 73
A * C # Multiply A and C (entrywise)

## [,1] [,2] [,3]

## [1,] 1 9 10
## [2,] 4 4 18

If we try to apply the operators to matrices that are not conformable, R will throw an
error.

A * B # A and B are not conformable

## Error in A * B: non-conformable arrays

43 / 73
Matrix Multiplication (the one from MATH 33A)

Caution: The entrywise multiplication (∗ ) is not the same as matrix multiplication in

linear algebra.
Requirements for Matrix Multiplication
▶ For matrix multiplication, two matrices are conformable if:
▶ The number of columns in the left matrix equals the number of rows in the right
matrix.
▶ If A is an m × n matrix and B is an n × p matrix:
▶ Then A and B are conformable, and their product AB will be an m × p matrix.
Sidenote: Having one or all of the dimensions m, n, p equal to 1 is allowed and no
problem.

44 / 73
Review: The Dot Product

For two vectors a = [a1 , a2 , . . . , an ] and b = [b1 , b2 , . . . , bn ], the dot product is:
n
X
a·b= ai bi = a1 b1 + a2 b2 + · · · + an bn
i=1

45 / 73
Your turn: Computing the Dot Product

For vectors        
v1 1 w1 2
v = v2  =  3  and w = w2  = −4 .
       
v3 −5 w3 6

compute the dot product v · w (which is the same as the matrix multiplication v T w ).
What is the dimensionality of your result?

46 / 73
Matrix Multiplication is a generalization of the Dot Product

▶ The (i, j)th entry of the product AB is the dot product of the ith row of A with
the jth column of B.
▶ Multiplying a matrix with only one row A and another matrix with only one
column B (i.e., matrix A this actually a row vector and matrix B that is actually a
column vector), then matrix multiplication simplifies to the dot product

47 / 73
For example, let A be a 2 × 3 matrix and B be a 3 × 2 matrix, denoted by
 
" # b11 b12
a11 a12 a13
A= and B = b21 b22  .
 
a21 a22 a23
b31 b32

then the matrix product AB is the 2 × 2 matrix given by

" #
a b + a12 b21 + a13 b31 a11 b12 + a12 b22 + a13 b32
AB = 11 11 .
a21 b11 + a22 b21 + a23 b31 a21 b12 + a22 b22 + a23 b32

" n #
X
Written generally, if A = [aik ]m×n and B = [bkj ]n×p , then AB = aik bkj .
k=1 m×p

48 / 73
Your turn: Computing the Matrix Multiplication

Compute the product of the matrices

 
1 2 " #
7 8 9
A = 3 4 and B = .
 
10 11 12
5 6

Questions
▶ What is the dimensionality of the resulting matrix?
▶ What is the resulting matrix AB?

49 / 73
Matrix Multiplication in R
In R, Matrix multiplication is performed using the %*% operator:

A %*% C # Non-conformable error

## Error in A %*% C: non-conformable arguments

A %*% B # Matrix multiplication of A and B

## [,1] [,2] [,3]

## [1,] 48 57 66
## [2,] 60 72 84

Important:
▶ Matrices A ∈ [2 × 3] and C ∈ [2 × 3] are conformable for entrywise
multiplication but not for matrix multiplication.
▶ Matrices A ∈ [2 × 3] and B ∈ [3 × 2] are conformable for matrix multiplication
but not for entrywise multiplication. 50 / 73
What’s up with the two Matrix Multiplications?
Yes, there are two versions of matrix multiplication (in this class).
1. Element-wise matrix multiplication: For A ∈ [m × n] and B ∈ [m × n], compute
the ijth entry of A ∗ B ∈ [m × n] as
[A ∗ B]ij = aij ∗ bij
2. Standard matrix product: For A ∈ [m × n] and B ∈ [n × p], compute the ijth
entry of AB ∈ [m × p] as
n
X
[AB]ij = aik bkj
k=1
So which one should I use?
That depends on what you are doing. Keep in mind that:
▶ A matrix is nothing but a clever way to arrange a lot of numbers.
▶ Matrix operators / multiplication are just a clever way to notate concisely what
computations to perform on all these numbers.
▶ There is no deeper logic, it’s just different ways to crunch large sets of numbers.
51 / 73
The Transpose of a Matrix

▶ The transpose of an m × n matrix A is the unique m × n matrix AT such where

the rows of AT are the columns of A and the columns of AT are the rows of A.
▶ You can also think of it as mirroring the matrix along its diagonal.
▶ Note that the transpose of the transpose of a matrix is the original matrix:
T
AT =A
 
" # 1 2 T " #
1 3 5 T T 1 3 5
A= , A = 3 4 , A = ,
 
2 4 6 2 4 6
5 6

52 / 73
Using the t() Function to Transpose a Matrix
In R, you can compute the transpose of a matrix using the t() function

t(A) # transpose of A

## [,1] [,2]
## [1,] 1 2
## [2,] 3 4
## [3,] 5 6

t(t(A)) # transpose of the transpose of A is A itself

## [,1] [,2] [,3]

## [1,] 1 3 5
## [2,] 2 4 6
53 / 73
B # B

## [,1] [,2] [,3]

## [1,] 1 2 3
## [2,] 4 5 6
## [3,] 7 8 9

t(B) # transpose of B

## [,1] [,2] [,3]

## [1,] 1 4 7
## [2,] 2 5 8
## [3,] 3 6 9

54 / 73
The Identity Matrix

▶ The identity matrix of size n, denoted by In (or simply I if the dimension is

implicit), is an n × n square matrix.
▶ The identity matrix has:
▶ 1’s on the main diagonal (where row = column, or i = j).
▶ 0’s everywhere else.

Mathematical Definition
(
1 if i = j
[In ]ij =
̸ j
0 if i =

Purpose
The identity matrix acts like the 1 in scalar multiplication but for matrix
multiplication, leaving other matrices unchanged when multiplying them by In .
55 / 73
Identity Matrices Example

Examples:
Identity matrices can be any size, they just have to be square (number of rows equals
number of columns)
 
" # 1 0 0
h i 1 0
I1 = 1 , I2 = , I3 = 0 1 0 , . . .
 
0 1
0 0 1

Example use:
Your turn: Compute AI3 or I2 A. Pick one, it does not matter which one.

56 / 73
Diagonal Matrices
A diagonal matrix is a matrix in which the entries outside the main diagonal (entries
Aij where i = j) are all zero.
 
   1  0 0
1 0 0 1 0 0 0 
 0 5 0
0 12 0 , 0 5 0 0 , 
   
0 0 10

0 0 4 0 0 10 0
0 0 0

The identity matrix is a matrix is a square (number of rows = number of columns)

diagonal matrix with only 1s on the diagonal.

···
 
1 0 0 0
0 1 0 ··· 0
0 0 1 ··· 0
. .. .. .. 

. ..
. . . . .
0 0 0 ··· 1

57 / 73
Creating Diagonal Matrices using the diag() Function

The diag() function has two main functionalities. By inputting a number, the diag()
function will generate an identity matrix of that size.

# inputting a number
diag(4) # Create a 4x4 identity matrix

## [,1] [,2] [,3] [,4]

## [1,] 1 0 0 0
## [2,] 0 1 0 0
## [3,] 0 0 1 0
## [4,] 0 0 0 1

58 / 73
By inputting a vector, the diag() function will generate a diagonal matrix (the only
nonzero entries are along the diagonal) with the vector values along the diagonal.

# inputting a vector
diag(c(1, 2, 3)) # Create a diagonal matrix with 1, 2, 3 along the diagonal

## [,1] [,2] [,3]

## [1,] 1 0 0
## [2,] 0 2 0
## [3,] 0 0 3

59 / 73
The nrow and ncol arguments can also be used to specify the dimensions for a
rectangular (non-square) matrix.

# specifying a rectangular (non square diagonal matrix)

diag(c(1, 2, 3), nrow = 3, ncol = 4)

## [,1] [,2] [,3] [,4]

## [1,] 1 0 0 0
## [2,] 0 2 0 0
## [3,] 0 0 3 0

60 / 73
The Inverse of a Matrix

▶ The inverse of an n × n square matrix A is the unique n × n matrix, denoted A−1 ,

such that:
AA−1 = A−1 A = In .
▶ The inverse of a matrix is similar to the reciprocal (or multiplicative inverse) of a
number:
▶ For example, the reciprocal of 2 is 2−1 = 21 , since:

1 1
2× = × 2 = 1.
2 2

61 / 73
Inverting Matrices using the solve() function
The function solve() computes the inverse of the inputted matrix:

M <- matrix(c(1, 4, 2, 1), nrow = 2, ncol = 2) # Create a matrix M

## [,1] [,2]
## [1,] 1 2
## [2,] 4 1

M_inv <- solve(M) # Compute the inverse of M

M_inv

## [,1] [,2]
## [1,] -0.1428571 0.2857143
## [2,] 0.5714286 -0.1428571
# Verify that M_inv is the inverse of M
62 / 73
Caution: Not all Matrices are Invertible

▶ Not all square matrices have an inverse.

▶ A matrix that has an inverse is called invertible or nonsingular. If no inverse
exists, the matrix is called singular.
▶ Trying to invert a singular matrix in R will result in an error:

solve(B)

Error in solve.default(B): system is computationally singular:

reciprocal condition
number = 2.59052e-18

63 / 73
Singular Matrices in Statistics

As statisticians it can (and will) happen that when estimating linear regression
coefficients −1
β = XTX XT y,

the matrix X T X is not invertible. Common causes are: - multicollinearity of our

predictor variables, for example: - You include height in inches as well as height in cm
as predictor variables in the same model - You include a sum total and its parts as
predictor variables, for example last months revenue by state and last months total
revenue in a regression model predicting this months revenue. - overdetermination of
the model which usually happens when we fewer observations than predictor variables.

64 / 73
Operations on Matrix Columns and Rows

65 / 73
The apply() Function

Suppose we want to compute the mean of each variable in the parks_mat matrix.

parks_mat

## Height Weight Income

## Leslie 62 115 4000
## Ron 71 201 NA
## April 66 119 2000

66 / 73
We could compute the mean of each variable individually, but it would require
repetitive code (or a for loop):

mean(parks_mat[, "Height"], na.rm = TRUE) # Or mean(parks_mat[, 1], na.rm =

## [1] 66.33333

mean(parks_mat[, "Weight"], na.rm = TRUE) # Or mean(parks_mat[, 2], na.rm =

## [1] 145

mean(parks_mat[, "Income"], na.rm = TRUE) # Or mean(parks_mat[, 3], na.rm =

## [1] 3000

For large matrices (or other data objects you will see later), using repetitive code is
inefficient and cumbersome.
67 / 73
The apply() function

The apply() function is used to apply a function to the rows or columns (the
margins) of matrices, arrays (higher dimension matrices), and data frames (which you
will see soon).
Similar to vapply(), the syntax of apply() is apply(X, MARGIN, FUN, ...),
where the arguments are:
▶ X: A matrix or data frame
▶ MARGIN: A vector giving the subscript(s) over which the function will be applied
over. A 1 indicates rows, 2 indicates columns, and c(1, 2) indicates rows and
columns.
▶ FUN: The function to be applied.
▶ ...: Any optional arguments to be passed to the FUN function (for example,
na.rm = TRUE).

68 / 73
Using apply() to Compute Row / Column Means
Using apply(), we can apply the mean() function to each column in parks_mat
simultaneously with a single command.
# Compute the mean of every column of the parks_mat matrix
apply(X = parks_mat, MARGIN = 2, FUN = mean, na.rm = TRUE)

## Height Weight Income

## 66.33333 145.00000 3000.00000

To compute the mean of each row, we can change the margin argument MARGIN from
2 (columns) to 1 (rows).
# Compute the mean of every row of the parks_mat matrix
apply(X = parks_mat, MARGIN = 1, FUN = mean, na.rm = TRUE)

## Leslie Ron April

## 1392.3333 136.0000 728.3333
69 / 73
Behavior of apply() Output
▶ Note: The structure of apply() output depends on the result of the function
specified in the FUN argument:
▶ If the function in FUN returns a single value, the output of apply() will be a
vector.
▶ If the function in FUN returns a vector with multiple values, the output of
apply() will be a matrix.
▶ Unlike the vapply() function, apply() is smart enough to figure out the
dimensionality of its output by itself, we do not need to specify a FUN.VALUE
argument.
range(parks_mat[,1]) # Compute the range (min and max), this is a vector of length 2

## [1] 62 71
apply(X = parks_mat, MARGIN = 2, FUN = range, na.rm = TRUE) # range of every column

## Height Weight Income

## [1,] 62 115 2000
## [2,] 71 201 4000
70 / 73
Using Custom Functions with apply()
▶ The FUN argument in apply() does not have to be a built-in function.
▶ We can create our own functions and apply them to each row or column.
Suppose we want to compute the squared deviations from the mean for each variable
in parks_mat:

squared_devs <- function(x, na.rm = FALSE) {

# This function inputs a vector and computes the squared deviations away
(x - mean(x, na.rm = na.rm))ˆ2
}
# Apply the squared_devs() function to every column of the parks_mat matrix
apply(X = parks_mat, MARGIN = 2, FUN = squared_devs, na.rm = TRUE)

## Height Weight Income

## Leslie 18.7777778 900 1e+06
## Ron 21.7777778 3136 NA
## April 0.1111111 676 1e+06 71 / 73
The function can also be written directly into the FUN argument without having to
save it as a separate object.

# Creates the same object as apply(parks_mat, 2, squared_devs, na.rm = TRUE

apply(X = parks_mat, MARGIN = 2, FUN = function(x) {
(x - mean(x, na.rm = TRUE))ˆ2
})

## Height Weight Income

## Leslie 18.7777778 900 1e+06
## Ron 21.7777778 3136 NA
## April 0.1111111 676 1e+06

72 / 73
apply() also Follows the Split-Apply-Combine Strategy

1. Split the data set into a set of row vectors (MARGIN = 1) or column vectors
(MARGIN = 2)
2. Apply a function (specified in FUN) to each vector
3. Combine all results into a vector (if individual results were scalars) or a matrix (if
individual results were vectors)

73 / 73

Application To Congruences
No ratings yet
Application To Congruences
16 pages
Assignment 3 (27 09 2010)
100% (1)
Assignment 3 (27 09 2010)
50 pages
Unit1 Matrix and Array
No ratings yet
Unit1 Matrix and Array
19 pages
R 03 Matrices Handouts
No ratings yet
R 03 Matrices Handouts
14 pages
Lenguaje R C4
No ratings yet
Lenguaje R C4
15 pages
03 Matrices
No ratings yet
03 Matrices
60 pages
Mod 2 Summary Table
No ratings yet
Mod 2 Summary Table
16 pages
Question 4: How To Create Matrices and Access Matrix Elements in R (Show Commands)
No ratings yet
Question 4: How To Create Matrices and Access Matrix Elements in R (Show Commands)
2 pages
Unit 2 Matrices
No ratings yet
Unit 2 Matrices
65 pages
Module2 DAR
No ratings yet
Module2 DAR
40 pages
Rbasics
No ratings yet
Rbasics
96 pages
Lecture 1
No ratings yet
Lecture 1
42 pages
N2 Data in R
No ratings yet
N2 Data in R
7 pages
r Programming Unit 3 Qb Solved
No ratings yet
r Programming Unit 3 Qb Solved
272 pages
Lab3-Lists, Matrices and Arrays
No ratings yet
Lab3-Lists, Matrices and Arrays
6 pages
Intr2R Week2 2020
No ratings yet
Intr2R Week2 2020
13 pages
R - Lecture 3
No ratings yet
R - Lecture 3
21 pages
R Nuts and Bolts
No ratings yet
R Nuts and Bolts
9 pages
1 - Introduction To Programming With R
No ratings yet
1 - Introduction To Programming With R
13 pages
IDS-UNIT-3-BY
No ratings yet
IDS-UNIT-3-BY
109 pages
M2_DAR_
No ratings yet
M2_DAR_
46 pages
Data in R
No ratings yet
Data in R
7 pages
R Matrix
No ratings yet
R Matrix
18 pages
Data Analysis Using R - 3
No ratings yet
Data Analysis Using R - 3
32 pages
Lecture 2: More Data Structures: Outline
No ratings yet
Lecture 2: More Data Structures: Outline
16 pages
Fall 2005 Statistics 579 R Tutorial: Vectors, Matrices, and Arrays
No ratings yet
Fall 2005 Statistics 579 R Tutorial: Vectors, Matrices, and Arrays
8 pages
Matrix
No ratings yet
Matrix
20 pages
Week 12 - Lecture Notes Special Matrices
No ratings yet
Week 12 - Lecture Notes Special Matrices
25 pages
Programming R - 3
No ratings yet
Programming R - 3
16 pages
R - Chapter 2
No ratings yet
R - Chapter 2
18 pages
Vector List
No ratings yet
Vector List
9 pages
IDS - Unit 3 - 5
No ratings yet
IDS - Unit 3 - 5
80 pages
Data Structures
No ratings yet
Data Structures
8 pages
Dar lecture 7
No ratings yet
Dar lecture 7
24 pages
R Programming Merged PDF
No ratings yet
R Programming Merged PDF
365 pages
Data Types
No ratings yet
Data Types
27 pages
Obejcts in R A13
No ratings yet
Obejcts in R A13
8 pages
Introduction To R
No ratings yet
Introduction To R
21 pages
R Data Structures_07_3
No ratings yet
R Data Structures_07_3
35 pages
Introduction To R
No ratings yet
Introduction To R
74 pages
R Programming 101 Part 1
No ratings yet
R Programming 101 Part 1
53 pages
Create and Name Matrices
No ratings yet
Create and Name Matrices
12 pages
R Software Notes
No ratings yet
R Software Notes
5 pages
Chap 3 - BSD2223
No ratings yet
Chap 3 - BSD2223
29 pages
R Programming: © 2016 SMART Training Resources Pvt. LTD
No ratings yet
R Programming: © 2016 SMART Training Resources Pvt. LTD
28 pages
Chapter 1 Introduction To R
No ratings yet
Chapter 1 Introduction To R
33 pages
Matrices
No ratings yet
Matrices
10 pages
WIN SEM (2022-23) CSE4027 ETH AP2022236000324 Reference Material I 25-Jan-2023 Module-1 Topic-3 - R Datatypes
No ratings yet
WIN SEM (2022-23) CSE4027 ETH AP2022236000324 Reference Material I 25-Jan-2023 Module-1 Topic-3 - R Datatypes
41 pages
R-Data Structures
No ratings yet
R-Data Structures
14 pages
Introduction To Spatial Data Handling in R
No ratings yet
Introduction To Spatial Data Handling in R
25 pages
R Prog
No ratings yet
R Prog
27 pages
STAT 04 Simplify Notes
No ratings yet
STAT 04 Simplify Notes
34 pages
Matrices Are The R Objects in Which The Elements Are Arranged in A Two
No ratings yet
Matrices Are The R Objects in Which The Elements Are Arranged in A Two
4 pages
List and Data Frame
No ratings yet
List and Data Frame
18 pages
R Objects
No ratings yet
R Objects
10 pages
Network Analysis and Visualization With R and Igraph
No ratings yet
Network Analysis and Visualization With R and Igraph
62 pages
Unit 1.1
No ratings yet
Unit 1.1
85 pages
R - Lecture 2
No ratings yet
R - Lecture 2
51 pages
RCourse Lecture12 Calculations Matrices - Watermark
No ratings yet
RCourse Lecture12 Calculations Matrices - Watermark
15 pages
DSF 9-10
No ratings yet
DSF 9-10
25 pages
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
Chapter-4-slides
No ratings yet
Chapter-4-slides
55 pages
Chapter-6-slides
No ratings yet
Chapter-6-slides
34 pages
Final Exam
No ratings yet
Final Exam
5 pages
Chapter-7-slides (1)
No ratings yet
Chapter-7-slides (1)
104 pages
Notes of AEM (3CS1-01,3AM1-01,3AD1-01) - Unit 2 by Dr. RM - 2024-25
No ratings yet
Notes of AEM (3CS1-01,3AM1-01,3AD1-01) - Unit 2 by Dr. RM - 2024-25
46 pages
LINFO2262: Decision Trees + Random Forests: Pierre Dupont
No ratings yet
LINFO2262: Decision Trees + Random Forests: Pierre Dupont
43 pages
Fourier Transform PPT
No ratings yet
Fourier Transform PPT
18 pages
Math 1090 Exponential and Logarithmic Project 5
No ratings yet
Math 1090 Exponential and Logarithmic Project 5
3 pages
Genetic Algorithms To Predict Problems in Crops
No ratings yet
Genetic Algorithms To Predict Problems in Crops
6 pages
CS8080 Irt Unit 3 23 24
No ratings yet
CS8080 Irt Unit 3 23 24
48 pages
Using Tournament Trees T O Sort Alexander Stepanov and Aaron Kershenbaum
No ratings yet
Using Tournament Trees T O Sort Alexander Stepanov and Aaron Kershenbaum
39 pages
Complete Time Series Analysis in Python 1673057003
No ratings yet
Complete Time Series Analysis in Python 1673057003
56 pages
MATH 685/ CSI 700/ OR 682 Lecture Notes: Optimization Problems
No ratings yet
MATH 685/ CSI 700/ OR 682 Lecture Notes: Optimization Problems
69 pages
Digital Control System
No ratings yet
Digital Control System
2 pages
AVL Example
No ratings yet
AVL Example
21 pages
Unmasking The Face Expression
No ratings yet
Unmasking The Face Expression
11 pages
Unit I Introduction
No ratings yet
Unit I Introduction
55 pages
7 Karnaugh Maps - 1
No ratings yet
7 Karnaugh Maps - 1
21 pages
Further Inequalities Excersises
No ratings yet
Further Inequalities Excersises
5 pages
Applied Information Processing Systems 2022
100% (1)
Applied Information Processing Systems 2022
588 pages
Ch-2 Mat MGMT
No ratings yet
Ch-2 Mat MGMT
14 pages
Page
No ratings yet
Page
1 page
2.13 Simultaneous Equations: C Pearson Education LTD 2000
No ratings yet
2.13 Simultaneous Equations: C Pearson Education LTD 2000
2 pages
Cepstrum
No ratings yet
Cepstrum
5 pages
Big Data Machine Learning
100% (1)
Big Data Machine Learning
6 pages
m575 Chapter 10
No ratings yet
m575 Chapter 10
18 pages
K Means Questions
No ratings yet
K Means Questions
2 pages
Review of Serial and Parallel Min-Cut/Max-Flow Algorithms For Computer Vision
No ratings yet
Review of Serial and Parallel Min-Cut/Max-Flow Algorithms For Computer Vision
20 pages
Kode BAUDOT
No ratings yet
Kode BAUDOT
2 pages
Supp E
No ratings yet
Supp E
92 pages
Econ 217 3
No ratings yet
Econ 217 3
16 pages
DSM Presentation I 13 April 07
No ratings yet
DSM Presentation I 13 April 07
27 pages