0% found this document useful (0 votes)

36 views7 pages

Matrix Calculus Tutorial

Foundations of Data science

Uploaded by

boatrockerz83

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

36 views7 pages

Matrix Calculus Tutorial

Foundations of Data science

Uploaded by

boatrockerz83

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Matrix Calculus

Tanmay Devale

1 Kronecker Product
Let A be a m × n matrix and B be a p × q matrix then the Kronecker or Tensor
product A and B denoted by A⊗B is a mp×nq matrix C with elements defined
by cαβ = aij bkl where α = p(i − 1) + k, β = q(j − 1)
 + l. 
b11 b12
a a12
For example: Consider A = 11 and B= b21 b22  then
a21 a22
 b31 b32

a11 b11 a11 b12 a12 b11 a12 b12
a11 b21 a11 b22 a12 b21 a12 b22 
 

a11 B a12 B a11 b31 a11 b32 a12 b31 a12 b32 
A⊗B= = 
a21 B a22 B a21 b11 a21 b12 a22 b11 a22 b12 
 
a21 b21 a21 b22 a22 b21 a22 b22 
a21 b31 a21 b32 a22 b31 a22 b32
 
10 −2 0 0
2 0 5 −1 2B 0B −2 8 0 0
Say A = and B = Then A ⊗ B = = 
1 3 −1 4 1B 3B  5 −1 15 −3
−1 4 −3 12

1.1 Exercise
1. Let A and B be matrices then find A ⊗ B.

3 1 0
(a) A = and B =
2 2 7

(b) A = 1 −1 and B = 1 0 5
 
1
(c) A = 3 6 and B = 0
1

1 0
(d) A = and B = −1 3
0 2

2 3
(e) A = and B =
1 8
2. Is A ⊗ B = B ⊗ A? Provide a proof or counter example.

1
2 Matrix Differentiation
We are going use the following notation:

1. x denotes scalars
2. ⃗x denotes vectors(specifically column vectors)

3. X denotes matrices
We are interested in the following 9 derivatives.

dy d⃗y dY
dx dx dx
dy dy dy
d⃗x d⃗x d⃗x
dy d⃗y dY
dX dX dX
Table 1: Derivatives of interest

2.1 Derivative with respect to a scalar

We are now looking at the first row from table 1.
dy
We already know what dx meansfrom elementary
 calculus.
  df1 (x)
f1 (x)
d⃗
y  dx
Now let ⃗y = f2 (x) then dx =  dfdx
2 (x) 

f3 (x) df3 (x)
dx " #
df11 (x) df12 (x)
f11 (x) f12 (x) dY dx dx
Let Y = then dx = df21 (x) df22 (x)
f21 (x) f22 (x)
dx dx

2.1.1 Exercise
Given the notation as specified above, find derivative of y, ⃗y , Y w.r.t. x.
 
x 2
2 x + 1 cos(x)
1. y = sin(x ), ⃗y = cos(x) , Y =
 
sin(x) x − 1
2x2
 
ln(x) 3
x + x2 + 1

2 ex
2. y = ex , ⃗y =  sin(x)  , Y =
2x sin(x)
cos(x2 )

2
 
cos(x)
 sin(x2 )  πx eπ

sin(πx)
3. y = ln(x10 ), ⃗y =  ,Y =
 tan(x)  cos(sec(x)) csc(x3 ) x
x4 + x + 2023

2.2 Derivative with respect to a vector

We are now looking at the second row from table 1.  
x1
While calculating the derivative w.r.t. a vector say ⃗x = x2  we consider the
x3
∂
∂ ∂ ∂

vector ∂⃗x = ∂x1 ∂x2 ∂x3 . and then
∂
for scalar find ∂⃗x ⊗ y
∂
for vector find ∂⃗x ⊗⃗ y
∂
for matrix find ∂⃗ x ⊗ Y

2.2.1 Exercise
1. Given the notation as specified above, find derivative of y, ⃗y , Y w.r.t. ⃗x.
 xyz   
e 2 2
x
x yz xy z
(a) y = sin(x + yz), ⃗y =  x2 z  , Y = 2 where ⃗x = y 
xyz ln(xyz)
xy z
3 2 e

ln(xyz) x + x + yz π cos(x + y + z)
(b) y = 5xyz , ⃗y = y ,Y =
 x cos(z) xyz 2sin(cos(x + y)) z
x
where ⃗x = y 
z
2. Consider functions f : Rn → Rm and g : Rn → Rm .
(a) Show that for ⃗x i.e. x ∈ Rn
d(f (x) + g(x) df (x) dg(x)
= +
dx dx dx
(b) Show that for ⃗x i.e. x ∈ Rn and a ∈ R
daf (x) df (x)
=a
dx dx
3. The quadratic form xT Ax is a form we will encounter often. In this
T
question, we are interested in dxdxAx . Assume that A is not a function of
x.

x
(a) Evaluate x Ax when x = 1 and the (i, j)th element of A is Aij
T
x2
T
.Why do you think x Ax is called the quadratic form?

3
(b) Which definition of the derivative do we need in order to evaluate
dxT Ax
dx ?
T
(c) Assume x ∈ R2 and A ∈ R2×2 . Evaluate dxdxAx .
(d) Generalize the previous result to when x ∈ R2 and A ∈ Rn×n and
T
evaluate dxdxAx . Can you express the result in matrix form?
(e) What happens when A is a symmetric matrix?

2.3 Derivative with respect to a matrix

We are now looking at the third row from table 1.  
x11 x12
While calculating the derivative w.r.t. a matrix say X = x21 x22  we con-
x31 x32
∂ ∂ ∂
∂
sider the vector ∂X = ∂x∂11 ∂x∂21 ∂x∂31 . and then
∂x12 ∂x22 ∂x32
∂
for scalar find ∂X ⊗y
∂
for vector find ∂X ⊗ ⃗y
∂
for matrix find ∂X ⊗Y

2.3.1 Exercise
1. Given the notation as specified above, find derivative of y, ⃗y , Y w.r.t. X.
xx
e 0 1 sin(x0 + 2x1 ) 2x1 + x3
(a) y = 4x3 +3x2 +2x1 +x0 , ⃗y = x2 x3 , Y =
e 2x0 + x2 cos(2x2 + x3 )

x0 x1
w.r.t. X =
x2 x3
sin(x5 ) + cos(x4 x3 ) + x22 + 2x1 x0

(b) y = ln(x25 x34 x3 x22 x01 ), ⃗y = ,Y =
eiπ + 1
 x x 2x1 
1 5 4 0
2x4 + x1 tan(2x2 + x4 ) w.r.t. X = x0 x1 x2
x3 x4 x5
cot(x0 ) csc(x4 + x1 )

3 Chain Rule
3.1 The basics
Recall that for h(x) = f (g(x)) the chain rule is
dh df dg
=
dx dg dx
For the multivariate case h(x) = f (g1 (x), g2 (x)), the chain rule is extended as
dh ∂f dg1 ∂f dg2 ∂f dg3
= + +
dx ∂g1 dx ∂g2 dx ∂g3 dx

4
3.1.1 Exercises
∂f ∂f
Evaluate ∂x and ∂y for each of the following:

1. f (u, v) = (u − v)eu , where u = xy and v = x2 − y 2 .

x 2
2. f (u, v) = ulog(v) + vlog(u), where u = 2 + y and v = xey

3. f (u, v) = ulog(v) where u = xsiny + ysinx and v = xcosy + ycosx

4. f (u, v) = u+v
1−uv where u = tan( x+y x−y
2 ) and v = tan( 2 )

Our previous operations can be thought of as adding all components that con-
tribute to the change of h. Building on this, we can extend the chain rule to also
work in matrix calculus. For detailed proof of why the chain rule still follows in
matrix calculus please refer to reference 3.

3.1.2 Exercise
Consider x ∈ Rp , y ∈ Rr , z ∈ Rn . Which of the following are true?
dz dz dy dz dy dz
= or =
dx dy dx dx dx dy

3.2 Useful examples of vectored derivatives

In the following we provide some examples of vectored derivatives that are used
frequently in machine learning. Consider the case where the function g(·) has
a d-dimensional vector argument and its output is scalar. Furthermore, the
function f (·) is a scalar-to-scalar function

J = f (g(w))
⃗

In such a case, we can apply the vectored chain rule to obtain the following:
∂J
∇J = ⃗ f ′ (g(w))
= ∇g(w) ⃗
∂w | {z }
scalar

In this case, the order of multiplication does not matter, because one of the
factors in the product is a scalar. Note that this result is used frequently in ma-
chine learning, because many loss-functions in machine learning are computed
by applying a scalar function f (·) to the dot product of w ⃗ with a training point
⃗a. In other words, we have g(w) ⃗ =w ⃗ · ⃗a. Note that w
⃗ · ⃗a can also be written
⃗ T (I)⃗a, where I represents the identity matrix. This is in the form of one of
as w
the matrix identities listed above. In such a case, one can use the chain rule to
obtain the following:
∂J
= [f ′ (g(w))]
⃗ ⃗a
∂w⃗ | {z }
scalar

5
This result is extremely useful, and it can be used for computing the derivatives
of many loss functions like least-squares regression, SVMs and logistic regression.
The vector ⃗a is simply replaced with the vector of the training point at hand.
The function f (·) defines the specific form of the loss function for the model at
hand.

3.2.1 Exercise
d 1
1. Evaluate dx σ(x) = 1+e−x . Is this equation* familiar? What is it com-
monly called?
2. Express your answer in the previous question using only σ(x).
3. Consider a weight vector w ⃗ and a sample point ⃗x. We perform affine
⃗ T ⃗x and then perform σ(z).
transformation i.e. z = w

(a) Write σ(z) in terms of w

⃗ and ⃗x.
∂σ(z)
(b) Evaluate ∂w
⃗ and compare with the discussion above.

4 Matrix Identities
Assume identity 3(a) and prove all other identities.
∂c
1. ∂⃗
x = 0T
∂⃗
u⃗v
2. ∂⃗x = ⃗u ∂⃗
u
x +
∂⃗
∂⃗
u
x⃗
∂⃗ v
3. Note the change in notation
Let u, v, x be variable column vectors.
Let a, b be constant column vectors.
Let A be a constant matrix then:
∂y T Av ∂v
(a) ∂x = uT A ∂x + v T AT ∂u
∂x
∂uT v ∂v
(b) ∂x = uT ∂x + v T ∂u
∂x
∂aT x
(c) ∂x = aT
∂bT Ax
(d) ∂x = bT A
T
∂x Ax
(e) ∂x = xT (A + AT )
∂||x||2
(f) ∂x = 2xT
∂au T ∂u
(g) ∂x = a ∂x

6
5 References
1. Weisstein, Eric W. ”Kronecker Product.” From MathWorld–A Wolfram
Web Resource. https://mathworld.wolfram.com/KroneckerProduct.
html
2. Taboga, Marco (2021). ”Kronecker product”, Lectures on matrix algebra.
https://www.statlect.com/matrix-algebra/Kronecker-product

3. Kim, H. Vijayakumar, A. (2022). Matrix calculus for 10-301/601. https:

//www.cs.cmu.edu/~mgormley/courses/10601/slides/10601-matrix-calculus.
pdf

Engineering Knowledge Test Ebook Electrical and Electronics
71% (17)
Engineering Knowledge Test Ebook Electrical and Electronics
423 pages
Project PDF
71% (21)
Project PDF
38 pages
1st Year Notes 2nd Year Notes Bhattiacademy Com PDF
No ratings yet
1st Year Notes 2nd Year Notes Bhattiacademy Com PDF
10 pages
Matrix Calculus: 1 The Derivative
100% (1)
Matrix Calculus: 1 The Derivative
13 pages
Matrix Calculus 2
No ratings yet
Matrix Calculus 2
6 pages
Matrix Differentiation
No ratings yet
Matrix Differentiation
36 pages
Lecture 2.1: Vector Calculus CSC 84020 - Machine Learning: Andrew Rosenberg
No ratings yet
Lecture 2.1: Vector Calculus CSC 84020 - Machine Learning: Andrew Rosenberg
46 pages
Matrix Derivatives
No ratings yet
Matrix Derivatives
4 pages
matrixcalc Đạo hàm ma trận PDF
No ratings yet
matrixcalc Đạo hàm ma trận PDF
25 pages
Matrix Algebra Calculus Review
0% (1)
Matrix Algebra Calculus Review
12 pages
Maths Primer
No ratings yet
Maths Primer
41 pages
Matrixcalc PDF
No ratings yet
Matrixcalc PDF
23 pages
DDA3020 Lecture 02 Linear Algebra
No ratings yet
DDA3020 Lecture 02 Linear Algebra
37 pages
Matrix Differentiation Rules and Application
No ratings yet
Matrix Differentiation Rules and Application
36 pages
Matrix Calculus: Derivation and Simple Application: HU, Pili March 30, 2012
No ratings yet
Matrix Calculus: Derivation and Simple Application: HU, Pili March 30, 2012
30 pages
Lecture 7
No ratings yet
Lecture 7
24 pages
CENG3300 Lecture 2-1
No ratings yet
CENG3300 Lecture 2-1
21 pages
Giaonx,+08Khang.pdf
No ratings yet
Giaonx,+08Khang.pdf
11 pages
mit18_s096iap23_lec02
No ratings yet
mit18_s096iap23_lec02
12 pages
TA WEEK 3 Copy
No ratings yet
TA WEEK 3 Copy
27 pages
Short Tutorial - Solving Fractional Differential Equations by Matlab Codes
100% (1)
Short Tutorial - Solving Fractional Differential Equations by Matlab Codes
6 pages
IB352 Warwick Wk1 - Maths
No ratings yet
IB352 Warwick Wk1 - Maths
15 pages
Gradient Notes
No ratings yet
Gradient Notes
5 pages
Matrix Differentiation
No ratings yet
Matrix Differentiation
15 pages
mit18_s096iap23_lec1
No ratings yet
mit18_s096iap23_lec1
16 pages
Mathrecap Sol
No ratings yet
Mathrecap Sol
4 pages
Vector, Matrix, and Tensor Derivatives: 1 Simplify, Simplify, Simplify
No ratings yet
Vector, Matrix, and Tensor Derivatives: 1 Simplify, Simplify, Simplify
7 pages
Calculus With Vectors and Matrices
No ratings yet
Calculus With Vectors and Matrices
16 pages
Matrix Differentiation
No ratings yet
Matrix Differentiation
34 pages
MLF Combined
No ratings yet
MLF Combined
84 pages
Matrix Calculus
No ratings yet
Matrix Calculus
9 pages
Lecture12 Diff
No ratings yet
Lecture12 Diff
31 pages
2207.04377v1
No ratings yet
2207.04377v1
6 pages
Ee 701 Lecture Notes
No ratings yet
Ee 701 Lecture Notes
261 pages
Chapter Matrix Derivative Common Cases
No ratings yet
Chapter Matrix Derivative Common Cases
6 pages
Linear Quadratic Gradients
No ratings yet
Linear Quadratic Gradients
3 pages
Math 5390 Chapter 2
No ratings yet
Math 5390 Chapter 2
5 pages
mit18_s096iap23_lec07
No ratings yet
mit18_s096iap23_lec07
4 pages
Matrix Calculus PDF
No ratings yet
Matrix Calculus PDF
9 pages
Matrix Calc
No ratings yet
Matrix Calc
23 pages
Day 1
No ratings yet
Day 1
41 pages
matrix-differential
No ratings yet
matrix-differential
21 pages
Derivatives, Backpropagation, and Vectorization
No ratings yet
Derivatives, Backpropagation, and Vectorization
7 pages
Thomas Minka - Note On Matrix Calculus and Algebra
No ratings yet
Thomas Minka - Note On Matrix Calculus and Algebra
19 pages
Matrix Calculus
No ratings yet
Matrix Calculus
8 pages
Numerical Linear Algebra With Matlab
No ratings yet
Numerical Linear Algebra With Matlab
16 pages
Mat Deriv
No ratings yet
Mat Deriv
3 pages
矩阵微分手册-Matrix calculus-Wikipedia
No ratings yet
矩阵微分手册-Matrix calculus-Wikipedia
18 pages
(eBook PDF) Multivariable Calculus 10th Editioninstant download
100% (3)
(eBook PDF) Multivariable Calculus 10th Editioninstant download
51 pages
MA1301 Chapter 3
No ratings yet
MA1301 Chapter 3
97 pages
(Ebook) Calculus : graphical, numerical, algebraic by Franklin D. Demana; Daniel Kennedy; Bert K. Waits; Ross L. Finney ISBN 9780133178579, 0133178579 - The full ebook version is ready for instant download
No ratings yet
(Ebook) Calculus : graphical, numerical, algebraic by Franklin D. Demana; Daniel Kennedy; Bert K. Waits; Ross L. Finney ISBN 9780133178579, 0133178579 - The full ebook version is ready for instant download
58 pages
Matrix Calculus - Notes On The Derivative of A Trace: Johannes Traa
No ratings yet
Matrix Calculus - Notes On The Derivative of A Trace: Johannes Traa
7 pages
4 Topics in Calculus: 4.1 Transformations From To
No ratings yet
4 Topics in Calculus: 4.1 Transformations From To
6 pages
7th Monthly Test in Basic Cal
No ratings yet
7th Monthly Test in Basic Cal
5 pages
1 Linear Transformations and Their Matrix Repre-Sentations
No ratings yet
1 Linear Transformations and Their Matrix Repre-Sentations
9 pages
F Matrix Calculus
No ratings yet
F Matrix Calculus
9 pages
Matrixcookbook Wiki
No ratings yet
Matrixcookbook Wiki
18 pages
Matrix Calculus
No ratings yet
Matrix Calculus
9 pages
Differential Equation
No ratings yet
Differential Equation
7 pages
Vector and Matrix Calculus: Herman Kamper 30 January 2013
No ratings yet
Vector and Matrix Calculus: Herman Kamper 30 January 2013
5 pages
Background Material Crib-Sheet: 1 Probability Theory
No ratings yet
Background Material Crib-Sheet: 1 Probability Theory
4 pages
Gazeta Matematic A
No ratings yet
Gazeta Matematic A
56 pages
Math HL Test Derivatives: by Christos Nikolaidis
No ratings yet
Math HL Test Derivatives: by Christos Nikolaidis
94 pages
Mws Gen Pde PPT Background
No ratings yet
Mws Gen Pde PPT Background
19 pages
Trigonometric Substitution
No ratings yet
Trigonometric Substitution
11 pages
Mathamethics IX Complete SLM-138-182
No ratings yet
Mathamethics IX Complete SLM-138-182
45 pages
Formula List For AP Calculus BC
100% (1)
Formula List For AP Calculus BC
6 pages
Matrix Calculus
100% (1)
Matrix Calculus
9 pages
Notice: Estimation Theory Pattern Recognition
No ratings yet
Notice: Estimation Theory Pattern Recognition
5 pages
BATCH.13.IEEE(1)
No ratings yet
BATCH.13.IEEE(1)
3 pages
Chapter 2: Multivariable Calculus: Lecture 2: Partial Derivatives
No ratings yet
Chapter 2: Multivariable Calculus: Lecture 2: Partial Derivatives
23 pages
Paper Group 7
No ratings yet
Paper Group 7
23 pages
MA209Notes 2017 18 PDF
No ratings yet
MA209Notes 2017 18 PDF
94 pages
Name of Student: Myth Dominic J. Tomolin Module Number: 6 Grade & Section: 11-Aphrodite Module Title: OPTIMIZATION
No ratings yet
Name of Student: Myth Dominic J. Tomolin Module Number: 6 Grade & Section: 11-Aphrodite Module Title: OPTIMIZATION
4 pages
TFC
No ratings yet
TFC
12 pages
M3
No ratings yet
M3
108 pages
Tut Sheets - Unit-5 - MAT211BT
No ratings yet
Tut Sheets - Unit-5 - MAT211BT
4 pages
Euler-Maruyama Method: Numerical Simulation
No ratings yet
Euler-Maruyama Method: Numerical Simulation
3 pages
Trapizoidal Method
No ratings yet
Trapizoidal Method
5 pages
Differential Forms
No ratings yet
Differential Forms
10 pages
Lesson 2 The Differentiation Rules For Algebraic Functions
No ratings yet
Lesson 2 The Differentiation Rules For Algebraic Functions
27 pages
Hyperbolic Functions
No ratings yet
Hyperbolic Functions
3 pages
Chapter 09
No ratings yet
Chapter 09
22 pages
Inverse Hyperbolic Functions
No ratings yet
Inverse Hyperbolic Functions
5 pages
Calculus Ab Sample Syllabus 3
No ratings yet
Calculus Ab Sample Syllabus 3
7 pages
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Solving Math Problems
From Everand
Solving Math Problems
George N. Frempong
No ratings yet
Algebraic Equations
From Everand
Algebraic Equations
Demetrios P. Kanoussis
No ratings yet
Trigonometric Ratios to Transformations (Trigonometry) Mathematics E-Book For Public Exams
From Everand
Trigonometric Ratios to Transformations (Trigonometry) Mathematics E-Book For Public Exams
Mohmmad Khaja Shareef
5/5 (1)
De Moiver's Theorem (Trigonometry) Mathematics Question Bank
From Everand
De Moiver's Theorem (Trigonometry) Mathematics Question Bank
Mohmmad Khaja Shareef
No ratings yet

Matrix Calculus Tutorial

Uploaded by

Matrix Calculus Tutorial

Uploaded by

Matrix Calculus

2.1 Derivative with respect to a scalar

2.2 Derivative with respect to a vector

2.3 Derivative with respect to a matrix

1. f (u, v) = (u − v)eu , where u = xy and v = x2 − y 2 .

3. f (u, v) = ulog(v) where u = xsiny + ysinx and v = xcosy + ycosx

3.2 Useful examples of vectored derivatives

(a) Write σ(z) in terms of w

3. Kim, H. Vijayakumar, A. (2022). Matrix calculus for 10-301/601. https:

You might also like