0% found this document useful (0 votes)

225 views6 pages

Lecture Note 3 - Introduction To Vector and Matrix Differentiation

This document provides an introduction to vector and matrix differentiation. It begins with conventions for taking derivatives of scalar and vector functions. It then discusses derivatives of some special functions that are useful in econometrics, including linear combinations of vectors and matrix-vector multiplications. Finally, it applies these concepts to derive the ordinary least squares estimator in a linear regression model. The overall purpose is to expand on concepts of matrix differentiation that are important for econometrics.

Uploaded by

Faizus Saquib Chowdhury

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

225 views6 pages

Lecture Note 3 - Introduction To Vector and Matrix Differentiation

Uploaded by

Faizus Saquib Chowdhury

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

INTRODUCTION

TO VECTOR AND MATRIX

DIFFERENTIATION
Econometrics C ¨ Lecture Note 3
Heino Bohn Nielsen
February 6, 2012

I
n this note we expand on Verbeek (2004, Appendix A.7) on matrix diﬀerentiation.
We first present the conventions for derivatives of scalar and vector functions;
then we present the derivatives of a number of special functions particularly useful
in econometrics, and, finally, we apply the ideas to derive the ordinary least squares
(OLS) estimator in a linear regression model. I should be emphasized that this note
is cursory reading; the particular results needed in this course are indicated with a
(∗).

Outline
§1 Conventions for Scalar Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
§2 Conventions for Vector Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
§3 Some Special Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
§4 The Linear Regression Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5

1
1 Conventions for Scalar Functions
Let  = ( 1     )0 be a  ×1 vector and let  () =  ( 1     ) be a real-valued function
that depends on , i.e.  (·) : R 7−→ R maps the vector  into a single number,  ().
Then the derivative of  (·) with respect to  is defined as
⎛  () ⎞
 1
 () ⎜ .. ⎟
=⎜
⎝ . ⎟
⎠ (1)

 ()
 

This is a  × 1 column vector with typical elements given by the partial derivative () .

Sometimes this vector is referred to as the gradient. It is useful to remember that the
derivative of a scalar function with respect to a column vector gives a column vector as
the result1 .
Similarly, the derivative of a scalar function with respect to a row vector yields the
1 ×  row vector
 () ³  ()  ()
´
= · · · 
 0  1  

2 Conventions for Vector Functions

Now let ⎛ ⎞
1 ()
⎜ .. ⎟
() = ⎜
⎝ . ⎟⎠
 ()
be a vector function depending on  = ( 1     )0 , i.e. (·) : R 7−→ R maps the  × 1
vector into a  × 1 vector, where  () =  ( 1     ),  = 1 2  , is a real-valued
function.
Since (·) is a column vector it is natural to consider the derivative with respect to a
row vector,  0 , i.e. ⎛  () ⎞
1 1 ()
 1 · · ·  
() ⎜ ⎜ .. . .. ⎟
⎟
0 =⎝ . . . . (2)
 ⎠
 ()  ()
 ··· 
1 

where each row,  = 1 2  , contains the derivative of the scalar function  (·) with
respect to the elements in . The result is therefore a  ×  matrix of derivatives with
typical element ( ) given by 
 ()
. If the vector function is defined as a row vector, it

is natural to take the derivative with respect to the column vector, .
We can note that it holds in general that
µ ¶
 (()0 ) () 0
= , (3)
  0
1  ()
Note that Wooldridge (2006, p. 815) does not follow this convention, and lets 
be a row vector.

2
which in the case above is a  ×  matrix.
Applying the conventions in (1) and (2) we can define the Hessian matrix of second
derivatives of a scalar function  () as
⎛  2  ()  2  ()
⎞
  · · ·  
 2  ()  2  () ⎜⎜
1
..
1
..
1
..

⎟
⎟
0 = 0 = ⎝ . . . ⎠
  
 2  ()  2  ()
  · · ·  
 1  

2
  ()
which is a  ×  matrix with typical elements ( ) given by the second derivative  .
  
Note that it does not matter if we first take the derivative with respect to the column or
the row.

3 Some Special Functions

First, let  be a  × 1 vector and let  be a  × 1 vector of parameters. Next define the
scalar function  () = 0 , which maps the  parameters into a single number. It holds
that
 (0 )
=  (4∗)

To see this, we can write the function as

 () = 0  = 1  1 + 2  2 +  +    

Taking the derivative with respect to  yields

⎛ (  +  ++ ⎞ ⎛ ⎞
1 1 2 2   )
 1 1
 () ⎜ .. ⎟ ⎜ . ⎟
=⎜
⎝ . ⎟ = ⎜ .. ⎟ = 
⎠ ⎝ ⎠

(1  1 +2  2 ++   )
  

which is a  × 1 vector as expected. Also note that since  0  = 0 , it holds that

¡ ¢
 0
=  (5∗)

Now, let  be a  ×  matrix and let  be a  × 1 vector of parameters. Furthermore
define the vector function () = , which maps the  parameters into  function values.
() is an  × 1 vector and the derivative with respect to  0 is a  ×  matrix given by

 ()
=  (6∗)
 0
To see this, write the function as
⎛ ⎞
11  1 + 12  2 +  + 1  
⎜ .. ⎟
() =  = ⎜
⎝ . ⎟
⎠
1  1 + 2  2 +  +   

3
and find the derivative
⎛ (  ++ ⎞ ⎛ ⎞
11 1 1   ) (11  1 ++1   )
 1 ···   11 · · · 1
() ⎜ .. .. .. ⎟ ⎜ . .. .. ⎟
=⎜
⎝ . . . ⎟ = ⎜ ..
⎠ ⎝ . . ⎟
⎠ = 
 0
(1  1 ++   ) (1  1 ++   )
 1 ···   1 · · · 

Similarly, if we consider the transposed function, () =  0 0 , which is a 1 ×  row vector,

we can find the  ×  matrix of derivatives as
¡ ¢
  0 0
= 0 . (7∗)

This is just an application of the result in (3).
Finally, consider a quadratic function  () =  0   for some  ×  matrix  . This
function maps the  parameters into a single number. Here we find the derivatives as the
 × 1 column vector ¡ ¢
 0 
= ( +  0 ) (8∗)

or the row variant ¡ ¢
 0 
=  0 ( +  0 ) (9∗)
 0
If  is symmetric this reduces to 2  and 2 0  , respectively. To see how this works,
consider the simple case  = 3 and write the function as
⎛ ⎞⎛ ⎞
³ ´ 11 12 13 1
⎜ ⎟⎜ ⎟
0  =  1  2  3 ⎝ 21 22 23 ⎠ ⎝  2 ⎠
31 32 33 3
= 11  21 + 22  22 + 33  23 + (12 + 21 ) 1  2 + (13 + 31 ) 1  3 + (23 + 32 ) 2  3 

Taking the derivative with respect to , we get

⎛ ⎞
( 0  )
¡ 0 ¢  1
   ⎜ ( 0  ) ⎟
= ⎜ ⎝ 0 2 ⎠
⎟
 (  )
 3
⎛ ⎞
211  1 + (12 + 21 ) 2 + (13 + 31 ) 3
⎜ ⎟
= ⎝ 222  2 + (12 + 21 ) 1 + (23 + 32 ) 3 ⎠
233  3 + (13 + 31 ) 1 + (23 + 32 ) 2
⎛ ⎞⎛ ⎞
211 12 + 21 13 + 31 1
⎜ ⎟⎜ ⎟
= ⎝ 12 + 21 222 23 + 32 ⎠ ⎝  2 ⎠
13 + 31 23 + 32 233 3
⎛⎛ ⎞ ⎛ ⎞⎞ ⎛ ⎞
11 12 13 11 21 31 1
⎜⎜ ⎟ ⎜ ⎟⎟ ⎜ ⎟
= ⎝⎝ 21 22 23 ⎠ + ⎝ 12 22 32 ⎠⎠ ⎝  2 ⎠
31 32 33 13 23 33 3
= ( +  0 )

4
4 The Linear Regression Model
To illustrate the use of matrix diﬀerentiation consider the linear regression model in matrix
notation,
 =  + ,
where  is a  × 1 vector of stacked left-hand-side variables,  is a  ×  matrix of
explanatory variables,  is a  × 1 vector of parameters to be estimated, and  is a  × 1
vector of error terms. Here  is the number of explanatory variables and  is the number
of observations.
One way to motivate the ordinary least squares (OLS) principle is to choose the esti-
b as the value of  that minimizes the sum of squared residuals, i.e.
mator, ,

X
b = arg min
 2 = arg min 0 
 
=1

Looking at the function to be minimized, we find that

0  = ( − )0 ( − )
¡ ¢
=  0 −  0  0 ( − )
=  0  −  0  −  0  0  +  0  0 
=  0  − 2 0  +  0  0 

where the last line uses the fact that  0  and  0  0  are identical scalar variables.
Note that 0  is a scalar function and taking the first derivative with respect to  yields
the  × 1 vector
¡ ¢
 (0 )   0  − 2 0  +  0  0 
= = −2 0  + 2 0 
 
where we have used the results in (4∗) and (8∗) for  0  symmetric. Solving the 
equations,
 (0 ) b = 0
= −2 0  + 2 0  

yields the OLS estimator
¡ ¢
b =  0  −1  0 

provided that  0  is non-singular.
To make sure that  b is a minimum of 0  and not a maximum, we should formally
ensure that the second derivative is positive definite. The  ×  Hessian matrix of second
derivatives is given by
 2 (0 )  (−2 0  + 2 0 )
= = 2 0 
 0  0
which is a positive definite matrix by construction.

5
References
Verbeek, M. (2004): A Guide to Modern Economtrics. John Wiley & Sons, 2nd edn.

Wooldridge, J. M. (2006): Introductory Econometrics: A Modern Approach. Thomson,

South-Western Publishing, 3rd edn.

Solution Manual for Econometric Analysis 7th Edition by Greene - 2025 Version Is Available With All Chapters
100% (9)
Solution Manual for Econometric Analysis 7th Edition by Greene - 2025 Version Is Available With All Chapters
40 pages
Linear Algebra Assignment Solution
100% (1)
Linear Algebra Assignment Solution
28 pages
Solution Manual for Econometric Analysis 7th Edition by Greene - Download Now To Experience The Complete Book
100% (10)
Solution Manual for Econometric Analysis 7th Edition by Greene - Download Now To Experience The Complete Book
46 pages
Download full Solution Manual for Econometric Analysis 7th Edition by Greene (PDF) with all chapters
100% (8)
Download full Solution Manual for Econometric Analysis 7th Edition by Greene (PDF) with all chapters
38 pages
Vector - Matrix Calculus
No ratings yet
Vector - Matrix Calculus
10 pages
Solution Manual for Econometric Analysis 7th Edition by Greenedownload
100% (9)
Solution Manual for Econometric Analysis 7th Edition by Greenedownload
39 pages
Solution Manual for Econometric Analysis 7th Edition by Greene instant download
100% (5)
Solution Manual for Econometric Analysis 7th Edition by Greene instant download
38 pages
Solution Manual For Econometric Analysis 7th Edition by Greene
100% (1)
Solution Manual For Econometric Analysis 7th Edition by Greene
12 pages
Matrix Calculus: 1 The Derivative
100% (1)
Matrix Calculus: 1 The Derivative
13 pages
Ann2018 L5
No ratings yet
Ann2018 L5
23 pages
CENG3300 Lecture 2-1
No ratings yet
CENG3300 Lecture 2-1
21 pages
ECE275AB Lecture 10 View Graphs 2008-2009
No ratings yet
ECE275AB Lecture 10 View Graphs 2008-2009
15 pages
Gradient Descent - Xiaowei Huang
No ratings yet
Gradient Descent - Xiaowei Huang
53 pages
Econometrics I 3
No ratings yet
Econometrics I 3
27 pages
mit18_s096iap23_lec02
No ratings yet
mit18_s096iap23_lec02
12 pages
STA2005S Regression
No ratings yet
STA2005S Regression
92 pages
4 - Multiple Linear Regressions
No ratings yet
4 - Multiple Linear Regressions
61 pages
Derivation of Normal Equations
No ratings yet
Derivation of Normal Equations
7 pages
Applied Econometrics: Department of Economics Stern School of Business
No ratings yet
Applied Econometrics: Department of Economics Stern School of Business
27 pages
Abramowitz & Stegun
100% (2)
Abramowitz & Stegun
1,059 pages
Matrix Differentiation
No ratings yet
Matrix Differentiation
15 pages
Lecture 4 - Estimation - BMSLec03
No ratings yet
Lecture 4 - Estimation - BMSLec03
20 pages
MLF Combined
No ratings yet
MLF Combined
84 pages
Matrix Introduction
No ratings yet
Matrix Introduction
30 pages
Calculus With Vectors and Matrices
No ratings yet
Calculus With Vectors and Matrices
16 pages
Vector Differentiation
No ratings yet
Vector Differentiation
5 pages
Mvcalc Notes 2018 v2
No ratings yet
Mvcalc Notes 2018 v2
39 pages
Matrix Calculus
No ratings yet
Matrix Calculus
8 pages
mit18_s096iap23_lec1
No ratings yet
mit18_s096iap23_lec1
16 pages
Scalar and Vector Field Operations
No ratings yet
Scalar and Vector Field Operations
6 pages
Day 1
No ratings yet
Day 1
41 pages
Tangent Planes and Linear Approximations
No ratings yet
Tangent Planes and Linear Approximations
9 pages
Matrix Calculus Tutorial
No ratings yet
Matrix Calculus Tutorial
7 pages
Chapter Matrix Derivative Common Cases
No ratings yet
Chapter Matrix Derivative Common Cases
6 pages
Derivation Ols
No ratings yet
Derivation Ols
11 pages
Matrix Calculus PDF
No ratings yet
Matrix Calculus PDF
9 pages
1 Linear Algebra: 1 K 1 1 K K 1 K
No ratings yet
1 Linear Algebra: 1 K 1 1 K K 1 K
3 pages
Thomas Minka - Note On Matrix Calculus and Algebra
No ratings yet
Thomas Minka - Note On Matrix Calculus and Algebra
19 pages
calculus
No ratings yet
calculus
5 pages
OLS
No ratings yet
OLS
18 pages
Mat Deriv
No ratings yet
Mat Deriv
3 pages
Math 5390 Chapter 2
No ratings yet
Math 5390 Chapter 2
5 pages
CS6910 Tutorial1
No ratings yet
CS6910 Tutorial1
10 pages
Numerical Linear Algebra With Matlab
No ratings yet
Numerical Linear Algebra With Matlab
16 pages
V. Nonlinear Regression by Modified Gauss-Newton Method: Theory
No ratings yet
V. Nonlinear Regression by Modified Gauss-Newton Method: Theory
39 pages
T&S Book
No ratings yet
T&S Book
8 pages
OptimumEngineeringDesign Day2b
No ratings yet
OptimumEngineeringDesign Day2b
24 pages
BCS Question Bank
No ratings yet
BCS Question Bank
360 pages
Differential Calculus For Vector Functions 1 Vector Functions of Variable
No ratings yet
Differential Calculus For Vector Functions 1 Vector Functions of Variable
11 pages
Gradient, Jacobian, Hessian, Laplacian and All That
No ratings yet
Gradient, Jacobian, Hessian, Laplacian and All That
2 pages
Linear Algebra For Business Analytics
No ratings yet
Linear Algebra For Business Analytics
27 pages
Vector and Matrix Calculus: Herman Kamper 30 January 2013
No ratings yet
Vector and Matrix Calculus: Herman Kamper 30 January 2013
5 pages
Lecture II - Docx - 12
No ratings yet
Lecture II - Docx - 12
12 pages
1 Linear Transformations and Their Matrix Repre-Sentations
No ratings yet
1 Linear Transformations and Their Matrix Repre-Sentations
9 pages
MATH 30.14 Module 1
No ratings yet
MATH 30.14 Module 1
66 pages
Data Structure Lab (CS29001) Lesson Plan
No ratings yet
Data Structure Lab (CS29001) Lesson Plan
19 pages
Derivation of The Normal Equation For Linear Regression - Eli Bendersky's Website
No ratings yet
Derivation of The Normal Equation For Linear Regression - Eli Bendersky's Website
2 pages
F Matrix Calculus
No ratings yet
F Matrix Calculus
9 pages
AB SG Unit 1 Progress Check MCQ Part C
75% (4)
AB SG Unit 1 Progress Check MCQ Part C
9 pages
Graphing Form of Sine and Cosine Functions
No ratings yet
Graphing Form of Sine and Cosine Functions
24 pages
CPP 1 Domain and Range Q
No ratings yet
CPP 1 Domain and Range Q
1 page
Algorithm U3 Answer Key
No ratings yet
Algorithm U3 Answer Key
26 pages
EN530.678 Nonlinear Control and Planning in Robotics Lecture 1: Matrix Algebra Basics January 27, 2020
No ratings yet
EN530.678 Nonlinear Control and Planning in Robotics Lecture 1: Matrix Algebra Basics January 27, 2020
4 pages
Lec10 LeastSquaresRegression PDF
No ratings yet
Lec10 LeastSquaresRegression PDF
4 pages
GPyTorch: Blackbox Matrix-Matrix Gaussian Process Inference With GPU Acceleration
No ratings yet
GPyTorch: Blackbox Matrix-Matrix Gaussian Process Inference With GPU Acceleration
22 pages
Exercise 9.2 Page No: 9.34: Solution
No ratings yet
Exercise 9.2 Page No: 9.34: Solution
39 pages
2nd Year Maths MCQs Fullbook NOTESPK
No ratings yet
2nd Year Maths MCQs Fullbook NOTESPK
24 pages
01A Introduction To Economic Time Series
No ratings yet
01A Introduction To Economic Time Series
26 pages
6.896: Probability and Computation: Spring 2011
No ratings yet
6.896: Probability and Computation: Spring 2011
19 pages
Project On Trigonometry.
No ratings yet
Project On Trigonometry.
22 pages
Lecture Note 6 - Cointegration and Common Trends
No ratings yet
Lecture Note 6 - Cointegration and Common Trends
31 pages
Handout 1: Classical Linear Regression Model (Revision)
No ratings yet
Handout 1: Classical Linear Regression Model (Revision)
30 pages
IITKGP_Assignment4_solution4
No ratings yet
IITKGP_Assignment4_solution4
3 pages
Power Series
No ratings yet
Power Series
55 pages
Finding Triconnected Components of Graphs PDF
No ratings yet
Finding Triconnected Components of Graphs PDF
24 pages
Fan and Pick Answer Key
No ratings yet
Fan and Pick Answer Key
3 pages
Some Products of Picture Fuzzy Soft Graph: M. Vijaya, D. Hema
No ratings yet
Some Products of Picture Fuzzy Soft Graph: M. Vijaya, D. Hema
8 pages
FW Lawvere - State Categories and Response Functors 1986
No ratings yet
FW Lawvere - State Categories and Response Functors 1986
32 pages
Transform Calculus, Fourier Series and Numerical Techniques
No ratings yet
Transform Calculus, Fourier Series and Numerical Techniques
2 pages
Autoregressive Conditional Heteroskedasticity (ARCH) Models: Econometrics II
No ratings yet
Autoregressive Conditional Heteroskedasticity (ARCH) Models: Econometrics II
49 pages
Lecture Note 4 - Dynamic Models For Stationary Data
100% (1)
Lecture Note 4 - Dynamic Models For Stationary Data
28 pages
MATH2071: LAB 8: The Eigenvalue Problem
No ratings yet
MATH2071: LAB 8: The Eigenvalue Problem
16 pages
1.2 Normed Spaces
No ratings yet
1.2 Normed Spaces
17 pages
Domain and Range of Inverse Functions
No ratings yet
Domain and Range of Inverse Functions
5 pages
Implicit Function Theorem
No ratings yet
Implicit Function Theorem
2 pages
Exercises For 8.5
No ratings yet
Exercises For 8.5
17 pages
CSU-Cabadbaran Advance Review For EE: Topic: Algebra 2 - Functions
No ratings yet
CSU-Cabadbaran Advance Review For EE: Topic: Algebra 2 - Functions
9 pages
Q4 Basic Calculus 11 - Module 1
No ratings yet
Q4 Basic Calculus 11 - Module 1
16 pages
Periodic Solutions of Linear Integro-Differential Equations
No ratings yet
Periodic Solutions of Linear Integro-Differential Equations
10 pages
Lecture Note 5 - Non-Stationary Time Series and Unit Root Testing
No ratings yet
Lecture Note 5 - Non-Stationary Time Series and Unit Root Testing
21 pages
Solution Method For Linear Ordinary Differential Equations: Mathematics
No ratings yet
Solution Method For Linear Ordinary Differential Equations: Mathematics
4 pages
Tables To Graph
No ratings yet
Tables To Graph
4 pages
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Geometric functions in computer aided geometric design
From Everand
Geometric functions in computer aided geometric design
Oscar Ruiz
No ratings yet
Calculus-II (Mathematics) Question Bank
From Everand
Calculus-II (Mathematics) Question Bank
Mohmmad Khaja Shareef
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)

Lecture Note 3 - Introduction To Vector and Matrix Differentiation

Uploaded by

Lecture Note 3 - Introduction To Vector and Matrix Differentiation

Uploaded by

INTRODUCTION

TO VECTOR AND MATRIX

2 Conventions for Vector Functions

3 Some Special Functions

Taking the derivative with respect to  yields

which is a  × 1 vector as expected. Also note that since  0  = 0 , it holds that

Similarly, if we consider the transposed function, () =  0 0 , which is a 1 ×  row vector,

Taking the derivative with respect to , we get

Looking at the function to be minimized, we find that

Wooldridge, J. M. (2006): Introductory Econometrics: A Modern Approach. Thomson,

You might also like