Math Econ Lecture 5
Math Econ Lecture 5
Mathematical Economics
Lecture 5
1 / 35
Single Variable Optimization Partial Differentiation Multiple Variable Optimization
Unconstrained Optimization
Most Economic Models are about optimization, which is a key solution principle.
Assume f (x) is at least twice continuously differentiable and possesses a max and/or min.
2 / 35
Single Variable Optimization Partial Differentiation Multiple Variable Optimization
∗ ′ ∗ ∗ ′ ∗
f (x ) ≥ f (x) ⇒ f (x ) = 0 or f (x ) ≤ f (x) ⇒ f (x ) = 0
If f ′ (x∗ ) ̸= 0 then it is always possible to find feasible dx ̸= 0 such that dy > 0 or dy < 0
Because this condition is based on the first order derivative, it is also known as the first order
condition.
3 / 35
Single Variable Optimization Partial Differentiation Multiple Variable Optimization
4 / 35
If x∗ is a local min, then f ′ (x∗ ) = 0.
Single Variable Optimization Partial Differentiation Multiple Variable Optimization
Example:
claiming that x∗ maximizes the function might be wrong if it really gives a min or point of
inflexion.
′ ∗ ′′ ∗ ∗
f (x ) = 0 and f (x ) < 0 ⇒ x gives a max.
′ ∗ ′′ ∗ ∗
f (x ) = 0 and f (x ) > 0 ⇒ x gives a min.
Since we apply this by checking whether f ′′ (x∗ ) < 0 in case of a max, or f ′′ (x∗ ) > 0 in case of
a min, these inequalities are called second order conditions.
6 / 35
Single Variable Optimization Partial Differentiation Multiple Variable Optimization
To prove the above sufficiency, and also to see what happens when f ′′ (x∗ ) = 0, Taylor Series
Expansion around x∗ :
Thus,
′′ ∗ ∗ f ′′ (x∗ )(x − x∗ )2 ∗
f (x ) < 0 ⇒ f (x) = f (x ) + =⇒ f (x) < f (x ).
2!
| {z }
<0
′′ ∗ ∗ f ′′ (x∗ )(x − x∗ )2 ∗
f (x ) > 0 ⇒ f (x) = f (x ) + =⇒ f (x) > f (x ).
2!
| {z }
>0
If f ′′ (x∗ ) = 0, continue the expansion until the non-zero even higher-order derivative and apply
the same argument or the ”n’th derivative test”
7 / 35
Single Variable Optimization Partial Differentiation Multiple Variable Optimization
Economic Applications
Monopoly with Linear Demand and Costs:
Cost function:
C(q) = 25q.
Profit function:
π(q) = p(q)q − C(q) = (100 − q) × q − 25q = 75q − q 2 .
′ ∗ ∗
π (q) = 75 − 2q = 0 =⇒ 2q = 75,
′′ ∗
π (q) = −2 < 0 =⇒ q is the global max.
9 / 35
Single Variable Optimization Partial Differentiation Multiple Variable Optimization
Economic Applications
If a book publisher sets the book’s price and Inverse demand function is p(q) = 100 − q, and the
Cost function is C(q) = 25q.
If the author is paid a royalty of 10% of the book’s price, then her income is based on the book
sales:
2 2
Y (q) = 0.1p(q)q = (100 − q) × q = 0.1(100q − q ) = 10q − 0.1q .
2
max Y (q) = 10q − 0.1q .
q
10 / 35
Single Variable Optimization Partial Differentiation Multiple Variable Optimization
Economic Applications
2
max π(q) = 65q − 0.9q .
q
The conflict of interest arises because the publisher wants to maximize profit, while the author
wants to maximize sales revenue.
11 / 35
Single Variable Optimization Partial Differentiation Multiple Variable Optimization
Economic Applications
Recall ϵ as the price elasticity of demand (p′ (q) < 0 for downward sloping demand):
dq
q p dq p 1 p 1 q ′ 1
ϵ=− =− = − dp = − =⇒ p (q) = − .
dp q dp q q p′ (q) p ϵ
p dq
′ ′
R (q) = p(q) + qp (q)
q ′ 1
= p(1 + p (q)) = p(1 − )
p ϵ
= p(1 − 1/ϵ).
12 / 35
Single Variable Optimization Partial Differentiation Multiple Variable Optimization
Economic Applications
Competitive market supply and demand functions:
S = bPS , D = a0 − a1 PB , a0 , a1 , b > 0.
PS = PB − t where t ≥ 0 is a specific tax per unit bought and sold, i.e. $t per unit.
∗
Solve for the equilibrium PB :
∗ a0 b ∗ ∗
PB = + t, PS = PB − t,
a1 + b a1 + b
∗ ∗ ∗ a0 bt a1 a1 b
S = D = a0 − a1 PB = a0 − a1 + = a0 (1 − )− t,
a1 + b a1 + b a1 + b a1 + b
∗
So, equilibrium PB increases with the tax t, but the equilibrium quantity bought D ∗ and sold S ∗
∗
decreases with PB .
13 / 35
Single Variable Optimization Partial Differentiation Multiple Variable Optimization
Economic Applications
a1 a1 b a1 a1 b 2 2
R(t) = t a0 (1 − )− t)t = a0 (1 − )t − t = αt − βt ,
a1 + b a1 + b a1 + b a1 + b
where
a1 a1
α = a0 (1 − ) > 0, since a1 + b > a1 , 0 < < 1, a0 > 0, a1 > 0, b > 0,
a1 + b a1 + b
a1 b
β= > 0.
a1 + b
′ ∗ ∗ α
R (t) = α − 2βt = 0 =⇒ t = > 0, since α > 0, β > 0.
2β
′′ ∗
R (t ) = −2β < 0, since β > 0.
14 / 35
Single Variable Optimization Partial Differentiation Multiple Variable Optimization
Economic Applications
In fact, this is a strictly concave quadratic function with unique stationary value at t∗ = α/2β.
For this curve, if t > t∗ , then a reduction in the tax increases tax revenue, gladdening politicians
and taxpayers.
If a function is everywhere strictly concave (f ′′ (x) < 0 at all points in its domain) and has a
stationary value, then this is the unique global maximum .
If a function is everywhere strictly convex (f ′′ (x) > 0 at all points in its domain) and has a
stationary value, then this is the unique global minimum.
15 / 35
Single Variable Optimization Partial Differentiation Multiple Variable Optimization
Partial Differentiation
Continuity, differentiability, and differentiation rules also apply to functions of many variables,
which adds to the richness in the economic applications of many more economics variables:
y = f(x1 , x2 , . . . , xn ),
Example:
Consider y = f(x1 , x2 ).
If x2 is fixed at some value, and treat y as a function of only x1 , then we can compute the
derivative of f (x1 , x2 ) with respect to x1 as if f (x1 , x2 ) was a function of only x1 .
Similarly, we can do this for x2 changing while holding x1 fixed at some value.
Thus, we find a pair of first order partial derivatives, which are often simply referred to as the first
derivatives of f(x1 , x2 ) .
We can do this iteratively for any number of variables; i.e., for y = f(x1 , x2 , . . . , xn ).
16 / 35
Single Variable Optimization Partial Differentiation Multiple Variable Optimization
Partial Differentiation
Formally,
17 / 35
Single Variable Optimization Partial Differentiation Multiple Variable Optimization
Partial Differentiation
Graphically, partial derivatives show the slopes in the direction of change of the relevant variable:
18 / 35
Single Variable Optimization Partial Differentiation Multiple Variable Optimization
Since the partial derivative of y = f(x1 , x2 , . . . , xn ) with respect to any of the xi , say xj , is simply the rate of
change of f (or y) with respect to xj while holding all other variables xi , i ̸= j, fixed.
The same Differentiation rules for a function of a single variable apply to a function of multiple variables.
Example:
19 / 35
Single Variable Optimization Partial Differentiation Multiple Variable Optimization
Gradient
The first-order derivatives of f are frequently displayed as a vector, also known as the gradient vector.
Each element fi indicates the steepness or grade of the function as one ‘moves’ in the direction of xi .
Example:
f (x1 , x2 ) = 5 − 2x1 + 3x2 where f1 = -2, and f2 =3, so ∇f = [f1 f2 ] = [−2, 3]:
Note:
The gradient decent algorithm.
20 / 35
Single Variable Optimization Partial Differentiation Multiple Variable Optimization
However, sometimes it might be helpful to look at the special cases where such interaction
(compound) effects among the variables don’t exist.
= x1 + x 2 + . . . + x n
Pn
= gk (xi )
i=1
is additively separable.
21 / 35
Single Variable Optimization Partial Differentiation Multiple Variable Optimization
∂fi (x1 , x2 , . . . , xn )
fij ≡ , i, j = 1, 2, . . . , n.
∂xj
For n = 2, this gives
22 / 35
Single Variable Optimization Partial Differentiation Multiple Variable Optimization
23 / 35
Single Variable Optimization Partial Differentiation Multiple Variable Optimization
Economics Applications
Previous Example show there can be ‘interaction’ effects among the variables (i.e., the partial
derivative with respect to x2 depends on the levels of the other variables x1 and x3 :
∂y 2 3 6
= 20x1 x2 x3 .
∂x2
These interaction effects among the variables reflect important economic properties.
Example:
The Cobb-Douglas production function with labor L and capital K which are complimentary inputs
needed in production: Y = ALα K β , where A is the technological parameter, L, K > 0, α < 1, and
β < 1,
∂Y α−1 β
MP L = = αAL K , (1)
∂L
where the marginal product of labor L also depends on the K or the capital invested.
∂Y α β−1
MP K = = βAL K , (2)
∂K
where the marginal product of capital K also depends on the labor L tending to the capital invested.
24 / 35
Single Variable Optimization Partial Differentiation Multiple Variable Optimization
Economics Applications
Unconstrained Optimization
A function has extreme values (maxima and minima) and stationary values, which may not be the
same.
For functions of n variables y = f (x1 , x2 , ..., xn ), it is useful also to write these as f (x), where
x = [x1 , x2 , ..., xn ] is a point in Rn .
Need both necessary and sufficient conditions to ensure that functions’ stationary values give
optimum solutions (global extreme values).
26 / 35
Single Variable Optimization Partial Differentiation Multiple Variable Optimization
∗ ∗ ∗
f1 (x1 , x2 , ..., xn ) = 0
∗ ∗ ∗
f2 (x1 , x2 , ..., xn ) = 0
...............
∗ ∗ ∗
fn (x1 , x2 , ..., xn ) =0
28 / 35
Single Variable Optimization Partial Differentiation Multiple Variable Optimization
∗ ∗ ∗
dy = f1 (x )dx1 + f2 (x )dx2 + ... + fn (x )dxn ,
if any one fi (x∗ ) ̸= 0, then it is always possible to find a feasible dxi ̸= 0 such that dy ≷ 0 and
so the function cannot be at an extreme value.
To find an optimum using the first order conditions means having to solve a set of quite possibly
nonlinear simultaneous equations, which is not easy and sufficient.
This is why we prefer well-behaved economic models - first order conditions relatively easy to
solve, albeit insufficient.
29 / 35
Single Variable Optimization Partial Differentiation Multiple Variable Optimization
The first order conditions may give inflexion points or saddle points, which may not be the true
(local or global) max or min, so need second-order conditions for optimum.
Intuitively, if any small deviation in any direction away from a stationary value at x∗ , results in a
decrease in the value of the function, then x∗ must yield a local max.
Consider again the total differential of the function at the stationary point x∗ :
∗ ∗ ∗
dy = f1 (x )dx1 + f2 (x )dx2 + ... + fn (x )dxn = 0.
If d(dy) = d2 y = d[f1 (x∗ )dx1 + f2 (x∗ )dx2 + ... + fn (x∗ )dxn ] < 0, then the value of dy
must fall as we move away from x∗ in any direction, implying that dy is negative in a small
neighbourhood of x∗ .
This implies that the value of the function is decreasing as we move from x∗ in any direction.
30 / 35
Single Variable Optimization Partial Differentiation Multiple Variable Optimization
n n
X X
2 ∗ T
d y= fij (x )dxi dxj = dx Hdx < 0,
i=1 j=1
where dx = [dx1 , dx2 , ..., dxn ] and dxT is its transpose, and H is the Hessian matrix of
second order partial derivatives.
For concave and convex functions of n variables, if the function is globally strictly concave
(convex) and possesses a stationary value, then this must be the unique global max (min).
31 / 35
Single Variable Optimization Partial Differentiation Multiple Variable Optimization
Quadratic Forms
n n
X X
T
q(x) = x Ax = aij xi xj .
i=1 j=1
If A is not symmetric, then we can always define a symmetric matrix A∗ that yields the same quadratic
form as A.
If q(x) is positive for some x and negative for some other x, then A is indefinite.
32 / 35
Single Variable Optimization Partial Differentiation Multiple Variable Optimization
Economic Applications
Monopoly Price Discrimination:
A monopoly firm sells an identical good in two separate markets with no possibility of arbitrage.
The inverse demand functions for both markets are p1 = 12 − q1 , and p2 = 10 − 2q2 , where pi are
prices, qi are quantities, and its cost function is C = 1
2 (q1 + q2 )2 . Profit function:
1 2 2 2 1 2
π(q1 , q2 ) = p1 q1 +p2 q2 −C = (12−q1 )q1 +(10−2q2 )q2 − (q1 +q2 ) = 12q1 +10q2 −q1 −2q2 − (q1 +q2 ) .
2 2
∂π ∗ ∗ ∗ ∗ ∗ ∗ ∗ ∗ ∗
= π1 (q1 , q2 ) = 12 − 2q1 − (q1 + q2 ) = 12 − 3q1 − q2 = 0 =⇒ q2 = 12 − 3q1 ,
∂q1
∂π ∗ ∗ ∗ ∗ ∗ ∗ ∗
= π2 (q1 , q2 ) = 10 − 4q2 − (q1 + q2 ) = 10 − q1 − 5q2 = 0,
∂q2
as the necessary condition for profit maximization, which implies equating marginal revenues, since the
marginal costs of the two outputs are identical at (q1∗ + q2∗ ).
Optimal outputs are q1∗ = 3.57, q2∗ = 1.29, so the firm sells more in market 1. Optimal prices
are p∗ ∗ ∗ ∗
1 = 12 − q1 = 8.43, p2 = 10 − 2q2 = 7.43, so price is higher in market 1.
Economic Applications
∂2 π ∂2 π
h i
∂π1 ∂π1
∂q 2 ∂q1 2 ∂q1 ∂q2
−3 −1
H = 1 = = ,
∂2 π ∂2 π ∂π2 ∂π2
−1 −5
∂q2 1 ∂q 2 ∂q1 ∂q2
2
where π1 (q1 , q2 ) = 12 − 3q1 − q2 , π2 (q1 , q2 ) = 10 − q1 − 5q2 , and π12 (q1 , q2 ) = π21 (q1 , q2 ) by Young’s
Theorem.
Then test for Hessian matrix of second order partial derivatives H’s negative definiteness:
Recall the a quadratic form q(x) = xT Ax, where n × n matrix A, and an n × 1 vector x for all x ̸= 0.
h i
1
Since H is 2 × 2, just need any 2 × 1 vector x, with x ̸= 0, such as x = .
0
h ih i h i h i
T −3 −1 1 1 1
x Hx = [1, 0] = [1×(−3)+0×(−1), 1×(−1)+0×(−5)] = [−3, −1] =
−1 −5 0 0 0
(3)
Since xT Hx < 0, for x ̸= 0, then H is a negative definite matrix, so π ∗ (q1∗ , q2∗ ) is the true maximum.
34 / 35
Single Variable Optimization Partial Differentiation Multiple Variable Optimization
∗ ∗ p∗ 1 − 1/ϵ2
M R1 = M R2 =⇒ p1 (1 − 1/ϵ1 ) = p2 (1 − 1/ϵ2 ) =⇒ 1
= . (4)
p∗2 1 − 1/ϵ1
This says that the market with the lower demand elasticity ϵ1 (demand not as responsive to price
changes) will have the higher price p∗
1 , with q1 = 12 − p1 , and q2 = 5 − 2 p2 .
1
Recall the monopolist always produces where ϵi > 1, which is confirmed at the 2 markets’
corresponding price points p∗ ∗
1 = 8.43, and p2 = 7.43:
At p∗
1 in Market 1:
dq1 p∗ 8.43
ϵ1 = − 1
= −(−1) × = 2.36 > 1. (5)
dp1 q1∗ 3.57
At p∗
2 in Market 2:
dq2 p∗ 1 7.43
ϵ2 = − 2
= −(− ) × = 2.89 > 1. (6)
dp2 q2∗ 2 1.29