Lecture 4
Lecture 4
other things)
1 Where we are
• So far, we’ve stated the firm’s problem under our basic model of production,
max p · y
y∈Y
and shown some properties of the firm’s profit function π and supply correspondence Y ∗
• We’ve shown in general that optimal supply must satisfy the Law of Supply,
(p0 − p) · (y 0 − y) ≥ 0
• A big result that you hopefully all watched in the last week was the Envelope Theorem,
relating the derivative of an objective function to the derivative of the maximized value:
if V (t) is defined as maxx∈X f (x, t),
∂f
the Envelope Theorem says that whever V 0 exists, V 0 (t) = ∂t (x, t) for any x ∈ x∗ (t)
∂π ∂ X
= ( pj yj ) = yi |y=y∗ (p) = yi∗ (p)
∂pi ∂pi
j
y=y ∗ (p)
53
• For the special case where π is twice differentiable, and y(p) = Y ∗ (p) is therefore differentiable,
we’ve seen some additional properties having to do with the Jacobian of y
• With finite data, this boils down to the Weak Axiom of Profit Maximization,
or p · y ≥ p · y 0 for any two observations (p, y) and (p0 , y 0 ) in the data
• When π is differentiable and we have data from every possible price vector,
ratinoalizability is equivalent to π being convex and Hotelling’s Lemma holding
• (More specifically, we showed that if π and y are both observed and satisfy adding-up,
then the data is rationalizable if and only if π is convex and Hotelling’s Lemma holds.
54
2 (Skip in lecture – in notes for completeness)
• The result we saw last time:
2. π is convex
• (To prove this, we would let π(p) = p · y(p), so adding-up would automatically hold;
we’d then confirm that Hotelling holds and that π is convex,2 and invoke the main result.)
• (Something cool here: we know the profit function must be homogeneous of degree 1,
and the optimal supply correspondence must be homogeneous of degree 0
• If we observe both π and y, though, we don’t have to check – these conditions are redundant:
if π and y satisfy both Hotelling’s lemma and the adding-up condition,
they have to be homogeneous of the right degree
• But if we only observe y or π, we need to check that it’s homogeneous of the right degree)
1
To prove it, note that π is assumed to be homogeneous of degree 1, or π(λp) = λπ(p); differentiating this with
∂π ∂π
respect to λ gives p1 ∂p 1
(p) + . . . + pk ∂pk
(p) = π(p) which is exactly the adding-up condition since yi (p) was defined
∂π
as ∂pi (p).
2 ∂π
= yi (p) + kj=1 pj ∂p
∂yi
P
Differentiating π(p) = p · y(p) gives ∂p i j
. Since y is homogeneous of degree 0, 0 =
∂
P k ∂yi ∂π 2
y (λp) =
∂λ i j=1 pj ∂pj , so ∂pi = yi and Hotelling holds. Once we have Hotelling, Dp π(p) = Dp y(p), which is
positive semidefinite, so π is convex.
55
3 Let’s think about solving the firm’s problem
• Next, we’re going to think a little more about how we might actually go about solving a firm’s
profit maximization problem
max p · y
y∈Y
3.1 Graphically...
• If we’re given the production set graphically, then conceptually, this is pretty simple
(draw it in two dimensions)
• But that’s not usually how we’re going to be given a production set
• This is a function
T : Rk → R
• That is, T is weakly negative whenever y is feasible, and strictly positive when y is infeasible
• (Informally, I like to think of T being how many years in the future a given technology is.
If a technology is a positive number of years away in the future, it’s not feasible today;
if T (y) is zero or negative, the technology has already arrived.)
∂T /∂yi
M RTi,j (y) =
∂T /∂yj
as the extra amount of good j obtained for each unit reduction of good i (meaning an increase
in its use as an input or a decrease of its production)
or the slope of the boundary at y)
56
– (If i and j are both being used as inputs,
this is the rate at which you can substitute one for the other –
how many extra units of labor will replace a unit of something else
– If j is an output,
this is how much more of output j you can get by increasing your use of input i
– If they’re both outputs, it’s how much more of one you can get with the same inputs,
by sacrificing some of the other)
• But in practice, when we want to solve the firm’s problem (or prove sharper results),
we typically simplify the problem by considering firms with a single output
– In IO, multi-product firms face a more complicated problem because their products may
be substitutes for each other –
if Samsung gets more people to buy one model Galaxy,
fewer people will buy a different model –
but in our price-taking model, this doesn’t matter
– If you think of a large firm as having separate teams and maybe even separate factories
to produce its different products,
there’s little problem thinking of each product as a separate single-product firm
57
4 Single-Output Firms
• So now consider a single-output firm
f : Rm
+ → R+
where f (z) is the most output the firm can product with inputs z
(Note the signs – now inputs are positive, not negative)
• If our firm also has free disposal, we can link back to the old notation by observing that
58
• Similar to the Marginal Rate of Transformation,
we can define the Marginal Rate of Technical Substitution
∂f /∂z`
M RT Sk,` =
∂f /∂zk
as the ratio at which you can substitute input k for good ` while holding production constant
• If we draw “isoquants” in input space (the set of input combinations that give the same
output level), the MRTS is the slope
• Of course, if p > 0, the firm will maximize profits by setting q = f (z) (not throwing away
any of its possible output), so we could write this as
59
4.2 Decomposing the Firm’s Problem
• However, instead of just solving the firm’s problem all at once,
we can also decompose it into two pieces:
1. For each output level q, given input prices w, what is the cheapest way to produce q?
2. Taking that minimum cost to produce q as given,
what output level q maximizes revenue minus costs?
• Well, for one thing, the cost function will accord with lots of earlier economic intuition about
marginal costs, average costs, and stuff like that
• For another, the firm’s optimal choice of q sometimes stretches the assumptions of our model
60
• These are all interesting questions,
but they make it hard sometimes to work with the “simple” model and feel good about it
• On the other hand, however the firm chooses q – whether or not it fits our model well –
whatever q is and however it’s chosen,
the firm will still want to choose the cost-minimizing way to produce it
as its solution.
61
4.4 Properties of c(q, w) and Z ∗ (q, w)
• Cost Minimization has nearly the same structure as Profit Maximization:
in both cases, the firm is optimizing a function over some set –
either the production set Y ,
or the set of input vectors that generate enough output
• So the properties of c(q, w) and Z ∗ (q, w) will be almost exactly the same as the properties of
π(p) and Y ∗ (p),
and we won’t bother to re-do the proofs
• The main difference is that, since now we’re minimizing instead of maximizing,
the cost function will be concave in prices rather than convex,
and the Jacobian matrix of optimal supply choices will be negative semidefinite rather than
positive semidefinite
• Proposition.
62
• The Law of Supply also holds at a fixed level of output:
if z ∈ Z ∗ (q, w) and z 0 ∈ Z ∗ (q, w0 ) then
(z 0 − z) · (w0 − w) ≤ 0
so zi is decreasing in wi
• One result that doesn’t have an analog in what we’ve done so far:
Proposition. If f is concave, then c is convex in q (marginal costs are rising).
(This is trivial with one input, but a result with more than one;
it’s actually problem 1 on the next homework.)
• (On the homework, you’ll show that a convex production set implies a concave production
function,
and a concave production function implies a convex cost function.
Once you know this result, it’s the key piece of the Weitzman proof of separating hyperplanes
that I posted as “bonus material” on the Canvas site.)
• The result on rationalizability looks just like the case of profit maximization:
• Proposition. Let W ⊂ Rm
+ be an open, convex subset of the input price space.
63
4.6 solving it...
• Once we have the cost function, the firm’s profit maximization problem is just
• As you’ll show on the homework, if the production set is convex (or the production function
is concave), then the cost function is convex in q, so the problem
pq − c(q)
is concave, and so the first-order condition is sufficient: setting price equal to marginal cost,
the firm is maximizing profits
64
• We can rewrite firm profits as
c(q, w)
pq − c(q, w) = q p−
q
c
so we’re interested in the firm’s average cost function q
• A common assumption people like to make is that marginal costs are u-shaped –
decreasing at first, then increasing
3
Proof. If f has IRS, it means f (λz) ≥ λf (z) for λ > 1. So fix input prices w, and let z ∗ ∈ Z(q, w) be a
cost-minimizing way to produce q; by definition, c(q, w) = w · z ∗ . With increasing returns, f (λz ∗ ) ≥ λf (z ∗ ) = λq,
so if you need to produce λq, λz ∗ gives you at least enough output; this means c(λq, w) ≤ w · (λz ∗ ) = λc(q, w), so
c(λq) ∗ ∗
λq
≤ λw·z
λq
= w·z
q
= c(q)
q
. With decreasing returns, the proof is identical, just with all the statements we just
made holding for λ < 1 rather than λ > 1.
65
5 The Monopolist’s Problem
• Now, so far we’ve assumed firms are price takers – both for their inputs, and for their outputs
• We can also think of a firm that makes a unique product, and is able to set the price of
that product, but faces a downward-sloping demand curve – a tradeoff between price and the
quantity they’ll sell
• Assume the firm is still a price taker in inputs, so the cost minimization problem is unchanged,
and take c(q) as given
• There are two ways we can formulate the firm’s problem: choosing price, or choosing quantity
• First, imagine the firm sets price p, knowing that leads to demand Q(p)
• For the second formulation, the firm chooses quantity, knowing this determines the price it
will have to sell at:
we let P (q) be the inverse demand function
• This formulation is easier to work with; if both P and c are differentiable, we get the FOC
• Since P 0 is negative, this tells us that at the optimum, P > c0 , or price is greater than marginal
costs
• If c is convex, this implies the monopolist sells less than a price-taking firm with the same
technology
the first term is the change in infra-marginal profit – the reduction in the profits I get from
the other customers I’m selling to anyway if I lower my price to serve another customer – and
the second term is the profit I make on this new marginal customer
66
6 Motivating Monotone Comparative Statics
• Consider a firm maximizing profits, given a production set Y
• If the price of steel goes up, we know the firm will use less steel;
should we expect the firm to use more aluminum?
should we expect the firm to use fewer steel-piercing drill bits?
• On the other extreme, if we want results that will hold for any Y ,
the Law of Supply tells us something, but it leaves open a pretty wide range of possibilities
67
• In the late 1980s, Paul Milgrom and John Roberts looked closely at US manufacturing,4
and noticed that many firms were breaking away from the traditional, highly-specialized
production line (which had been around since Henry Ford)
and had made a number of changes to their production process:
introducing more flexible tools and programmable, multi-task production equipment;
producing output in smaller batches;
having shorter production cycles, smaller inventories, and lower levels of back orders
(summarized as “a general emphasis on speeding up all aspects of the firm’s operation”);
along with broader product lines, a widespread emphasis on higher quality,
and new organizational strategies and workforce management policies
• Milgrom and Roberts noted that these changes tended to happen together –
many firms had adopted all or most of these changes,
while very few firms had adopted a small subset of them –
so they concluded the changes were likely complementary to each other –
a firm that had adopted some, would be more likely to find it worthwhile to adopt others
• They argue that adoption was triggered by technological advances which effectively lowered
the costs of certain inputs, but not necessarily all of them –
focusing on reductions in the cost of data collecting and processing, computer-aided design,
and flexible manufacturing,
due to advances in computers, computer networks, and robots and other programmable pro-
duction equipment
4
Paul Milgrom and John Roberts (1990), “The Economics of Modern Manufacturing: Technology, Strategy and
Organization,” American Economic Review 80.3. Note that Theorems 7, 8 and 9 are not correct as published. If you
want to go deep into the weeds on this, see Bushnell and Shepard (1995), “The Economics of Modern Manufacturing:
Comment”; Donald Topkis (1995), “The Economics of Modern Manufacturing: Comment”; and Milgrom and Roberts
(1995), “The Economics of Modern Manufacturing: Reply,” all in the American Economic Review 85.4. Or just use
Milgrom and Roberts (1990) for context, motivation, and to understand the general setup.
68
• Milgrom and Roberts thus wanted a mathematical framework for considering these changes
which allowed for non-convexities in firms’ choice sets –
allowing for discrete jumps in technology, not just continuous incremental changes –
and they therefore needed mathematical tools which don’t depend on differentiability
• The approach they introduced has come to be known as Monotone Comparative Statics –
Comparative Statics being the question of how the solution to an optimization problem (like
a firm’s choice of production) changes in response to an exogenous parameter change (like a
falling input price),
and Monotone emphasizing that the approach focuses on predicting the sign of a change, but
not necessarily its magnitude
• Today, I’ll introduce a “baby version” of the Monotone Comparative Statics approach,
to give some intuition for how it works
69
7 Monotone Comparative Statics
7.1 General Setup – one dimension
• We’ll start with the simple case – a one-dimensional choice variable – and then move on to
the more interesting case of a multi-dimensional problem
• Like when we introduced the Envelope Theorem, we’ll start out with a generic parameterized
optimization problem
max g(x, t)
x∈X
• Like with the Envelope Theorem, the choice set X can be continuous or discrete –
so x could be how much output to produce, or how many factories to open
• We’ll let
x∗ (t) = arg max g(x, t)
x∈X
70
7.2 When is a set above another set?
• One complication is that the problems we’re considering won’t always have a unique solution
• So how do we say that one set of solutions is “greater than” another set?
• Let A, B ⊂ R be sets
A ≥SSO B
• This means that all the points that are in B but not A
are below all the points that are in both sets
which are below all the points that are in A and not B
• If A and B have just one element each, then A ≥SSO B just means a ≥ b –
so this really can be thought of as a generalization of “greater than”,
to allow for sets that have nontrivial overlap
• That’s how we’ll rank sets – and if we have a set defined as a function of a parameter, X(t),
we’ll say that X is increasing in t if it’s increasing via the strong set order – that is, if
whenever t0 > t
71
7.3 Increasing Differences
• We’ll say our objective function g has increasing differences
if the change in g when you move from a lower value of x to a higher value
is increasing in t
g(x0 , t) − g(x, t)
is weakly increasing in t; or
∂g ∂g
increasing in t or increasing in x
∂x ∂t
∂2g
≥ 0
∂x∂t
• Our main result for today will be that if g has increasing differences,
then x∗ (t) is increasing in t
72
7.4 Result – Topkis in One Dimension
Theorem (“Baby Topkis”). Let
where X ⊆ R. If g has increasing differences, then x∗ (t) is increasing in t via the Strong Set Order.
73
Proof of Topkis in one dimension
g(x, t) − g(x0 , t) ≥ 0
• Increasing differences, with x > x0 , means g(x, ·) − g(x0 , ·) is increasing in the parameter, so
74
7.5 How do we use the result?
• Consider a single-output firm with cost function c
• So now we’re guaranteed that q ∗ (p) is increasing in p (via the strong set order)
• If the firm has a unique optimal production level q, then q must be weakly increasing in p
75