0% found this document useful (0 votes)

148 views135 pages

Linear Optimization Geometry Guide

Uploaded by

Udit Jethva

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

148 views135 pages

Linear Optimization Geometry Guide

Uploaded by

Udit Jethva

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

ISyE/Math/CS/Stat 525

Linear Optimization

2. The geometry of linear programming

Prof. Alberto Del Pia

University of Wisconsin-Madison

Based on the book Introduction to Linear Optimization by D. Bertsimas and J.N. Tsitsiklis
Outline

Sec. 2.1 We define a polyhedron as a set described by a finite number

of linear equality and inequality constraints.
Sec. 2.2 We study the basic geometric properties of polyhedra, with
emphasis on their “corner points”.
Sec. 2.3 We focus on the case where the feasible set is in the standard
form {x | Ax = b, x ≥ 0}.
Sec. 2.4 We study what happens when corner points arise in a
degenerate manner.
Sec. 2.5 We see under which conditions a polyhedron has corner points.
Sec. 2.6 We see that in this case, the search for optimal solutions to
LP problems can be restricted to corner points.
Sec. 2.7 We provide an alternative representation of polyhedra.
Sec. 2.8 We present the oldest method for solving LP problems.
General Optimization Problem
Consider the optimization problem
minimize f(x)
subject to x ∈ S,
where f : Rn → R and S ⊆ Rn .

Definition
The problem is infeasible if S = ∅.

Definition
The problem is unbounded if for each real number K there
exists x ∈ S such that f(x) < K.

Definition
The problem has an optimal solution if there exists x∗ ∈ S
such that f(x∗ ) ≤ f(x) for all x ∈ S.
General Optimization Problem
▶ In general, an optimization problem can be neither infeasible,
nor unbounded, nor have an optimal solution.
▶ Example:
minimize 1/x
subject to x ≥ 1.
LP Problem
Consider the LP problem
minimize c′ x
subject to Ax ≥ b,
where c ∈ Rn , b ∈ Rm and A ∈ Rm×n .

We will prove the following fundamental property:

Property 1:
Any LP problem is either infeasible, or it is unbounded, or it
has an optimal solution.
Local vs Global optimal solutions
Consider the optimization problem
minimize f(x)
subject to x ∈ S,
where f : Rn → R and S ⊆ Rn .

Definition
A vector x∗ ∈ S is called a (global) optimal solution if

f(x∗ ) ≤ f(x) for all x ∈ S.

Definition
A vector x∗ ∈ S is called a local optimal solution if ∃ϵ > 0
and a neighborhood N(x∗ , ϵ) = {x ∈ Rn : ∥x∗ − x∥ ≤ ϵ} of
x∗ such that f(x∗ ) ≤ f(x) for all x ∈ S ∩ N(x∗ , ϵ).
Local vs Global optimal solutions
Consider the optimization problem
minimize f(x)
subject to x ∈ S,
where f : Rn → R and S ⊆ Rn .

There can be local optimal solutions that are NOT global

optimal solutions.
Local vs Global optimal solutions
Consider the LP problem
minimize c′ x
subject to Ax ≥ b,

where c ∈ Rn , b ∈ Rm and A ∈ Rm×n .

We will prove the following fundamental property:

Property 2:
In a LP problem each local optimal solution is also a global
optimal solution.
2.1 Polyhedra and convex sets
Hyperplanes, halfspaces, and polyhedra
Hyperplanes and halfspaces

Definition 2.3
Let a be a nonzero vector in Rn and
let b be a scalar.
(a) The set {x ∈ Rn | a′ x = b} is
called a hyperplane.
(b) The set {x ∈ Rn | a′ x ≥ b} is
called a halfspace.

▶ A hyperplane is the boundary of a corresponding halfspace.

▶ A hyperplane is equal to the intersection of two halfspaces.
▶ The vector a in the definition of the hyperplane is
perpendicular to the hyperplane itself. Exercise: prove it!
Polyhedra

Definition 2.1
A polyhedron is a set that can be described in the form

{x ∈ Rn | Ax ≥ b},

where A is an m × n matrix and b is a vector in Rm .

▶ A polyhedron is the intersection of a finite number of

halfspaces.
Polyhedra

Definition 2.1
A polyhedron is a set that can be described in the form

{x ∈ Rn | Ax ≥ b},

where A is an m × n matrix and b is a vector in Rm .

▶ A polyhedron is the intersection of a finite number of

halfspaces.
Polyhedra

▶ As discussed in Section 1.1, the feasible set of any LP problem

can be described by inequality constraints of the form Ax ≥ b,
and is therefore a polyhedron.
▶ A set of the form

{x ∈ Rn | Ax = b, x ≥ 0}

is also a polyhedron, in a standard form representation.

Bounded Sets and Polytopes
A set can either “extend to infinity” or be confined in a finite
region.

Definition 2.2
A set S ⊂ Rn is bounded if there exists a constant K such
that the absolute value of every component of every element
of S is less than or equal to K.

O K x1
Bounded Sets and Polytopes
A set can either “extend to infinity” or be confined in a finite
region.

Definition 2.2
A set S ⊂ Rn is bounded if there exists a constant K such
that the absolute value of every component of every element
of S is less than or equal to K.

Definition
A bounded polyhedron is called a polytope.
Why is LP special?
[spoilers ahead...]

Property 1:
Any LP problem is either infeasible, or it is unbounded, or it
has an optimal solution.

Property 2:
In a LP problem each local optimal solution is also a global
optimal solution.
Why is LP special?
[spoilers ahead...]

Property 1:
Any LP problem is either infeasible, or it is unbounded, or it
has an optimal solution.

Property 2:
In a LP problem each local optimal solution is also a global
optimal solution.

Property 3:
If the feasible region of a LP problem is a nonempty polytope,
then there exists an optimal solution that is a ‘corner point’.
2.2 Extreme points, vertices, and basic feasible
solutions
Extreme points, vertices, and basic feasible solutions

▶ We observed in Section 1.4 that an optimal solution to a LP

problem tends to occur at a “corner” of the polyhedron over
which we are optimizing.
▶ We now introduce three different ways of defining the concept
of a “corner”.
▶ We then show that all three definitions are equivalent.
Extreme points

Definition 2.6
Let P be a polyhedron. A vector x ∈ P is an extreme point of
P if we cannot find two vectors y, z ∈ P, both different from x,
and a scalar λ ∈ [0, 1], such that x = λy + (1 − λ)z.

▶ This definition is entirely

geometric: It depends on P,
but it does not depend on
the (algebraic)
representation of P.
Vertices

Definition 2.7
Let P be a polyhedron. A vector x ∈ P is a vertex of P if
there exists some c such that c′ x < c′ y for all y satisfying
y ∈ P and y ̸= x.

▶ A vertex of a polyhedron P
is the unique optimal
solution to some LP
problem with feasible set P.

▶ Also this definition is entirely geometric.

▶ We would like to have a definition which reduces to an
algebraic test.
Active constraints in x∗ ∈ Rn
Consider a polyhedron P ⊂ Rn defined in terms of the linear
equality and inequality constraints

a′i x ≥ bi , i ∈ M1 ,
a′i x ≤ bi , i ∈ M2 ,
a′i x = bi , i ∈ M3 ,

where M1 , M2 , and M3 are finite index sets, each ai is a vector in

Rn , and each bi is a scalar.

Definition 2.8
If a vector x∗ satisfies a′i x∗ = bi for i
in M1 , M2 , or M3 , we say that
constraint i is active at x∗ .

Example: P = {(x1 , x2 , x3 ) | x1 + x2 + x3 = 1, x1 , x2 , x3 ≥ 0}.

Basic (feasible) solutions

Definition 2.9
Consider a polyhedron P defined by linear equality and
inequality constraints, and let x∗ be an element of Rn .
(a) The vector x∗ is a basic solution if:
(i) All equality constraints are active;
(ii) Out of the constraints that are active at x∗ , there are n
of them that are linearly independent.
(b) If x∗ is a basic solution that satisfies all of the
constraints, we say that it is a basic feasible solution.

▶ We say that certain constraints a′i x ∼ bi are linearly

independent, meaning that the corresponding vectors ai are
linearly independent.
Basic (feasible) solutions

Example:
P = {(x1 , x2 , x3 ) : x1 + x2 + x3 = 1, x1 , x2 , x3 ≥ 0}.
Basic (feasible) solutions

▶ If the number m of constraints used to define a polyhedron

P ⊂ Rn is less than n, the number of active constraints at any
given point must also be less than n, and there are no basic or
basic feasible solutions.
Equivalence of definitions
▶ We have given three different definitions that are meant to
capture the same concept.
▶ Two of them are geometric (extreme point, vertex) and the
third is algebraic (basic feasible solution).
▶ All three definitions are equivalent and, for this reason, the
three terms can be used interchangeably.

Theorem 2.3
Let P be a nonempty polyhedron and let x∗ ∈ P. Then, the
following are equivalent:
(a) x∗ is a vertex;
(b) x∗ is an extreme point;
(c) x∗ is a basic feasible solution.
Equivalence of definitions
▶ To prove Theorem 2.3 we need the following result.

Theorem 2.2 (edited)

Let a′i x = bi , i ∈ I, be constraints in Rn . The following are
equivalent:
(a) There exist n constraints among a′i x = bi , i ∈ I, which are
linearly independent.
(b) The span of the vectors ai , i ∈ I, is all of Rn , that is, every
element of Rn can be expressed as a linear combination of
the vectors ai , i ∈ I.
(c) The system of equations a′i x = bi , i ∈ I, has a unique
solution.

Proof: Exercise. You can prove Theorem 2.2 using basic facts from
linear algebra (Theorem 1.3(a) in Section 1.5).
Now let’s prove Theorem 2.3!
Equivalence of definitions

▶ Theorem 2.3 states that a vector is a basic feasible solution if

and only if it is an extreme point.
▶ We have seen that the definition of an extreme point does not
depend on the representation of a polyhedron.
▶ We conclude that the property of being a basic feasible
solution is also independent of the representation of the
polyhedron.
Equivalence of definitions

▶ Theorem 2.3 states that a vector is a basic feasible solution if

Corollary 2.1
Given a finite number of linear inequality constraints, there
can only be a finite number of basic or basic feasible
solutions.

Proof.
▶ Consider a system of m linear inequality constraints in Rn .
▶ At any basic solution, there are n linearly independent
active constraints that define it.
▶ Therefore, the number of basic solutions is bounded above
by the number of ways that we can choose n constraints
out of m.
▶ This number is finite.
Equivalence of definitions

Corollary 2.1
Given a finite number of linear inequality constraints, there
can only be a finite number of basic or basic feasible
solutions.

▶ Although the number of basic feasible solutions is

guaranteed to be finite, it can be very large.
▶ For example, the unit cube

{x ∈ Rn | 0 ≤ xi ≤ 1, i = 1, . . . , n}

is defined by 2n constraints, but has 2n basic feasible

solutions.
Adjacent basic solutions
Adjacent basic solutions

▶ Two distinct basic solutions

to a set of linear constraints
in Rn are said to be adjacent
if we can find n − 1 linearly
independent constraints that
are active at both of them.
▶ If two adjacent basic
solutions are also feasible,
then the line segment that
joins them is called an edge
of the feasible set.
Adjacent basic solutions

▶ Two distinct basic solutions

▶ The definition of a basic solution refers to general polyhedra.

▶ We will now specialize to polyhedra in standard form.

▶ Let P be a polyhedron in standard form

P = {x ∈ Rn | Ax = b, x ≥ 0}.

▶ Let the dimensions of A be m × n.

The full row rank assumption on A

▶ In most of our discussion of standard form problems, we will

make the assumption that the m rows of the matrix A are
linearly independent.
▶ We now see that when P is nonempty, linearly dependent
rows of A correspond to redundant constraints that can be
discarded.
▶ Therefore, our linear independence assumption can be made
without loss of generality.
The full row rank assumption on A

Theorem 2.5
Let P = {x | Ax = b, x ≥ 0} be a nonempty polyhedron,
where A is a matrix of dimensions m × n, with rows a′1 , . . . , a′m .
Suppose that rank(A) = k < m and that the rows a′i1 , . . . , a′ik
are linearly independent. Consider the polyhedron

Q = {x | a′i1 x = bi1 , . . . , a′ik x = bik , x ≥ 0}.

Then Q = P.

Proof: Easy exercise.

Example 2.3

▶ Consider the nonempty polyhedron defined by

2x1 + x2 + x3 = 2
x1 + x2 = 1
x1 + x3 = 1
x1 , x2 , x3 ≥ 0.
Example 2.3

▶ Consider the nonempty polyhedron defined by

2x1 + x2 + x3 = 2
x1 + x2 = 1
x1 + x3 = 1
x1 , x2 , x3 ≥ 0.

▶ The corresponding matrix A has rank two.

▶ This is because the last two rows (1, 1, 0) and (1, 0, 1) are
linearly independent, but the first row (2, 1, 1) is equal to the
sum of the other two.
Example 2.3

▶ Consider the nonempty polyhedron defined by

2x1 + x2 + x3 = 2
x1 + x2 = 1
x1 + x3 = 1
x1 , x2 , x3 ≥ 0.

▶ The corresponding matrix A has rank two.

▶ This is because the last two rows (1, 1, 0) and (1, 0, 1) are
linearly independent, but the first row (2, 1, 1) is equal to the
sum of the other two.
▶ Thus, the first constraint is redundant and after it is
eliminated, we still have the same polyhedron.
The full row rank assumption on A

▶ Notice that the polyhedron Q in Theorem 2.5 is in standard

form like P:
Q = {x | Dx = f, x ≥ 0}.
▶ Moreover, D is a k × n submatrix of A, with rank equal to k.
▶ We conclude that as long as the feasible set is nonempty, a
LP problem in standard form can be reduced to an equivalent
standard form LP problem (with the same feasible set) in
which the equality constraints are linearly independent.
Polyhedra in standard form: basic solutions

▶ Let’s get back to our task of specializing the definition of a

basic solution to polyhedra in standard form.
▶ Let P be a polyhedron in standard form

P = {x ∈ Rn | Ax = b, x ≥ 0}.

▶ Let the dimensions of A be m × n.

▶ We can now assume, without loss of generality, that the m
rows of the matrix A are linearly independent.
▶ Since the rows are n-dimensional, this requires that m ≤ n.
Polyhedra in standard form: basic solutions

▶ Recall that at any basic solution, there must be n linearly

independent constraints that are active.
▶ Furthermore, every basic solution must satisfy the equality
constraints Ax = b, which provides us with m active
constraints; these are linearly independent because of our
assumption on the rows of A.
▶ In order to obtain a total of n active constraints, we need to
choose n − m of the variables xi and set them to zero, which
makes the corresponding nonnegativity constraints xi ≥ 0
active.
▶ However, for the resulting set of n active constraints to be
linearly independent, the choice of these n − m variables is not
entirely arbitrary.
Polyhedra in standard form: basic solutions

Theorem 2.4
Consider the constraints Ax = b and x ≥ 0 and assume that
the m × n matrix A has linearly independent rows. A vector
x ∈ Rn is a basic solution if and only if we have Ax = b, and
there exist distinct indices B(1), . . . , B(m) ∈ {1, . . . , n} such
that:
(a) The columns AB(1) , . . . , AB(m) of A are linearly
independent;
(b) If i ̸= B(1), . . . , B(m), then xi = 0.

Proof idea:
n m n−m
( ) ( )
m A m B C
Active constraints: =
n−m 0 I n−m 0 I
Polyhedra in standard form: basic solutions

In view of Theorem 2.4, all basic solutions to a standard form

polyhedron can be constructed according to the following
procedure.

Procedure for constructing basic solutions

1. Choose m linearly independent columns AB(1) , . . . , AB(m) .
2. Let xi = 0 for all i ̸= B(1), . . . , B(m).
3. Solve the system of m equations Ax = b for the unknowns
xB(1) , . . . , xB(m) .
Polyhedra in standard form: basic solutions

We can use a similar procedure to construct all basic feasible

solutions to a standard form polyhedron.
▶ If a basic solution constructed according to this procedure is
nonnegative, then it is feasible, and it is a basic feasible
solution.
▶ Conversely, since every basic feasible solution is a basic
solution, it can be obtained from this procedure.
Example 2.1

Let the constraint Ax = b be of the form

   
1 1 2 1 0 0 0 8
0 1 6 0 1 0 0 12
 x =  .
1 0 0 0 0 1 0 4
0 1 0 0 0 0 1 6
Example 2.1

Let the constraint Ax = b be of the form

   
1 1 2 1 0 0 0 8
0 1 6 0 1 0 0 12
 x =  .
1 0 0 0 0 1 0 4
0 1 0 0 0 0 1 6

▶ B(1) = 4, B(2) = 5, B(3) = 6, B(4) = 7.

▶ The columns A4 , A5 , A6 , A7 are linearly independent.
▶ The corresponding basic solution is

x = (0, 0, 0, 8, 12, 4, 6).

▶ x≥0 ⇒ x is a basic feasible solution.

Example 2.1

Let the constraint Ax = b be of the form

   
1 1 2 1 0 0 0 8
0 1 6 0 1 0 0 12
 x =  .
1 0 0 0 0 1 0 4
0 1 0 0 0 0 1 6

▶ B(1) = 3, B(2) = 5, B(3) = 6, B(4) = 7.

▶ The columns A3 , A5 , A6 , A7 are linearly independent.
▶ The corresponding basic solution is

x = (0, 0, 4, 0, −12, 4, 6).

▶ x5 = −12 < 0 ⇒ x is not feasible.

Polyhedra in standard form: basic solutions

▶ If x is a basic solution, the variables

xB(1) , . . . , xB(m)

are called basic variables; the remaining variables are called

nonbasic.
▶ Similarly, B(1), . . . , B(m) are called basic indices, and
{1, . . . , n} \ {B(1), . . . , B(m)} are called nonbasic indices.
▶ The columns
AB(1) , . . . , AB(m)
are called the basic columns and, since they are linearly
independent, they form a basis of Rm .
Polyhedra in standard form: basic solutions
▶ By arranging the m basic columns next to each other, we
obtain an m × m matrix B, called a basis matrix:
[ ]
B = AB(1) AB(2) · · · AB(m) .

▶ Note that a basis matrix B is invertible because the basic

columns are linearly independent.
▶ We can similarly define a vector xB with the values of the
basic variables:  
xB(1)
 
xB =  ...  .
xB(m)
▶ The basic variables are determined by solving the equation
BxB = b whose unique solution is given by

xB = B−1 b.
Example 2.1
   
1 1 2 1 0 0 0 8
0 1 6 0 1 0 
0 12
 x= .
1 0 0 0 0 1 0  4
0 1 0 0 0 0 1 6

▶ We chose columns A4 , A5 , A6 , A7 and obtained the basic feasible

solution x = (0, 0, 0, 8, 12, 4, 6).
▶ Basic indices: 4, 5, 6, 7.
▶ Basic variables: x4 , x5 , x6 , x7 .
▶ Basic columns: A4 , A5 , A6 , A7 .
Example 2.1
   
1 1 2 1 0 0 0 8
0 1 6 0 1 0 
0 12
 x= .
1 0 0 0 0 1 0  4
0 1 0 0 0 0 1 6

▶ Basis matrix B and vector xB :

   
1 0 0 0 x4
0 1 0 0 x5 
B= 0 0 1 0 ,
 xB =  
x6  .
0 0 0 1 x7

▶ Note that Bi = AB(i) for every i = 1, . . . , m.

Different bases

▶ We say that two bases are distinct or different if they involve

different sets {B(1), . . . , B(m)} of basic indices.
▶ If two bases involve the same set of indices in a different
order, they will be viewed as one and the same basis.
Example 2.1

Suppose there is an eighth column A8 = A7 .

   
1 1 2 1 0 0 0 0 8
0 1 6 0 1 0 0 0   
 x = 12 .
1 0 0 0 0 1 0 0 4
0 1 0 0 0 0 1 1 6

▶ The two sets of columns {A3 , A5 , A6 , A7 } and

{A3 , A5 , A6 , A8 } coincide.
▶ The corresponding sets of basic indices {3, 5, 6, 7} and
{3, 5, 6, 8} are different.
▶ We have two different bases.
Example 2.1

Suppose there is an eighth column A8 = A7 .

   
1 1 2 1 0 0 0 0 8
0 1 6 0 1 0 0 0   
 x = 12 .
1 0 0 0 0 1 0 0 4
0 1 0 0 0 0 1 1 6

▶ The two sets of columns {A3 , A5 , A6 , A7 } and

{A3 , A5 , A6 , A8 } coincide.
▶ The corresponding sets of basic indices {3, 5, 6, 7} and
{3, 5, 6, 8} are different.
▶ We have two different bases.
Polyhedra in standard form: basic solutions
Intuitive view of basic solutions.
▶ Recall our interpretation of the
constraint
∑
n
Ax = b ⇔ Ai xi = b
i=1

as a requirement to synthesize the

vector b ∈ Rn using the resource
vectors Ai (Section 1.1).
▶ In a basic solution, we use only m Question: In this example
of the resource vectors, those m = 2. Which are the
associated with the basic variables. bases that yield basic
▶ In a basic feasible solution, this is feasible solutions?
accomplished using a nonnegative
amount of each basic vector.
Correspondence of bases and basic solutions
Correspondence of bases and basic solutions

▶ A basis uniquely determines a basic solution, thus different

basic solutions must correspond to different bases.
▶ However, two different bases may lead to the same basic
solution.
▶ Example: If we have b = 0, then every basis matrix leads to
the same basic solution, namely, the zero vector.
▶ This phenomenon will have some important algorithmic
implications.
Adjacent basic solutions and adjacent bases
Adjacent basic solutions and adjacent bases

▶ Recall that two distinct basic solutions are said to be adjacent

if there are n − 1 linearly independent constraints that are
active at both of them.
▶ For standard form problems, we define two bases to be
adjacent if they share all but one basic columns.

Exercise: Show that:

1. Adjacent basic solutions can always be obtained from two
adjacent bases.
2. Conversely, if two adjacent bases lead to distinct basic
solutions, then the latter are adjacent.
(Hint: Use proof of Theorem 2.4.)
Example 2.2

In Example 2.1 we considered constraint Ax = b of the form

   
1 1 2 1 0 0 0 8
0 1 6 0 1 0 0   
 x = 12 .
1 0 0 0 0 1 0 4
0 1 0 0 0 0 1 6
Example 2.2

In Example 2.1 we considered constraint Ax = b of the form

   
1 1 2 1 0 0 0 8
0 1 6 0 1 0 0   
 x = 12 .
1 0 0 0 0 1 0 4
0 1 0 0 0 0 1 6

▶ The bases {A4 , A5 , A6 , A7 } and {A3 , A5 , A6 , A7 } are

adjacent because all but one columns are the same.
▶ The corresponding basic solutions x = (0, 0, 0, 8, 12, 4, 6)
and x = (0, 0, 4, 0, −12, 4, 6) are adjacent: we have n = 7
and a total of six common linearly independent active
constraints: the four equality constraints, x1 ≥ 0, and
x2 ≥ 0.
Example 2.2

In Example 2.1 we considered constraint Ax = b of the form

   
1 1 2 1 0 0 0 8
0 1 6 0 1 0 0   
 x = 12 .
1 0 0 0 0 1 0 4
0 1 0 0 0 0 1 6

▶ The bases {A4 , A5 , A6 , A7 } and {A3 , A5 , A6 , A7 } are

▶ We now consider again general polyhedra.

▶ According to our definition, at a basic solution, we must have
n linearly independent active constraints.
▶ This allows for the possibility that the number of active
constraints is greater than n.
▶ In this case, we say that we have a degenerate basic solution.

Definition 2.10
A basic solution x ∈ Rn is said to be degenerate if more than n
of the constraints are active at x.
Degeneracy

▶ If n = 2, a degenerate basic
solution is at the intersection of
three or more lines.
Degeneracy

▶ If n = 3, a degenerate basic
solution is at the intersection of
four or more planes.

▶ The presence of degeneracy can strongly affect the behavior

of LP algorithms.
Example 2.4
Consider the polyhedron P defined by the constraints

x1 + x2 + 2x3 ≤ 8 (1)
x2 + 6x3 ≤ 12 (2)
x1 ≤ 4 (3)
x2 ≤ 6 (4)
x1 , x2 , x3 ≥ 0.
Example 2.4
Consider the polyhedron P defined by the constraints