0% found this document useful (0 votes)

52 views32 pages

17 Dynamic Programming Matrix Chain Multiplication No Pause

The document discusses the matrix chain multiplication problem. It involves finding the most efficient way to parenthesize and multiply a sequence of matrices. There are multiple ways to parenthesize the matrices, and the goal is to minimize the number of scalar multiplications. The problem exhibits optimal substructure - optimal solutions to subproblems are contained within the optimal solution. A dynamic programming algorithm is presented that solves the problem in O(n^3) time by filling a table in a bottom-up manner.

Uploaded by

Abdallahi Sidi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

52 views32 pages

17 Dynamic Programming Matrix Chain Multiplication No Pause

Uploaded by

Abdallahi Sidi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 32

Lecture 17/18: Dynamic Programming - Matrix

Chain Parenthesization
COMS10007 - Algorithms

Dr. Christian Konrad

01.04.2019 and 02.04.2019

Dr. Christian Konrad Lecture 17/18: Matrix Chain Parenthesization 1 / 18

Matrix Multiplication

Problem: Matrix-Multiplication
1 Input: Matrices A, B with A.columns = B.rows
2 Output: Matrix product A × B

Example:
 q   r 
2 3 r 6 2 4
1 0 q 0 1 2

0 1 2
2 6 × 2 0 0 = 12
p 4 p
  
2
0 9 18 0 0

Notation: p × q matrix: p rows and q columns

p × q matrix times q × r matrix gives a p × r matrix

(A × B)i,j = row i of A times column j of B

Dr. Christian Konrad Lecture 17/18: Matrix Chain Parenthesization 2 / 18

Algorithm for Matrix-Multiplication
Algorithm: (A × B)i,j = row i of A times column j of B

Require: Matrices A, B with A.columns = B.rows

Let C be a new A.rows × B.columns matrix
for i ← 1 . . . A.rows do
for j ← 1 . . . B.columns do
Cij ← 0
for k ← 1 . . . A.columns do
Cij ← Cij + Aik · Bkj
return C
Algorithm Matrix-Multiply(A, B)

Runtime:
Three nested loops: O(A.rows · B.columns · A.columns)
Number of Multiplications: A.rows · B.columns · A.columns
Multiplying two n × n matrices: runtime O(n3 )
Dr. Christian Konrad Lecture 17/18: Matrix Chain Parenthesization 3 / 18
Background: Faster Matrix Multiplication

History: Multiplying two n × n matrices

before 1969: O(n3 )

1969: Strassen O(n2.8074 ) (divide-and-conquer)
1990: Coppersmith-Winograd O(n2.3755 )
2010: Stothers O(n2.374 )
2011: Virginia Williams O(n2.3728642 )
2014: Le Gall O(n2.3728639 )

Important Problem:

Many algorithms rely on fast matrix multiplication

Better bound for matrix multiplication improves many
algorithms

Dr. Christian Konrad Lecture 17/18: Matrix Chain Parenthesization 4 / 18

The Matrix-chain Multiplication Problem

Problem: Matrix-Chain-Multiplication
1 Input: A sequence (chain) of n matrices A1 , A2 , A3 , . . . , An
2 Output: The product A1 × A2 × A3 × · · · × An

Discussion:
Ai .columns = Ai+1 .rows for every 1 ≤ i < n
Assume Ai has dimension pi−1 × pi , for vector p[0 . . . n]
Matrix product is associative:

(A1 × A2 ) × A3 = A1 × (A2 × A3 )

Exploit Associativity: Parenthesize A1 × A2 × A3 × . . . An so as

to minimize the number of scalar multiplications (and thus the
runtime)

Dr. Christian Konrad Lecture 17/18: Matrix Chain Parenthesization 5 / 18

Order matters
Example: Three matrices A1 , A2 , A3 with dimensions

A1 : 10 × 100 A2 : 100 × 5 A3 : 5 × 50

(p0 = 10, p1 = 100, p2 = 5, p3 = 50)

Computation of (A1 × A2 ) × A3 :
A1 × A2 = A12 requires 10 · 100 · 5 = 5000 multiplications
A12 × A3 requires 10 · 5 · 50 = 2500 multiplications
Total: 7500 multiplications

Computation of A1 × (A2 × A3 ):
A2 × A3 = A23 requires 100 · 5 · 50 = 25000 multiplications
A1 × A23 requires 10 · 100 · 50 = 50000 multiplications
Total: 75000 multiplications
Dr. Christian Konrad Lecture 17/18: Matrix Chain Parenthesization 6 / 18
The Matrix-Chain-Parenthesization Problem
Problem: Matrix-Chain-Parenthesization
1 Input: A sequence (chain) of n matrices A , A , A , . . . , A
1 2 3 n
2 Output: A parenthesization of A × A × A × · · · × A that
1 2 3 n
minimizes the number of scalar multiplications

How many Parenthesizations P(n) are there?

We write: Aij for the product Ai × Ai+1 × · · · × Aj
There is a final matrix multiplication: A1k × A(k+1)n , for some
1 ≤ k ≤ n − 1. Hence:
(
1 if n = 1 ,
P(n) = Pn−1
k=1 P(k)P(n − k) if n ≥ 2 .

Example: Four matrices A1 , A2 , A3 , A4

A1 × A24 A12 × A34 A13 × A4

Dr. Christian Konrad Lecture 17/18: Matrix Chain Parenthesization 7 / 18

Number of Parenthesizations
Example (continued): Four matrices A1 , A2 , A3 , A4
A1 × A24 A12 × A34 A13 × A4
2
X
P(3) = P(k)P(n − k) = P(1)P(2) + P(2)P(1) = 2
k=1
3
X
P(4) = P(k)P(n − k) = P(1)P(3) + P(2)P(2) + P(3)P(1)
k=1
= P(3) + 1 + P(3) = 2P(3) + 1 = 5 .
1 A1 × ((A2 × A3 ) × A4 )
2 A1 × ((A2 × (A3 × A4 ))
3 (A1 × A2 ) × (A3 × A4 )
4 ((A1 × A2 ) × A3 ) × A4
5 (A1 × (A2 × A3 )) × A4

Dr. Christian Konrad Lecture 17/18: Matrix Chain Parenthesization 8 / 18

Number of Parenthesizations (2)

A Bound on the Number of Parenthesizations:

(
1 if n = 1 ,
P(n) = Pn−1
k=1 P(k)P(n − k) if n ≥ 2 .

1, 1, 2, 5, 14, 42, 132, 429, 1430, 4862, 16796, 58786, 208012, 742900, . . .

It can be seen that there are Ω(2n ) possibilities

An efficient algorithm thus cannot try out all possibilities
We will give a dynamic programming algorithm

Dr. Christian Konrad Lecture 17/18: Matrix Chain Parenthesization 9 / 18

Optimal Substructure
Optimal Substructure
We say that a problem P exhibits optimal substructure if:

An optimal solution to P contains within it optimal solutions to

subproblems of P.

Optimal Substructure in Matrix-Chain-Parenthesization

Consider optimal solution to instance of size n
Suppose that last product is A1k × A(k+1)n
Then the optimal solution contains optimal parenthesizations
of A1 × A2 × · · · × Ak and Ak+1 × Ak+2 × . . . An
Proof. Suppose it did not contain optimal parenthesizations
of A1 × A2 × · · · × Ak and of Ak+1 × Ak+2 × . . . An . Then
picking optimal parenthesizations of the two subproblems
would give better solution to initial instance.
Dr. Christian Konrad Lecture 17/18: Matrix Chain Parenthesization 10 / 18
Recursive Solution
Optimal Solution to Subproblem:
m[i, j] : minimum number of scalar multiplications needed to
compute Ai × Ai+1 × · · · × Aj = Aij
Observe that m[i, i] = 0 (chain consists of single matrix Ai )
Suppose j > i. Suppose last multiplication in optimal solution
is: Aik × A(k+1)j , for some k
Then: cost of multiplying Aik × A(k+1)j
m[i, j] = m[i, k] + m[k + 1, j] + pi−1 pk pj
(Aik : pi−1 × pk matrix, A(k+1)j : pk × pj matrix)
Since we do not know k, we try out all possibilities and
choose the best solution:
(
0 if i = j ,
m[i, j] =
mini≤k<j {m[i, k] + m[k + 1, j] + pi−1 pk pj } if i < j .

Dr. Christian Konrad Lecture 17/18: Matrix Chain Parenthesization 11 / 18

Computing the Optimal Costs

(
0 if i = j ,
m[i, j] =
mini≤k<j {m[i, k] + m[k + 1, j] + pi−1 pk pj } if i < j .

Algorithmic Considerations:
As in Pole-Cutting, we could implement this recursive
formula directly. → exponential runtime
Instead, we compute the table m[i, j] bottom up
Observe that there are less than n2 subproblems m[i, j] (i and
j take values in {1, . . . , n})
We will see that computing one value m[i, j] takes O(n) time
This yields an O(n3 ) time algorithm

Dr. Christian Konrad Lecture 17/18: Matrix Chain Parenthesization 12 / 18

Dynamic Programming Algorithm

Require: Integer n, vector of dimensions of matrices p so that

matrix Ai has dimensions pi−1 × pi
Let m[1 . . . n, 1 . . . n] be a new array
for i ← 1 . . . n do
m[i, i] ← 0
for l ← 2 . . . n do {chain length}
for i ← 1 . . . n − l + 1 do {left position}
j ← i + l − 1 {right position}
m[i, j] ← ∞
for k ← i . . . j − 1 do
m[i, j] ← min{m[i, j], m[i, k] + m[k + 1, j] + pi−1 pk pj }
return m
Algorithm Matrix-Chain-Value(n, p)
Pn Pn−l+1 Pi+l−2
Runtime: O(n3 ) (by evaluating l=2 i=1 k=1 O(1))

Dr. Christian Konrad Lecture 17/18: Matrix Chain Parenthesization 13 / 18

Runtime Evaluation

Useful Formula:
b
X
1=b−a+1
i=a

n n−l+1
X X i+l−2
X n n−l+1
X X i+l−2
X
O(1) = O(1) · 1
l=2 i=1 k=1 l=2 i=1 k=1
Xn X
n Xn Xn Xn n X
X n
≤ O(1) · 1 = O(1) · n = O(1) · n 1
l=1 i=1 k=1 l=1 i=1 l=1 i=1
Xn n
X
= O(1) · n n = O(1) · n2 1 = O(1) · n2 · n = O(1)n3
l=1 l=1
3
= O(n ) .