0% found this document useful (0 votes)

6 views6 pages

Strassen_algorithm

Uploaded by

Michael Margolese

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views6 pages

Strassen_algorithm

Uploaded by

Michael Margolese

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Strassen algorithm

In linear algebra, the Strassen algorithm, named after Volker Strassen, is an algorithm for matrix multiplication. It is faster
than the standard matrix multiplication algorithm for large matrices, with a better asymptotic complexity, although the naive
algorithm is often better for smaller matrices. The Strassen algorithm is slower than the fastest known algorithms for
extremely large matrices, but such galactic algorithms are not useful in practice, as they are much slower for matrices of
practical size. For small matrices even faster algorithms exist.

Strassen's algorithm works for any ring, such as plus/multiply, but not all semirings, such as min-plus or boolean algebra,
where the naive algorithm still works, and so called combinatorial matrix multiplication.

History
Volker Strassen first published this algorithm in 1969 and thereby proved that the general matrix multiplication
algorithm was not optimal.[1] The Strassen algorithm's publication resulted in more research about matrix multiplication that
led to both asymptotically lower bounds and improved computational upper bounds.

Algorithm

The left column visualizes the calculations necessary to determine the result of a 2x2 matrix multiplication. Naïve matrix multiplication
requires one multiplication for each "1" of the left column. Each of the other columns (M1-M7) represents a single one of the 7
multiplications in the Strassen algorithm. The sum of the columns M1-M7 gives the same result as the full matrix multiplication on the
left.

Let , be two square matrices over a ring , for example matrices whose entries are integers or the real numbers. The
goal of matrix multiplication is to calculate the matrix product . The following exposition of the algorithm assumes
that all of these matrices have sizes that are powers of two (i.e., ), but this is only conceptually
necessary — if the matrices , are not of type , the "missing" rows and columns can be filled with zeros to
obtain matrices with sizes of powers of two — though real implementations of the algorithm do not do this in practice.

The Strassen algorithm partitions , and into equally sized block matrices

with . The naive algorithm would be:

This construction does not reduce the number of multiplications: 8 multiplications of matrix blocks are still needed to
calculate the matrices, the same number of multiplications needed when using standard matrix multiplication.

The Strassen algorithm defines instead new values:

using only 7 multiplications (one for each ) instead of 8. We may now express the in terms of :

We recursively iterate this division process until the submatrices degenerate into numbers (elements of the ring ). If, as
mentioned above, the original matrix had a size that was not a power of 2, then the resulting product will have zero rows
and columns just like and , and these will then be stripped at this point to obtain the (smaller) matrix we really
wanted.

Practical implementations of Strassen's algorithm switch to standard methods of matrix multiplication for small enough
submatrices, for which those algorithms are more efficient. The particular crossover point for which Strassen's algorithm is
more efficient depends on the specific implementation and hardware. Earlier authors had estimated that Strassen's algorithm
is faster for matrices with widths from 32 to 128 for optimized implementations.[2] However, it has been observed that this
crossover point has been increasing in recent years, and a 2010 study found that even a single step of Strassen's algorithm is
often not beneficial on current architectures, compared to a highly optimized traditional multiplication, until matrix sizes
exceed 1000 or more, and even for matrix sizes of several thousand the benefit is typically marginal at best (around 10% or
less).[3] A more recent study (2016) observed benefits for matrices as small as 512 and a benefit around 20%.[4]

Winograd form
It is possible to reduce the number of matrix additions by instead using the following form discovered by Winograd:

where .

This reduces the number of matrix additions and subtractions from 18 to 15. The number of matrix multiplications is still 7,
and the asymptotic complexity is the same.[5]

Asymptotic complexity
The outline of the algorithm above showed that one can get away with just 7, instead of the traditional 8, matrix-matrix
multiplications for the sub-blocks of the matrix. On the other hand, one has to do additions and subtractions of blocks,
though this is of no concern for the overall complexity: Adding matrices of size requires only operations
whereas multiplication is substantially more expensive (traditionally addition or multiplication operations).
The question then is how many operations exactly one needs for Strassen's algorithms, and how this compares with the
standard matrix multiplication that takes approximately (where ) arithmetic operations, i.e. an asymptotic
complexity .

The number of additions and multiplications required in the Strassen algorithm can be calculated as follows: let be the
number of operations for a matrix. Then by recursive application of the Strassen algorithm, we see that
, for some constant that depends on the number of additions performed at each application of the
algorithm. Hence , i.e., the asymptotic complexity for multiplying matrices of size using the
Strassen algorithm is . The reduction in the number of arithmetic
operations however comes at the price of a somewhat reduced numerical stability,[6] and the algorithm also requires
significantly more memory compared to the naive algorithm. Both initial matrices must have their dimensions expanded to
the next power of 2, which results in storing up to four times as many elements, and the seven auxiliary matrices each
contain a quarter of the elements in the expanded ones.

Strassen's algorithm needs to be compared to the "naive" way of doing the matrix multiplication that would require 8 instead
of 7 multiplications of sub-blocks. This would then give rise to the complexity one expects from the standard approach:
. The comparison of these two algorithms shows that asymptotically, Strassen's algorithm
is faster: There exists a size so that matrices that are larger are more efficiently multiplied with Strassen's
algorithm than the "traditional" way. However, the asymptotic statement does not imply that Strassen's algorithm is always
faster even for small matrices, and in practice this is in fact not the case: For small matrices, the cost of the additional
additions of matrix blocks outweighs the savings in the number of multiplications. There are also other factors not captured
by the analysis above, such as the difference in cost on today's hardware between loading data from memory onto
processors vs. the cost of actually doing operations on this data. As a consequence of these sorts of considerations,
Strassen's algorithm is typically only used on "large" matrices. This kind of effect is even more pronounced with alternative
algorithms such as the one by Coppersmith and Winograd: While asymptotically even faster, the cross-over point
is so large that the algorithm is not generally used on matrices one encounters in practice.

Rank or bilinear complexity

The bilinear complexity or rank of a bilinear map is an important concept in the asymptotic complexity of matrix
multiplication. The rank of a bilinear map over a field F is defined as (somewhat of an abuse of notation)

In other words, the rank of a bilinear map is the length of its shortest bilinear computation.[7] The existence of Strassen's
algorithm shows that the rank of matrix multiplication is no more than seven. To see this, let us express this algorithm
(alongside the standard algorithm) as such a bilinear computation. In the case of matrices, the dual spaces A* and B* consist
of maps into the field F induced by a scalar double-dot product, (i.e. in this case the sum of all the entries of a Hadamard
product.)
Standard algorithm Strassen algorithm

It can be shown that the total number of elementary multiplications required for matrix multiplication is tightly
asymptotically bound to the rank , i.e. , or more specifically, since the constants are known, .
One useful property of the rank is that it is submultiplicative for tensor products, and this enables one to show that
matrix multiplication can be accomplished with no more than elementary multiplications for any . (This
-fold tensor product of the matrix multiplication map with itself — an -th tensor power—is realized by the
recursive step in the algorithm shown.)

Cache behavior
Strassen's algorithm is cache oblivious. Analysis of its cache behavior algorithm has shown it to incur

cache misses during its execution, assuming an idealized cache of size (i.e. with lines of length ).[8]: 13

Implementation considerations
The description above states that the matrices are square, and the size is a power of two, and that padding should be used if
needed. This restriction allows the matrices to be split in half, recursively, until limit of scalar multiplication is reached. The
restriction simplifies the explanation, and analysis of complexity, but is not actually necessary;[9] and in fact, padding the
matrix as described will increase the computation time and can easily eliminate the fairly narrow time savings obtained by
using the method in the first place.

A good implementation will observe the following:

It is not necessary or desirable to use the Strassen algorithm down to the limit of scalars. Compared to
conventional matrix multiplication, the algorithm adds a considerable workload in
addition/subtractions; so below a certain size, it will be better to use conventional multiplication. Thus, for
instance, a does not need to be padded to , since it could be subdivided down to
matrices and conventional multiplication can then be used at that level.
The method can indeed be applied to square matrices of any dimension.[3] If the dimension is even, they are
split in half as described. If the dimension is odd, zero padding by one row and one column is applied first.
Such padding can be applied on-the-fly and lazily, and the extra rows and columns discarded as the result is
formed. For instance, suppose the matrices are . They can be split so that the upper-left portion is
and the lower-right is . Wherever the operations require it, dimensions of are zero
padded to first. Note, for instance, that the product is only used in the lower row of the output, so is
only required to be rows high; and thus the left factor used to generate it need only be rows
high; accordingly, there is no need to pad that sum to rows; it is only necessary to pad to
columns to match .
Furthermore, there is no need for the matrices to be square. Non-square matrices can be split in half using the same
methods, yielding smaller non-square matrices. If the matrices are sufficiently non-square it will be worthwhile reducing the
initial operation to more square products, using simple methods which are essentially , for instance:

A product of size can be done as 20 separate operations,

arranged to form the result;
A product of size can be done as 10 separate operations,
summed to form the result.
These techniques will make the implementation more complicated, compared to simply padding to a power-of-two square;
however, it is a reasonable assumption that anyone undertaking an implementation of Strassen, rather than conventional
multiplication, will place a higher priority on computational efficiency than on simplicity of the implementation.

In practice, Strassen's algorithm can be implemented to attain better performance than conventional multiplication even for
matrices as small as , for matrices that are not at all square, and without requiring workspace beyond buffers that
are already needed for a high-performance conventional multiplication.[4]

See also
Computational complexity of mathematical operations
Gauss–Jordan elimination
Computational complexity of matrix multiplication
Z-order curve
Karatsuba algorithm, for multiplying n-digit integers in instead of in time
A similar complex multiplication algorithm multiplies two complex numbers using 3 real multiplications
instead of 4
Toom-Cook algorithm, a faster generalization of the Karatsuba algorithm that permits recursive divide-and-
conquer decomposition into more than 2 blocks at a time

References
1. Strassen, Volker (1969). "Gaussian Elimination is not Optimal". Numer. Math. 13 (4): 354–356.
doi:10.1007/BF02165411 (https://doi.org/10.1007%2FBF02165411). S2CID 121656251 (https://api.semantic
scholar.org/CorpusID:121656251).
2. Skiena, Steven S. (1998), "§8.2.3 Matrix multiplication", The Algorithm Design Manual, Berlin, New York:
Springer-Verlag, ISBN 978-0-387-94860-7.
3. D'Alberto, Paolo; Nicolau, Alexandru (2005). Using Recursion to Boost ATLAS's Performance (https://www.ic
s.uci.edu/~paolo/Reference/paoloA.ishp-vi.pdf) (PDF). Sixth Int'l Symp. on High Performance Computing.
4. Huang, Jianyu; Smith, Tyler M.; Henry, Greg M.; van de Geijn, Robert A. (13 Nov 2016). Strassen's Algorithm
Reloaded (https://www.researchgate.net/publication/315365781). SC16: The International Conference for
High Performance Computing, Networking, Storage and Analysis (https://ieeexplore.ieee.org/xpl/conhome/78
75333/proceeding). IEEE Press. pp. 690–701. doi:10.1109/SC.2016.58 (https://doi.org/10.1109%2FSC.2016.
58). ISBN 9781467388153. Retrieved 1 Nov 2022.
5. Knuth (1997), p. 500.
6. Webb, Miller (1975). "Computational complexity and numerical stability". SIAM J. Comput. 4 (2): 97–107.
doi:10.1137/0204009 (https://doi.org/10.1137%2F0204009).
7. Burgisser; Clausen; Shokrollahi (1997). Algebraic Complexity Theory. Springer-Verlag. ISBN 3-540-60582-7.
8. Frigo, M.; Leiserson, C. E.; Prokop, H.; Ramachandran, S. (1999). Cache-oblivious algorithms (http://superte
ch.csail.mit.edu/papers/FrigoLePr99.pdf) (PDF). Proc. IEEE Symp. on Foundations of Computer Science
(FOCS). pp. 285–297.
9. Higham, Nicholas J. (1990). "Exploiting fast matrix multiplication within the level 3 BLAS" (http://www.maths.
manchester.ac.uk/~higham/papers/high90s.pdf) (PDF). ACM Transactions on Mathematical Software. 16 (4):
352–368. doi:10.1145/98267.98290 (https://doi.org/10.1145%2F98267.98290). hdl:1813/6900 (https://hdl.ha
ndle.net/1813%2F6900). S2CID 5715053 (https://api.semanticscholar.org/CorpusID:5715053).

Thomas H. Cormen, Charles E. Leiserson, Ronald L. Rivest, and Clifford Stein. Introduction to Algorithms,
Second Edition. MIT Press and McGraw-Hill, 2001. ISBN 0-262-03293-7. Chapter 28: Section 28.2:
Strassen's algorithm for matrix multiplication, pp. 735–741.
Knuth, Donald (1997). The Art of Computer Programming, Seminumerical Algorithms. Vol. II (3rd ed.).
Addison-Wesley. ISBN 0-201-89684-2.

External links
Weisstein, Eric W. "Strassen's Formulas" (https://mathworld.wolfram.com/StrassenFormulas.html).
MathWorld. (also includes formulas for fast matrix inversion)
Tyler J. Earnest, Strassen's Algorithm on the Cell Broadband Engine (https://web.archive.org/web/20100612
150812/http://www.mc2.umbc.edu/docs/earnest.pdf)

Retrieved from "https://en.wikipedia.org/w/index.php?title=Strassen_algorithm&oldid=1188592387"

The Right To Remain Silent - A New Answer To An Old Question by James Duane
No ratings yet
The Right To Remain Silent - A New Answer To An Old Question by James Duane
3 pages
Automating Program Compilation - Writing Makefiles PDF
No ratings yet
Automating Program Compilation - Writing Makefiles PDF
6 pages
class4_1
No ratings yet
class4_1
18 pages
LUFA Library_ VID and PID values
No ratings yet
LUFA Library_ VID and PID values
2 pages
Nanowire electrodes for high-density stimulation and measurement of neural circuits
No ratings yet
Nanowire electrodes for high-density stimulation and measurement of neural circuits
5 pages
Answer Algo3
No ratings yet
Answer Algo3
2 pages
5. Read Various Algorithms Listed
No ratings yet
5. Read Various Algorithms Listed
11 pages
ADA Madhav
No ratings yet
ADA Madhav
5 pages
Everything You Need To Know About Semaphores And Mutexes
No ratings yet
Everything You Need To Know About Semaphores And Mutexes
5 pages
Automatic_Reproduction_of_a_Genius_Algorithm_Strassens_Algorithm_Revisited_by_Genetic_Search
No ratings yet
Automatic_Reproduction_of_a_Genius_Algorithm_Strassens_Algorithm_Revisited_by_Genetic_Search
6 pages
Strassens
No ratings yet
Strassens
2 pages
unit 2 strassen's algo
No ratings yet
unit 2 strassen's algo
7 pages
Lecture 9 _ 10(Divide and Conquer)
No ratings yet
Lecture 9 _ 10(Divide and Conquer)
16 pages
Strassen
No ratings yet
Strassen
11 pages
Nano VNA v2 - 2
No ratings yet
Nano VNA v2 - 2
1 page
DAA-Q Bank (THEORY) Solved
No ratings yet
DAA-Q Bank (THEORY) Solved
63 pages
Strassen Algorithm
No ratings yet
Strassen Algorithm
2 pages
Analog Engineers Circuit Cookbook - Data Converters (2nd Edition) PDF
100% (2)
Analog Engineers Circuit Cookbook - Data Converters (2nd Edition) PDF
303 pages
Strassian Matrix
No ratings yet
Strassian Matrix
18 pages
Lab Manual DAA
No ratings yet
Lab Manual DAA
42 pages
Strassen
No ratings yet
Strassen
11 pages
Matrix Multiplication Algorithms With Better Time Complexity
No ratings yet
Matrix Multiplication Algorithms With Better Time Complexity
9 pages
How To Estimate Motor Regeneration and VM Pumping - ssztch8
No ratings yet
How To Estimate Motor Regeneration and VM Pumping - ssztch8
7 pages
Fast Fourier Transform (FFT) FAQ
No ratings yet
Fast Fourier Transform (FFT) FAQ
4 pages
Analysis of Time Complexity of Strassens Algo
No ratings yet
Analysis of Time Complexity of Strassens Algo
4 pages
Strassen's Matrix Multiplication
100% (5)
Strassen's Matrix Multiplication
11 pages
Strassen's Algorithm For MATRIX MULTIPLICATION
No ratings yet
Strassen's Algorithm For MATRIX MULTIPLICATION
13 pages
Matrix_multiplication_algorithm
No ratings yet
Matrix_multiplication_algorithm
9 pages
Final Report Daa Case Study 1
No ratings yet
Final Report Daa Case Study 1
19 pages
Strassen Matrix Multiplication: Under The Guidance of
No ratings yet
Strassen Matrix Multiplication: Under The Guidance of
10 pages
DAA
No ratings yet
DAA
8 pages
Breaking Through Memory Limitation in GPU Parallel Processing Using Strassen Algorithm
No ratings yet
Breaking Through Memory Limitation in GPU Parallel Processing Using Strassen Algorithm
5 pages
Using Strassen's Algorithm To Accelerate The Solution of Linear Systems
No ratings yet
Using Strassen's Algorithm To Accelerate The Solution of Linear Systems
15 pages
Strassen
No ratings yet
Strassen
11 pages
Strassen's Matrix Multiplication
100% (1)
Strassen's Matrix Multiplication
12 pages
Copy of Algo_Presentation
No ratings yet
Copy of Algo_Presentation
20 pages
Strassens Matrix Multiplication
No ratings yet
Strassens Matrix Multiplication
18 pages
DAA IA-1 Case Study Material-CSE
No ratings yet
DAA IA-1 Case Study Material-CSE
9 pages
The Macross Saga Errata 1.5
No ratings yet
The Macross Saga Errata 1.5
5 pages
A Derivation of The EM Updates For Finding The Maximum Likelihood Parameter Estimates of The Student's T Distribution
No ratings yet
A Derivation of The EM Updates For Finding The Maximum Likelihood Parameter Estimates of The Student's T Distribution
5 pages
1 Matrix Multiplication: Strassen's Algorithm: Tuan Nguyen, Alex Adamson, Andreas Santucci
No ratings yet
1 Matrix Multiplication: Strassen's Algorithm: Tuan Nguyen, Alex Adamson, Andreas Santucci
8 pages
Strassen's Matrix Multiplication
No ratings yet
Strassen's Matrix Multiplication
10 pages
Internet of Things A Survey On Enabling PDF
No ratings yet
Internet of Things A Survey On Enabling PDF
30 pages
Strassens Matrix Multiflication
No ratings yet
Strassens Matrix Multiflication
14 pages
Strassen Matrix
No ratings yet
Strassen Matrix
24 pages
What Are Derivatives of Displacement
No ratings yet
What Are Derivatives of Displacement
10 pages
Strassen's Matrix Multiplcation
No ratings yet
Strassen's Matrix Multiplcation
13 pages
Strassen's Matrix Multiplication Algorithm: Problem Description
No ratings yet
Strassen's Matrix Multiplication Algorithm: Problem Description
5 pages
Csce411 Divideconquer2
No ratings yet
Csce411 Divideconquer2
12 pages
Strassen S
No ratings yet
Strassen S
10 pages
Matrix Multiplication1
No ratings yet
Matrix Multiplication1
10 pages
Strassen Matrix Multiplication
No ratings yet
Strassen Matrix Multiplication
29 pages
FALLSEM2023-24 CSE2012 ETH VL2023240103657 2023-10-06 Reference-Material-I
No ratings yet
FALLSEM2023-24 CSE2012 ETH VL2023240103657 2023-10-06 Reference-Material-I
11 pages
Tiny Encryption Algorithm (TEA) For The Compact Framework: Download Source Files - 96.1 KB
No ratings yet
Tiny Encryption Algorithm (TEA) For The Compact Framework: Download Source Files - 96.1 KB
5 pages
Performing Worst-Case Circuit Analysis With LTspice - Technical Articles
No ratings yet
Performing Worst-Case Circuit Analysis With LTspice - Technical Articles
10 pages
10 Strassens Matrix Multiplication
No ratings yet
10 Strassens Matrix Multiplication
25 pages
Strassen's Matrix Multiplication
No ratings yet
Strassen's Matrix Multiplication
11 pages
Matrix Mult 09
No ratings yet
Matrix Mult 09
12 pages
Strassens Matrix Multiplication
No ratings yet
Strassens Matrix Multiplication
11 pages
Numerical Implementation of The Hilbert Transform - XianglingWang
No ratings yet
Numerical Implementation of The Hilbert Transform - XianglingWang
147 pages
Strassen Algorithm
No ratings yet
Strassen Algorithm
4 pages
Strassen S
No ratings yet
Strassen S
3 pages
Fast Sparse Matrix Multiplication
No ratings yet
Fast Sparse Matrix Multiplication
11 pages
Stassen Matrix Multiplication
No ratings yet
Stassen Matrix Multiplication
5 pages
Computing The Hilbert Transform and Its Inverse PDF
No ratings yet
Computing The Hilbert Transform and Its Inverse PDF
28 pages
Back To Basics:: Loop Vs Line Power
No ratings yet
Back To Basics:: Loop Vs Line Power
4 pages
CS124 Spring 2011: (N) Is The Number of Comparisons, Then T (N) 2T (n/2) + 2. (The 2T (n/2) Term Comes From
No ratings yet
CS124 Spring 2011: (N) Is The Number of Comparisons, Then T (N) 2T (n/2) + 2. (The 2T (n/2) Term Comes From
4 pages
Strassen PDF
No ratings yet
Strassen PDF
27 pages
A Short Fuzzy Logic Tutorial
No ratings yet
A Short Fuzzy Logic Tutorial
6 pages
Stationary & Non-Stationary Processes
No ratings yet
Stationary & Non-Stationary Processes
17 pages
Strassen's Matrix Multiplication
No ratings yet
Strassen's Matrix Multiplication
13 pages
Strassen Matrix Multiplication
No ratings yet
Strassen Matrix Multiplication
11 pages
Strassen's Matrix Multiplication: Sibel Kirmizigül
No ratings yet
Strassen's Matrix Multiplication: Sibel Kirmizigül
11 pages
Applying The Wheatstone Bridge Circuit
0% (1)
Applying The Wheatstone Bridge Circuit
36 pages
Lecture 33 Algebraic Computation and FFTs
No ratings yet
Lecture 33 Algebraic Computation and FFTs
16 pages
Strassen's 2 × 2 Matrix Multiplication Algorithm: A Conceptual Perspective
No ratings yet
Strassen's 2 × 2 Matrix Multiplication Algorithm: A Conceptual Perspective
6 pages
Libya Free High Study Academy / Misrata: - Report of Address
No ratings yet
Libya Free High Study Academy / Misrata: - Report of Address
19 pages
Matrices with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
From Everand
Matrices with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
Peter Kattan
3/5 (4)
Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet
Kernel Methods: Fundamentals and Applications
From Everand
Kernel Methods: Fundamentals and Applications
Fouad Sabry
No ratings yet
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet

Strassen_algorithm

Uploaded by

Strassen_algorithm

Uploaded by

Strassen algorithm

with . The naive algorithm would be:

The Strassen algorithm defines instead new values:

Rank or bilinear complexity

A good implementation will observe the following:

A product of size can be done as 20 separate operations,

Retrieved from "https://en.wikipedia.org/w/index.php?title=Strassen_algorithm&oldid=1188592387"

You might also like