Parallel Processing Lecture1
Parallel Processing Lecture1
Lecture 1
WHY PARALLEL PROCESSING?
The quest for higher-performance digital
computers seems unending. In the past two
decades, the performance of microprocessors
has enjoyed an exponential growth. The
growth of microprocessor speed/performance
by a factor of 2 every 18 months (or about
60% per year) is known as Moore’s law.
TIPS
Projection,
circa 1998
Projection,
Processor performance
circa 2012
1.6 / yr
GIPS The number of cores
Pentium II R10000 has been increasing
from a few in 2005
Pentium to the current 10s, and
68040 is projected to reach
80486 100s by 2020
80386 Cores
MIPS
68000 1000
80286
100
10
KIPS 1
1980 1990 2000 2010 2020
Calendar year
From:
“Robots After All,”
by H. Moravec,
CACM, pp. 90-97,
October 2003.
Processor performance
1.6 / yr
Denser circuits; Architectural improvements GIPS
Pentium II R10000
68000
Floating-point operations per second 80286
Clock
Power
Cores
Year of Introduction
NRC Report (2011): The Future of Computing Performance: Game Over or Next Level?
Trends in Processor Chip Density, Performance,
Clock Speed, Power, and Number of Cores
Year of Introduction
Original data up to 2010 collected/plotted by M. Horowitz et al.; Data for 2010-2017 extension collected by K. Rupp
Shares of Technology and Architecture in Processor
Performance Improvement
Koomey’s Law:
Exponential improvement in
energy-efficient computing,
with computations
performed per KWh
doubling every 1.57 years
$30M MPPs
CM-5 Vector supers
TFLOPS
CM-5
CM-2
Micros
Y-MP
GFLOPS
Alpha
Cray
X-MP 80860
80386
MFLOPS
1980 1990 2000 2010
Calendar year
Fig. 1.2 The exponential growth in supercomputer performance over
the past two decades (from [Bell92], with ASCI performance goals and
microprocessor peak FLOPS superimposed as dotted lines).
The ASCI Program
1000 Plan Develop Use
Performance (TFLOPS)
100+ TFLOPS, 20 TB
100 ASCI Purple
30+ TFLOPS, 10 TB
ASCI Q
10+ TFLOPS, 5 TB
10 ASCI White ASCI
3+ TFLOPS, 1.5 TB
ASCI Blue
1+ TFLOPS, 0.5 TB
1 ASCI Red
1995 2000 2005 2010
Calendar year
Fig. 24.1 Milestones in the Accelerated Strategic (Advanced Simulation &)
Computing Initiative (ASCI) program, sponsored by the US Department of
Energy, with extrapolation up to the PFLOPS level.
The Quest for Higher Performance
Top Three Supercomputers in 2005 (IEEE Spectrum, Feb. 2005, pp. 15-16)
Parallelism = Concurrency
Doing more than one thing at a time
Has been around for decades, since early computers
I/O channels, DMA, device controllers, multiple ALUs
Bit-vector n
1 2
1 2 n
(b)
2 |
| 3 11 | 19 29 31
5 | 7 13|17 23
p = 3, t = 499
Communi-
c ation n/p+1 2n/p
n <n/p
n–n/p+1 n
Ideal speedup
Computation
Communication
Ideal speedup
Solution time
Actual speedup
Computation
I/O time