Unit4 Session1 Intro To Parallel Computing
Unit4 Session1 Intro To Parallel Computing
Architecture (μpCA)
Unit 4: Introduction to Parallel Computing
UE22CS251B
Session : 4.1
Microprocessor & Computer Architecture (μpCA)
Sequential Computing
Vs
Microprocessor & Computer Architecture (μpCA)
A Computational Problem on a Parallel System
• Virtually all stand-alone computers today are parallel from a hardware perspective
• Multiple functional units
L1 cache, L2 cache, branch, prefetch, decode,
floating-point, graphics processing (GPU), integer, etc.
• Multiple execution units/cores
• Multiple hardware threads
IBM BG/Q Compute Chip with 18 cores (PU) and 16 L2 Cache units(L2)
Microprocessor & Computer Architecture (μpCA)
Parallel Computers
• Networks connect multiple stand-alone computers (nodes)
to make larger parallel computer clusters.
Working in parallel shortens completion time Parallel computing can solve increasingly complex problems
Microprocessor & Computer Architecture (μpCA)
Why use Parallel Computers ?
Large problems
• 10,000 x 10,000 x 10,000 grid
• 10^12 grid points
• 4x10^12 double variables
• 32x10^12 bytes
• 32 Tera-Bytes.
Microprocessor & Computer Architecture (μpCA)
Quantifying the Capability to Solve Problem
ASCI White
ASCI Red
Pacific
1 TFlop/s
(1012)
TMC CM-5 Cray T3D
2X Transistors/Chip Vector
Every 1.5 Years TMC CM-2
Cray 2
1 GFlop/s Cray X-MP
(109) Super Scalar
Cray 1
1941 1 (Floating Point operations / second, Flop/s)
CDC 7600 IBM 360/195 1945 100
1 MFlop/sScalar 1949 1,000 (1 KiloFlop/s, KFlop/s)
1951 10,000
(106) CDC 6600 1961 100,000
1964 1,000,000 (1 MegaFlop/s, MFlop/s)
IBM 7090 1968 10,000,000
1975 100,000,000
1987 1,000,000,000 (1 GigaFlop/s, GFlop/s)
1992 10,000,000,000
1993 100,000,000,000
1 KFlop/s 1997 1,000,000,000,000 (1 TeraFlop/s, TFlop/s)
(103) UNIVAC 1 2000 10,000,000,000,000
EDSAC 1 2005 131,000,000,000,000 (131 Tflop/s)
•During the past 20+ years, the trends indicated by ever faster networks,
distributed systems, and multi-processor computer architectures (even at the
desktop level) clearly show that parallelism is the future of computing.
•In this same time period, there has been a greater than 500,000x increase in
supercomputer performance, with no end currently in sight.
4 5
Microprocessor & Computer Architecture (μpCA)
Where Do We live?
Rpeak
Rmax (TFlop/s Power
Rank System Cores (TFlop/s) ) (kW)
165 Supercomputer SERC - Cray XC40, Xeon E5-2680v3 12C 31,104 901.5 1,244.2 608
Education and Research 2.5GHz, Aries interconnect
Centre (SERC), Indian Cray Inc.
Institute of Science
India
261 Indian Institute of iDataPlex DX360M4, Xeon E5-2670 8C 38,016 719.2 790.7 790
Tropical Meteorology 2.600GHz, Infiniband FDR
India IBM
356 Indian Lattice Gauge TIFR - Cray XC30, Intel Xeon E5- 11,424 558.8 730.7 320
Theory Initiative (ILGTI), 2680v2 10C 2.8GHz, Aries
Tata Institute of interconnect , NVIDIA K20x
Fundamental Research Cray Inc.
(TIFR)
India
392 Indian Institute of HP Apollo 6000 Xl230/250 , Xeon E5- 22,572 524.4 1,170.1 498
Technology Delhi 2680v3 12C 2.5GHz, Infiniband FDR,
India NVIDIA Tesla K40m
HPE
Microprocessor & Computer Architecture (μpCA)
Top 5 Super Computers of India
https://www.digit.in/general/top-5-supercomputers-in-india-29784.html
Microprocessor & Computer Architecture (μpCA)
Growth & Change in Trend
THANK YOU
Team MPCA
Department of Computer Science and Engineering