2-Parallel Hardware
2-Parallel Hardware
Computer History
These slides are adopted from Prof Dr Zahran's Parallel Computing lecture notes.
BM5351 PARALEL HESAPLAMA YÖNTEMLERİ
These slides are adopted from Prof Dr Zahran's Parallel Computing lecture notes.
BM5351 PARALEL HESAPLAMA YÖNTEMLERİ
These slides are adopted from Prof Dr Zahran's Parallel Computing lecture notes.
BM5351 PARALEL HESAPLAMA YÖNTEMLERİ
These slides are adopted from Prof Dr Zahran's Parallel Computing lecture notes.
BM5351 PARALEL HESAPLAMA YÖNTEMLERİ
These slides are adopted from Prof Dr Zahran's Parallel Computing lecture notes.
BM5351 PARALEL HESAPLAMA YÖNTEMLERİ
Intel 4004
● The Intel 4004 was the world’s first
microprocessor - GPCPU
● Released in March 1971, and using
cutting-edge silicon-gate technology
● Manufactured in 1970
● 4Bit CPU
● 2,250 transistors
● 12 mm2
● 108 KHz
https://spectrum.ieee.org/chip-hall-of-fame-intel-4004-microprocessor#toggle-gdpr
These slides are adopted from Prof Dr Zahran's Parallel Computing lecture notes.
BM5351 PARALEL HESAPLAMA YÖNTEMLERİ
http://p2k.unhamzah.ac.id/IT/en/3073-2970/8086_2467_p2k-unhamzah.html
These slides are adopted from Prof Dr Zahran's Parallel Computing lecture notes.
BM5351 PARALEL HESAPLAMA YÖNTEMLERİ
Intel 80486
● Introduced in 1989
● 1,200,000 transistors
● 81 mm2
● 25 MHz
● 1st pipelined
● implementation of IA32
● 1 st processor with on-chip cache
These slides are adopted from Prof Dr Zahran's Parallel Computing lecture notes.
BM5351 PARALEL HESAPLAMA YÖNTEMLERİ
Pentium
● Introduced in 1993
● 3,100,000 transistors
● 296 mm2
● 60 MHz
● 1st superscalar implementation of IA32
These slides are adopted from Prof Dr Zahran's Parallel Computing lecture notes.
BM5351 PARALEL HESAPLAMA YÖNTEMLERİ
Pentium 4
● Introduced in 2000
● 55,000,000 transistors
● 146 mm2
● 3 GHz
http://www.chip-architect.com
These slides are adopted from Prof Dr Zahran's Parallel Computing lecture notes.
BM5351 PARALEL HESAPLAMA YÖNTEMLERİ
These slides are adopted from Prof Dr Zahran's Parallel Computing lecture notes.
BM5351 PARALEL HESAPLAMA YÖNTEMLERİ
These slides are adopted from Prof Dr Zahran's Parallel Computing lecture notes.
BM5351 PARALEL HESAPLAMA YÖNTEMLERİ
https://www.computerhope.com/history/processor.htm
These slides are adopted from Prof Dr Zahran's Parallel Computing lecture notes.
BM5351 PARALEL HESAPLAMA YÖNTEMLERİ
https://www.computerhope.com/history/processor.htm
These slides are adopted from Prof Dr Zahran's Parallel Computing lecture notes.
BM5351 PARALEL HESAPLAMA YÖNTEMLERİ
These slides are adopted from Prof Dr Zahran's Parallel Computing lecture notes.
BM5351 PARALEL HESAPLAMA YÖNTEMLERİ
https://semiengineering.com/knowledge_centers/compute-architectures/von-neumann-architecture/
These slides are adopted from Prof Dr Zahran's Parallel Computing lecture notes.
BM5351 PARALEL HESAPLAMA YÖNTEMLERİ
● Pipelinining:
○ The hardware divided into stages
○ temporal parallelism
● Number of stages increases with each generation
● Maximum CPI (Cycles Per Instruction) = 1
These slides are adopted from Prof Dr Zahran's Parallel Computing lecture notes.
BM5351 PARALEL HESAPLAMA YÖNTEMLERİ
These slides are adopted from Prof Dr Zahran's Parallel Computing lecture notes.
BM5351 PARALEL HESAPLAMA YÖNTEMLERİ
These slides are adopted from Prof Dr Zahran's Parallel Computing lecture notes.
BM5351 PARALEL HESAPLAMA YÖNTEMLERİ
These slides are adopted from Prof Dr Zahran's Parallel Computing lecture notes.
BM5351 PARALEL HESAPLAMA YÖNTEMLERİ
Flynn’s Taxonomy
These slides are adopted from Prof Dr Zahran's Parallel Computing lecture notes.
BM5351 PARALEL HESAPLAMA YÖNTEMLERİ
Flynn’s Taxonomy
These slides are adopted from Prof Dr Zahran's Parallel Computing lecture notes.
BM5351 PARALEL HESAPLAMA YÖNTEMLERİ
SIMD
These slides are adopted from Prof Dr Zahran's Parallel Computing lecture notes.
BM5351 PARALEL HESAPLAMA YÖNTEMLERİ
SIMD
These slides are adopted from Prof Dr Zahran's Parallel Computing lecture notes.
BM5351 PARALEL HESAPLAMA YÖNTEMLERİ
SIMD
These slides are adopted from Prof Dr Zahran's Parallel Computing lecture notes.
BM5351 PARALEL HESAPLAMA YÖNTEMLERİ
SIMD
● All ALUs are required to execute the same instruction(s), or remain idle.
● In classic design, they must also operate synchronously.
● The ALUs have no instruction storage.
● Efficient for large data parallel problems, but not other types of more complex
parallel problems.
These slides are adopted from Prof Dr Zahran's Parallel Computing lecture notes.
BM5351 PARALEL HESAPLAMA YÖNTEMLERİ
MIMD
These slides are adopted from Prof Dr Zahran's Parallel Computing lecture notes.
BM5351 PARALEL HESAPLAMA YÖNTEMLERİ
These slides are adopted from Prof Dr Zahran's Parallel Computing lecture notes.
BM5351 PARALEL HESAPLAMA YÖNTEMLERİ
These slides are adopted from Prof Dr Zahran's Parallel Computing lecture notes.
BM5351 PARALEL HESAPLAMA YÖNTEMLERİ
These slides are adopted from Prof Dr Zahran's Parallel Computing lecture notes.
BM5351 PARALEL HESAPLAMA YÖNTEMLERİ
These slides are adopted from Prof Dr Zahran's Parallel Computing lecture notes.
BM5351 PARALEL HESAPLAMA YÖNTEMLERİ
These slides are adopted from Prof Dr Zahran's Parallel Computing lecture notes.
BM5351 PARALEL HESAPLAMA YÖNTEMLERİ
These slides are adopted from Prof Dr Zahran's Parallel Computing lecture notes.
BM5351 PARALEL HESAPLAMA YÖNTEMLERİ
Cache
https://searchstorage.techtarget.com/definition/cache
These slides are adopted from Prof Dr Zahran's Parallel Computing lecture notes.
BM5351 PARALEL HESAPLAMA YÖNTEMLERİ
Cache Coherence
These slides are adopted from Prof Dr Zahran's Parallel Computing lecture notes.
BM5351 PARALEL HESAPLAMA YÖNTEMLERİ
Cache Coherence
These slides are adopted from Prof Dr Zahran's Parallel Computing lecture notes.
BM5351 PARALEL HESAPLAMA YÖNTEMLERİ
These slides are adopted from Prof Dr Zahran's Parallel Computing lecture notes.
BM5351 PARALEL HESAPLAMA YÖNTEMLERİ
● Uses a data structure called a directory that stores the status of each
cache line.
● When a variable is updated, the directory is consulted, and the cache
controllers of the cores that have that variable’s cache line in their
caches are invalidated.
These slides are adopted from Prof Dr Zahran's Parallel Computing lecture notes.
BM5351 PARALEL HESAPLAMA YÖNTEMLERİ
These slides are adopted from Prof Dr Zahran's Parallel Computing lecture notes.
Example: MESI Protocol
BM5351 PARALEL HESAPLAMA YÖNTEMLERİ
Super Computers
https://www.ibm.com/topics/supercomputing
These slides are adopted from Prof Dr Zahran's Parallel Computing lecture notes.
BM5351 PARALEL HESAPLAMA YÖNTEMLERİ
1 Supercomputer Fugaku - Supercomputer Fugaku, A64FX 48C 2.2GHz, Tofu 7,630,848 442,010.0 537,212.0 29,899
interconnect D, Fujitsu
RIKEN Center for Computational Science
Japan
2 Summit - IBM Power System AC922, IBM POWER9 22C 3.07GHz, NVIDIA 2,414,592 148,600.0 200,794.9 10,096
Volta GV100, Dual-rail Mellanox EDR Infiniband, IBM
DOE/SC/Oak Ridge National Laboratory
United States
3 Sierra - IBM Power System AC922, IBM POWER9 22C 3.1GHz, NVIDIA 1,572,480 94,640.0 125,712.0 7,438
Volta GV100, Dual-rail Mellanox EDR Infiniband, IBM / NVIDIA / Mellanox
DOE/NNSA/LLNL
United States
https://top500.org/lists/top500/2021/06/
These slides are adopted from Prof Dr Zahran's Parallel Computing lecture notes.
BBM101- Bilgisayar Programlamaya Giriş-I
Sorular