0% found this document useful (0 votes)

10 views19 pages

TLP

Tlp pdf

Uploaded by

cse.20201016

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views19 pages

TLP

Tlp pdf

Uploaded by

cse.20201016

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

Overview

¨ Announcement
¤ Homework 4 is due on Dec. 11th

¨ This lecture
¤ Thread level parallelism (TLP)
¤ Parallel architectures for exploiting TLP
n Hardware multithreading
n Symmetric multiprocessors
n Chip multiprocessing
Flynn’s Taxonomy
¨ Forms of computer architectures
Instruction Stream
Single Multiple

Single-Instruction, Multiple-Instruction,
Single Single Data (SISD) Single Data (MISD)
Data Stream

uniprocessors systolic arrays

Multiple-Instruction,
Single-Instruction,
Multiple Data
Multiple Multiple Data (SIMD)
(MIMD)
vector processors
multiprocessors
Flynn’s Taxonomy
¨ Forms of computer architectures
Instruction Stream
Single Multiple

Single-Instruction, Multiple-Instruction,
Single Single Data (SISD) Single Data (MISD)
Data Stream

uniprocessors systolic arrays

Multiple-Instruction,
Single-Instruction,
Multiple Data
Multiple Multiple Data (SIMD)
(MIMD)
vector processors
multiprocessors
Basics of Threads
¨ Thread is a single sequential flow of control within a
program including instructions and state
¤ Register state is called thread context
¨ A program may be single- or multi-threaded
¤ Single-threaded program can handle one task at any
time
¨ Multitasking is performed by modern operating
systems to load the context of a new thread while
the old thread’s context is written back to memory
Thread Level Parallelism (TLP)
¨ Users prefer to execute multiple applications
¤ Piping applications in Linux
n gunzip -c foo.gz | grep bar | perl some-script.pl

¤ Your favorite applications while working in office

n Music player, web browser, terminal, etc.
¨ Many applications are amenable to parallelism
¤ Explicitly multi-threaded programs
n Pthreaded applications
¤ Parallel languages and libraries
n Java, C#, OpenMP
Thread Level Parallel Architectures
¨ Architectures for exploiting thread-level parallelism
Hardware Multithreading Multiprocessing
q Multiple threads run on the q Different threads run on
same processor pipeline different processors
q Multithreading levels q Two general types
o Coarse grained o Symmetric multiprocessors
multithreading (CGMT) (SMP)
o Fine grained multithreading § Single CPU per chip
(FGMT) o Chip Multiprocessors (CMP)
o Simultaneous multithreading § Multiple CPUs per chip
(SMT)
Hardware Multithreading
Hardware Multithreading
¨ Observation: CPU become idle due to latency of
memory operations, dependent instructions, and
branch resolution
¨ Key idea: utilize idle resources to improve
performance
¤ Support multiple thread contexts in a single processor
¤ Exploit thread level parallelism

¨ Challenge: the energy and performance costs of

context switching
Coarse Grained Multithreading
¨ Single thread runs until a costly stall—e.g. last level
cache miss
¨ Another thread starts during stall for first
¤ Pipeline fill time requires several cycles!
¨ At any time, only one thread is in the pipeline
¨ Does not cover short stalls
¨ Needs hardware support
¤ PC and register file for each thread
Coarse Grained Multithreading
¨ Superscalar vs. CGMT
FU1 FU2 FU3 FU4 FU1 FU2 FU3 FU4

Coarse Grained Multithreading

Conventional Superscalar
Fine Grain Multithreading
¨ Two or more threads interleave instructions
¤ Round-robin fashion
¤ Skip stalled threads

¨ Needs hardware support

¤ Separate PC and register file for each thread
¤ Hardware to control alternating pattern

¨ Naturally hides delays

¤ Data hazards, Cache misses
¤ Pipeline runs with rare stalls

¨ Does not make full use of multi-issue architecture

Fine Grained Multithreading
¨ CGMT vs. FGMT
FU1 FU2 FU3 FU4 FU1 FU2 FU3 FU4
Coarse Grained Multithreading

Fine Grained Multithreading

Simultaneous Multithreading
¨ Instructions from multiple threads issued on same
cycle
¤ Uses register renaming and dynamic scheduling facility
of multi-issue architecture
¨ Needs more hardware support
¤ Register files, PC’s for each thread
¤ Temporary result registers before commit
¤ Support to sort out which threads get results from which
instructions
¨ Maximizes utilization of execution units
Simultaneous Multithreading
¨ FGMT vs. SMT
FU1 FU2 FU3 FU4 FU1 FU2 FU3 FU4

Simultaneous Multithreading
Fine Grained Multithreading
Multiprocessing
Symmetric Multiprocessors
¨ Multiple CPU chips share the same CPU 0
CPU 1
memory CPU 2
CPU 3
¨ From the OS’s point of view
¤ Allof the CPUs have equal compute appapp
app
capabilities
OS
¤ The main memory is equally accessible
by the CPU chips
¨ OS runs every thread on a CPU
¨ Every CPU has its own power
distribution and cooling system
AMD Opteron
Chip Multiprocessors
¨ Can be viewed as a simple SMP on
single chip Core Core
…
Core
0 1 3
¨ CPUs are now called cores
¤ One thread per core Shared
cache
¨ Shared higher level caches
¤ Typicallythe last level
¤ Lower latency

¤ Improved bandwidth

¨ Not necessarily homogenous cores!

Intel Nehalem (Core i7)

Why Chip Multiprocessing?
¨ CMP exploits parallelism at lower costs than SMP
¤A single interface to the main memory
¤ Only one CPU socket is required on the motherboard

¨ CMP requires less off-chip communication

¤ Lower power and energy consumption

¤ Better performance due to improved AMAT

¨ CMP better employs the additional transistors that

are made available based on the Moore’s law
¤ More cores rather than more complicated pipelines

Service Manual Skanmobile
100% (11)
Service Manual Skanmobile
136 pages
API Q2 SDI Operations Quality Manual 1000-MAN-QMS-OPS-20024
73% (11)
API Q2 SDI Operations Quality Manual 1000-MAN-QMS-OPS-20024
27 pages
D904 - D906 - D914 - D916 - D924 - D926 - 8718458 - 04092008 - v02 - en
89% (19)
D904 - D906 - D914 - D916 - D924 - D926 - 8718458 - 04092008 - v02 - en
218 pages
Management Advisory Services by Roque Solution Manual PDF
24% (29)
Management Advisory Services by Roque Solution Manual PDF
3 pages
Multi-Core Architectures
100% (1)
Multi-Core Architectures
43 pages
Flynns Taxonomy
0% (1)
Flynns Taxonomy
79 pages
Introduction To Digital Economics: Foundations, Business Models and Case Studies 2nd Edition Harald Øverby
No ratings yet
Introduction To Digital Economics: Foundations, Business Models and Case Studies 2nd Edition Harald Øverby
73 pages
Step-By-step Guide To Implement Modeling Scenarios in SAP BW 7.4 On HANA
100% (1)
Step-By-step Guide To Implement Modeling Scenarios in SAP BW 7.4 On HANA
25 pages
SMT and CMP Architectures
100% (3)
SMT and CMP Architectures
19 pages
Multi Processors and Thread Level Parallelism
No ratings yet
Multi Processors and Thread Level Parallelism
74 pages
Hardware Multithreading
100% (1)
Hardware Multithreading
4 pages
EE6304 Lecture12 TLP
No ratings yet
EE6304 Lecture12 TLP
70 pages
Lecture 3 Flynn's Classical Taxonomy
No ratings yet
Lecture 3 Flynn's Classical Taxonomy
29 pages
Failure Data Analysis PDF
No ratings yet
Failure Data Analysis PDF
2 pages
Oracle Database As A Service (Dbaas)
No ratings yet
Oracle Database As A Service (Dbaas)
17 pages
CS Chap7 Multicores Multiprocessors Clusters
No ratings yet
CS Chap7 Multicores Multiprocessors Clusters
65 pages
Redd Coin Book 28082020
No ratings yet
Redd Coin Book 28082020
125 pages
Ca - Unit 4
No ratings yet
Ca - Unit 4
77 pages
ECE 4100/6100 Advanced Computer Architecture: Lecture 13 Multithreading and Multicore Processors
No ratings yet
ECE 4100/6100 Advanced Computer Architecture: Lecture 13 Multithreading and Multicore Processors
56 pages
Multi Thread2
No ratings yet
Multi Thread2
37 pages
Mil
No ratings yet
Mil
29 pages
5 4 Parallel
No ratings yet
5 4 Parallel
47 pages
Mod 7
No ratings yet
Mod 7
56 pages
Hardware Multithreading
No ratings yet
Hardware Multithreading
22 pages
Parallel Processors From Client To Cloud: Omputer Rganization and Esign
No ratings yet
Parallel Processors From Client To Cloud: Omputer Rganization and Esign
43 pages
Arch13 Multiprocessors Afterlecture
No ratings yet
Arch13 Multiprocessors Afterlecture
70 pages
Parallelism and Multicores
No ratings yet
Parallelism and Multicores
54 pages
Unit IV CA
No ratings yet
Unit IV CA
73 pages
Multi-Core Computing: Osama Awwad
No ratings yet
Multi-Core Computing: Osama Awwad
37 pages
Architecture
No ratings yet
Architecture
67 pages
Pcme Man DT780 DT280 Ing Issue 1.01
No ratings yet
Pcme Man DT780 DT280 Ing Issue 1.01
86 pages
MULTITHREADING
No ratings yet
MULTITHREADING
30 pages
Lec 4 Superscalarprocessor Updated PDF
No ratings yet
Lec 4 Superscalarprocessor Updated PDF
40 pages
Unit Iv Parallelism
No ratings yet
Unit Iv Parallelism
80 pages
Parallelism (2) & Heterogeneous Computing & Future Perspetives
No ratings yet
Parallelism (2) & Heterogeneous Computing & Future Perspetives
50 pages
British Standard: A Single Copy of This British Standard Is Licensed To
No ratings yet
British Standard: A Single Copy of This British Standard Is Licensed To
25 pages
SMT and CMP Architectures
No ratings yet
SMT and CMP Architectures
19 pages
Basic of Thread Level Parallelism
No ratings yet
Basic of Thread Level Parallelism
30 pages
Lecture ParallelArchTLP-DLP
No ratings yet
Lecture ParallelArchTLP-DLP
52 pages
Lecture 1
No ratings yet
Lecture 1
37 pages
Lecture19 ILP SMT
No ratings yet
Lecture19 ILP SMT
31 pages
Chapter 3
No ratings yet
Chapter 3
35 pages
Future Processors To Use Coarse-Grain Parallelism
No ratings yet
Future Processors To Use Coarse-Grain Parallelism
48 pages
Osa Multi Core
No ratings yet
Osa Multi Core
37 pages
Memory 2
No ratings yet
Memory 2
31 pages
Background: Computer System Architectures Computer System Software
No ratings yet
Background: Computer System Architectures Computer System Software
25 pages
15th Lecture 6. Future Processors To Use Coarse-Grain Parallelism
No ratings yet
15th Lecture 6. Future Processors To Use Coarse-Grain Parallelism
35 pages
10 Multithreading
No ratings yet
10 Multithreading
60 pages
Lec 4 Superscalarprocessor PDF
No ratings yet
Lec 4 Superscalarprocessor PDF
23 pages
Electric Bike
No ratings yet
Electric Bike
15 pages
TS-V9 Multi-Functional Vehicle GPS Tracker User Manual Updated 201801
No ratings yet
TS-V9 Multi-Functional Vehicle GPS Tracker User Manual Updated 201801
14 pages
Week 5
No ratings yet
Week 5
35 pages
Unit 5
No ratings yet
Unit 5
86 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
17 pages
Memory Hierarchy 1
No ratings yet
Memory Hierarchy 1
44 pages
Introduction To Multi-Core Architecture
No ratings yet
Introduction To Multi-Core Architecture
16 pages
Project Schedule Management
No ratings yet
Project Schedule Management
20 pages
06 Flynn-S Classification
No ratings yet
06 Flynn-S Classification
31 pages
06b Multithreading MF
No ratings yet
06b Multithreading MF
37 pages
Multi Core 15213 Sp07
No ratings yet
Multi Core 15213 Sp07
67 pages
Measuring Computer Performance
No ratings yet
Measuring Computer Performance
26 pages
Multi Core
No ratings yet
Multi Core
19 pages
SSC Course 6 CPU
No ratings yet
SSC Course 6 CPU
17 pages
Unit IV QB With Answers
No ratings yet
Unit IV QB With Answers
16 pages
Antenna Design
No ratings yet
Antenna Design
6 pages
Sap Certified Exams
No ratings yet
Sap Certified Exams
6 pages
Development of A Simultaneously Threaded
No ratings yet
Development of A Simultaneously Threaded
14 pages
DST4030A Lecture Notes Week 3
No ratings yet
DST4030A Lecture Notes Week 3
31 pages
SMT and CMP Architectures
No ratings yet
SMT and CMP Architectures
19 pages
UPSC ANSWER WRITING Schedule
No ratings yet
UPSC ANSWER WRITING Schedule
12 pages
Virtual Try-On System Using Image Processing and Augmented Reality
No ratings yet
Virtual Try-On System Using Image Processing and Augmented Reality
4 pages
L38 TLP
No ratings yet
L38 TLP
13 pages
Morimoto - Wide Speed Operation of Interior Permanent Magnet Synchronous Motors
No ratings yet
Morimoto - Wide Speed Operation of Interior Permanent Magnet Synchronous Motors
7 pages
Organisasi & Arsitektur Komputer
No ratings yet
Organisasi & Arsitektur Komputer
7 pages
50622990operating Guide S-FCRW240W-S
No ratings yet
50622990operating Guide S-FCRW240W-S
8 pages
Lab 1 - BPMN Modeling V3
No ratings yet
Lab 1 - BPMN Modeling V3
8 pages
Ds70 - Turbine Compressor
No ratings yet
Ds70 - Turbine Compressor
2 pages
InductionHeatingDevices FAG PDF
No ratings yet
InductionHeatingDevices FAG PDF
8 pages
03 TLP
No ratings yet
03 TLP
33 pages
Presentation On Multithreading/Vector
No ratings yet
Presentation On Multithreading/Vector
7 pages
Key Points: Week 4: Strategy Driven by Digital
No ratings yet
Key Points: Week 4: Strategy Driven by Digital
6 pages
Cs6303-Computer Architecture Unit-Iv Parallelism Part A: Svcet
No ratings yet
Cs6303-Computer Architecture Unit-Iv Parallelism Part A: Svcet
4 pages
Department of Computer Science and Engineering: Course Material (Question Bank)
No ratings yet
Department of Computer Science and Engineering: Course Material (Question Bank)
4 pages
Multithreading, SMT and CMP
No ratings yet
Multithreading, SMT and CMP
7 pages
Periodical Examination 3rd Quarter
No ratings yet
Periodical Examination 3rd Quarter
4 pages
Multi Threading and Multi Core Handout
No ratings yet
Multi Threading and Multi Core Handout
3 pages
Date Sheet Final Exams Spring 2024 FOS
No ratings yet
Date Sheet Final Exams Spring 2024 FOS
3 pages
Gmail - Associate Engineer-Trainee Hiring - Entry-Level Opportunity at Innofied Solutions
No ratings yet
Gmail - Associate Engineer-Trainee Hiring - Entry-Level Opportunity at Innofied Solutions
2 pages
Eaton Fire Xdetect Panel Datasheet Td450173en en
No ratings yet
Eaton Fire Xdetect Panel Datasheet Td450173en en
2 pages
FPTS - 2016 Brochure
No ratings yet
FPTS - 2016 Brochure
2 pages
JD - Machine Learning Engineer
No ratings yet
JD - Machine Learning Engineer
1 page
Dell I5 5410
No ratings yet
Dell I5 5410
1 page
Node.js, JavaScript, API: Interview Questions and Answers
From Everand
Node.js, JavaScript, API: Interview Questions and Answers
John Edward Cooper Berg
5/5 (1)
Node.js 63 Interview Questions and Answers
From Everand
Node.js 63 Interview Questions and Answers
John Edward Cooper Berg
No ratings yet

TLP

Uploaded by

TLP

Uploaded by

Overview

uniprocessors systolic arrays

uniprocessors systolic arrays

¤ Your favorite applications while working in office

¨ Challenge: the energy and performance costs of

Coarse Grained Multithreading

¨ Needs hardware support

¨ Naturally hides delays

¨ Does not make full use of multi-issue architecture

Fine Grained Multithreading

¨ Not necessarily homogenous cores!

Intel Nehalem (Core i7)

¨ CMP requires less off-chip communication

¤ Better performance due to improved AMAT

¨ CMP better employs the additional transistors that

You might also like