202004261306373620rohit Engg Multi Threaded

Multithreading is a technique that allows a CPU to execute multiple threads concurrently by sharing core resources. There are three main types of multithreading: interleaved/temporal which switches between threads on long latency events, coarse-grained which switches on blocked events, and simultaneous multithreading which issues instructions from multiple threads each cycle to better utilize superscalar resources. The goal is to increase throughput by executing multiple threads in parallel to hide latency and improve core utilization.

Uploaded by

bluesoul

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views4 pages

202004261306373620rohit Engg Multi Threaded

Uploaded by

bluesoul

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Multi-threaded Architecture

In computer architecture, multithreading is the ability of a central processing

unit (CPU) (or a single core in a multi-core processor) to provide multiple threads
of execution concurrently, supported by the operating system. This approach
differs from multiprocessing. In a multithreaded application, the threads share the
resources of a single or multiple cores, which include the computing units,
the CPU caches, and the translation lookaside buffer (TLB).
Where multiprocessing systems include multiple complete processing units in one
or more cores, multithreading aims to increase utilization of a single core by
using thread-level parallelism, as well as instruction-level parallelism. As the two
techniques are complementary, they are sometimes combined in systems with
multiple multithreading CPUs and with CPUs with multiple multithreading cores.
The multithreading paradigm has become more popular as efforts to further
exploit instruction-level parallelism have stalled since the late 1990s. This allowed
the concept of throughput computing to re-emerge from the more specialized field
of transaction processing. Even though it is very difficult to further speed up a
single thread or single program, most computer systems are actually multitasking
among multiple threads or programs. Thus, techniques that improve the throughput
of all tasks result in overall performance gains.

Two major techniques for throughput computing

are multithreading and multiprocessing.
Advantages
If a thread gets a lot of cache misses, the other threads can continue taking
advantage of the unused computing resources, which may lead to faster overall
execution, as these resources would have been idle if only a single thread were
executed. Also, if a thread cannot use all the computing resources of the CPU
(because instructions depend on each other's result), running another thread may
prevent those resources from becoming idle.
Disadvantages
Multiple threads can interfere with each other when sharing hardware resources
such as caches or translation lookaside buffers (TLBs). As a result, execution times
of a single thread are not improved and can be degraded, even when only one
thread is executing, due to lower frequencies or additional pipeline stages that are
necessary to accommodate thread-switching hardware.
Overall efficiency varies; Intel claims up to 30% improvement with its Hyper-
Threading Technology,[1] while a synthetic program just performing a loop of non-
optimized dependent floating-point operations actually gains a 100% speed
improvement when run in parallel. On the other hand, hand-tuned assembly
language programs using MMX or AltiVec extensions and performing data pre-
fetches (as a good video encoder might) do not suffer from cache misses or idle
computing resources. Such programs therefore do not benefit from hardware
multithreading and can indeed see degraded performance due to contention for
shared resources.

Types of multithreading
Interleaved/Temporal multithreading
Coarse-grained multithreading
The simplest type of multithreading occurs when one thread runs until it is blocked
by an event that normally would create a long-latency stall. Such a stall might be a
cache miss that has to access off-chip memory, which might take hundreds of CPU
cycles for the data to return. Instead of waiting for the stall to resolve, a threaded
processor would switch execution to another thread that was ready to run. Only
when the data for the previous thread had arrived, would the previous thread be
placed back on the list of ready-to-run threads.
For example:

1. Cycle i: instruction j from thread A is issued.

2. Cycle i + 1: instruction j + 1 from thread A is issued.
3. Cycle i + 2: instruction j + 2 from thread A is issued, which is a load
instruction that misses in all caches.
4. Cycle i + 3: thread scheduler invoked, switches to thread B.
5. Cycle i + 4: instruction k from thread B is issued.
6. Cycle i + 5: instruction k + 1 from thread B is issued.
Conceptually, it is similar to cooperative multi-tasking used in real-time operating
systems, in which tasks voluntarily give up execution time when they need to wait
upon some type of the event. This type of multithreading is known as block,
cooperative or coarse-grained multithreading.
Interleaved multithreading
The purpose of interleaved multithreading is to remove all data dependency stalls
from the execution pipeline. Since one thread is relatively independent from other
threads, there is less chance of one instruction in one pipelining stage needing an
output from an older instruction in the pipeline. Conceptually, it is similar
to preemptive multitasking used in operating systems; an analogy would be that the
time slice given to each active thread is one CPU cycle.
For example:

1. Cycle i + 1: an instruction from thread B is issued.

2. Cycle i + 2: an instruction from thread C is issued.
This type of multithreading was first called barrel processing, in which the staves
of a barrel represent the pipeline stages and their executing threads. Interleaved,
preemptive, fine-grained or time-sliced multithreading are more modern
terminology.
In addition to the hardware costs discussed in the block type of multithreading,
interleaved multithreading has an additional cost of each pipeline stage tracking the
thread ID of the instruction it is processing. Also, since there are more threads
being executed concurrently in the pipeline, shared resources such as caches and
TLBs need to be larger to avoid thrashing between the different threads.

Simultaneous multithreading
The most advanced type of multithreading applies to superscalar processors.
Whereas a normal superscalar processor issues multiple instructions from a single
thread every CPU cycle, in simultaneous multithreading (SMT) a superscalar
processor can issue instructions from multiple threads every CPU cycle.
Recognizing that any single thread has a limited amount of instruction-level
parallelism, this type of multithreading tries to exploit parallelism available across
multiple threads to decrease the waste associated with unused issue slots.
For example:

1. Cycle i: instructions j and j + 1 from thread A and instruction k from

thread B are simultaneously issued.
2. Cycle i + 1: instruction j + 2 from thread A, instruction k + 1 from thread B,
and instruction m from thread C are all simultaneously issued.
3. Cycle i + 2: instruction j + 3 from thread A and instructions m + 1 and m +
2 from thread C are all simultaneously issued.
To distinguish the other types of multithreading from SMT, the term "temporal
multithreading" is used to denote when instructions from only one thread can be
issued at a time.

Image Registration Methods A Survey
No ratings yet
Image Registration Methods A Survey
25 pages
Learn Multithreading with Modern C++
From Everand
Learn Multithreading with Modern C++
James Raynard
No ratings yet
EUROCOD 5 - Design of Timber Structures - General Rules
100% (1)
EUROCOD 5 - Design of Timber Structures - General Rules
72 pages
Hardware Multithreading
100% (1)
Hardware Multithreading
4 pages
Multi Threading
No ratings yet
Multi Threading
5 pages
Multithreading: Multithreading Computers Have Hardware Support To Efficiently Execute Multiple
No ratings yet
Multithreading: Multithreading Computers Have Hardware Support To Efficiently Execute Multiple
5 pages
03 TLP
No ratings yet
03 TLP
33 pages
Multithreading, SMT and CMP
No ratings yet
Multithreading, SMT and CMP
7 pages
MULTITHREADING
No ratings yet
MULTITHREADING
30 pages
EE6304 Lecture12 TLP
No ratings yet
EE6304 Lecture12 TLP
70 pages
Hardware Multithreading
No ratings yet
Hardware Multithreading
22 pages
Os Minor Imp - Questions
No ratings yet
Os Minor Imp - Questions
25 pages
1.1 Processor Micro Architecture
No ratings yet
1.1 Processor Micro Architecture
21 pages
OS Multi-Threading Concepts
No ratings yet
OS Multi-Threading Concepts
25 pages
Biruk Tewoderos 1790
No ratings yet
Biruk Tewoderos 1790
21 pages
CS252 Graduate Computer Architecture Multithreading / Vector Processing March 2, 2011
No ratings yet
CS252 Graduate Computer Architecture Multithreading / Vector Processing March 2, 2011
26 pages
Thread Level Parallelism
No ratings yet
Thread Level Parallelism
21 pages
Multi Thread2
No ratings yet
Multi Thread2
37 pages
Lecture19 ILP SMT
No ratings yet
Lecture19 ILP SMT
31 pages
Lec 4 Superscalarprocessor PDF
No ratings yet
Lec 4 Superscalarprocessor PDF
23 pages
06b Multithreading MF
No ratings yet
06b Multithreading MF
37 pages
It Is The Thread
No ratings yet
It Is The Thread
17 pages
Super
No ratings yet
Super
7 pages
OSYf
No ratings yet
OSYf
12 pages
Unit 1 Modern Processors
No ratings yet
Unit 1 Modern Processors
52 pages
Unit 4
No ratings yet
Unit 4
13 pages
UNIT II Chapter I
No ratings yet
UNIT II Chapter I
15 pages
Unit 5
No ratings yet
Unit 5
86 pages
Lec 4 Superscalarprocessor Updated PDF
No ratings yet
Lec 4 Superscalarprocessor Updated PDF
40 pages
Development of A Simultaneously Threaded
No ratings yet
Development of A Simultaneously Threaded
14 pages
Threads
No ratings yet
Threads
8 pages
Hyper-Threading Technology: Shaik Mastanvali (06951A0541)
No ratings yet
Hyper-Threading Technology: Shaik Mastanvali (06951A0541)
23 pages
Future Processors To Use Coarse-Grain Parallelism
No ratings yet
Future Processors To Use Coarse-Grain Parallelism
48 pages
Unit IV QB With Answers
No ratings yet
Unit IV QB With Answers
16 pages
Hyper-Threading Technology: Processor Microarchitecture
No ratings yet
Hyper-Threading Technology: Processor Microarchitecture
18 pages
Module4 Thread PDF
No ratings yet
Module4 Thread PDF
8 pages
Chapter 1: Multi Threaded Programming: (Operating Systems-18Cs43)
No ratings yet
Chapter 1: Multi Threaded Programming: (Operating Systems-18Cs43)
39 pages
5TH Operating System Notes
No ratings yet
5TH Operating System Notes
8 pages
Thread
No ratings yet
Thread
15 pages
Presentation On Multithreading/Vector
No ratings yet
Presentation On Multithreading/Vector
7 pages
Operating System Lecture 5
No ratings yet
Operating System Lecture 5
16 pages
Debugging Real-Time Multiprocessor Systems: Class #264, Embedded Systems Conference, Silicon Valley 2006
No ratings yet
Debugging Real-Time Multiprocessor Systems: Class #264, Embedded Systems Conference, Silicon Valley 2006
15 pages
Ch4 Threads
No ratings yet
Ch4 Threads
18 pages
Multithreading: An Operating System Analysis
No ratings yet
Multithreading: An Operating System Analysis
10 pages
CS307 Lecture 1
No ratings yet
CS307 Lecture 1
33 pages
Simultaneous Multithreading Processor
No ratings yet
Simultaneous Multithreading Processor
4 pages
Nust College of Eme Nismah Saleem
No ratings yet
Nust College of Eme Nismah Saleem
4 pages
SMT and CMP Architectures
100% (3)
SMT and CMP Architectures
19 pages
SMT and CMP Architectures
No ratings yet
SMT and CMP Architectures
19 pages
G3 Thread Functionality
No ratings yet
G3 Thread Functionality
20 pages
07 Multiprocessors MF PDF
No ratings yet
07 Multiprocessors MF PDF
99 pages
A502018463 23825 5 2019 Unit6
No ratings yet
A502018463 23825 5 2019 Unit6
36 pages
Tutorial 4 Solution - OS
No ratings yet
Tutorial 4 Solution - OS
6 pages
Performance Analysis of N-Computing Device Under Various Load Conditions
No ratings yet
Performance Analysis of N-Computing Device Under Various Load Conditions
7 pages
Parallelism and Multicores
No ratings yet
Parallelism and Multicores
54 pages
Study Guide Designing Cisco Data Centre Infrastructure (300-610) Exam
From Everand
Study Guide Designing Cisco Data Centre Infrastructure (300-610) Exam
Anand Vemula
No ratings yet
Operating Systems Interview Questions You'll Most Likely Be Asked
From Everand
Operating Systems Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
LPIC-1 Primer
From Everand
LPIC-1 Primer
John Greene
4.5/5 (3)
Node.js 63 Interview Questions and Answers
From Everand
Node.js 63 Interview Questions and Answers
John Edward Cooper Berg
No ratings yet
Advanced Java Interview Questions and Answers
From Everand
Advanced Java Interview Questions and Answers
Jaishree Soni
No ratings yet
Concurrency and Multithreading in C: POSIX Threads and Synchronization
From Everand
Concurrency and Multithreading in C: POSIX Threads and Synchronization
Larry Jones
No ratings yet
200-301 CCNA (Cisco Certified Network Associate) Study Guide
From Everand
200-301 CCNA (Cisco Certified Network Associate) Study Guide
Anand Vemula
No ratings yet
Inductive and Capacitive Sensors XS & XT - XT130B1NAL2
No ratings yet
Inductive and Capacitive Sensors XS & XT - XT130B1NAL2
7 pages
Realtek Driver For Windows 10
No ratings yet
Realtek Driver For Windows 10
5 pages
Biology Revision KS3 Cells To Systems and Respiration
No ratings yet
Biology Revision KS3 Cells To Systems and Respiration
3 pages
A. Rupasri (20NE1A0510) Sk. Rehamunnisha (20NE1A0539) D. Sai Supriya (20NE1A0542) Sk. Mohammad Fahim (20NE1A0551)
No ratings yet
A. Rupasri (20NE1A0510) Sk. Rehamunnisha (20NE1A0539) D. Sai Supriya (20NE1A0542) Sk. Mohammad Fahim (20NE1A0551)
20 pages
KeyTalk Anything You Ever Wanted To Know About SMIME Email Encryption DigitalSigning Configurations. But Were Afraid To Ask
No ratings yet
KeyTalk Anything You Ever Wanted To Know About SMIME Email Encryption DigitalSigning Configurations. But Were Afraid To Ask
19 pages
Radiator - Wikipedia
No ratings yet
Radiator - Wikipedia
8 pages
Dictionary - Programs Questions and Answers - Class 11
No ratings yet
Dictionary - Programs Questions and Answers - Class 11
17 pages
Mu-Analysis and Synthesis Toolbox
No ratings yet
Mu-Analysis and Synthesis Toolbox
734 pages
Draftspecificationformantransformer 7775 Kvawithincr
No ratings yet
Draftspecificationformantransformer 7775 Kvawithincr
13 pages
JNV. Chemistry Viva
No ratings yet
JNV. Chemistry Viva
30 pages
Comparison of Shielding Methods
No ratings yet
Comparison of Shielding Methods
2 pages
Computer Ebook English RBE
No ratings yet
Computer Ebook English RBE
69 pages
Microsoft Excel 2007 Chris Menard
No ratings yet
Microsoft Excel 2007 Chris Menard
20 pages
QTP Imp
No ratings yet
QTP Imp
53 pages
Lec1 PDF
No ratings yet
Lec1 PDF
28 pages
English: Communication Studies
No ratings yet
English: Communication Studies
4 pages
Edexcel IGCSE Mathematics B 4MB1 Revision Notes
No ratings yet
Edexcel IGCSE Mathematics B 4MB1 Revision Notes
42 pages
NLP Using Python
100% (3)
NLP Using Python
12 pages
CE Topic 2 & 3
No ratings yet
CE Topic 2 & 3
2 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
49 pages
Pos - 0101 Qe Et200sp Elev.28kw Inv - Sew.+cat. 1,5kw Sew 2i004764 (Es2-2019) q5 Vinamilk - 04!30!2020 English Version
No ratings yet
Pos - 0101 Qe Et200sp Elev.28kw Inv - Sew.+cat. 1,5kw Sew 2i004764 (Es2-2019) q5 Vinamilk - 04!30!2020 English Version
51 pages
Logic: Term
No ratings yet
Logic: Term
2 pages
Constructive Cost Model
No ratings yet
Constructive Cost Model
14 pages
Loan Eligibility Prediction Using Logistics Regression Algorithm
No ratings yet
Loan Eligibility Prediction Using Logistics Regression Algorithm
11 pages
3BUS094398 H C en System 800xa 5.0 Harmony Overview Hires
No ratings yet
3BUS094398 H C en System 800xa 5.0 Harmony Overview Hires
34 pages
Application of 3D Numerical Model in Bed PDF
No ratings yet
Application of 3D Numerical Model in Bed PDF
11 pages
Ultrapac 2000 Standard, Ultrapac 2000 Superplus, Mini (Typ 0005 Bis 0025)
No ratings yet
Ultrapac 2000 Standard, Ultrapac 2000 Superplus, Mini (Typ 0005 Bis 0025)
3 pages
45 90 Degree Pipe Elbow Dimensions Sizes
No ratings yet
45 90 Degree Pipe Elbow Dimensions Sizes
10 pages

202004261306373620rohit Engg Multi Threaded

Uploaded by

202004261306373620rohit Engg Multi Threaded

Uploaded by

Multi-threaded Architecture

In computer architecture, multithreading is the ability of a central processing

Two major techniques for throughput computing

1. Cycle i: instruction j from thread A is issued.

1. Cycle i + 1: an instruction from thread B is issued.

1. Cycle i: instructions j and j + 1 from thread A and instruction k from

You might also like