0% found this document useful (0 votes)

10 views5 pages

CA Classes-136-140

The document discusses the Tomasulo algorithm for dynamic scheduling and its key aspects. It describes how the algorithm avoids data hazards through register renaming and using reservation stations and load/store buffers. It also compares Tomasulo's algorithm to scoreboarding.

Uploaded by

SrinivasaRao

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views5 pages

CA Classes-136-140

Uploaded by

SrinivasaRao

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Computer Architecture Unit 6

6.4 Dynamic Scheduling Algorithm – The Tomasulo Approach

Dynamic Scheduling Algorithm was proposed by Robert Tomasulo.
Tomasulo’s scheme combines the important constituents of scoreboard
methodology with the prologue of Register renaming. This scheme has
many variants. The basic idea behind this algorithm is “Avoiding WAR and
WAW data hazards by use of renaming registers”.
The Tumasulo algorithm
It was formulated for IBM 360/91 in 1967; approximately three years later to
CDC 6600. This algorithm emphasises on the FPUs, in relation to a
pipelined FPU for DLX. The key distinction between DLX and the IBM360 is
that IBM 360 processor contains register-memory instructions.
Tomasulo’s algorithm makes use of a load FU therefore no key alterations
are essential for adding register-memory addressing modes. One of the
most significant additions is an added bus. The IBM 360/91 also contains
pipelined FU rather than numerous FUs. The only dissimilarity is that
pipelined FU can commence at the most one action in a clock cycle. There
are no major variations between the IBM 360/91 and CDC6600. The IBM
360/91 is capable of holding 3 operations for the FP (floating-point) adder
and 2 for the FP (floating-point multiplier). Additionally it may contain
maximum of 6 FP loads, or memory references, and 3 FP stores as
outstanding. To do this load data buffers & store data buffers are utilized.
There are various differences between Tomasulo’s scheme and
scoreboarding. These are given below:
 In Tomasulo’s scheme, the control and buffers are dispersed between
FUs (Functional Units) but it is centralised in score board technique. In
case of Tomasulo’s scheme register renaming is done to avoid the data
and structural hazards but no register renaming is done in score board
technique.
 CBD (Common Data Bus) is responsible for broadcasting the results to
all FUs in case of Tomasulo’s scheme. But scoreboard technique writes
the results into various registers.
 The Tumasulo algorithm can read operands from registers and CDB
(common data bus) and write operands to CDB only. While the
operands are read and written from and to registers in case of score
board technique.

Manipal University of Jaipur B1648 Page No. 136

Computer Architecture Unit 6

 In Tomasulo’s scheme, issue can take place only when the RS

(Reservation station) is free while the issue can take place when the FU
is free.
Figure 6.4 shows the basic structure of a Tomasulo-based floating-point
unit for DLX.

Figure 6.4: Basic Structure of a DLX Floating-Point Unit using Tomasulo’s

Algorithm

The reservation station contains the following:

 Issued instructions which are waiting for execution by the FU,
 operands for the instructions which have already been worked out (else
the source of the operands),
 Information required to handle the instruction after it has started
execution.

Manipal University of Jaipur B1648 Page No. 137

Computer Architecture Unit 6

The addresses, which come from or go to the memory are held in the load
buffers and store buffers. A pair of bus connects the FP register to FU and
a bus connects FP register to store buffers. Common bus transmits the
results from the FU & from memory everywhere excluding the load buffer.
The buffers & RS (reservation stations) contain tag fields that are utilized
for hazard control.
Tomasulo’s scheme is invoking when the designers are compelled to
pipeline the architecture where it is hard to schedule code or has registers
sufficiency of. But when evaluated in terms of cost, the benefits of the
Tomasulo approach as compared to compiler scheduling for an effective
single-issue pipeline are very less. But with the increasing demand for
issuance capability and improved performance of difficult-to-schedule
codes the methods of dynamic scheduling & register renaming are
becoming more wide-spread.
Self Assessment Questions
9. Tumasulo scheme was invented by ______________.
10. The ________________ could hold 3 operations for the FP adder and
2 for the FP multiplier.
11. The ____________ and ______________ are used to store the data/
addresses that come from or go to memory.

Activity 1:
Imagine yourself as a computer architect. Explain the measures you will
take to overcome data hazards with dynamic scheduling.

6.5 High Performance Instruction Delivery

In case of MIPS 5-stage pipelining, the address of the incoming-instruction-
fetch must be recognized before the completion of the present Instruction
Fetch (IF) cycle. Consequently, for ZERO branch penalties, it ought to be
realized if the fetched (as-yet un-decoded) instruction is branch or not. In
case it is a branch then it must also know the next-PC (Program Counter).
This is accomplished by introducing a Cache which contains the address of
the following instruction if branch is taken as well as not-taken. This cache
is known as the Branch-Target Cache or Branch-Target Buffer (BTB).
The branch-prediction buffer is accessed throughout the ID phase, after the
Manipal University of Jaipur B1648 Page No. 138
Computer Architecture Unit 6

instruction decode, i.e., we know the branch-target address at the end of ID

stage to fetch the next predicted instruction. This is shown in figure 6.5.

Figure 6.5: Branch Prediction

6.5.1 Branch target buffer

Branch Target Buffer has three fields:
 Lookup: Addresses of the known branch instructions (predicted as
taken)
 Predicted PC: PC of the fetched instruction predicted taken-branch
 Prediction State: Optional: Extra prediction state bits
Branch Target Buffer has the following complications:
 Complication arise in using 2-bit predictor because it uses information
for both the branches taken and not-taken
 This complication is resolved in PowerPC processors by using both the
Target-buffer and Prediction-buffer
The penalty can be calculated by looking at the possibility of the 2 events:
(i) Branch predicted taken but end up not take
= %buffer hit rate x % incorrect prediction
= 0.95 x 0.1 = 0.095
Manipal University of Jaipur B1648 Page No. 139
Computer Architecture Unit 6

(ii) Branch is taken but is not found in buffer

= % incorrect prediction
= 0.1
The penalty in both the cases is 2 cycles, therefore,
Branch Penalty = (0.095 + 0.1) x2 = 0.195 x 2 = 0.39
Example:
Consider a branch-target buffer implemented for conditional branches only
for pipelined processor.
Assuming that:
 Misprediction penalty = 4 cycles
 Buffer miss-penalty = 3cycles
 Hit rate and accuracy each = 90%
 Branch Frequency = 15%
Solution:
The speedup with Branch Target Buffer verses no BTB is expressed as:
Speedup = CPI no BTB/CPI BTB
= (CPI base+Stallsno BTB) / (CPI base + Stalls BTB)
The stalls are determined as:
Stalls = ΣFrequency x Penalty
The sum over all the stall cases is given as the product of frequency of the
stall cases and the stall-penalty.
i) Stallsno BTB = 0.15 x 2 = 0.30
ii) To find Stalls BTB, we have to consider each output from BTB
There exist three possibilities:
a) Branch misses the BTB:
Frequency = 15 % x 0.1 = 1.5% = 0.015
Penalty = 3
Stalls=0.045
b) Branch can hit and correctly predicted:
Frequency = 15 % x 0.9(hit)x 0.9(prediction)= 12.1% = 0.121
Penalty = 0
Stalls= 0

Manipal University of Jaipur B1648 Page No. 140

Project Report
67% (15)
Project Report
40 pages
Instruction Pipeline Design, Arithmetic Pipeline Deign - Super Scalar Pipeline Design
No ratings yet
Instruction Pipeline Design, Arithmetic Pipeline Deign - Super Scalar Pipeline Design
34 pages
Fleet Management System-Sample
83% (6)
Fleet Management System-Sample
30 pages
Pharmabeginers Com Investigation Tools Guideline
No ratings yet
Pharmabeginers Com Investigation Tools Guideline
31 pages
Fall 2015 - CS704 - 3 - MS150200187
100% (1)
Fall 2015 - CS704 - 3 - MS150200187
5 pages
Aca Important Questions 2 Marks 16marks
60% (5)
Aca Important Questions 2 Marks 16marks
18 pages
CO Assignment 4 Solution
100% (1)
CO Assignment 4 Solution
10 pages
Computer Science 146 Computer Architecture
No ratings yet
Computer Science 146 Computer Architecture
22 pages
Superscalar Processor Simulator Report PDF Version
No ratings yet
Superscalar Processor Simulator Report PDF Version
16 pages
Lec 13
No ratings yet
Lec 13
13 pages
Conditional Branches
No ratings yet
Conditional Branches
35 pages
Tomasulo Algorithm and Dynamic Branch Prediction
No ratings yet
Tomasulo Algorithm and Dynamic Branch Prediction
57 pages
Assignment Questions
No ratings yet
Assignment Questions
3 pages
Differences Between The PICS EU GMP Guidelines and WHO Guidelines - Final
No ratings yet
Differences Between The PICS EU GMP Guidelines and WHO Guidelines - Final
20 pages
Minimizes Data Hazards
No ratings yet
Minimizes Data Hazards
10 pages
Bill of Engineering Measurements and Evaluation (BEME)
No ratings yet
Bill of Engineering Measurements and Evaluation (BEME)
18 pages
CAO EST Solution 2022
No ratings yet
CAO EST Solution 2022
8 pages
Pipeline Hazards
No ratings yet
Pipeline Hazards
4 pages
Diplom
No ratings yet
Diplom
120 pages
cs146 Fall2017 Midterm1xx
No ratings yet
cs146 Fall2017 Midterm1xx
12 pages
Midterm1 s15 Sol
No ratings yet
Midterm1 s15 Sol
26 pages
Data Hazards
No ratings yet
Data Hazards
31 pages
Ts1 ts2
No ratings yet
Ts1 ts2
61 pages
Computer Architecture
No ratings yet
Computer Architecture
100 pages
Kien-Truc-May-Tinh - David-Brooks - cs146-hw2 - (Cuuduongthancong - Com)
No ratings yet
Kien-Truc-May-Tinh - David-Brooks - cs146-hw2 - (Cuuduongthancong - Com)
5 pages
Be - Computer Engineering - Semester 4 - 2019 - December - Computer Organization and Architecture Cbcgs
No ratings yet
Be - Computer Engineering - Semester 4 - 2019 - December - Computer Organization and Architecture Cbcgs
23 pages
A4 Solution
No ratings yet
A4 Solution
4 pages
Lec18 Tomasulo Algorithm
No ratings yet
Lec18 Tomasulo Algorithm
40 pages
Out-Of-Order Completion
No ratings yet
Out-Of-Order Completion
12 pages
Appendix C
No ratings yet
Appendix C
26 pages
Dynamic Scheduling Using Tomasulo's Approach
No ratings yet
Dynamic Scheduling Using Tomasulo's Approach
4 pages
Lecture 9: Dynamic Scheduling: Kunle Olukotun Gates 302 Kunle@ogun - Stanford.edu
No ratings yet
Lecture 9: Dynamic Scheduling: Kunle Olukotun Gates 302 Kunle@ogun - Stanford.edu
14 pages
Tomasulo Algorithm
No ratings yet
Tomasulo Algorithm
38 pages
Chapter One: Introduction To Pipelined Processors
No ratings yet
Chapter One: Introduction To Pipelined Processors
48 pages
Cdunit 6
No ratings yet
Cdunit 6
20 pages
Pipeline - Instr - Super Branch
No ratings yet
Pipeline - Instr - Super Branch
48 pages
Mca A3
No ratings yet
Mca A3
9 pages
ILP Techniques: Laxmi N. Bhuyan CS 162 Spring 2003
No ratings yet
ILP Techniques: Laxmi N. Bhuyan CS 162 Spring 2003
23 pages
Cs433 Fa20 Hw3 Solution
No ratings yet
Cs433 Fa20 Hw3 Solution
15 pages
Be Computer Engineering Semester 4 2018 December Computer Organization and Architecture Cbcgs
No ratings yet
Be Computer Engineering Semester 4 2018 December Computer Organization and Architecture Cbcgs
18 pages
Slides Chapter 6 Pipelining
No ratings yet
Slides Chapter 6 Pipelining
60 pages
Dynamic Scheduling - Tomasulo Algorithm
No ratings yet
Dynamic Scheduling - Tomasulo Algorithm
48 pages
CA Classes-126-130
No ratings yet
CA Classes-126-130
5 pages
Department of Computer Science and Engineering Subject Name: Advanced Computer Architecture Code: Cs2354
No ratings yet
Department of Computer Science and Engineering Subject Name: Advanced Computer Architecture Code: Cs2354
7 pages
Pipelining Basic Concepts: Instruction Fetch Execute Operand Fetch IF OF EX
No ratings yet
Pipelining Basic Concepts: Instruction Fetch Execute Operand Fetch IF OF EX
28 pages
Dpco Unit 4
No ratings yet
Dpco Unit 4
21 pages
Star Lion College of Engineering & Technology: Cs2354 Aca-2 Marks & 16 Marks
No ratings yet
Star Lion College of Engineering & Technology: Cs2354 Aca-2 Marks & 16 Marks
14 pages
CA Unit-2 Chapter-2
No ratings yet
CA Unit-2 Chapter-2
36 pages
Group 17 - 2151177
No ratings yet
Group 17 - 2151177
15 pages
Pipeline Hazards
No ratings yet
Pipeline Hazards
53 pages
Dynamic Approach Tomosulo Algorithm
No ratings yet
Dynamic Approach Tomosulo Algorithm
59 pages
Dynamic Approach Tomosulo Algorithm
No ratings yet
Dynamic Approach Tomosulo Algorithm
57 pages
Lecture 6: Dynamic Scheduling With Scoreboarding and Tomasulo Algorithm (Section 2.4)
No ratings yet
Lecture 6: Dynamic Scheduling With Scoreboarding and Tomasulo Algorithm (Section 2.4)
31 pages
Sp11-Quiz1 Soln
No ratings yet
Sp11-Quiz1 Soln
20 pages
2162 Term Project: The Tomasulo Algorithm Implementation
No ratings yet
2162 Term Project: The Tomasulo Algorithm Implementation
5 pages
ACA Notes
No ratings yet
ACA Notes
39 pages
Ca 12
No ratings yet
Ca 12
10 pages
Branch Prediction Techniques
No ratings yet
Branch Prediction Techniques
48 pages
Sections 3.2 and 3.3 Dynamic Scheduling - Tomasulo's Algorithm
No ratings yet
Sections 3.2 and 3.3 Dynamic Scheduling - Tomasulo's Algorithm
53 pages
MEL G642-Compre Solution - 2 2016-17
No ratings yet
MEL G642-Compre Solution - 2 2016-17
9 pages
Instruction Level Parallelism
No ratings yet
Instruction Level Parallelism
36 pages
Midtermarch 2
No ratings yet
Midtermarch 2
9 pages
CMP3010L07 Tomasulo
No ratings yet
CMP3010L07 Tomasulo
70 pages
OpenText File System Archiving 10.2.0 Release Notes
No ratings yet
OpenText File System Archiving 10.2.0 Release Notes
13 pages
Tomasulo's Algorithm and Scoreboarding
No ratings yet
Tomasulo's Algorithm and Scoreboarding
17 pages
Mobil™ Dexron-VI ATF: Product Description
No ratings yet
Mobil™ Dexron-VI ATF: Product Description
2 pages
a094mMPMC Multiple Choice Questions
No ratings yet
a094mMPMC Multiple Choice Questions
7 pages
Deh-P4180sd crt4248
No ratings yet
Deh-P4180sd crt4248
83 pages
Megersa MBA Thesis For Defense (2024)
No ratings yet
Megersa MBA Thesis For Defense (2024)
74 pages
MIT's Undergraduate String Theory Project
100% (13)
MIT's Undergraduate String Theory Project
18 pages
Datasheer-11kw-220v-2900rpm-Afs225m-Dc Shunt Motor-Dvc
No ratings yet
Datasheer-11kw-220v-2900rpm-Afs225m-Dc Shunt Motor-Dvc
3 pages
65° Panel Antenna
No ratings yet
65° Panel Antenna
2 pages
Preliminary Dpp-04: For Unacademy Subscription Use Code - Join For Updates
No ratings yet
Preliminary Dpp-04: For Unacademy Subscription Use Code - Join For Updates
7 pages
Digital Filter Design (FIR) Using Frequency Sampling Method: Abstract
No ratings yet
Digital Filter Design (FIR) Using Frequency Sampling Method: Abstract
10 pages
Cao Syllabus
No ratings yet
Cao Syllabus
2 pages
CA Classes-236-240
No ratings yet
CA Classes-236-240
5 pages
CA Classes-221-225
No ratings yet
CA Classes-221-225
5 pages
Lab-4 Report
No ratings yet
Lab-4 Report
8 pages
Programming in C - 41-60
No ratings yet
Programming in C - 41-60
20 pages
CA Classes-201-205
No ratings yet
CA Classes-201-205
5 pages
Tips For Writing User Friendly GMP Document
No ratings yet
Tips For Writing User Friendly GMP Document
12 pages
Computer Architecture AllClasses-Outline
No ratings yet
Computer Architecture AllClasses-Outline
294 pages
CA Classes-86-90
No ratings yet
CA Classes-86-90
5 pages
Computer Architecture AllClasses-Outline-1-99
No ratings yet
Computer Architecture AllClasses-Outline-1-99
99 pages
Custom Reports Design Manual: Micros
No ratings yet
Custom Reports Design Manual: Micros
58 pages
Flow in Closed Conduits (Pipes)
No ratings yet
Flow in Closed Conduits (Pipes)
2 pages
01 Task Performance 1
No ratings yet
01 Task Performance 1
3 pages
Grade 3 Excel Formatting
No ratings yet
Grade 3 Excel Formatting
2 pages
ITF24-DS-Assignment #1
No ratings yet
ITF24-DS-Assignment #1
3 pages
White Paper CPV Lets Foster Quality
No ratings yet
White Paper CPV Lets Foster Quality
7 pages
WWW Pharmaceutical Technology Com Sponsored Pharmaceutical Q
No ratings yet
WWW Pharmaceutical Technology Com Sponsored Pharmaceutical Q
6 pages
CA Classes-106-110
No ratings yet
CA Classes-106-110
5 pages
Qbdgroup Com en Blog What Is The Gamp 5 V Model in Computeri
No ratings yet
Qbdgroup Com en Blog What Is The Gamp 5 V Model in Computeri
16 pages
CA Classes-196-200
No ratings yet
CA Classes-196-200
5 pages
CA Classes-251-255
No ratings yet
CA Classes-251-255
5 pages
CA Classes-216-220
No ratings yet
CA Classes-216-220
5 pages
CA Classes-26-30
No ratings yet
CA Classes-26-30
5 pages
CA Classes-16-20
No ratings yet
CA Classes-16-20
5 pages
Getting Started With Swiper
No ratings yet
Getting Started With Swiper
4 pages
ENGR 2530 Syllabus-Spring 2015 - KLM Abbreviated
No ratings yet
ENGR 2530 Syllabus-Spring 2015 - KLM Abbreviated
2 pages
Emailing Metrology Lab Manual - Consolidated Mar2021
No ratings yet
Emailing Metrology Lab Manual - Consolidated Mar2021
113 pages
Computer Architecture AllClasses-Outline-100-198
No ratings yet
Computer Architecture AllClasses-Outline-100-198
99 pages
Frafos ABC SBC Brochure
No ratings yet
Frafos ABC SBC Brochure
4 pages
Programming in C - 121-140
No ratings yet
Programming in C - 121-140
20 pages
Programming in C - 21-40
No ratings yet
Programming in C - 21-40
20 pages
C Programming AllClasses-Outline-198-233
No ratings yet
C Programming AllClasses-Outline-198-233
36 pages
C Programming AllClasses-Outline-1-98
No ratings yet
C Programming AllClasses-Outline-1-98
98 pages
Programming in C - 161-180
No ratings yet
Programming in C - 161-180
20 pages
CA Classes-186-190
No ratings yet
CA Classes-186-190
5 pages
Geography F1T1 2024 QS Teacher - Co - .Ke
No ratings yet
Geography F1T1 2024 QS Teacher - Co - .Ke
4 pages
CA Classes-261-265
No ratings yet
CA Classes-261-265
5 pages
CA Classes-116-120
No ratings yet
CA Classes-116-120
5 pages
CA Classes-36-40
No ratings yet
CA Classes-36-40
5 pages
STD 9 Worksheet On Gravitation-2 - 1695986277296 - Xpq9F
No ratings yet
STD 9 Worksheet On Gravitation-2 - 1695986277296 - Xpq9F
4 pages
Company SNP (Eng) - Color - 1-6-61
No ratings yet
Company SNP (Eng) - Color - 1-6-61
95 pages
Assignment EE5179 ME20B145 Report
No ratings yet
Assignment EE5179 ME20B145 Report
6 pages
Cold Storage of Tomato The Good The Bad en The Ug-Wageningen University and Research 444870
No ratings yet
Cold Storage of Tomato The Good The Bad en The Ug-Wageningen University and Research 444870
1 page
Preliminary Specifications: Programmed Data Processor Model Three (PDP-3) October, 1960
From Everand
Preliminary Specifications: Programmed Data Processor Model Three (PDP-3) October, 1960
Digital Equipment Corporation
No ratings yet
Computer Science II Essentials
From Everand
Computer Science II Essentials
Randall Raus
No ratings yet

CA Classes-136-140

Uploaded by

CA Classes-136-140

Uploaded by

Computer Architecture Unit 6

6.4 Dynamic Scheduling Algorithm – The Tomasulo Approach

Manipal University of Jaipur B1648 Page No. 136

 In Tomasulo’s scheme, issue can take place only when the RS

Figure 6.4: Basic Structure of a DLX Floating-Point Unit using Tomasulo’s

The reservation station contains the following:

Manipal University of Jaipur B1648 Page No. 137

6.5 High Performance Instruction Delivery

instruction decode, i.e., we know the branch-target address at the end of ID

Figure 6.5: Branch Prediction

6.5.1 Branch target buffer

(ii) Branch is taken but is not found in buffer

Manipal University of Jaipur B1648 Page No. 140

You might also like