CS60003 High Performance Computer Architecture

This document contains a 10 question mid-semester examination for a high performance computer architecture course. The questions cover topics like branch prediction accuracy, cost/performance analysis of superscalar processors, branch predictor design choices, pipelined processor characteristics and performance, and analyzing loops for branch prediction. Students are instructed to answer all questions briefly and concisely, showing relevant computations and assumptions.

Uploaded by

Narayan Kunal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

224 views3 pages

CS60003 High Performance Computer Architecture

Uploaded by

Narayan Kunal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Indian Institute of Technology, Kharagpur

Department of Computer Science and Engineering

Mid-Semester Examination

High Performance Computer Architecture (C560003)

Time=2 Hours Max Marks=65

Important Instructions:
• Answer all questions.
• No clarifications to any question shall be provided. In case you have any queries, you can make suitable
assumptions, but please write down your assumptions clearly.
• All answers should be brief and concise. Lengthy and irrelevant answers will be penalized.

1. What is the asymptotic prediction accuracy of a two-bit branch predictor on the following repeating
pattern for a certain branch? ... NNTNNTNNNTNNTNNTT .... (length of repeating segment =17) [3]

2. Assume that the cost of a processor using a simple 5 stage MIPS pipeline is 25% of the total cost of a
computer system. The disks, main memory, power supply and enclosure make up the other 75% of
the total cost. It is now proposed to increase the speed of the processor by a factor of 10 using a
superscalar design approach. But this will increase the cost of the processor by a factor of 10.
Further, simulation studies show that the superscalar processor would have wait on the average
30% of time for I/O. On the other hand, the original pipelined processor stalled only 10% time for
I/O. From a cost/performance viewpoint, is increasing the speed by a factor of 10 desirable?
Assume that in both the processors there are no stalls on account of data or control hazards. Justify
your answer with a quantitative analysis of the two computers. [5]

3. An engineer is trying to design a branch predictor for the MIPS 5 stage processor. Branches
constitute 20% of all instructions. The engineer has essentially two options for the branch predictor:
PicoPruner and ChartCooser. For both these predictors, the branch mispredict penalty is 3 cycles.
Branches correctly predicted undergo no penalty cycles. Simulation of PicoPruner shows 10%
misprediction rate, but implementing PicoPruner will increase the cycle time by 20%. Simulation of
ChartChooser shows 20% misprediction rate, but its implementation will increase the cycle time
by only 10%. Which predictor should the designer choose? Clearly show all details of your
computations. [8]

4. A program has the following instruction mix: 40% integer operations, 40% loads and stores, and
20% floating point operations. The processor uses a diversified pipeline in which integer operations
take 1 cycle, loads and stores take 2 cycles, and floating point operations take 3 cycles. A compiler
writer suggests a transformation in which every floating point operation is replaced with 4 integer
operations. The hardware-designers indicate that by not imptementing floating point arithmetic,
the clock cycle time can be decreased by 15%. Will the combination of these two changes improve
or degrade the overall performance of the processor, and by how much? [5]

1
5. A certain 6 stage pipe lined processor has two branch delay slots. An optimizing compiler can fill the
first slot 80% of the time and can also fill the second slot 20% of the time. Of the filled slots, 10%
eventually get discarded on account of being taken from the fall through path. What is the
percentage improvement in performance achieved by this optimizing compiler relative to a
compiler that does not fill any of the branch delay slots? Assume a branch occurs once every 7
instructions on the average. [7]

6. Suppose a processor has a 10-stage pipeline with the following stages:

F1- start the fetch and predict - if the instr. is a branch, whether it is taken or not, and if taken, the
target address.
F2 - complete the fetch
D1- decode the instruction - know it's a branch at the end of D1,
D2 - complete decode
RR- read the registers- compute branch target address
A1- start ALU operation; resolve branch condition and update CPU
A2 - complete ALU operation
M1- start memory operation
M2 - complete memory operation
WB - write the result to registers
a) What is the penalty for a mispredicted branch? [2]
b) What is the penalty when a branch is correctly predicted as taken, but branch target address is
incorrectly predicted? [2]
c) Assuming the following characteristics of the processor and the program, what is the CPI for
the processor? [4]
• There are no stalls in this pipeline except for branch instructions.
• Branch instructions account for 25% of all instructions
• 60% of branch instructions are taken
• Branch direction is correctly predicted 90% of the time
• The target address for a taken branch is correctly predicted 80% of the time

7. Consider a 5 stage MIPS pipeline in which a dynamic predictor with target prediction is used at the
IF stage. A tagged branch target prediction (BTB) scheme for taken branches is used. The programs
run on this processor have 20% branches. Of these, on the average 60%. are taken and 40% are not
taken. For taken branches, there is a 10% chance of miss in the branch target buffer. Also, 10% of
the branches matching the BTB turn out to be not taken. Further, the branch target address is
predicted with 90% accuracy. In case of misprediction, the PC is updated with correct target
address in MEM stage. On account of data hazards, the pipeline stalls 30% of time. What would be
the CPI for this processor? Clearly show all steps of your computation. [8]

2
8. For the simple 5-stage pipe lined MIPS processor, we are required to design a new more
complicated instruction. Two design options (given below) are being considered. Determine which
design option would be superior for executing large programs and how much faster (ratio of CPls)
is it expected to run as compared to the other option? You may ignore the effect of hazards. Clearly
show your work out. [5]
a) Adding extra logic circuits to the execute stage of the pipeline. This would increase the
latency of the EXEstage by 20%.
b) Adding a new stage after the EXE stage altogether, making it a 6-stage pipeline. This
arrangement would leave the cycle time unaffected.

9. Consider the following pseudo code:

int i=l, j, c:
loop1: j = 1;
loop2: c = c + i + j;
j++;
if (j <= 3) goto loop2;
i++;
if (i <= 1000) goto loop1;
The compiler will produce conditional branches for the two if statements. Using a branch history table
with 2-bit saturating counters, exactly how many mispredictions will there be? Assume there are
enough table entries to avoid conflicts. Assume all table entries are initialized to O. Clearly show your
reasoning and computations. [8]

10. In the following double-nested loop 51; 52; ... 5k; are the statements that form the body of the inner
loop and are simple arithmetic statements.

for(i=O ; i < m ; i++)

for(j=O; j<n ; j++){
51; 52; ... 5k;}

a) Suppose we have. the option of using either a I-bit local predictor or a 2-bit saturating counter
local predictor for the inner loop. Which predictor would be more accurate for the inner loop
and by how much? [4]
b) Suppose a (1,2) correlating predictor is used. Compare the performance of the (1,2) correlating
predictor with the 2-bit saturating counter. [4]

--- The End ---

Lab 3
No ratings yet
Lab 3
1 page
Homework 1 - Computer Architecture - HCMIU
No ratings yet
Homework 1 - Computer Architecture - HCMIU
3 pages
High Performance Computer Architecture (CS60003)
No ratings yet
High Performance Computer Architecture (CS60003)
2 pages
CS-3010 (HPC) - CS Mid Sept 2023
No ratings yet
CS-3010 (HPC) - CS Mid Sept 2023
7 pages
CS/COE 1541 Term 2174 Quiz 1: (Solutions)
No ratings yet
CS/COE 1541 Term 2174 Quiz 1: (Solutions)
2 pages
Homework Set - 5
No ratings yet
Homework Set - 5
2 pages
PS4 Solution
No ratings yet
PS4 Solution
6 pages
Sample Problems
No ratings yet
Sample Problems
5 pages
CompEng 361 - Homework 3 Solutions
No ratings yet
CompEng 361 - Homework 3 Solutions
6 pages
Nmam Institute of Technology: Department of Computer Science and Engineering
No ratings yet
Nmam Institute of Technology: Department of Computer Science and Engineering
8 pages
Fall 2022 Qs
No ratings yet
Fall 2022 Qs
15 pages
Cse590490 HW2
No ratings yet
Cse590490 HW2
5 pages
Practice Final Soln
No ratings yet
Practice Final Soln
17 pages
CSE 530 Homework #1 Due September 26 Anthony Dotterer: C C C T C T C C T T
No ratings yet
CSE 530 Homework #1 Due September 26 Anthony Dotterer: C C C T C T C C T T
9 pages
Karaikudi Institute of Technology: Year/Dept: Ii/Cse Date: 07.08.2014 Time:09:20 A.M. - 11:00 A.M
No ratings yet
Karaikudi Institute of Technology: Year/Dept: Ii/Cse Date: 07.08.2014 Time:09:20 A.M. - 11:00 A.M
2 pages
COE301 Final Solution 162
No ratings yet
COE301 Final Solution 162
10 pages
Midtermarch 2
No ratings yet
Midtermarch 2
9 pages
Instructions: Csce 212: Final Exam Spring 2009
No ratings yet
Instructions: Csce 212: Final Exam Spring 2009
5 pages
Department of Computer Science and Engineering: State University of Bangladesh
No ratings yet
Department of Computer Science and Engineering: State University of Bangladesh
2 pages
Pipeline History
No ratings yet
Pipeline History
30 pages
Compre 23
No ratings yet
Compre 23
3 pages
Assignment - 1
0% (1)
Assignment - 1
4 pages
Computer Architecture Midterm1 Cmu
No ratings yet
Computer Architecture Midterm1 Cmu
30 pages
Computer Arch Test
No ratings yet
Computer Arch Test
8 pages
Quiz Questions
No ratings yet
Quiz Questions
2 pages
Coss MidSemester Regular
No ratings yet
Coss MidSemester Regular
3 pages
National University of Computer and Emerging Sciences, Lahore Campus
No ratings yet
National University of Computer and Emerging Sciences, Lahore Campus
4 pages
Illinois Exam2 Practice Solfa08
No ratings yet
Illinois Exam2 Practice Solfa08
4 pages
CS704 Mid Term
No ratings yet
CS704 Mid Term
4 pages
L-3rr-l/CSE Date:: Iw Iw
No ratings yet
L-3rr-l/CSE Date:: Iw Iw
30 pages
Cs433 Fa12 Hw4 Sol Correct
No ratings yet
Cs433 Fa12 Hw4 Sol Correct
14 pages
CA Fall 2022 Final Exam
No ratings yet
CA Fall 2022 Final Exam
6 pages
Midterm1 Soln Fall09 PDF
No ratings yet
Midterm1 Soln Fall09 PDF
6 pages
hw2 Sols Ece570 w14
No ratings yet
hw2 Sols Ece570 w14
9 pages
Correlating (Global) Branch Predictors Correlating Branch Predictors
No ratings yet
Correlating (Global) Branch Predictors Correlating Branch Predictors
3 pages
2018 Second
No ratings yet
2018 Second
7 pages
Ca PDF
No ratings yet
Ca PDF
10 pages
2EC319 IR December 2015
No ratings yet
2EC319 IR December 2015
4 pages
Midterm1 s15 Sol
No ratings yet
Midterm1 s15 Sol
26 pages
Sample Problems Pipe&Memory
No ratings yet
Sample Problems Pipe&Memory
57 pages
106
No ratings yet
106
80 pages
البحث الثاني
No ratings yet
البحث الثاني
10 pages
CS704 Mid Term Papers
No ratings yet
CS704 Mid Term Papers
3 pages
Co HW 1
No ratings yet
Co HW 1
2 pages
CS433 hw1 Fall 07
No ratings yet
CS433 hw1 Fall 07
3 pages
Ca Mid1 2017
No ratings yet
Ca Mid1 2017
9 pages
L02 Branch Prediction V2021
No ratings yet
L02 Branch Prediction V2021
82 pages
COSS MidSem 2020.07.05 MakeUp With Key COPYM06Tq# Name-Rana
No ratings yet
COSS MidSem 2020.07.05 MakeUp With Key COPYM06Tq# Name-Rana
5 pages
Lecture-6-13.01.2025 HPC
No ratings yet
Lecture-6-13.01.2025 HPC
17 pages
L13 MIPS Control Hazards
No ratings yet
L13 MIPS Control Hazards
40 pages
Midsem Final
No ratings yet
Midsem Final
3 pages
05 - Pipelining - Branch Prediction
No ratings yet
05 - Pipelining - Branch Prediction
20 pages
Indian Institute of Technology, Kharagpur: Mid-Spring Semester 2021-22
No ratings yet
Indian Institute of Technology, Kharagpur: Mid-Spring Semester 2021-22
4 pages
Archi Second 2013 2014 JCE
No ratings yet
Archi Second 2013 2014 JCE
2 pages
Ceng400 - Assessment - Fall 2024-2025 - Mips v3 - Ems 1
No ratings yet
Ceng400 - Assessment - Fall 2024-2025 - Mips v3 - Ems 1
15 pages
C Programming for the Pc the Mac and the Arduino Microcontroller System
From Everand
C Programming for the Pc the Mac and the Arduino Microcontroller System
Peter D Minns
No ratings yet
CCNA Exam Excellence: Study Guide & Practice Tests
From Everand
CCNA Exam Excellence: Study Guide & Practice Tests
SUJAN
No ratings yet
IGNOU PGDCA MCS 202 Computer Organisation Previous Years Unsolved Papers
From Everand
IGNOU PGDCA MCS 202 Computer Organisation Previous Years Unsolved Papers
Manish Soni
No ratings yet
Projects With Microcontrollers And PICC
From Everand
Projects With Microcontrollers And PICC
Guillermo Perez Guillen
5/5 (1)
ISA Certified Automation Professional (CAP) Associate: Certification Exam Prep: 500 Practice Exam Questions and Explanations
From Everand
ISA Certified Automation Professional (CAP) Associate: Certification Exam Prep: 500 Practice Exam Questions and Explanations
Steve Brown
No ratings yet
Computer Aided Design of Electrical Machines
From Everand
Computer Aided Design of Electrical Machines
K.M. Vishnu Murthy
No ratings yet
Cs501 Mcqs Mid Term by Vu Topper RM
No ratings yet
Cs501 Mcqs Mid Term by Vu Topper RM
62 pages
Coam2 - ST
No ratings yet
Coam2 - ST
74 pages
Interview Preparation: RISC-V Pipeline Architecture SV - UVM Verification Q/A
No ratings yet
Interview Preparation: RISC-V Pipeline Architecture SV - UVM Verification Q/A
6 pages
e-PG PATHSHALA-Computer Science Computer Architecture
No ratings yet
e-PG PATHSHALA-Computer Science Computer Architecture
9 pages
Unit 4 Branch Instructions
No ratings yet
Unit 4 Branch Instructions
23 pages
CPU Structure and Functions
No ratings yet
CPU Structure and Functions
39 pages
Itanium Processor: Presented by
No ratings yet
Itanium Processor: Presented by
26 pages
Unit 3-2 COA
No ratings yet
Unit 3-2 COA
58 pages
CS1601 Computer Architecture
100% (1)
CS1601 Computer Architecture
389 pages
HPC Lecture4
No ratings yet
HPC Lecture4
13 pages
MPCA Assignment 11 B - 66
No ratings yet
MPCA Assignment 11 B - 66
5 pages
Advanced Computer Architecture
No ratings yet
Advanced Computer Architecture
1 page
Pipelining in Modern CPU's: Sangita Sah 221235 Nepal College of Information Technology
No ratings yet
Pipelining in Modern CPU's: Sangita Sah 221235 Nepal College of Information Technology
2 pages
QUESTION BANK UNIT 5 - Computer Organization and Architecture
No ratings yet
QUESTION BANK UNIT 5 - Computer Organization and Architecture
9 pages
3-RISC Architecture
100% (1)
3-RISC Architecture
13 pages
CH 3 Branch Call Delay
No ratings yet
CH 3 Branch Call Delay
32 pages
CO
No ratings yet
CO
6 pages
Questions Ch5 1
No ratings yet
Questions Ch5 1
2 pages
Pipeline Hazards
No ratings yet
Pipeline Hazards
94 pages
Lec11 Pipeline 1 Notes
No ratings yet
Lec11 Pipeline 1 Notes
26 pages
Basic State Transitions - Machine Cycle and Instruction Cycles - Timing Diagram - Data Transfer Instructions
No ratings yet
Basic State Transitions - Machine Cycle and Instruction Cycles - Timing Diagram - Data Transfer Instructions
71 pages
Coa Unit 4
No ratings yet
Coa Unit 4
10 pages
Microsequencer
No ratings yet
Microsequencer
38 pages
Tomasulo
No ratings yet
Tomasulo
54 pages
Elet 3405 HW 4
0% (1)
Elet 3405 HW 4
6 pages
722 9 5 2011 Review
No ratings yet
722 9 5 2011 Review
101 pages
Ca Unit 3 Prabu
100% (1)
Ca Unit 3 Prabu
24 pages
Arm
100% (2)
Arm
44 pages
IT3030E CA Chap5 CPU
No ratings yet
IT3030E CA Chap5 CPU
98 pages

CS60003 High Performance Computer Architecture

Uploaded by

CS60003 High Performance Computer Architecture

Uploaded by

Indian Institute of Technology, Kharagpur

Department of Computer Science and Engineering

High Performance Computer Architecture (C560003)

Time=2 Hours Max Marks=65

6. Suppose a processor has a 10-stage pipeline with the following stages:

9. Consider the following pseudo code:

for(i=O ; i < m ; i++)

--- The End ---

You might also like