High Performance Computer Architecture (CS60003)

This document is the mid-semester examination for a course on high performance computer architecture. It consists of 8 questions testing various concepts related to parallelization, pipelining, instruction scheduling, and dependencies. The questions range from 3 to 10 points and cover topics such as speedup from parallelization, cache sizing for branch prediction, impact of clock rate and CPI on performance, identifying dependencies in code sequences, and using compilation techniques to overcome hazards and improve pipeline efficiency.

Uploaded by

Venkata Pranav

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

214 views2 pages

High Performance Computer Architecture (CS60003)

Uploaded by

Venkata Pranav

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

\

Indian Institute of Technology, Kharagpur

Department of Computer Science and Engineering

j; Mid-Semester Examination
High Performance Computer Architecture (CS60003)

Time=2 Hours Max Marks=75

Important Instructions:
• Answer all questions.
• No clarification to any of the questions shall be provided. In case you have any querie~, you can make
suitable assumptions, but please write down your assumptions clearly.
• All answers should be brief and concise. Lengthy and irrelevant answers will be penalized.

1. A software engineer decides to rewrite a portion of a program that accounts for 60% of execution time of
the program, so that this portion can be run on multiple processors in parallel. What is the maximum speedup
that the software engineer can hope to achieve? (3]

2. Consider a 32 bit, 5 stage MIPS processor.

a) What is the size of cache memory required to implement a (3,2) correlating branch predictor?
Assume that 4K entries have to be maintained in the prediction buffer, and one of the 4K entries
would be selected based on the lower 12 bits of a branch address. [4]

b) What would be the size of the cache memory required for implementing the (3,2) correlating
prediction scheme of the part a) of this question, if the target address is also to be stored in the
prediction table for target address prediction? [2]

3. Estimate the speedup that would be obtained by replacing a CPU having an average CPI (clock cycles per
instruction) of 5 with another CPU having an average CPI of 3.5, with the clock period increased from I OOns
to 120ns. (5]

4. An unpipelined processor A is a single-cycle processor (that is, CPI=l) and uses a 1GHz clock. Processor 8
is a pipelined version of the processor A with a 12 stage instruction pipe. Assuming ideal pipelining, what
would be the clock rate ofB? Give two reasons as to why B will probably not attain this clock rate. (2+4)

5. IdentifY and indicate all the true data dependences, anti dependences, and output dependences in the
following MIPS code sequence (Mark the dependences using annotated arrows between the
corresponding instructions). [8]
I~ $3, 0($2)
add $3, $3, $1
lw $1, 0($2)
add $4, $3, $1
sw $4. 0($2)

6. A processor has a six-stage pipeline with the stages: Fetch, Decode, Register Read, Execute, Memory
Access, Register Write. The processor is able to execute one instruction per cycle in the absence of
branches. The branch condition and target address are both generated during the Execute pipeline stage.
For a typical program, branches account for 20% of all the instructions. Of all branches, !0% are
unconditional branches. Of the conditional branches, 40% are taken on the aver~ge.
a) What will be the execution rate of this processor? Express your answer in clock cycles per
instruction (CPI). [10]
b) What would be the impact on CPJ of using a "Not Taken static branch predictor" in the

I
'\},~

I proces~or? [5]
;

7. The table below shows the supported instruction types and the CPI of varic,us instruction types of a 3GHz
Stone Bear pro•:essor. Assume that a benchmark program having a total, of 2 '1 09 instructions is to be run on
the processor. c! "he percentage of the different types of instructions present in this benchmark program is also
shown in the table.

T~
Instruction Instruction Count Percentage
Integer operation 55%
Load/Store 30%
Branch 15%

a) What is the expected total execution time of the given benchmark.program? [4]

b) A revision of the StoneBear processor raises the clock rate to 4GHz, but also at the same time
increases the CPI of Load/Store instructions to 12. What speedup is expected to be achieved by this
revised processor compared to the original? (6]

7. Consider the following code segment:

for(i=O;i< lOOO;i++X
a[i]=b[i]+c[i];
c[i]=a[i]+d[i];
}
a) List all the dependences (control, output, anti, and true) in the given code fragment. Indicate
whe~her the true dependences are loop-carried or not. [3]
b) Give a scheme of how the true and false dependence can be overcome using compilation
techniques to help run the for loop efficiently in a 5 stage pipelined MIPS processor. Give the
restructured code. [5) 1
c) Can both data and control hazards be overcome for the given code segment, or at least largely
reduced using compilation techniques? Briefly outline your scheme and give the restructured
code. [6]

8. Identify all the data hazards in the following sequence of MIPS instructions. For each hazard, state the
register involved, the writing instruction (by nuniber) and the reading instruction (by number). Is it
possible to resolve any of the hazards you identified in the previous part by reordering the instructions so
that forwarding would be unnecessary? If yes, show how. If not, explain why not. [4+4]
add $tO, $sO, $s1 ;1
xor $t1, $tO, $s2 ;2
lw $sO, -12($a0) ;3
sub $s5, $sO, $s1 ;4

---The End---

I 2

"\
'.

11th Computer Science EM EC Guide Sample Notes English Medium PDF Download
No ratings yet
11th Computer Science EM EC Guide Sample Notes English Medium PDF Download
20 pages
ParallelProgramminginCwithMPIandOpenMP PDF
No ratings yet
ParallelProgramminginCwithMPIandOpenMP PDF
272 pages
Mobile Computing CF-19 Overview, Accessories & Mounting Options
100% (2)
Mobile Computing CF-19 Overview, Accessories & Mounting Options
18 pages
Chapter 4 (Processors and Memory Hierarchy)
100% (1)
Chapter 4 (Processors and Memory Hierarchy)
17 pages
CANstress Manual en
100% (1)
CANstress Manual en
92 pages
20-On Board Maintenance Systems
100% (3)
20-On Board Maintenance Systems
67 pages
Processor and Memory Organization
No ratings yet
Processor and Memory Organization
17 pages
Ch01 Basic Concepts and Computer Evolution
No ratings yet
Ch01 Basic Concepts and Computer Evolution
36 pages
Computer Hardware Installation and Maintenance Lab Manual
100% (1)
Computer Hardware Installation and Maintenance Lab Manual
40 pages
Instruction-Level Parallelism (ILP), Since The
100% (1)
Instruction-Level Parallelism (ILP), Since The
57 pages
Chapter 5 - CPU Scheduling
100% (1)
Chapter 5 - CPU Scheduling
41 pages
PowerPoint Slides To Chapter 07
No ratings yet
PowerPoint Slides To Chapter 07
49 pages
Image Filtering
0% (1)
Image Filtering
56 pages
OS PYQs
No ratings yet
OS PYQs
23 pages
TR Front Office Services NC II
No ratings yet
TR Front Office Services NC II
68 pages
Real Time Software
No ratings yet
Real Time Software
272 pages
SPM 6
No ratings yet
SPM 6
77 pages
Computer Organization Hamacher Instructor Manual Solution - Chapter 7
67% (3)
Computer Organization Hamacher Instructor Manual Solution - Chapter 7
13 pages
1) Define MIPS. CPI and MFLOPS.: Q.1 Attempt Any FOUR
No ratings yet
1) Define MIPS. CPI and MFLOPS.: Q.1 Attempt Any FOUR
10 pages
Robotics and Machine Vision Internal 3 Important Questions
No ratings yet
Robotics and Machine Vision Internal 3 Important Questions
1 page
Microinstructions With Next Address Field
0% (1)
Microinstructions With Next Address Field
11 pages
Mes Manual 2022-23
No ratings yet
Mes Manual 2022-23
39 pages
PS4 Solution
No ratings yet
PS4 Solution
6 pages
Test 6 PracticeQuestion Cachememory 1
No ratings yet
Test 6 PracticeQuestion Cachememory 1
21 pages
Xzno22222222222 PDF
No ratings yet
Xzno22222222222 PDF
278 pages
Clean Desk Policy V1.0
No ratings yet
Clean Desk Policy V1.0
1 page
Ecs Usermanual
No ratings yet
Ecs Usermanual
441 pages
Pipeline Hazards
No ratings yet
Pipeline Hazards
39 pages
Instruction Pipeline
No ratings yet
Instruction Pipeline
27 pages
L 1 ParallelProcess Challenges
No ratings yet
L 1 ParallelProcess Challenges
82 pages
Embedded System Design
No ratings yet
Embedded System Design
4 pages
Embedded System Case Study
No ratings yet
Embedded System Case Study
6 pages
NEUST Student Profile: Student Id Last Name First Name Middle Name
No ratings yet
NEUST Student Profile: Student Id Last Name First Name Middle Name
10 pages
MS-7B49-1.1 (Intel - Coffeelake Plamform Z370)
No ratings yet
MS-7B49-1.1 (Intel - Coffeelake Plamform Z370)
65 pages
Microprocessor and Interfacing Techniques: (Course Code: CET208A) Credits-3
No ratings yet
Microprocessor and Interfacing Techniques: (Course Code: CET208A) Credits-3
147 pages
Neural Network Complete Notes
No ratings yet
Neural Network Complete Notes
46 pages
Slide 2 ARM Architecture and Instruction Set
No ratings yet
Slide 2 ARM Architecture and Instruction Set
234 pages
Ch02 OS9e
No ratings yet
Ch02 OS9e
97 pages
Question Bank: Department of Information Technology
No ratings yet
Question Bank: Department of Information Technology
14 pages
00 - ASUS VivoBook S15 S510UA - 0409 - E12415 - X510 - A PDF
No ratings yet
00 - ASUS VivoBook S15 S510UA - 0409 - E12415 - X510 - A PDF
100 pages
MMX Unit 1
No ratings yet
MMX Unit 1
33 pages
Untitled
No ratings yet
Untitled
25 pages
Moore and Mealy Machines: By: Engr - Syed Atir Iftikhar
No ratings yet
Moore and Mealy Machines: By: Engr - Syed Atir Iftikhar
21 pages
E5888 M4a88td-V Evo-Usb3 Contents v2 Print
No ratings yet
E5888 M4a88td-V Evo-Usb3 Contents v2 Print
128 pages
Sswwgyour: in Ari
No ratings yet
Sswwgyour: in Ari
82 pages
Unit 1 Introduction To Embedded System Design
No ratings yet
Unit 1 Introduction To Embedded System Design
67 pages
I2c Eeprom's
No ratings yet
I2c Eeprom's
44 pages
Branch Prediction Techniques
No ratings yet
Branch Prediction Techniques
48 pages
Aramco - All - As of 17.nov.2022
No ratings yet
Aramco - All - As of 17.nov.2022
103 pages
CD Unit 4 Compiler Design Jntuk r20
No ratings yet
CD Unit 4 Compiler Design Jntuk r20
17 pages
Information and Cbis
No ratings yet
Information and Cbis
15 pages
CEN468 Lab 3 V2
No ratings yet
CEN468 Lab 3 V2
14 pages
Chapter 10
No ratings yet
Chapter 10
12 pages
4th Sem End Semester Question Papers
No ratings yet
4th Sem End Semester Question Papers
15 pages
OpenMP Presentation
No ratings yet
OpenMP Presentation
51 pages
CS60003 High Performance Computer Architecture
No ratings yet
CS60003 High Performance Computer Architecture
3 pages
What Is Network-Attached Storage A Complete Guide
No ratings yet
What Is Network-Attached Storage A Complete Guide
51 pages
Systolic Array
No ratings yet
Systolic Array
42 pages
VTU Exam Question Paper With Solution of 18CS34 Computer Organization Dec-2019-Gopika D
No ratings yet
VTU Exam Question Paper With Solution of 18CS34 Computer Organization Dec-2019-Gopika D
19 pages
Position of Code Generator: Principles of Compiler Design Lecture Notes
No ratings yet
Position of Code Generator: Principles of Compiler Design Lecture Notes
16 pages
Solutions Ch4
No ratings yet
Solutions Ch4
7 pages
Collision Free Scheduling
No ratings yet
Collision Free Scheduling
18 pages
An4758 Proprietary Code Readout Protection On Stm32l4 Stm32l4 Stm32g4 and Stm32wb Series Mcus Stmicroelectronics
No ratings yet
An4758 Proprietary Code Readout Protection On Stm32l4 Stm32l4 Stm32g4 and Stm32wb Series Mcus Stmicroelectronics
43 pages
Paradise Manual
No ratings yet
Paradise Manual
9 pages
Device Order Data
No ratings yet
Device Order Data
19 pages
HW2 S24 Sol
No ratings yet
HW2 S24 Sol
15 pages
QX-Mini V2.2.7
No ratings yet
QX-Mini V2.2.7
17 pages
Cs433 Fa12 Hw4 Sol Correct
No ratings yet
Cs433 Fa12 Hw4 Sol Correct
14 pages
A History of Microsoft Windows OS
No ratings yet
A History of Microsoft Windows OS
20 pages
Dlms-Network V 1.0: Digital Load Measurement System
No ratings yet
Dlms-Network V 1.0: Digital Load Measurement System
16 pages
Computer Architecture - A Quantitative Approach Chapter 5 Solutions
No ratings yet
Computer Architecture - A Quantitative Approach Chapter 5 Solutions
14 pages
Design of Power Efficient Posit Multiplier Using Compressor Based Adder
No ratings yet
Design of Power Efficient Posit Multiplier Using Compressor Based Adder
8 pages
CS 6290: High-Performance Computer Architecture Spring 2009 Final Exam
No ratings yet
CS 6290: High-Performance Computer Architecture Spring 2009 Final Exam
14 pages
Openmp Tutorial: Seung-Jai Min
No ratings yet
Openmp Tutorial: Seung-Jai Min
30 pages
C# Project Proposal2
No ratings yet
C# Project Proposal2
18 pages
Processor Verification Using Open Source Tools and The GCC Regression Test Suite: A Case Study
No ratings yet
Processor Verification Using Open Source Tools and The GCC Regression Test Suite: A Case Study
13 pages
VHDL Implementation of A Mips-32 Pipeline Processor
No ratings yet
VHDL Implementation of A Mips-32 Pipeline Processor
5 pages
Computer Organization July 2005 Old
No ratings yet
Computer Organization July 2005 Old
2 pages
Reconfigurable Hardware Design Approach For Economic Neural Network
No ratings yet
Reconfigurable Hardware Design Approach For Economic Neural Network
5 pages
Microprocessor - Homework 1 Your Group Name: Nguyễn Thế An: b. ORG
No ratings yet
Microprocessor - Homework 1 Your Group Name: Nguyễn Thế An: b. ORG
6 pages
Midtermarch 2
No ratings yet
Midtermarch 2
9 pages
Computer Organization Jan 2010 OLD
No ratings yet
Computer Organization Jan 2010 OLD
1 page
P79 MLB Board Functional Test Coverage: APPLE - Need To Know Only
No ratings yet
P79 MLB Board Functional Test Coverage: APPLE - Need To Know Only
3 pages
Advanced Computer Architecture Test-1 Answer
No ratings yet
Advanced Computer Architecture Test-1 Answer
2 pages
Computer Architecture and Organization Ch#2 Examples
No ratings yet
Computer Architecture and Organization Ch#2 Examples
6 pages
Instruction Format
No ratings yet
Instruction Format
4 pages
Midterm Exam Architecture
No ratings yet
Midterm Exam Architecture
2 pages
Operating System
No ratings yet
Operating System
2 pages
Ipc B300
No ratings yet
Ipc B300
2 pages
Branch Prediction
No ratings yet
Branch Prediction
2 pages

High Performance Computer Architecture (CS60003)

Uploaded by

High Performance Computer Architecture (CS60003)

Uploaded by

\

Indian Institute of Technology, Kharagpur

Time=2 Hours Max Marks=75

2. Consider a 32 bit, 5 stage MIPS processor.

7. Consider the following code segment:

You might also like