Assignment 3 - 2024 HPC
Assignment 3 - 2024 HPC
(Deemed to be University)
SCHOOL OF COMPUTER ENGINEERING
Date: 04/09/2024
Subject: HPC
ASSIGNMENT - 3
1. Consider the loop on MIPS without any scheduling will execute as follows:-
Clock Cycles Issued
LOOP: LOAD F2, 0(R7) 1
STALL 2
ADD F0, F2, F4 3
STALL 4
STALL 5
SUB F6, F0, F2 6
STALL 7
STALL 8
STORE F0, 0(R7) 9
STALL 10
ADD R7, R7, #+8 11
STALL 12
BEQ F1, F5, LOOP 13
Use the benefits of instruction scheduling and loop unrolling to reduce the number of clock cycle
required to complete the execution of the above given code.
2. Assume we have 2 multipliers, 2 adders, 1 divide & 1 integer unit. The following set of MIPS
instruction is going to be executed in a pipelined system.
4. Assume a fully associative write back cache with many cache entries that starts empty. Below is a sequence of 7
memory operations (the address is in square brackets).
WriteMem [100]
ReadMem [300 ]
WriteMem [200]
ReadMem [100 ]
ReadMem [400 ]
WriteMem [200]
WriteMem [300]
What are the numbers of hits and misses when using no-write allocate Vs write allocate?
5. What is cache miss penalty time? Explain how to improve cache memory performance by reducing cache miss
penalty.
6. Draw and explain the states of directory-based cache protocol which implements both CPU request and BUS
request for each cache block access.
7. What is a distributed shared memory architecture? Explain cache coherence in symmetric shared-memory
multiprocessors with an example.