COA Assignments
COA Assignments
4) A 4 way set Associative cache memory in MERS X486 computer system has 4 words in each
set. A replacement procedure based on the LRU algorithm is implemented by means of 2 bit
counters associated with each word in the set. A value in the range 0 to 3 is thus recorded for
each word. When a hit occurs , the counter is associated with the referenced word is set to 0.,
those counters with values originally lower than referenced one are incremented by 1 , and all
others remain unchanged. If a miss occurs , the word with counter value 3 is replaced with the
new word resetting the counter to 0, and other three counters are incremented by 1.
Show that this procedure works for the following sequence of word reference
A,B,C,D,B,E,D,A,C,E,C,E ( Start with A ,B,C,D as the initial four words with A being the least
recently used.
5) Array Processors Pipeline design: In certain scientific computation it is necessary to perform
the arithmetic operation (Pi +Qi)(Xi+Yi) with a stream of numbers. Design a pipeline
configuration to carry out this task for i=1 to 6. Show clearly all the intermediate registers
used for the design.
6) An 8-bit computer has a 16bit address bus. The first 15 lines of address are used to select a
bank of 32K bytes of memory. The higher order bit of address is used to select a register which
receives the contents of the data bus? Explain how this configuration can be used to extend the
memory capacity of system to eight banks of 32 K bytes each, for a total of 256K bytes of
memory.
7) Consider having a program that runs in 50 s on machine - A, which has a 500 MHz clock. We
would like to run the same program on another machine-B, in 20 s. If machine B requires 2.5
times as many clock cycles as machine A for the same program, what clock rate must machine B
have in MHz?
A [B+C + D]
8) Write the program to evaluate the expression X ¿ using zero, one and two address
E [F−G]
instructions. The instructions available to use as follows.
131840 clocks
2)Divide the I loop iterations among the 16 cluster as follows.Computer1 executes the first 32
iterations , processor 2 executes the next 32 and so on.What is the total execution time and
speed up factor as compared to 1. Is this a balanced design.
3)explain how to modify the parallelizing to facilitate the balanced parallel execution of all the
16 processors. Find the execution time and speed up over a single computer.
8240= 16480 clocks