0% found this document useful (0 votes)

11 views3 pages

Exercises 6

The document outlines exercises related to parallel processors and programming concepts, including concurrency, parallelism, and performance evaluation using Amdahl's law. It covers various systems such as embedded systems, SIMD uniprocessors, and multi-core web servers, and includes tasks on speedup measurement, instruction-level parallelism, and concurrent programming with semaphores. Additionally, it provides assembly code examples and requires explanations of their functionality.

Uploaded by

ChrisTo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views3 pages

Exercises 6

Uploaded by

ChrisTo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Exercises 6

Parallel Processors and Programs

Computer Organization and Components / Datorteknik och komponenter (IS1500), 9 hp
Computer Hardware Engineering / Datorteknik, grundkurs (IS1200), 7.5 hp

KTH Royal Institute of Technology

Wednesday 15th March, 2017

Concurrency, Parallelism, and Concepts

1. Consider the following three cases:

(a) An embedded system consisting of a MIPS microcontroller (uniprocessor) that runs

two periodic tasks with the period times 5ms and 20ms, respectively. The tasks are
invoked using a timer interrupt.
(b) A SIMD uniprocessor that solves numerical problem and achieves high speedup
because several floating-point instructions can be executed in parallel.
(c) A web server that is implemented on an Intel Core i7 processor system with 4 cores.
Each time a HTTP request comes to the server, the web server creates a new thread
that handles the request.

For each of the three systems, state if this system executes programs sequentially or con-
currently and if the hardware is executing in parallel or not. Motivate your answer.

2. Draw a 2 by 2 matrix, where each element represents SISD, SIMD, MISD, and MIMD.

(a) Explain what the acronyms stand for.

(b) Place the following words and acronyms inside the matrix and explain why they
belong in one or more places: MIPS uniprocessor, task-level parallelism, AVX,
Intel Core i7, GPU, data-level parallelism, and ILP.

Speedup and Amdahl’s law

3. Assume that you are performing a speedup performance measurement for a image pro-
cessing algorithm. Someone else has created a very efficient sequential implementation
Is and you have implemented a new parallel version Ip of the algorithm. The sequential
implementation has been executed on a benchmark B on a computer C1 with two cores,
each running at 4GHz. You have the source code for both Is and Ip and access to a ma-
chine with 8 cores. Each core is running at 2.2GHz with hardware multi-threading, which
makes the OS believe that there are 16 cores in the machine.
Explain how you would perform a fair speedup evaluation of your parallel implementa-
tion.

1
4. Assume we have a program where 10% of the execution time is purely sequential and that
the rest of the execution time can be improved by parallelization. For the part of the code
that can be parallelized, each core gives only 80% improvement. For instance, 5 cores
give 5 × 80% = 4 times improvement.

(a) Create a speedup chart, showing speedup on the Y-axis and the number of cores on
the X-axis. Show the graph for 1 to 200 cores, for instance by plotting with 25 cores
interval.
(b) What is the maximal speedup that can be achieved regardless how many cores we
add?
(c) What is it called if we would increase the problem size linearly to the number of
cores? What kind of scaling was used in problem (a)? Why would either of these
scaling approaches make sense?

Instruction Level Parallelism

5. Consider the following C function.

void reverse_double(int *src, int *dst, int n){
int i;
for(i=0; i < n; i++){
dst[i] = src[n - i - 1] * 2;
}
}

(a) Explain what the C function is doing.

(b) Translate the C program into MIPS code, which is executed on a 1-issue MIPS
processor. Assume the conditional branch calculation for beq can be computed in
the decode stage.
(c) Compute the IPC for executing the function with n = 5. Do not include the cost of
calling the function, but include the return cost. Assume an 1-issue 5 stage pipelined
MIPS processor with static branch prediction assuming branch-not-taken. The com-
parison for beq is assumed to be done in the decode stage.
(d) Assume we have a static 2-issue processor that can execute any type of instruction
in two different slots. Show the optimized MIPS code for each slot.
(e) Compute the IPC again for executing the function on the 2-issue processor with
n = 5. Again, do not include the cost of calling the function, but include the return
cost.
(f) What speedup do we achieve by using a 2-issue processor in this case?

2
Concurrent Programming and Semaphores

6. In this task, you should consider a multi-threaded producer-consumer problem. There are
two threads, a producer thread and a consumer thread. The producer thread is writing data
into a first-in-first-out (FIFO) buffer and the consumer thread is reading from the buffer.
The FIFO buffer can hold between 0 and n elements.
The task is to create both a tread safe consumer function and a thread safe producer
function with proper synchronization. If the buffer is empty, the consumer needs to wait
until there is an available item in the buffer. If the buffer is full (holds n elements),
the producer has to wait until there is space available, before it writes any data items
into the buffer. Solve the problem by using semaphores. You may write the solution as
pseudocode, as long as you clearly explain the program semantics.

Data-Level Parallelism

7. Consider the following lines of assembly code:

vmovapd (%r10), %ymm0
vmovapd (%r11), %ymm1
vmulpd %ymm0, %ymm1, %ymm1
vaddpd (%r10), %ymm1, %ymm1
vmovapd %ymm1, (%r11)

(a) What kind of assembly code is this?

(b) Explain what each line of code is doing.

Sample Midterm
No ratings yet
Sample Midterm
14 pages
exSILentia v4 User Guide
No ratings yet
exSILentia v4 User Guide
284 pages
Introduction To Operating System
0% (1)
Introduction To Operating System
20 pages
CSC 313 Module 3 Pipelining
No ratings yet
CSC 313 Module 3 Pipelining
59 pages
Parallel Programming: Sathish S. Vadhiyar Course Web Page
No ratings yet
Parallel Programming: Sathish S. Vadhiyar Course Web Page
36 pages
Parallel Computing
No ratings yet
Parallel Computing
28 pages
An14g7 Course
No ratings yet
An14g7 Course
901 pages
ACA Question Bank
No ratings yet
ACA Question Bank
19 pages
Multiprocessor Concepts
No ratings yet
Multiprocessor Concepts
40 pages
HPC Int I Retest Answer Key
No ratings yet
HPC Int I Retest Answer Key
10 pages
Seminar
No ratings yet
Seminar
85 pages
7th Question Paper
No ratings yet
7th Question Paper
21 pages
CCNP2 Sba Solution
83% (12)
CCNP2 Sba Solution
12 pages
Parallel and Distributed Computing
No ratings yet
Parallel and Distributed Computing
7 pages
Coa Ct3 Set A Answer Key
No ratings yet
Coa Ct3 Set A Answer Key
5 pages
CLAT3 - Set D - Answerkey
No ratings yet
CLAT3 - Set D - Answerkey
5 pages
Unit 4
No ratings yet
Unit 4
42 pages
24csppc202 Multicore Architecture and Programming
No ratings yet
24csppc202 Multicore Architecture and Programming
21 pages
Aos HW2 (700742863)
No ratings yet
Aos HW2 (700742863)
3 pages
Parallel Processors From Client To Cloud: Omputer Rganization and Esign
No ratings yet
Parallel Processors From Client To Cloud: Omputer Rganization and Esign
43 pages
Parallel Algorithm Merged
No ratings yet
Parallel Algorithm Merged
76 pages
Coa ct3 Set C Answer Key
No ratings yet
Coa ct3 Set C Answer Key
5 pages
CA Mid01 Fall'19 Solution
No ratings yet
CA Mid01 Fall'19 Solution
3 pages
UIT1522-DS-U02S04-VM PROVISIONING AND MIGRATION-NetV
No ratings yet
UIT1522-DS-U02S04-VM PROVISIONING AND MIGRATION-NetV
50 pages
Real Time System
No ratings yet
Real Time System
4 pages
Unit 2 Part 3
No ratings yet
Unit 2 Part 3
40 pages
Week1 - Parallel and Distributed Computing
100% (1)
Week1 - Parallel and Distributed Computing
46 pages
Speedup
No ratings yet
Speedup
12 pages
Solution Manual of Cmputer Organization and Architectur
44% (27)
Solution Manual of Cmputer Organization and Architectur
29 pages
Parallel Processing
No ratings yet
Parallel Processing
31 pages
L 1 ParallelProcess Challenges
No ratings yet
L 1 ParallelProcess Challenges
82 pages
Major
No ratings yet
Major
10 pages
Parallelism in Uniprocessor System and Granularity
100% (5)
Parallelism in Uniprocessor System and Granularity
5 pages
Pc98 Lect5 Part1 Speedup
No ratings yet
Pc98 Lect5 Part1 Speedup
36 pages
1) Define MIPS. CPI and MFLOPS.: Q.1 Attempt Any FOUR
No ratings yet
1) Define MIPS. CPI and MFLOPS.: Q.1 Attempt Any FOUR
10 pages
Operating Systems Lab Manual JNTU
100% (1)
Operating Systems Lab Manual JNTU
9 pages
Mid Sem1
No ratings yet
Mid Sem1
2 pages
Keyboard Manual
No ratings yet
Keyboard Manual
1 page
Server sk0-005 Samplelesson
No ratings yet
Server sk0-005 Samplelesson
25 pages
CS6801 MCP QB
No ratings yet
CS6801 MCP QB
16 pages
Part 1 - Lecture 3 - Parallel Software-1
No ratings yet
Part 1 - Lecture 3 - Parallel Software-1
45 pages
Midterm Fall2012Solutions
No ratings yet
Midterm Fall2012Solutions
6 pages
Sir Syed University of Engineering and Technology
No ratings yet
Sir Syed University of Engineering and Technology
4 pages
Parallel Computing
No ratings yet
Parallel Computing
32 pages
Number Conversion
No ratings yet
Number Conversion
31 pages
O.S. Lab Assignment
No ratings yet
O.S. Lab Assignment
1 page
Pipelining vs. Parallel Processing
No ratings yet
Pipelining vs. Parallel Processing
23 pages
Ca PDF
No ratings yet
Ca PDF
10 pages
Boot Loader
No ratings yet
Boot Loader
19 pages
Lec7 PDF
No ratings yet
Lec7 PDF
16 pages
Mid Sem QP&Solution
No ratings yet
Mid Sem QP&Solution
7 pages
Ia Cell Unit 4
No ratings yet
Ia Cell Unit 4
5 pages
CS326 Parallel and Distributed Computing: SPRING 2021 National University of Computer and Emerging Sciences
No ratings yet
CS326 Parallel and Distributed Computing: SPRING 2021 National University of Computer and Emerging Sciences
47 pages
QB345
No ratings yet
QB345
3 pages
CLAT3 - Set C - Answerkey
No ratings yet
CLAT3 - Set C - Answerkey
5 pages
Imp Questions For Os Ai Genertated For Mid and Sem
No ratings yet
Imp Questions For Os Ai Genertated For Mid and Sem
4 pages
18CS34
No ratings yet
18CS34
4 pages
Mid 19
No ratings yet
Mid 19
3 pages
Architecture
No ratings yet
Architecture
21 pages
DELL EMC Avamar For Oracle User Guide
No ratings yet
DELL EMC Avamar For Oracle User Guide
168 pages
Os Endterm CT 135 QP
No ratings yet
Os Endterm CT 135 QP
2 pages
How To Use EmuCon Files With Emu Loader
No ratings yet
How To Use EmuCon Files With Emu Loader
2 pages
Midtermarch 2
No ratings yet
Midtermarch 2
9 pages
Concurrency and Multiprogramming: 1 Project 1
No ratings yet
Concurrency and Multiprogramming: 1 Project 1
12 pages
NS4300N PM v2.4
No ratings yet
NS4300N PM v2.4
218 pages
1-ArubaOS-CX OVA GNS3 VM
No ratings yet
1-ArubaOS-CX OVA GNS3 VM
20 pages
Csi 3310 Midterm Samples Solution
No ratings yet
Csi 3310 Midterm Samples Solution
5 pages
Ut330 Software Manual
No ratings yet
Ut330 Software Manual
9 pages
HW Monitor
No ratings yet
HW Monitor
117 pages
Week2 Lecture Chapter1
No ratings yet
Week2 Lecture Chapter1
36 pages
LM Computernetworks Shortsize (1.3)
No ratings yet
LM Computernetworks Shortsize (1.3)
118 pages
01-05 OSPF Configuration
No ratings yet
01-05 OSPF Configuration
183 pages
ISM Chapter 2
No ratings yet
ISM Chapter 2
24 pages
Beamng Log
No ratings yet
Beamng Log
21 pages
Maico Database Manual
No ratings yet
Maico Database Manual
11 pages
Why Android: A Case Study of Smartphone Operating Systems: Abstract
No ratings yet
Why Android: A Case Study of Smartphone Operating Systems: Abstract
4 pages
DSDV
No ratings yet
DSDV
31 pages
Using Capacity Magic To Size Storwize V7000 Disk Systems
No ratings yet
Using Capacity Magic To Size Storwize V7000 Disk Systems
19 pages
Computer Networking Principles Bonaventure 1-30-31 OTC1
No ratings yet
Computer Networking Principles Bonaventure 1-30-31 OTC1
1 page
Questionario - What Is A Computer
No ratings yet
Questionario - What Is A Computer
4 pages
Erbessd 10XIP: Rugged Tablet Computer
No ratings yet
Erbessd 10XIP: Rugged Tablet Computer
3 pages
Synopsis of Creditinfo PDF
No ratings yet
Synopsis of Creditinfo PDF
3 pages
Handbook Coupa PS Configuration Setup
No ratings yet
Handbook Coupa PS Configuration Setup
6 pages
II Cse Cs3451 QB Int 3
No ratings yet
II Cse Cs3451 QB Int 3
3 pages
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet
Learn Java Programming in 24 Hours
From Everand
Learn Java Programming in 24 Hours
PublishDrive
No ratings yet
C Programming
From Everand
C Programming
Netra
No ratings yet
IGNOU PGDCA MCS 202 Computer Organisation Previous Years Unsolved Papers
From Everand
IGNOU PGDCA MCS 202 Computer Organisation Previous Years Unsolved Papers
Manish Soni
No ratings yet
C Programming for the Pc the Mac and the Arduino Microcontroller System
From Everand
C Programming for the Pc the Mac and the Arduino Microcontroller System
Peter D Minns
No ratings yet
Foundation Course for Advanced Computer Studies
From Everand
Foundation Course for Advanced Computer Studies
Franck Ismael Djédjé
No ratings yet

Exercises 6

Uploaded by

Exercises 6

Uploaded by

Exercises 6

Parallel Processors and Programs

KTH Royal Institute of Technology

Concurrency, Parallelism, and Concepts

(a) An embedded system consisting of a MIPS microcontroller (uniprocessor) that runs

(a) Explain what the acronyms stand for.

Speedup and Amdahl’s law

Instruction Level Parallelism

5. Consider the following C function.

(a) Explain what the C function is doing.

7. Consider the following lines of assembly code:

(a) What kind of assembly code is this?

You might also like