0% found this document useful (0 votes)

19 views

Assignment Exercise1

The document provides instructions for an assignment to implement a parallel version of Conway's Game of Life cellular automaton using MPI for distributed memory parallelism and OpenMP for shared memory parallelism. Students are asked to initialize and evolve a 2D grid representing the game state, performing scaling studies and optionally adding additional rules. The game state is to be saved in PGM image format.

Uploaded by

ChePalle

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views

Assignment Exercise1

Uploaded by

ChePalle

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Exercise 1 - Parallel Programming

Foundations of HPC @ UniTS-DSSC 2022

Assignment for the _Foundations of High Performance Computing course at “Data
Science and Scientific Computing”, University of Trieste, 2022-2023

Stefano Cozzini stefano.cozzini at areasciencepark.it

Luca Tornatore luca.tornatore at inaf.it

Due time: 1 week before taking the oral exam

v1.2

26th Jan, 2023

History of changes

v1.2 : added a section about the expected structure of the report; added a detail about
how to run hybrid code when calling mpirun .

v1.1 : minor adjustments & clarifications

v1.0 : first release

Game of Life
You’ll be prompted to implement a parallel version of a variant of the famous Conway’s
“game of life” ( references: 1, 2 ), that is cellular automaton which plays autonomously on
a infinite 2D grid.

The game itself has several exciting properties and pertains to a very interesting field, but
we leave to the readers going deeper into it at will.

Playground

As said, the game evolves on a 2D discrete world, where the tiniest position is a single
cell; actually you can imagine it as a point on a system of integer coordinates.

The neighbours of a cell are the 8 most adjacent cells, i.e. those that on a grid
representation share an edge with the considered cell, as depicted in the Fig. 1 below.

Figure 1: The cell’s neightbours are the 8 immediately adjacent cells

The playground, that will be a grid of size , has periodic boundary conditions at the
edges. It- means that cells at an edge have to be considered neighbours of the cells at the
opposite edge along the same axis. For instance, cell will have cells
and as neighbours.

Figure 2: Periodic boundary conditions: the cells at the edges with the same colour are
adjacent .

Rules of the game

Each cell can be either “alive” or “dead” depending on the conditions of the neighboring
cells:
a cell becomes, or remains, alive if 2 to 3 cells in its neighborhood are alive;

Figure 3: examples for a cell to become or to remain alive

a cell dies, or do not generate new life, if either less than 2 cells or more than 3
cells in its neighborhood are alive (under-population or over-population
conditions, respectively).

Figure 4: examples for a cell going to die

In this way, the evolution of the system is completely determined by the initial conditions
and by the rules adopted to decide the update of the cells’ status.

Upgrading cells status and evolving the universe

Classically, the cells are upgraded in row-major order, like in the code:

for ( int i = 0; i < k_i; i++ )

for ( int j = 0; j < k_j; i++ )
upgrade_cell(i,j); // calculate the status of neighb. cells
and update
Since, obviously, changing a cell’s status impacts on the fate of adjacent cells, that inserts
a “spurious” signal in the system evolution. Let’s call that “ordered evolution”. Note that
this evolution is intrinsically serial because of the inherent dependency that descends
from its definition; in that case, the only issue related o the parallelism is the correct
propagation among the OpenMP threads.

A second option is to disentangle the status evaluation and status update of each cell, in
that the status of all the cells (i.e., the computation of how many alive adjacent each of
them has) should be evaluated at first, freezing the system, and updating the status of the
cells only afterwards. Let’s call this “static evolution”.

A third, among many others, option is that the ordered evolution does not always start
from the same point but from a random position and propagate in all directions as a
square wave, as illustrated in Fig. 5.

An additional simple option is to evolve the cells in a “white-black” order, i.e.

considering the playground as it was a chessboard and evolving the white positions at
first and the black ones afterwards.

Figure 5: cells upgrade starting from a random point, propagating as a square wave.

Requirements

1. You have to implement an hybrid MPI + OpenMP code, using a domain

decomposition to distribute the workload among the MPI tasks. Each MPI task
will then further parallelize its own work using OpenMP threadization (see the
appendix for some comments on hybrid codes).

2. The playground must be a general checker (you are free to generalize to

any rectangular ) with and unlimited upper value.
3. Initialize a playground and write it to a file, in a binary format (see the section
“File formats” below).
4. Load a playground from a file and run it for a given number of steps.

5. Along a run, a dump of the system has to be saved with a given frequency.

6. You are required to provide a scalability study:

a. OpenMP scalability: fix the number of MPI tasks to 1 per socket, and
report the behaviour of the code when you increase the number of
threads per task from 1 up to the number of cores present on the
socket;
b. Strong MPI scalability: given a fixed size (you may opt for several
increasing sizes ) show the run-time behaviour when you increase the
number of MPI tasks (use as many nodes as possible, depending on the
machine you run on).
7. [ OPTIONAL ] To spice-up the game, let’s introduce a further rule to the two
original and classical ones, let’s call it “Finger of God”: due to a random
interaction with a, say, cosmic ray a living cell may die or generate like
probabilities and respectively (these are two parameters whose values
have to be provided at run-time).

If a cell is set alive by chance, and it has less than 2 alive neighbours, then it
spreads life on neighbour cells so that at least 2 of them are set alive. It’s easy to
understand that, given the previous rules, whether this new life will survive
depends on its initial shape (you will quickly discover that a surprising number
of life forms can even move).

You are free to define whatever set of initial live patterns to be used in this case,
and to experiment with it.

An obvious advice is to use very small values for and to keep the Finger of
God a perturbation instead of the dominant force.
8. [ OPTIONAL ] implement the evolution with a square-wave signal from a grid
point randomly chosen at every time-step.

9. [ OPTIONAL ] implement the white-black evolution.

Your code should accept and handle the following command-line arguments:

ARGUMENT MEANING
-i initialize a playground

-r run a playground
-k num. value playground size

-e [0|1] evolution type; 0 means "ordered", 1 meas "static"

-f string the name of the file to be either read or written

-n num. value number of steps to be calculated

ARGUMENT MEANING
-s num. value every how many steps a dump of the system is saved on a file

(0 meaning only at the end)

A couple of examples to clarify the usage of the command-line options:

to create initial conditions:

$executable -i -k 10000 -f $my_initial_condition

this produces a file named $my_initial_condition that contains a playground

of size 10000x10000

to run a play ground:

$executable -r -f $initial_conditions -n 10000 -e 1 -s 1

this evolves the initial status of the file $initial_conditions for 10000 steps
with the static evolution mode, producing a snapshot at every time-step.

CLARIFICATION: following a question from a student, let be clear that

$my_initial_condition and $initial_conditions are just placeholders that stands for
any name the files may have. So, the -f option is intended to give to the program the
name of the file that must be created (when used with the -i option) or read (with the -r
option).

File format

The adopted file format is the pgm binary format for both the initial conditions and the
evolutions snapshots, that allows to use any snapshot as a new initial conditions file for a
new experiment.

The snapshot file name should be:

snapshot_nnnnn

where nnnnn (with 5 digits, padded with zeros) is the time-step it refers to.

Appendix I - Reading/Writing a PGM image

The PGM image format, companion of the PBM and PPM formats, is a quite simple and
portable one.

It consists in a small header, written in ASCII, and in the pixels that compose the image,
written all one after the others as integer values.

Each pixel may occupy either 1 or 2 bytes, and its value in PGM corresponds to the grey
level of the pixel expressed in the range [0..255] or [0..65535]; since in our case the only
possibility is "dead/alive" then we adopt the single byte representation and the only two
values that are meaningful are 0 and 255.

Even if also the pixels can be written in ASCII format, we require the usage of a binary
format.

The header is a string that can be formatted like the following:

printf( "%2s %d %d\n%d\n", magic, width, height, maximum_value );

where magic is a magic number that for PGM is "" P4 ”, width and heigth are the
dimensions of the image in pixels, and maximum_value is either <256 or <65536 .

If maximum_value < 256 , then 1 byte is sufficient to represent all the possible values and
each pixel will be stored as 1 byte. instead, if 256 <= maximum_value < 65535 , 2 bytes
are needed to represent each pixel (that in the current case would be a waste of bytes).

In the sample file read_write_pgm_image.c that you find the folder, there are the
functions write_pgm_image() and read_pgm_image() that you can use to respectively
write and read such a file.

In the same file, there is a sample code that generates a square image and write it using
the read_write_pgm_image() function.

It generates a vertical gradient of pixels, where and are parameters.

Whether the image is made by single-byte or 2-bytes pixels is decided by the maximum
colour, which is also a parameter.

The usage of the code is as follows :

cc -o read_write_pgm_image read_write_pgm_image.c
./read_write_pgm_image [ max_val] [ width height]

as output you will find the image image.pgm which should be easily rendered by any
decent visualizer .

NOTE: the pbm file format is conceptually similar to the proposed pgm ; every pixel
requires one bit only of information because it is a black&white encoding. As such - and
that is the slightly trickier part - every byte in the file corresponds to 8 pixels that are
mapped onto the single bits. In the pgm format, instead, a pixel corresponds to a byte in
the file, which is easier to code. Optionally, you may want to implement also the pbm
format, which is perfectly adequate to our case because every cell (that is a pixel in the
image) may just be dead (0 ) or alive (1).
Appendix II - A note about hybrid MPI+OpenMP
As we mentioned in the class, a simple hybridization of MPI with OpenMP is quite
straightforward. As you have seen, it is obviously not a requirement but just an
opportunity for those among you that like to be challenged.

As long as you use OpenMP regions in a MPI process for computation only and not to
execute MPI calls, everything is basically safe and you can proceed as usual with both MPI
and OpenMP calls and constructs.

At a more advanced level, the same thread that initializes an OpenMP region (i.e. the
thread 0), and only that one, can make the MPI calls from within an OpenMP region
(“funneled” mode).

Possibly, every thread could call MPI routines but only one at one time (“serialized”
mode).

Eventually, multiple threads can make MPI calls at the same time, which is to be handled
carefully (“multiple” mode).

Initialize the MPI library with a call slightly different than MPI_Init() :

int mpi_provided_threaD_level;

MPI_Init_thread( &argc, &argv, MPI_THREAD_FUNNELED,

&mpi_provided_thread_level);

if ( mpi_provided_thread_level < MPI_THREAD_FUNNELED ) {

printf("a problem arise when asking for MPI_THREAD_FUNNELED
level\n");
MPI_Finalize();
exit( 1 );
}

...; // your code

MPI_Finalize();
return 0;

Running an hybrid code

On several platforms you may notice that running something like the following
export OMP_NUM_THREADS=$MY_NUM_THREADS
export OMP_PLACES=$MY_PLACE_CHOICE
export OMP_PROC_BIN=$MY_BIND_CHOICE
export OMP_DISPLAY_ENV=TRUE
mpirun -np $MPI_NTASKS $MY_EXEC ...

results in all the threads spawned by an MPI task run on the same physical core.

That is due to the default mapping of MPI tasks to physical resource. You know that it is
possible to request a given mapping of the MPI tasks pool onto the hardware by, for
instance, –map-by .

According to the standard,

Supported options include slot, hwthread, core, L1cache, L2cache,

L3cache, socket, numa, board, node, sequential, distance, and ppr
[ from the mpirun man page ]

So, the set of resources visible to a given MPI task will include only those included at the
required mapping level. For instance, if you require --map-by core , then the resources
visible to a given MPI task will include the logical threads of the core it will be assigned
to.

If you require --map-by L3cache the resource set will consist of all the cores that share
the L3 cache with the core that hosts the task itself. And so on.

Hence, we have this effect:

--MAP-BY VISIBLE RESOURCE IMPACT ON THREADS

hwthread only the logical core on which the whatever is the PLACES and BIND
task runs options, all of them will run on the
same logical core
core the physical core on which the task they will distribute on the visible
runs logical cores accordingly to your policy
choice
L?cache the phisical cores that share the ? same as above
level of cache with the core the task
run onto
socket the phisical cores on the same same as above
socket the task run onto

numa the numa region to which the core same as above

that hosts the task belongs to
--MAP-BY VISIBLE RESOURCE IMPACT ON THREADS

node all the node onto the task run same as above

Hence, we suggest to run with --map-by socket at least.

Appendix III - Structure of the Report

The Report that you must submit is the major source of documentation for your work. As
such, please, take care about its overall quality.

In the following we suggest a structure that is somehow typical for any report/paper of
this kind.

SECTION DESCRIPTION
Introduction Brief overview and description of the problem tackled by your work.
Be concise and focused, leaving a comprehensive discussion to some
well-motivated reference.

Methodology A discussion at abstract level of your approach and algorithms.

Describe the possible options, if any, and describe and motivate your
choices.

Implementation Present and discuss the significant technical details of your

implementation. For instance, for a given task you may have opted
for a technical option among several possible: present, motivate and
discuss your choice, also quoting a code snippet if needed. Example:
how do you allocate the memory? Did you opt for either domain- or
functional-decomposition among MPI tasks? Why? and how did you
design it? How is your code threaded? etc.
Results & Present and discuss your results, which at least are the scaling
Discussion relations listed above in the text. Add any other result and relative
figures from optional topics you you did select any.
Conclusions Here you put your final remarks. How is the quality that you achieved
with your code? did you understand the pitfalls? and above all: how
would you improve it ?

Email Search
100% (1)
Email Search
4 pages
Dchuynh HW4
No ratings yet
Dchuynh HW4
5 pages
Parallel and Distributed Computing
33% (3)
Parallel and Distributed Computing
10 pages
Master in High Performance Computing Advanced Parallel Programming Mpi: Rma and I/O
No ratings yet
Master in High Performance Computing Advanced Parallel Programming Mpi: Rma and I/O
7 pages
CPSC 350 - Data Structures Fall 2013 Assignment #2 - Life (Due 9-17-13 11:59 PM) Overview
No ratings yet
CPSC 350 - Data Structures Fall 2013 Assignment #2 - Life (Due 9-17-13 11:59 PM) Overview
4 pages
MPI The Best Possible Practise
No ratings yet
MPI The Best Possible Practise
5 pages
CS 211 - Assignment 3 Due November 27, 2018 The Game of Life
No ratings yet
CS 211 - Assignment 3 Due November 27, 2018 The Game of Life
3 pages
Data Structures A1 - Game of Life
No ratings yet
Data Structures A1 - Game of Life
5 pages
CSC630, Spring 2014 Assignment Single and Parallel Implementation of Game of Life
No ratings yet
CSC630, Spring 2014 Assignment Single and Parallel Implementation of Game of Life
1 page
OOP EENG3 ProjectLifeAndAgents
No ratings yet
OOP EENG3 ProjectLifeAndAgents
8 pages
Extra
No ratings yet
Extra
2 pages
18bce2427 PDC 5
No ratings yet
18bce2427 PDC 5
12 pages
ECE 190 Spring 2011: Problem Statement
No ratings yet
ECE 190 Spring 2011: Problem Statement
8 pages
Cellular Automata: Jarkko Kari
No ratings yet
Cellular Automata: Jarkko Kari
16 pages
UD Complex 2016 I FractalsSelfOrganization I
No ratings yet
UD Complex 2016 I FractalsSelfOrganization I
65 pages
Pico8zine 2
No ratings yet
Pico8zine 2
48 pages
Jarkko Kari - Cellular Automata: Tutorial
No ratings yet
Jarkko Kari - Cellular Automata: Tutorial
294 pages
15-automatas_celulares-2
No ratings yet
15-automatas_celulares-2
37 pages
Game of lifeee
No ratings yet
Game of lifeee
5 pages
Lecture P4: Cellular Automata: Array Review
No ratings yet
Lecture P4: Cellular Automata: Array Review
9 pages
Cellular Automaton (Pl. Cellular Automata, Abbrev. CA) : Discrete Coupled Map Lattice
No ratings yet
Cellular Automaton (Pl. Cellular Automata, Abbrev. CA) : Discrete Coupled Map Lattice
17 pages
Fallsem2019-20 Cse4001 Eth Vl2019201001348 Reference Material Cse4001 Parallel and Distributed Computing May 2019 (003) 18
No ratings yet
Fallsem2019-20 Cse4001 Eth Vl2019201001348 Reference Material Cse4001 Parallel and Distributed Computing May 2019 (003) 18
4 pages
VL2020210104311 Fat PDF
No ratings yet
VL2020210104311 Fat PDF
6 pages
[Group 8] Group Work week 9 Report
No ratings yet
[Group 8] Group Work week 9 Report
9 pages
Discrete Structures For Computer Science (CS F222) : Assignment-2
No ratings yet
Discrete Structures For Computer Science (CS F222) : Assignment-2
10 pages
CSCI427 Wk01 Lab Part3
No ratings yet
CSCI427 Wk01 Lab Part3
6 pages
Samsara Hypothesis and Pattern Recognition
No ratings yet
Samsara Hypothesis and Pattern Recognition
26 pages
Game of Life
No ratings yet
Game of Life
2 pages
Lessons in Bioinformatics - Dot Plots: Lessons in Bioinformatics, #1
From Everand
Lessons in Bioinformatics - Dot Plots: Lessons in Bioinformatics, #1
Björn Olsson
No ratings yet
HPC La2-2023qs
No ratings yet
HPC La2-2023qs
5 pages
Game of Life 2020
No ratings yet
Game of Life 2020
4 pages
2501.04761v1
No ratings yet
2501.04761v1
18 pages
Assignment 6
No ratings yet
Assignment 6
9 pages
C Programming Exam: Game of Life 25-04-2017, 8:45-10.30
No ratings yet
C Programming Exam: Game of Life 25-04-2017, 8:45-10.30
22 pages
Population Explosion: Input
No ratings yet
Population Explosion: Input
3 pages
MATLAB Report Example
No ratings yet
MATLAB Report Example
3 pages
HPC LA2 2024 Questions
No ratings yet
HPC LA2 2024 Questions
7 pages
1.00/1.001 Introduction To Computers and Engineering Problem Solving Fall 2002 Problem Set 6 Due: Day 21 Problem 1. Simple Drawing (20%)
No ratings yet
1.00/1.001 Introduction To Computers and Engineering Problem Solving Fall 2002 Problem Set 6 Due: Day 21 Problem 1. Simple Drawing (20%)
5 pages
2503.19869v1
No ratings yet
2503.19869v1
54 pages
The Game of Life: Inal Roject Eport
No ratings yet
The Game of Life: Inal Roject Eport
28 pages
Project_ Snake Game
No ratings yet
Project_ Snake Game
8 pages
Constructing Computer Virus Phylogenies
No ratings yet
Constructing Computer Virus Phylogenies
21 pages
Prob TreeScript
No ratings yet
Prob TreeScript
15 pages
"Encoding Trees," Comput.: 1I15. New (71
No ratings yet
"Encoding Trees," Comput.: 1I15. New (71
5 pages
Parallel Implementation of John Conway's Game of Life
No ratings yet
Parallel Implementation of John Conway's Game of Life
11 pages
5 Command Structures in Matlab: 5.1 Using The If Command
No ratings yet
5 Command Structures in Matlab: 5.1 Using The If Command
8 pages
hw1
No ratings yet
hw1
2 pages
NCPC 2020 Problems
No ratings yet
NCPC 2020 Problems
28 pages
S R T S: OME Esearch Opics For Tudents
No ratings yet
S R T S: OME Esearch Opics For Tudents
3 pages
School of Computer Science University of ST Andrews 2022-23 CS4402 Constraint Programming Practical 2: Constraint Solver Implementation
No ratings yet
School of Computer Science University of ST Andrews 2022-23 CS4402 Constraint Programming Practical 2: Constraint Solver Implementation
8 pages
(Paper Size - Legal) ICPC Dhaka Preliminary, 2017 - Statements
No ratings yet
(Paper Size - Legal) ICPC Dhaka Preliminary, 2017 - Statements
13 pages
Conway's Game of Life Makes Use of Sparse Matrices
No ratings yet
Conway's Game of Life Makes Use of Sparse Matrices
9 pages
Check6 2
No ratings yet
Check6 2
7 pages
Conways Game of Life
No ratings yet
Conways Game of Life
3 pages
Static and Dynamical Equilibrium Propert
No ratings yet
Static and Dynamical Equilibrium Propert
7 pages
Đề toán tin 1
No ratings yet
Đề toán tin 1
9 pages
Paperciarp09 Lncs English
No ratings yet
Paperciarp09 Lncs English
12 pages
Netlogo
No ratings yet
Netlogo
18 pages
MP 1
No ratings yet
MP 1
4 pages
C# Package Mastery: 100 Essentials in 1 Hour - 2024 Edition
From Everand
C# Package Mastery: 100 Essentials in 1 Hour - 2024 Edition
Tenko
No ratings yet
Competitive Learning: Fundamentals and Applications for Reinforcement Learning through Competition
From Everand
Competitive Learning: Fundamentals and Applications for Reinforcement Learning through Competition
Fouad Sabry
No ratings yet
The AHAM Seven Basic Truths
100% (1)
The AHAM Seven Basic Truths
6 pages
Credit - Private Credit_ Outlook and Considerations _ Morgan Stanley
No ratings yet
Credit - Private Credit_ Outlook and Considerations _ Morgan Stanley
7 pages
Chapter Ii
No ratings yet
Chapter Ii
14 pages
CV An Jessica Febrina Sitompul
No ratings yet
CV An Jessica Febrina Sitompul
2 pages
Calculation Procedure For Ground Potentials With Multiple Anodes
100% (1)
Calculation Procedure For Ground Potentials With Multiple Anodes
17 pages
Mini Project
No ratings yet
Mini Project
37 pages
Hand Pump Repair - Zambia - SOHIP - 2014
No ratings yet
Hand Pump Repair - Zambia - SOHIP - 2014
141 pages
Best Youth Paragraphs
No ratings yet
Best Youth Paragraphs
1 page
Magic Cheat Sheet
100% (1)
Magic Cheat Sheet
2 pages
A Comparative Study To Assess The Satisfaction of Online Vs Offline Classes Among School Children
No ratings yet
A Comparative Study To Assess The Satisfaction of Online Vs Offline Classes Among School Children
6 pages
AI MODULE1
No ratings yet
AI MODULE1
10 pages
Water Filtration Plant
No ratings yet
Water Filtration Plant
6 pages
PF-CIS-Fall 2022 LAB
No ratings yet
PF-CIS-Fall 2022 LAB
4 pages
Prof. Dr. Aqeela Bashir - CV
No ratings yet
Prof. Dr. Aqeela Bashir - CV
9 pages
Practical For 2022-23
No ratings yet
Practical For 2022-23
11 pages
As 2698.3-1990 Plastics Pipes and Fittings For Irrigation and Rural Applications Mechanical Joint Fittings Fo
No ratings yet
As 2698.3-1990 Plastics Pipes and Fittings For Irrigation and Rural Applications Mechanical Joint Fittings Fo
6 pages
Reading Comprehension Homework For Second Grade
100% (1)
Reading Comprehension Homework For Second Grade
6 pages
Advate 2005 EPAR Scientific Discussion
No ratings yet
Advate 2005 EPAR Scientific Discussion
44 pages
Structural Analysis of Wind Turbine Blade of Profile Naca 2418
No ratings yet
Structural Analysis of Wind Turbine Blade of Profile Naca 2418
10 pages
Phases of The Moon
No ratings yet
Phases of The Moon
4 pages
Market Feasibility Study #2
No ratings yet
Market Feasibility Study #2
4 pages
Perrin 2012 4087
No ratings yet
Perrin 2012 4087
23 pages
Quartile For Ungroup Data
No ratings yet
Quartile For Ungroup Data
37 pages
Unkeep Attribute Could Not Be Removed
No ratings yet
Unkeep Attribute Could Not Be Removed
7 pages
Schneider Electric - GoPact-MCCB - G20F3A63
No ratings yet
Schneider Electric - GoPact-MCCB - G20F3A63
4 pages
Guide To Green Building Certifications August 2018 Weblow Res
No ratings yet
Guide To Green Building Certifications August 2018 Weblow Res
156 pages
Final Uew Thesis 2021. PGD
No ratings yet
Final Uew Thesis 2021. PGD
91 pages
Strategies in Answering Toefl Listening Test
No ratings yet
Strategies in Answering Toefl Listening Test
2 pages
Charging Station Installation Guide
No ratings yet
Charging Station Installation Guide
71 pages

Assignment Exercise1

Uploaded by

Assignment Exercise1

Uploaded by

Exercise 1 - Parallel Programming

Foundations of HPC @ UniTS-DSSC 2022

Stefano Cozzini stefano.cozzini at areasciencepark.it

Luca Tornatore luca.tornatore at inaf.it

Due time: 1 week before taking the oral exam

26th Jan, 2023

v1.1 : minor adjustments & clarifications

v1.0 : first release

Figure 1: The cell’s neightbours are the 8 immediately adjacent cells

Rules of the game

Figure 3: examples for a cell to become or to remain alive

Figure 4: examples for a cell going to die

Upgrading cells status and evolving the universe

for ( int i = 0; i < k_i; i++ )

An additional simple option is to evolve the cells in a “white-black” order, i.e.

1. You have to implement an hybrid MPI + OpenMP code, using a domain

2. The playground must be a general checker (you are free to generalize to

6. You are required to provide a scalability study:

9. [ OPTIONAL ] implement the white-black evolution.

-e [0|1] evolution type; 0 means "ordered", 1 meas "static"

-n num. value number of steps to be calculated

(0 meaning only at the end)

A couple of examples to clarify the usage of the command-line options:

to create initial conditions:

$executable -i -k 10000 -f $my_initial_condition

this produces a file named $my_initial_condition that contains a playground

to run a play ground:

$executable -r -f $initial_conditions -n 10000 -e 1 -s 1

CLARIFICATION: following a question from a student, let be clear that

The snapshot file name should be:

Appendix I - Reading/Writing a PGM image

The header is a string that can be formatted like the following:

printf( "%2s %d %d\n%d\n", magic, width, height, maximum_value );

It generates a vertical gradient of pixels, where and are parameters.

The usage of the code is as follows :

MPI_Init_thread( &argc, &argv, MPI_THREAD_FUNNELED,

if ( mpi_provided_thread_level < MPI_THREAD_FUNNELED ) {

...; // your code

Running an hybrid code

According to the standard,

Supported options include slot, hwthread, core, L1cache, L2cache,

Hence, we have this effect:

--MAP-BY VISIBLE RESOURCE IMPACT ON THREADS

numa the numa region to which the core same as above

Hence, we suggest to run with --map-by socket at least.

Appendix III - Structure of the Report

Methodology A discussion at abstract level of your approach and algorithms.

Implementation Present and discuss the significant technical details of your

You might also like