0% found this document useful (0 votes)

5 views

Assignment_2-3

The assignment requires students to compute partial degree centrality measures for an undirected graph representing social connections at a university using MPI. Students must implement three functions to calculate centrality scores, initialize and finalize MPI, and return the top k influential nodes for each color. The submission includes specific files and a report detailing optimizations and performance analysis.

Uploaded by

Aneeket1 Yadav

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views

Assignment_2-3

Uploaded by

Aneeket1 Yadav

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Assignment 2: Parallel Graph Centrality Computation

Using MPI
COL380
February 16, 2025

Due Dates
• Due Date 1: 11:59 PM, March 12, 2025

• Due Date 2: 11:59 PM, March 17, 2025

Objective
In this assignment, you are given an undirected graph representing social connections in a
University, where nodes are individuals, undirected edges represent relationships, and colors
represent the houses they belong to. Each node has a color represented by a number. Your
task is to compute partial degree centrality measures in parallel using MPI and then use this
score to identify the top k most influential nodes for each color.
Degree Centrality: This measure indicates how connected a node is within the graph. Unlike
the standard degree centrality, in this problem, only connections to nodes of specified colors
are counted. For a node ′ u′ and color ′ c′ , the degree centrality is computed as:

Degree Centrality(u, c) = count of v ,where v ∈ neighbors of u and color(v) = c

Remember, color(u) need not be c. The graph is undirected and u is a neighbor of v if there
is an edge between u and v.
After computing the centrality scores for each color, you will find the top k most influential
nodes in the graph for each color. (k ≪ |v| and c ≪ |v|, where |v| the number of vertices in
the graph)

Problem Statement
You will define the 3 functions as follows:

Function 1:
vector<vector<int>> degree cen(vector<pair<int,int>> partial edge list,
map<int,int> partial vector color,int k)

Inputs:

1
• partial edge list : A list of outgoing edges from a subset of vertices.
• partial vector color : A mapping of vertices in the subset to their corresponding colors.
• k : The number of nodes to be returned for each color.
Outputs:
• A vector of vectors containing the top k nodes in the given graph based on their color
degree centrality for each color. You will have to find the number of distinct colors for
this. each color is represented by a non-negative integer (zero or any positive integer).
You will only consider the color if at least one node of that color exists. The color code
need not be consecutive. Example c = 3 might include color codes [0, 2, 3].
• The order of vectors should be in lexicographic order of colors, i.e., in the above case of
[0,2,3] the first vector corresponds to color 0, the second to color 2, and the last to color
3.
Instructions:
1. Compute the color degree centrality for each node u for each color c, which is the number
of edges connecting u to nodes of the color c. Use MPI for parallel processing as all the
processes only have a part of the graph information, not the whole graph.
2. For each color i (where i is the color code of at least one vertex), select the top k nodes with
the highest color degree centrality. In the case of ties, the node with the lexicographically
smaller label is ranked higher.
3. Ensure that the resulting vectors are ordered lexicographically by color.
4. Optimize the function for performance, as runtime will be measured only for this function.
5. You need to send the final results to the rank 0 process as we will only consider the rank
0 returned value as the final output. All other rank outputs will be ignored.

Function 2:
void init mpi(int argc, char* argv[])

This function should initialize MPI. The driver code won’t directly initialize MPI and will call
this function instead at the beginning and assume the initialization is done inside this function.
MPI initialization uses argc and argv which will be passed directly.

Function 3:
void end mpi()
This function should finalize the MPI. We will call this at the end of the code.

Solving an Example Graph

Let’s manually compute the color degree centrality, and find the top k nodes for each color for
the following graph:
Graph:
• Nodes: {11 , 21 , 32 , 41 , 52 } (The subscript represent the color codes.)
• Edges: {(1, 2), (1, 3), (2, 3), (2, 4), (3, 4), (3, 5), (4, 5)}

2
Step 1: Degree Centrality
For each node, we count the degree centrality for each color:

Degree Centrality(1, 1) = 1(to 2) ⇒ 2

Degree Centrality(2, 1) = 2(to 1, 4) ⇒ 1
Degree Centrality(3, 1) = 3(to 1, 2, 4) ⇒ 3
Degree Centrality(4, 1) = 4(to 1, 2) ⇒ 3
Degree Centrality(5, 1) = 1(to 4) ⇒ 1

Degree Centrality(1, 2) = 1(to 3) ⇒ 2

Degree Centrality(2, 2) = 1(to 3) ⇒ 1
Degree Centrality(3, 2) = 1(to 5) ⇒ 3
Degree Centrality(4, 2) = 2(to 3, 5) ⇒ 3
Degree Centrality(5, 2) = 1(to 3) ⇒ 1

Step 3: Returning top k

Let’s say k = 2: You will return [[3,4], [4,1]] (Note the order)

Evaluation Criteria
• Correctness: The accuracy of the computed centrality measures and top k nodes.

• Performance: Efficiency and scalability of the parallel implementation with different

numbers of nodes (up to 6 or 8) and processes per nodes (up to 6 or 8).

• Report: Clarity and depth of performance analysis.

Starter Code
The starter code is provided here.
You are given a ”check.cpp” file, a ”template.hpp” file, and a ”template.cpp” file. You are
not allowed to change the ”check.cpp” and ”template.hpp” files. You have to implement your
functions in the ”template.cpp” file, without changing the signature of the functions.
”check.cpp” has the read graph file code and write to output file code. It also calls the three
functions implemented in ”template.cpp”.
You are also given sample input folders and the corresponding output.txt files. Read file and
write output file functions are already present in the ”check.cpp” file. This ”TestCase1” case
is for V = 1000, E = 20000, k = 10, and C = 5. And SmallTestCase is V = 40, E = 160, K =
10, C = 2. We will test on much larger test cases.

3
Report
Your report should list your optimizations done and a discussion on performance in a text
file named readme.pdf, and a CSV file showing your timings for degree cen function across
different values of k on varying numbers of cores.
Format for Final Scalability Analysis: The performance table records timings (in
milliseconds, up to two decimal points) for varying the number of nodes (Use 2 processes per
node for the analysis) for the test cases given by varying k and other test cases you might use (if
any), in CSV format. Note the timings for all the optimizations you mentioned in the report.
The last rows should correspond to your final submission. Below is the formatted version for
clarity:
Optimizations V E C k 1 node 2 nodes 3 nodes 4 nodes
O1 V1 E1 C1 k1
O1 V1 E1 C1 k2
O2 V1 E1 C1 k1
O2 V1 E1 C1 k2
Here V = Number of vertices, E = Number of Edges in the given graph, C = Number of
distinct colors, and k = Number of top nodes to return.

Execution Instructions
Module Requirements
Before running the code, you must load ONLY the following modules:
1. module load compiler/gcc/9.1.0
2. module load compiler/gcc/9.1/mpich/3.3.1
Run module purge before loading any modules so that just the two modules are loaded.

Running the Code

To Compile:
mpic++ -o check check.cpp template.cpp
To run:
mpirun -np (Number_of_cores) ./check (Input_folder_name) (Output_file_name) (k)
Example: mpirun -np 4 ./check TestCase1 output.txt 10
Do not run this directly on the login node. You may use a PBS script.

Submission Instructions
Submit the following files on Gradescope:
|-- template.cpp
|-- readme.pdf
|-- data.csv
|-- Makefile

4
Note that if you submit more or less than these 4 files in the final submission, you will get a
0. Please stick to the given file names. In the make file mention the compile statement so that
when we run ”make compile”, we get an executable named check. This is done so that you can
add your own optimization flags to the compile statement.
The first submission is for correctness check, so only submit ”template.cpp” file (Only 1 file).

JtdmoMJK64 hw4
No ratings yet
JtdmoMJK64 hw4
10 pages
OpenMPCoursework2
100% (1)
OpenMPCoursework2
5 pages
Project
No ratings yet
Project
4 pages
Gd Script
From Everand
Gd Script
Marijo Trkulja
No ratings yet
PDC Report
No ratings yet
PDC Report
22 pages
Rust finedoc
No ratings yet
Rust finedoc
16 pages
14_The GAP Benchmark Suite
No ratings yet
14_The GAP Benchmark Suite
16 pages
ML Clustering
No ratings yet
ML Clustering
5 pages
Graph Theory and its Applications_ What Can Graphs Do for Your Software_ _ by Héla Ben Khalfallah _ Sep, 2024 _ ITNEXT
No ratings yet
Graph Theory and its Applications_ What Can Graphs Do for Your Software_ _ by Héla Ben Khalfallah _ Sep, 2024 _ ITNEXT
52 pages
"General - IO - HPP" "General - Defs - HPP" "Maxchordal - Subgraph - HPP"
No ratings yet
"General - IO - HPP" "General - Defs - HPP" "Maxchordal - Subgraph - HPP"
2 pages
Cse-106 ( Discrete Mathematics)
No ratings yet
Cse-106 ( Discrete Mathematics)
7 pages
Using MPI Portable Programming With The Message Pa PDF
No ratings yet
Using MPI Portable Programming With The Message Pa PDF
8 pages
I210277 I210461 ProjectProposal
No ratings yet
I210277 I210461 ProjectProposal
8 pages
Performance DebuggingTools
No ratings yet
Performance DebuggingTools
21 pages
A Dynamic Graph-Based Malware Classifier
No ratings yet
A Dynamic Graph-Based Malware Classifier
119 pages
HPC 2025 (1)
No ratings yet
HPC 2025 (1)
16 pages
CENG 213 PA3 2023-v1
No ratings yet
CENG 213 PA3 2023-v1
11 pages
INF_3201_h24_Assignment_1
No ratings yet
INF_3201_h24_Assignment_1
4 pages
dslr-sheet
No ratings yet
dslr-sheet
1 page
Understanding Networks Through Clustering
No ratings yet
Understanding Networks Through Clustering
5 pages
Jeffrey D. Ullman: Stanford University
No ratings yet
Jeffrey D. Ullman: Stanford University
52 pages
IO Efficient Generation of Hyperbolic Random Graphs
No ratings yet
IO Efficient Generation of Hyperbolic Random Graphs
109 pages
AI Networks - Ultra Series - Research 00z0021
No ratings yet
AI Networks - Ultra Series - Research 00z0021
5 pages
Graph Journeyman (1) Final
No ratings yet
Graph Journeyman (1) Final
25 pages
Clustering Networks
No ratings yet
Clustering Networks
5 pages
EE_36800_Project_3
No ratings yet
EE_36800_Project_3
3 pages
mpi_book
No ratings yet
mpi_book
673 pages
CS 484 - ParallelProgramming
No ratings yet
CS 484 - ParallelProgramming
5 pages
Learn C++
From Everand
Learn C++
Durgesh
4.5/5 (9)
SWE2017 - Lab Assignment 1pages-7
No ratings yet
SWE2017 - Lab Assignment 1pages-7
5 pages
C Programming
From Everand
C Programming
Netra
No ratings yet
White-Box and Black-Box Testing Techniques Example in Detail
No ratings yet
White-Box and Black-Box Testing Techniques Example in Detail
8 pages
Untitled document
No ratings yet
Untitled document
23 pages
Programming Assignment 2: Decomposition of Graphs
No ratings yet
Programming Assignment 2: Decomposition of Graphs
16 pages
CS 170 Efficient Algorithms and Intractable Problems Spring 2020 A. Chiesa and J. Nelson Problem Statement
No ratings yet
CS 170 Efficient Algorithms and Intractable Problems Spring 2020 A. Chiesa and J. Nelson Problem Statement
5 pages
Homework 4: ME 570 - Prof. Tron
No ratings yet
Homework 4: ME 570 - Prof. Tron
15 pages
Learn Excel Functions: Count, Countif, Sum and Sumif
From Everand
Learn Excel Functions: Count, Countif, Sum and Sumif
Rajan
5/5 (4)
Mpi
No ratings yet
Mpi
46 pages
Project4 (1)
No ratings yet
Project4 (1)
4 pages
Fallsem2019-20 Cse4001 Eth Vl2019201001348 Reference Material Cse4001 Parallel and Distributed Computing May 2019 (003) 18
No ratings yet
Fallsem2019-20 Cse4001 Eth Vl2019201001348 Reference Material Cse4001 Parallel and Distributed Computing May 2019 (003) 18
4 pages
EECS3311-W24-Final Review
No ratings yet
EECS3311-W24-Final Review
38 pages
US Airports: Usairport (Http://konect - Uni-Koblenz - De/networks/opsahl-Usairport)
No ratings yet
US Airports: Usairport (Http://konect - Uni-Koblenz - De/networks/opsahl-Usairport)
3 pages
CP4292-MCAP(1)
No ratings yet
CP4292-MCAP(1)
15 pages
Parallelizing Particle-In-Cell Codes With Openmp and Mpi: Nils Magnus Larsgård
No ratings yet
Parallelizing Particle-In-Cell Codes With Openmp and Mpi: Nils Magnus Larsgård
74 pages
Final Project
No ratings yet
Final Project
12 pages
Graphs: CS 302 - Data Structures Section 9.3
No ratings yet
Graphs: CS 302 - Data Structures Section 9.3
78 pages
Group38
No ratings yet
Group38
8 pages
Brandes U., Erlebach T. Eds. Network Analysis. Methodological Foundations 2005ã. 482ñ. ISBN ISBN10 3-540-24979-6 PDF
100% (1)
Brandes U., Erlebach T. Eds. Network Analysis. Methodological Foundations 2005ã. 482ñ. ISBN ISBN10 3-540-24979-6 PDF
482 pages
CSC3100 Data Structures HW4 Fall 2024(1)
No ratings yet
CSC3100 Data Structures HW4 Fall 2024(1)
9 pages
Data Structures (Spring 2021) Semester Project: Network Analysis Using Graphs
100% (1)
Data Structures (Spring 2021) Semester Project: Network Analysis Using Graphs
5 pages
EXERCISE- 4[1] (1)
No ratings yet
EXERCISE- 4[1] (1)
8 pages
PPT10-W10-Graph Analytics For Big Data
No ratings yet
PPT10-W10-Graph Analytics For Big Data
55 pages
Unit 4 - Non-Linear Data Structure - Binary - Graph - 1923081007
No ratings yet
Unit 4 - Non-Linear Data Structure - Binary - Graph - 1923081007
105 pages
Algorithms Unit 2
No ratings yet
Algorithms Unit 2
71 pages
Eij Kh Out Parallel Programming
No ratings yet
Eij Kh Out Parallel Programming
838 pages
04 Graph1
No ratings yet
04 Graph1
27 pages
Assessment Brief Template 2024-2025_CS3003 - Copy (1) 8
No ratings yet
Assessment Brief Template 2024-2025_CS3003 - Copy (1) 8
3 pages
MPI2
No ratings yet
MPI2
3 pages
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
From Everand
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
Fouad Sabry
No ratings yet
SriVenkataSaiRavitejaPasupuleti-project2-both
No ratings yet
SriVenkataSaiRavitejaPasupuleti-project2-both
9 pages
Non Consequentalism: Good Points of Duty-Based Ethics
No ratings yet
Non Consequentalism: Good Points of Duty-Based Ethics
4 pages
UENR15860001
0% (1)
UENR15860001
23 pages
HD8765 47 PDF
No ratings yet
HD8765 47 PDF
109 pages
RP 030343
No ratings yet
RP 030343
59 pages
Fresadora HF - Manual Peças
No ratings yet
Fresadora HF - Manual Peças
39 pages
Resistive Sensing Elements - POT, RTD, Thermistor
No ratings yet
Resistive Sensing Elements - POT, RTD, Thermistor
33 pages
Acoustic Terminology
No ratings yet
Acoustic Terminology
4 pages
Six Box Model
No ratings yet
Six Box Model
3 pages
Technical Notice SLS 6 Rev 3 - Fire Protection Systems Appliances and Compressed Gas Cylinder PDF
No ratings yet
Technical Notice SLS 6 Rev 3 - Fire Protection Systems Appliances and Compressed Gas Cylinder PDF
20 pages
Control of Dual Active Bridge
No ratings yet
Control of Dual Active Bridge
15 pages
Ex4 Intro To Game Theory
No ratings yet
Ex4 Intro To Game Theory
36 pages
Third Party Risk Management Solution - Web
No ratings yet
Third Party Risk Management Solution - Web
16 pages
Personal Computer (PC) - Based Flood Monitoring System Using Cloud Computing
No ratings yet
Personal Computer (PC) - Based Flood Monitoring System Using Cloud Computing
7 pages
Layout Mata101n
No ratings yet
Layout Mata101n
5 pages
Savera Spares Rist
No ratings yet
Savera Spares Rist
41 pages
Design Calculation For External Pipe Rack Section PK 12 - Grid 90 To 109
No ratings yet
Design Calculation For External Pipe Rack Section PK 12 - Grid 90 To 109
24 pages
Syllabus College of Business, Accountancy, and Auditing: ST ST
No ratings yet
Syllabus College of Business, Accountancy, and Auditing: ST ST
47 pages
India Hydroponics Forecast
No ratings yet
India Hydroponics Forecast
9 pages
NIR - Multivariate Calibration - 3rd Edition 2014
No ratings yet
NIR - Multivariate Calibration - 3rd Edition 2014
12 pages
SCD-C_FO_101_76_EN
No ratings yet
SCD-C_FO_101_76_EN
21 pages
Jacket Vessels. Overall Coefficients
No ratings yet
Jacket Vessels. Overall Coefficients
1 page
Final Guidelines For No Objection Certificate For EV Charging
No ratings yet
Final Guidelines For No Objection Certificate For EV Charging
12 pages
Science6 - Q1-WK-3 FOR STUDENT
No ratings yet
Science6 - Q1-WK-3 FOR STUDENT
18 pages
UMAT Practice Paper 1
No ratings yet
UMAT Practice Paper 1
41 pages
Design of Common Raft: Loads On Column at Base
No ratings yet
Design of Common Raft: Loads On Column at Base
20 pages
Shooters Catalog
75% (12)
Shooters Catalog
16 pages
Carl Savigny
No ratings yet
Carl Savigny
16 pages
Nabard Project Boq
No ratings yet
Nabard Project Boq
34 pages
Re Evaluation Result of Answer Scripts of 5th Semester 6th BP Semester Winter 2023 Exam
No ratings yet
Re Evaluation Result of Answer Scripts of 5th Semester 6th BP Semester Winter 2023 Exam
24 pages
The Impact of Motivation Ability Role Perception o
No ratings yet
The Impact of Motivation Ability Role Perception o
10 pages

Assignment_2-3

Uploaded by

Assignment_2-3

Uploaded by

Assignment 2: Parallel Graph Centrality Computation

• Due Date 2: 11:59 PM, March 17, 2025

Degree Centrality(u, c) = count of v ,where v ∈ neighbors of u and color(v) = c

Solving an Example Graph

Degree Centrality(1, 1) = 1(to 2) ⇒ 2

Degree Centrality(1, 2) = 1(to 3) ⇒ 2

Step 3: Returning top k

• Performance: Efficiency and scalability of the parallel implementation with different

• Report: Clarity and depth of performance analysis.

Running the Code

You might also like