Graph (Abstract Data Type) - Wikipedia
Graph (Abstract Data Type) - Wikipedia
data type)
In computer science, a graph is an abstract data type that is meant to implement the
undirected graph and directed graph concepts from the field of graph theory within
mathematics.
A directed graph with three vertices (blue circles) and three edges (black arrows).
A graph data structure consists of a finite (and possibly mutable) set of vertices (also called
nodes or points), together with a set of unordered pairs of these vertices for an undirected
graph or a set of ordered pairs for a directed graph. These pairs are known as edges (also
called links or lines), and for a directed graph are also known as edges but also sometimes
arrows or arcs. The vertices may be part of the graph structure, or may be external entities
represented by integer indices or references.
A graph data structure may also associate to each edge some edge value, such as a symbolic
label or a numeric attribute (cost, capacity, length, etc.).
Operations
adjacent(G, x, y): tests whether there is an edge from the vertex x to the vertex y;
neighbors(G, x): lists all vertices y such that there is an edge from the vertex x to the
vertex y;
add_edge(G, x, y): adds the edge from the vertex x to the vertex y, if it is not there;
remove_edge(G, x, y): removes the edge from the vertex x to the vertex y, if it is there;
get_edge_value(G, x, y): returns the value associated with the edge (x, y);
set_edge_value(G, x, y, v): sets the value associated with the edge (x, y) to v.
Adjacency list[2]
Vertices are stored as records or objects, and every vertex stores a list of adjacent vertices.
This data structure allows the storage of additional data on the vertices. Additional data
can be stored if edges are also stored as objects, in which case each vertex stores its
incident edges and each edge stores its incident vertices.
Adjacency matrix[3]
A two-dimensional matrix, in which the rows represent source vertices and columns
represent destination vertices. Data on edges and vertices must be stored externally. Only
the cost for one edge can be stored between each pair of vertices.
Incidence matrix[4]
A two-dimensional matrix, in which the rows represent the vertices and columns represent
the edges. The entries indicate the incidence relation between the vertex at a row and edge
at a column.
The following table gives the time complexity cost of performing various operations on
graphs, for each of these representations, with |V| the number of vertices and |E| the number
of edges. In the matrix representations, the entries encode the cost of following an edge. The
cost of edges that are not present are assumed to be ∞.
Store graph
Add vertex
Add edge
Remove vertex
Remove edge
Adjacency lists are generally preferred because they efficiently represent sparse graphs. An
adjacency matrix is preferred if the graph is dense, that is the number of edges |E| is close to
the number of vertices squared, |V|2, or if one must be able to quickly look up if there is an
edge connecting two vertices.[5][6]
Parallel representations
Shared memory
In the case of a shared memory model, the graph representations used for parallel
processing are the same as in the sequential case,[9] since parallel read-only access to the
graph representation (e.g. an adjacency list) is efficient in shared memory.
Distributed memory
In the distributed memory model, the usual approach is to partition the vertex set of the
graph into sets . Here, is the amount of available processing elements
(PE). The vertex set partitions are then distributed to the PEs with matching index,
additionally to the corresponding edges. Every PE has its own subgraph representation,
where edges with an endpoint in another partition require special attention. For standard
communication interfaces like MPI, the ID of the PE owning the other endpoint has to be
identifiable. During computation in a distributed graph algorithms, passing information along
these edges implies communication.[9]
Partitioning the graph needs to be done carefully - there is a trade-off between low
communication and even size partitioning[10] But partitioning a graph is a NP-hard problem,
so it is not feasible to calculate them. Instead, the following heuristics are used.
1D partitioning: Every processor gets vertices and the corresponding outgoing edges.
This can be understood as a row-wise or column-wise decomposition of the adjacency
matrix. For algorithms operating on this representation, this requires an All-to-All
communication step as well as message buffer sizes, as each PE potentially has
outgoing edges to every other PE.[11]
2D partitioning: Every processor gets a submatrix of the adjacency matrix. Assume the
processors are aligned in a rectangle , where and are the amount of
processing elements in each row and column, respectively. Then each processor gets a
submatrix of the adjacency matrix of dimension . This can be visualized
as a checkerboard pattern in a matrix.[11] Therefore, each processing unit can only have
outgoing edges to PEs in the same row and column. This bounds the amount of
communication partners for each PE to out of possible ones.
Compressed representations
Graphs with trillions of edges occur in machine learning, social network analysis, and other
areas. Compressed graph representations have been developed to reduce I/O and memory
requirements. General techniques such as Huffman coding are applicable, but the adjacency
list or adjacency matrix can be processed in specific ways to increase efficiency.[12]
See also
Graph traversal for graph walking strategies
Graph rewriting for rule based transformations of graphs (graph data structures)
Graph drawing software for software, systems, and providers of systems for drawing
graphs
References
1. See, e.g. Goodrich & Tamassia (2015), Section 13.1.2: Operations on graphs, p. 360. For a more
detailed set of operations, see Mehlhorn, K.; Näher, S. (1999). "Chapter 6: Graphs and their data
structures". LEDA: A platform for combinatorial and geometric computing (https://people.mpi-inf.mp
g.de/~mehlhorn/ftp/LEDAbook/Graphs.pdf) (PDF). Cambridge University Press. pp. 240–282.
2. Cormen et al. (2001), pp. 528–529; Goodrich & Tamassia (2015), pp. 361-362.
3. Cormen et al. (2001), pp. 529–530; Goodrich & Tamassia (2015), p. 363.
5. Cormen, Thomas H.; Leiserson, Charles E.; Rivest, Ronald L.; Stein, Clifford (2001). "Section 22.1:
Representations of graphs". Introduction to Algorithms (Second ed.). MIT Press and McGraw-Hill.
pp. 527–531. ISBN 0-262-03293-7.
6. Goodrich, Michael T.; Tamassia, Roberto (2015). "Section 13.1: Graph terminology and
representations". Algorithm Design and Applications. Wiley. pp. 355–364. ISBN 978-1-118-33591-8.
7. Bader, David; Meyerhenke, Henning; Sanders, Peter; Wagner, Dorothea (January 2013). Graph
Partitioning and Graph Clustering (https://www.ams.org/conm/588/) . Contemporary Mathematics.
588. American Mathematical Society. doi:10.1090/conm/588/11709 (https://doi.org/10.1090%2Fcon
m%2F588%2F11709) . ISBN 978-0-8218-9038-7.
8. Lumsdaine, Andrew; Gregor, Douglas; Hendrickson, Bruce; Berry, Jonathan (March 2007). "Challenges
in Parallel Graph Processing". Parallel Processing Letters. 17 (1): 5–20.
doi:10.1142/s0129626407002843 (https://doi.org/10.1142%2Fs0129626407002843) . ISSN 0129-
6264 (https://www.worldcat.org/issn/0129-6264) .
9. Sanders, Peter; Mehlhorn, Kurt; Dietzfelbinger, Martin; Dementiev, Roman (2019). Sequential and
Parallel Algorithms and Data Structures: The Basic Toolbox (https://www.springer.com/gp/book/978
3030252083) . Springer International Publishing. ISBN 978-3-030-25208-3.
11. Buluç, A.; Madduri, Kamesh (2011). "Applications". Parallel breadth-first search on distributed memory
systems. 2011 International Conference for High Performance Computing, Networking, Storage and
Analysis. doi:10.1145/2063384.2063471 (https://doi.org/10.1145%2F2063384.2063471) .
ISBN 978-1-4503-0771-0. S2CID 6540738 (https://api.semanticscholar.org/CorpusID:6540738) .
12. Besta, Maciej; Hoefler, Torsten (27 April 2019). "Survey and Taxonomy of Lossless Graph
Compression and Space-Efficient Graph Representations". arXiv:1806.01799 (https://arxiv.org/abs/18
06.01799) .
External links
Retrieved from
"https://en.wikipedia.org/w/index.php?
title=Graph_(abstract_data_type)&oldid=1052445
998"
Wikipedia