0% found this document useful (0 votes)

17 views314 pages

Merged Lecture Notes

The document discusses sorting algorithms and tries. It explains that comparison-based sorting algorithms like mergesort and quicksort have worst-case complexity of O(n log n). Radix sorting is discussed as a non-comparison based sorting algorithm with worst-case linear time complexity of O(n). Radix sorting works by distributing items into buckets based on the value of certain bits in each iteration and concatenating the buckets, with m/b iterations needed for items of length m bits sorted using a radix of b bits. An example of radix sorting with m=6 and b=2 is provided.

Uploaded by

Bentoja

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views314 pages

Merged Lecture Notes

Uploaded by

Bentoja

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 314

1 Sorting and Tries 2

2 Graphs and Graph Algorithms 29

3 Strings and text algorithms 101
4 NP Completeness 185
5 Computability 242
Algorithmics I 2022

Algorithmics I

Section 1 – Sorting and Tries

Dr. Gethin Norman

School of Computing Science

University of Glasgow

[email protected]

1 Sorting and Tries 2

Sorting - Recap
Naïve sorting algorithms: O(n2) in the worst/average case
− Selectionsort, Insertionsort, Bubblesort

Clever sorting algorithms: O(n log n) in the worst/average case

− Mergesort, Heapsort (which we have just seen)

The fastest sorting algorithm in practice is Quicksort

− O(n log n) on average
− but no better than O(n2) in the worst case (unless a clever variant is used)

Question: can we come up with a sorting algorithm that is better

than O(n log n) in the worst case?
− for example a O(n) algorithm

Algorithmics I, 2022 2
1 Sorting and Tries 3
Sorting - Comparison based sorting
Claim: no sorting algorithm that is based on pairwise comparison
of values can be better than O(n log n)

Justification: describe the algorithm by a decision tree (binary tree)

− each node represents a comparison between two elements
− path branches left or right depending on the outcome of the comparison

a1>b1
no yes

a2>b2 a3>b3
no yes no yes

Algorithmics I, 2022 3
1 Sorting and Tries 4
Sorting - Comparison based sorting
Claim: no sorting algorithm that is based on pairwise comparison
of values can be better than O(n log n)

Justification: describe the algorithm by a decision tree (binary tree)

− each node represents a comparison between two elements
− path branches left or right depending on the outcome of the comparison
− an execution of the algorithm is a path from the root to a leaf node
− the number of leaf nodes in the tree must be at least the number of
‘outcomes’ of the algorithm
− therefore number of leaf nodes equals the possible orderings of n items
− that is there are least n! leaf nodes (remember permutations from AF2)

Algorithmics I, 2022 4
1 Sorting and Tries 5
Sorting - Comparison based sorting
We have shown the decision tree has at least n! leaf nodes

The worst-case complexity of the algorithm is no better than O(h)

− where h is the height of the tree
− an execution is a path from the root node to a leaf node
− we perform an operation an each branch node so h operations in the
worst case

A decision tree is a binary tree (two branches ‘yes’ and ‘no’)

and hence the number of leaf nodes is less than or equal to 2h+1-1
− a binary tree of height h has at most 2h+1-1 nodes

Combining these properties it follows that n!≤ 2h+1–1 ≤ 2h+1

Algorithmics I, 2022 5
1 Sorting and Tries 6
Sorting - Comparison based sorting
We have shown: complexity is no better than O(h) and 2h+1 ≥ n!
− h is the height of the decision tree
− n is the number of items to be sorted

Taking log2 of both sides of 2h+1≥n! yields:

h+1 ≥ log2(n!)
> log2(n/2)n/2 (since n! > (n/2)n/2)
= (n/2)log2(n/2) (since log ab = b log a)
= (n/2)log2n - (n/2)log22 (since log a/b = log a – log b)
= (n/2)log2n - n/2 (since logaa = 1)

Giving a complexity of at least O(n log n) as required

Algorithmics I, 2022 6
1 Sorting and Tries 7
Sorting – Radix sorting
We haven shown no sorting algorithm that is based on pairwise
comparisons can be better than O(n log n) in the worst case
− therefore to improve on this worst case bound, we have to devise a
method based on something other than comparisons

Radix sort uses a different approach to achieve an O(n) complexity

− but the algorithm has to exploit the structure of the items being sorted,
so may be less versatile
− in practice, it is faster than O(n log n) algorithms only for very large n

Assume items to sort can be treated as bit-sequences of length m

− let b be a chosen factor of m
− so b and m are constants for any particular instance

Algorithmics I, 2022 7
1 Sorting and Tries 8
Sorting – Radix sorting - Algorithm
Each item has bit positions labelled 0,1,…,m-1
− bit 0 being the least significant (i.e. the right-most)

The algorithm uses m/b iterations

− in each iteration the items are distributed into 2b buckets
length b length b
− a bucket is just a list
− the buckets are labelled 0,1,…,2b-1 (or, equivalently, 00...0 to 11...1)
− during the ith iteration an item is placed in the bucket corresponding to
the integer represented by the bits in positions b×i−1,…,b×(i−1)
• e.g. for b=4 and i=2

item = 0010100100110001

Algorithmics I, 2022 8
1 Sorting and Tries 9
Sorting – Radix sorting - Algorithm
Each item has bit positions labelled 0,1,…,m-1
− bit 0 being the least significant (i.e. the right-most)
The algorithm uses m/b iterations
− in each iteration the items are distributed into 2b buckets
− a bucket is just a list
length b length b
− the buckets are labelled 0,1,…,2b-1 (or, equivalently, 00...0 to 11...1)
− during the ith iteration an item is placed in the bucket corresponding to
the integer represented by the bits in positions b×i−1,…,b×(i−1)
• e.g. for b=4 and i=2, consider bits in position 7,..,4
item = 0010100100110001
• 0011 represents the integer 3
• so item is placed in the bucket labelled 3 (or, equivalently, 0011)
− at the end of an iteration the buckets are concatenated to give a new
sequence which will be used as the starting point of the next iteration

Algorithmics I, 2022 9
1 Sorting and Tries 10
Sorting – Radix sorting - Example
Suppose we want to sort the following sequence with Radix sort

15 43 5 27 60 18 26 2

Binary encodings are given by

15 = 001111 43 = 101011 5 = 000101 27 = 011011

60 = 111100 18 = 010010 26 = 011010 2 = 000010

− items have bit positions 0,…,5, hence m=6

− b must be a factor of m, so lets choose b=2

This means in Radix sort we have:

− 2b=22=4 buckets labelled 0,1,2,3 (or equivalently 00,01,10,11)
and m/b = 3 iterations are required

Algorithmics I, 2022 10
1 Sorting and Tries 11
Sorting – Radix sorting - Example
Sequence: 15 43 5 27 60 18 26 2

Binary encodings: 15 = 001111 43 = 101011 5 = 000101 27 = 011011

60 = 111100 18 = 010010 26 = 011010 2 = 000010

First iteration of radix

− items are distributed into 4 buckets (a bucket is just a list)
− during the 1st iteration, an item is placed in a bucket corresponding to
the integer represented by the bits in positions 1,…,0
− buckets concatenated at the end of an iteration to give input sequence
for the next iteration
1st iteration:
bucket 00: 60
bucket 01: 5
bucket 10: 18 26 2
bucket 11: 15 43 27
new sequence: 60 5 18 26 2 15 43 27
Algorithmics I, 2022 11
1 Sorting and Tries 12
Sorting – Radix sorting - Example
New sequence: 60 5 18 26 2 15 43 27

Binary encodings: 60 = 111100 5 = 000101 18 = 010010 26 = 011010

2 = 000010 15 = 001111 43 = 101011 27 = 011011

Second iteration of radix

− items are distributed into 4 buckets (a bucket is just a list)
− during the 2nd iteration, an item is placed in a bucket corresponding to
the integer represented by the bits in positions 3,…,2
− buckets concatenated at the end of an iteration to give input sequence
for the next iteration
2nd iteration:
bucket 00: 18 2
bucket 01: 5
bucket 10: 26 43 27
bucket 11: 60 15
new sequence: 18 2 5 26 43 27 60 15

Algorithmics I, 2022 12
1 Sorting and Tries 13
Sorting – Radix sorting - Example
New sequence: 18 2 5 26 43 27 60 15

Binary encodings: 18 = 010010 2 = 000010 5 = 000101 26 = 011010

43 = 101011 27 = 011011 60 = 111100 15 = 001111

Third (and final) iteration of radix

− items are distributed into 4 buckets (a bucket is just a list)
− during the 3rd iteration, an item is placed in a bucket corresponding to
the integer represented by the bits in positions 5,…,4
− buckets concatenated at the end of an iteration to give input sequence
for the next iteration
3rd iteration:
bucket 00: 2 5 15
bucket 01: 18 26 27
bucket 10: 43
bucket 11: 60
sorted sequence: 2 5 15 18 26 27 43 60

Algorithmics I, 2022 13
1 Sorting and Tries 14
Sorting – Radix sorting - Pseudocode

// assume we have the following method which returns the value

// represented by the b bits of x when starting at position pos
private int bits(Item x, int b, int pos)

// suppose that:
// a is the sequence to be sorted
// m is the number of bits in each item of the sequence a
// b is the ‘block length’ of radix sort

int numIterations = m/b; // number of iterations required for sorting

int numBuckets = (int) Math.pow(2, b); // number of buckets

// represent sequence a to be sorted as an ArrayList of Items

ArrayList<Item> a = new ArrayList<Item>();

// represent the buckets as an array of ArrayLists

ArrayList<Item>[] buckets = new ArrayList[numBuckets];
for (int i=0; i<numBuckets; i++) buckets[i] = new ArrayList<Item>();

Algorithmics I, 2022 14
1 Sorting and Tries 15
Sorting – Radix sorting - Pseudocode

for (int i=1; i<=numIterations; i++){

// clear the buckets

for (int j=0; j<numBuckets; j++) buckets[j].clear();

// distribute the items (in order from the sequence a)

for (Item x : a){
// find the value of the b bits starting from position (i-1)*b in x
int k = bits(x, b, (i-1)*b); // find the correct bucket for item x
buckets[k].add(x); // add item to this bucket
}

a.clear(); // clear the sequence

// concatenate the buckets (in sequence) to form the new sequence

for (j=0; j<numBuckets; j++) a.addAll(buckets[j]);

Algorithmics I, 2022 15
1 Sorting and Tries 16
Sorting – Radix sorting - Correctness
Let x and y be two items with x<y
− need to show that x precedes y in the final sequence

Suppose j is the last iteration for which relevant bits of x and y differ
− since x<y and j is the last iteration that x and y differ
the relevant bits of x must be smaller than those of y
− therefore x goes into an ‘earlier’ bucket than y
and hence x precedes y in the sequence after this iteration

− since j is the last iteration where bits differ:

in all later iterations x and y go in the same bucket
so their relative order is unchanged

Algorithmics I, 2022 16
1 Sorting and Tries 17
Sorting – Radix sorting - Complexity
Number of iterations is m/b and number of buckets is 2b

During each of the m/b iterations

− the sequence is scanned and items are allocated buckets: O(n) time
− buckets are concatenated: O(2b) time

Therefore the overall complexity is O(m/b·(n+2b))

− this is O(n), since m and b are constants

Time-space trade-off
− the larger the value of b, the smaller the multiplicative constant (m/b) in
the complexity function and so the faster the algorithm will become
− however an array of size 2b is required for the buckets
therefore increasing b will increase the space requirements

Algorithmics I, 2022 17
1 Sorting and Tries 18
Tries (retrieval)
Binary search trees are comparison-based data structures

Tries are to binary trees as Radixsort is to comparison-based sorting

− stored items have a key value that is interpreted as a sequence of bits,
or characters, …
− there is a multiway branch at each node where each branch has an
associated symbol and no two siblings have the same symbol
− the branch taken at level i during a search, is determined by the ith
element of the key value (ith bit, ith character, …)
− tracing a path from the root to a node spells out the key value of the item

Example: use a trie to store items with a key value that is a string
− say the words in a dictionary

Algorithmics I, 2022 18
1 Sorting and Tries 19
Tries - Examples
An example trie containing words from a 4 letter alphabet

a t
e r
r t a r a e a r
e

e t e r t a r t a r a e

e e e r e t t a e

r t

• Two kinds of nodes

− nodes representing words path represents the string: ar
− internal/intermediate nodes (not a word)

Algorithmics I, 2022 19
1 Sorting and Tries 20
Tries - Examples
An example trie containing words from a 4 letter alphabet

a t
e r
r t a r a e a r
e

e t e r t a r t a r a e

e e e r e t t a e

r t

• Two kinds of nodes

− nodes representing words path represents the word: art
− internal/intermediate nodes

Algorithmics I, 2022 20
1 Sorting and Tries 21
Tries – Search algorithm (pseudo code)
// searching for a word w in a trie t
Node n = root of t; // current node (start at root)
int i = 0; // current position in word w (start at beginning)

while (true) {
if (n has a child c labelled w.charAt(i)) {
// can match the character of word in the current position
if (i == w.length()-1) { // end of word
if (c is an 'intermediate' node) return "absent";
else return "present";
}
else { // not at end of word
n = c; // move to child node
i++; // move to next character of word
}
}
else return "absent"; // cannot match current character
}
Algorithmics I, 2022 21
1 Sorting and Tries 22
Tries – Insertion algorithm (pseudo code)

// inserting a word w in a trie t

Node n = root of t; // current node (start at root)

for (int i=0; i < w.length(); i++){ // go through chars of word

if (n has no child c labelled w.charAt(i)){
// need to add new node
create such a child c;
mark c as intermediate;
}
n = c; // move to child node
}
mark n as representing a word;

Algorithmics I, 2022 22
1 Sorting and Tries 23
Tries - Algorithms
Deletion of a string from a trie
− exercise

Complexity of trie operations

− (almost) independent of the number of items
− essentially linear in the string length

Algorithmics I, 2022 23
1 Sorting and Tries 24
Tries - Implementation
Various possible implementations
− using an array (of pointers to represent the children of each node)
− using a linked lists (to represent the children of each node)
− time/space trade-off
List implementation
a e r t
− trie
− becomes the list e

′′ F

′a′ T ′e′ F ′r′ F ′t′ F

′e′ T

Algorithmics I, 2022 24
1 Sorting and Tries 25
Tries – Class to represent dictionary tries
public class Node { // node of a trie
private char letter; // label on incoming branch
private boolean isWord; // true when node represents a word
private Node sibling; // next sibling (when it exists)
private Node child; // first child (when it exists)

/** create a new node with letter c */

public Node(char c){
letter = c;
isWord = false;
sibling = null;
child = null;
}
// include accessors and mutators for the various components of class
}
public class Trie {
private Node root;
public Trie() {
root = new Node(Character.MIN_VALUE); // null character in root
}

Algorithmics I, 2022 25
1 Sorting and Tries 26
Tries – Method to search
private enum Outcomes {PRESENT, ABSENT, UNKNOWN}
/** search trie for word w */
public boolean search(String w) {
Outcomes outcome = Outcomes.UNKNOWN;
int i = 0; // position in word so far searched (start at beginning)
Node current = root.getChild(); // start with first child of root
while (outcome == Outcomes.UNKNOWN) {
if (current == null) outcome = Outcomes.ABSENT; // dead-end
else if (current.getLetter() == w.charAt(i)) { // positions match
if (i == w.length()-1) outcome = Outcomes.PRESENT; // matched word
else { // descend one level…
current = current.getChild(); // in trie
i++; // in word being searched
}
}
else current = current.getSibling(); // try next sibling
}
if (outcome != Outcomes.PRESENT) return false;
else return current.getIsWord(); // true if current node represents a word
}

Algorithmics I, 2022 26
1 Sorting and Tries 27
Tries – Method to insert
public void insert(String w){ /* insert word w into trie */
int i = 0; // position in word (start at beginning)
Node current = root; // current node of trie (start at root)
Node next = current.getChild(); // child of current node we are testing
while (i < w.length()) { // not reached the end of the word
if (next.getLetter() == w.charAt(i)) { // chars match: descend a level
current = next; // update current to the child node
next = current.getChild(); // update child node
i++; // next position in word
} else if (next != null) next = next.getSibling(); // try next child
else { // no more siblings: need new node
Node x = new Node(s.charAt(i)); // label with ith element of word
x.setSibling(current.getChild()); // sibling: first child of current
current.setChild(x); // make it first child of current node
current = x; // move to the new node
next = current.getChild(); // update child node
i++; // next position in word
}
}
current.setIsWord(true); // current represents word w
}
Algorithmics I, 2022 27
1 Sorting and Tries 28
Algorithmics I 2022

Algorithmics I

Section 2 – Graphs & graph algorithms

Dr. Gethin Norman

School of Computing Science
University of Glasgow
[email protected]

2 Graphs and Graph Algorithms 29

Graph basics
(undirected) graph G = (V,E)
− V is finite set of vertices (the vertex set)
− E is set of edges, each edge is a subset of V of size 2 (the edge set)
Pictorially:
− a vertex is represented by a point
− an edge by a line joining the relevant pair of points
− a graph can be drawn in different ways
− e.g. two representations of the same graph

a b c x c
V= {a,b,c,x,y,z}
E= { {a,x},{a,y},{a,z},
a z
{b,x},{b,y},{b,z},
{c,x},{c,y},{c,z} }
x y z y b

Algorithmics I, 2022 2
2 Graphs and Graph Algorithms 30
Graph basics
x c
a b c

a z

x y z y b

In this graph:
− vertices a & z are adjacent that is {a,z} is an element of the edge set E
− vertices a & b are non-adjacent that is {a,b} is not an element of E
− vertex a is incident to edge {a,x}
− a➝x➝b➝y➝c is a path of length 4 (number of edges)
− a➝x➝b➝y➝a is a cycle of length 4
− all vertices have degree 3
• i.e. all vertices are incident to three edges

Algorithmics I, 2022 3
2 Graphs and Graph Algorithms 31
Graph basics - Definitions
A graph is: connected, if every pair of vertices is joined by a path
x y z

u v w
A non-connected graph has two or more connected components

• A graph is a tree if it is connected and acyclic (no cycles)

a tree with n vertices has n-1 edge
- at least n-1 edges to be connected
- at most n-1 edges to be acyclic

• A graph is a forest if it is acyclic and components are trees

Algorithmics I, 2022 4
2 Graphs and Graph Algorithms 32
Graph basics - Definitions
A graph is complete (a clique) if every pair vertices is joined by an edge

K6, the clique on 6 vertices

A graph is bipartite if the vertices are in two disjoint sets U & W

and every edge joins a vertex in U to a vertex in W
a b c
U
the complete bipartite graph K3,3
W
x y z

it is complete since all edges between vertices in U and W are present

Algorithmics I, 2022 5
2 Graphs and Graph Algorithms 33
Graph basics - Definitions
A graph is complete (a clique) if every pair vertices is joined by an edge

K6, the clique on 6 vertices

A graph is bipartite if the vertices are in two disjoint sets U & W

and every edge joins a vertex in U to a vertex in W
a b c
U
bipartite graphs do not need to be complete
W
x y z

Algorithmics I, 2022 6
2 Graphs and Graph Algorithms 34
Graph basics – Directed graphs
A directed graph (digraph) D = (V,E)
− V is the finite set of vertices and E is the finite set of edges
− here each edge is an ordered pair (x,y) of vertices

Pictorially: edges are drawn as directed lines/arrows

v x
u for example (u,v),(w,y),(y,w) Î E

w z
y
− u is adjacent to v and v is adjacent from u
− y has in-degree 2 and out-degree 1

In a digraph, paths and cycles must follow edge directions

• e.g. u ➝ w ➝ x is a path and w ➝ y ➝ w is a cycle

Algorithmics I, 2022 7
2 Graphs and Graph Algorithms 35
Graph representations – Undirected graphs
Undirected graph: Adjacency matrix
− one row and column for each vertex
− row i, column j contains a 1 if ith and jth vertices adjacent, 0 otherwise

Undirected graph: Adjacency lists

− one list for each vertex
− list i contains an entry for j if the vertices i and j are adjacent

Algorithmics I, 2022 8
2 Graphs and Graph Algorithms 36
Graph representations – Undirected graphs
Undirected graph G
x y z

u v w

Adjacency matrix for G Adjacency lists for G

u v w x y z
u: 0 1 0 1 0 0 u: v➝x
v: 1 0 1 1 1 0 v: u➝w➝x➝y
w: 0 1 0 1 1 0 w: v➝x➝y
x: 1 1 1 0 1 0 x: u➝v➝w➝y
y: 0 1 1 1 0 1 y: v➝w➝x➝z
z: 0 0 0 0 1 0 z: y
|V|×|V| array 2×|E| entries in all

Algorithmics I, 2022 9
2 Graphs and Graph Algorithms 37
Graph representations – Directed graphs
Directed graph: Adjacency matrix
− one row and column for each vertex
− row i, column j contains a 1 if there is an edge from i to j
and 0 otherwise

Directed graph: Adjacency lists

− one list for each vertex
− the list for vertex i contains vertex j if there is an edge from i to j

Algorithmics I, 2022 10
2 Graphs and Graph Algorithms 38
Graph representations – Directed graphs
Directed graph D v x
u

w z
y

Adjacency matrix for D Adjacency lists for D

u v w x y z u: v➝w
u: 0 1 1 0 0 0 v:
v: 0 0 0 0 0 0 w: x➝y
w: 0 0 0 1 1 0 x:
x: 0 0 0 0 0 0 y: w
y: 0 0 1 0 0 0 z: y
z: 0 0 0 0 1 0
|E| entries in all
|V|×|V| array

Algorithmics I, 2022 11
2 Graphs and Graph Algorithms 39
Implementation – Adjacency lists
Recall adjacency list for an undirected graph
− one list for each vertex
− list i contains an element for j if the vertices i and j are adjacent

graph G adjacency lists for G

x z v: w➝x➝y
y w: v➝x➝y
x: v➝w➝y
w y: v➝w➝x➝z
v z: y

Implementation: define classes for

− the entries of adjacency lists
− the vertices (includes a linked list representing its adjacency list)
− graphs (includes the size of the graph and an array of vertices)
• array allows for efficient access using “index” of a vertex
Algorithmics I, 2022 12
2 Graphs and Graph Algorithms 40
Implementation – Adjacency lists
/** class to represent an entry in the adjacency list of a vertex
in a graph */
public class AdjListNode {

private int vertexIndex; // the vertex index of the entry

// possibly other fields, for example representing properties

// of the edge such as weight, capacity, …

/** creates a new entry for vertex indexed i */

public AdjListNode(int i){
vertexIndex = i;
}
public int getVertexIndex(){ // gets the vertex index of the entry
return vertexIndex;
}
public void setVertexIndex(int i){ // sets vertex index to i
vertexIndex = i;
}
}

Algorithmics I, 2022 13
2 Graphs and Graph Algorithms 41
Implementation – Adjacency lists
import java.util.LinkedList; // we require the linked list class

/** class to represent a vertex in a graph */

public class Vertex {

private int index; // the index of this vertex

private LinkedList<AdjListNode> adjList; // the adjacency list of vertex

// possibly other fields, e.g. representing data stored at the node

/** create a new instance of vertex with index i */

public Vertex(int i) {
index = i; // set index
adjList = new LinkedList<AdjListNode>();// create empty adjacency list
}

/** return the index of the vertex */

public int getIndex(){
return index;
}
Algorithmics I, 2022 14
2 Graphs and Graph Algorithms 42
Implementation – Adjacency lists
// class Vertex continued

/** set the index of the vertex */

public void setIndex(int i){
index = i;
}
/** return the adjacency list of the vertex */
public LinkedList<AdjListNode> getAdjList(){
return adjList;
}
/** add vertex with index j to the adjacency list */
public void addToAdjList(int j){
adjList.addLast(new AdjListNode(j));
}
/** return the degree of the vertex */
public int vertexDegree(){
return adjList.size();
}
}

Algorithmics I, 2022 15
2 Graphs and Graph Algorithms 43
Implementation – Adjacency lists
import java.util.LinkedList; // again require the linked list class
/** class to represent a graph */
public class Graph {

private Vertex[] vertices; // the vertices

private int numVertices = 0; // number of vertices

// possibly other fields representing properties of the graph

/** Create a Graph with n vertices indexed 0,...,n-1 */

public Graph(int n) {
numVertices = n;
vertices = new Vertex[n];
for (int i = 0; i < n; i++) vertices[i] = new Vertex(i);
}
/** returns number of vertices in the graph */
public int size(){
return numVertices;
}
}

Algorithmics I, 2022 16
2 Graphs and Graph Algorithms 44
Graph search and traversal algorithms
Graph search and traversal algorithms
− a systematic way to explore a graph (when starting from some vertex)

v x
u

w z
y
Example: web crawler collects data from hypertext documents
by traversing a directed graph D where
− vertices are hypertext documents
− (u,v) is an edge if document u contains a hyperlink to document v

A search/traversal visits all vertices by travelling along edges

− traversal is efficient if it explores graph in O(|V|+|E|) time

Algorithmics 3, 2010 17
2 Graphs and Graph Algorithms 45
Depth first search/traversal (DFS)
From starting vertex
− follow a path of unvisited vertices until path can be extended no further
− then backtrack along the path until an unvisited vertex can be reached
− continue until we cannot find any unvisited vertices
Repeat for other components (if any)

Algorithmics I, 2022 18
2 Graphs and Graph Algorithms 46
Depth first search/traversal (DFS)
From starting vertex
− follow a path of unvisited vertices until path can be extended no further
− then backtrack along the path until an unvisited vertex can be reached
− continue until we cannot find any unvisited vertices
Repeat for other components (if any)

The edges traversed form a spanning tree (or forest)

− a depth-first spanning tree (forest)
− spanning tree of a graph is tree composed of all the vertices and some
(or perhaps all) of the edges of the graph

Algorithmics I, 2022 19
2 Graphs and Graph Algorithms 47
Depth first traversal - Example
Undirected graph G

Depth first spanning tree of G

8 1 2
4

7 3

5 6

Algorithmics I, 2012 20
2 Graphs and Graph Algorithms 48
Implementation – DFS – Add to vertex class

private boolean visited; // has vertex been visited in a traversal?

private int pred; // index of the predecessor vertex in a traversal

public boolean getVisited(){

return visited;
}
public void setVisited(boolean b){
visited = b;
}
public int getPred(){
return pred;
}
public void setPred(int i){
pred = i;
}

Algorithmics I, 2012 21
2 Graphs and Graph Algorithms 49
Implementation – DFS – Add to graph class
/** visit vertex v, with predecessor index p, during a dfs */
private void visit(Vertex v, int p){
v.setVisited(true); // update as now visited
v.setPred(p); // set predecessor (indicates edge used to find vertex)
LinkedList<AdjListNode> L = v.getAdjList(); // get adjacency list

for (AdjListNode node : L){ // go through all adjacent vertices

int i = node.getIndex(); // find index of current vertex in list
if (!vertices[i].getVisited()) // if vertex has not been visited
visit(vertices[i], v.getIndex()); // continue dfs search from it
// setting the predecessor vertex index to the index of v
}
}
/** carry out a depth first search/traversal of the graph */
public void dfs(){
for (Vertex v : vertices) v.setVisited(false); // initialise
for (Vertex v : vertices) if (!v.getVisited()) visit(v,-1);
// if vertex is not yet visited, then start dfs on vertex
// -1 is used to indicate v was not found through an edge of the graph
}

Algorithmics I, 2012 22
2 Graphs and Graph Algorithms 50
Analysis – Depth first search
Each vertex is visited, and each element in the adjacency lists is
processed, so overall O(n+m)
− where n is the number of vertices and m the number of edges

Can be adapted to the adjacency matrix representation

− but now O(n2) since look at every entry of the adjacency matrix

Some applications
− to determine if a given graph is connected
− to identify the connected components of a graph
− to determine if a given graph contains a cycle (see tutorial questions)
− to determine if a given graph is bipartite (see tutorial questions)

Algorithmics I, 2022 23
2 Graphs and Graph Algorithms 51
Breadth first search/traversal (BFS)
Search fans out as widely as possible at each vertex
− from the current vertex, visit all the adjacent vertices
this is referred to as processing the current vertex
− vertices are processed in the order in which they are visited
− continue until all vertices in current component have been processed
− then repeat for other components
(if there are any)

Again the edges traversed form a spanning tree (or forest)

− a breadth-first spanning tree (forest)
− spanning tree of a graph is tree composed of all the vertices and some
(or perhaps all) of the edges of the graph

Algorithmics I, 2022 24
2 Graphs and Graph Algorithms 52
Breadth first traversal - Example
Undirected graph G

Breadth first spanning tree of G

b a c
d

e f

g h

Algorithmics I, 2022 25
2 Graphs and Graph Algorithms 53
Analysis – Breadth first search
Complexity
− each vertex is visited and queued exactly once
− each adjacency list is traversed once
− so overall O(n+m) (n is the number of vertices and m number of edges)
− can adapt to adjacency matrix representation but O(n2) (as for DFS)

Example application
− finding the distance between two vertices, say v and w, in a graph
− the distance is the number of edges in the shortest path from v to w
− assign distance to v to be 0
− carry out a breadth-first search from v
− when visiting a new vertex for first time, assign its distance to be
1 + the distance to its predecessor in the BF spanning tree
− stop when w is reached

Algorithmics I, 2022 26
2 Graphs and Graph Algorithms 54
Distance between two vertices - Example
Distance between v and w v
− assign distance to v to be 0
− carry out a breadth-first search from v
− when visiting a new vertex for first time
assign its distance to be 1 + the distance
to its predecessor in the BF spanning tree w

1 0 1

v 2

shortest
1 2 path

number beside each vertex 3

2 w
indicates the distance from v

Algorithmics I, 2022 27
2 Graphs and Graph Algorithms 55
Weighted graphs
Each edge e has an integer weight given by wt(e)>0
− graph may be undirected or directed
− weight may represent length, cost, capacity, etc
− if an edge is not part of the graph its weight is infinity

4 u 5

v w
5
6 7 5 4
x y
8
6 z 7

Example: cost of sending a message down a particular edge

− could be a monetary cost or some combination of time and distance
− can be used to formulate the shortest path problem for routing packets
Algorithmics I, 2022 28
2 Graphs and Graph Algorithms 56
Weighted graphs - Representation
Adjacency matrix becomes weight matrix 4 u 5
Adjacency lists include weight in node
v w
5
6 7 5 4

x y
8
6 z 7

adjacency matrix
adjacency list
u v w x y z
u 0 4 5 7 0 0 u:v(4)➝w(5)➝x(7)
v 4 0 5 6 0 0 v:u(4)➝w(5)➝x(6)
w 5 5 0 0 4 5 w:u(5)➝v(5)➝y(4)➝z(5)
x 7 6 0 0 8 6 x:u(7)➝v(6)➝y(8)➝z(6)
y 0 0 4 8 0 7 y:w(4)➝x(8)➝z(7)
z 0 0 5 6 7 0 z:w(5)➝x(6)➝y(7)

Algorithmics I, 2022 29
2 Graphs and Graph Algorithms 57
Weighted graphs - Shortest Paths
Given a weighted (un)directed graph and two vertices u and v
find a shortest path between u and v (for directed from u to v)
− where the length of a path is the sum of the weights of its edges

Example: weights are distances between airports

− shortest path between San Francisco and Miami

Applications include:
− flight reservations
− internet packet routing
− driving directions

Algorithmics I, 2022 30
2 Graphs and Graph Algorithms 58
Edsger Dijkstra, in an interview in 2010...
"… the algorithm for the shortest path, which I designed in
about 20 minutes. One morning I was shopping in Amsterdam
with my young fiancé, and tired, we sat down on the cafe
terrace to drink a cup of coffee, and I was just thinking about
whether I could do this, and I then designed the algorithm
for the shortest path."
Dijkstra, E.W. A note on two problems in Connexion with graphs.
Numerische Mathematik 1, 269–271 (1959)
Dijkstra describes the algorithm in English in 1956 (he was 26 years old)
− most people were programming in assembly language
− only one high-level language: Fortran by John Backus at IBM and not quite finished

No big O notation in 1959, in the paper, Dijkstra says: “my solution is preferred
to another one … the amount of work to be done seems considerably less.”

Algorithmics I, 2022 31
2 Graphs and Graph Algorithms 59
Dijkstra’s algorithm (as seen in NOSE2)
Algorithm finds shortest path between one vertex u and all others
− based on maintaining a set S containing all vertices for which shortest
path with u is currently known
− S initially contains only u (obviously shortest path between u and u is 0)
− eventually S contains all the vertices (so all shortest paths are known)

Each vertex v has a label d(v) indicating the length of a shortest

path between u and v passing only through vertices in S
− if no path exists then we set to d(v) infinity
− if v is in S, then d(v) is the length of the shortest path between u and v

Invariant of the algorithm: if v is in S and w is not, then the length of

the shortest path between u and w is at least that between u and v
− this means the weight of the edge between u and w is at least d(v)
Algorithmics I, 2022 32
2 Graphs and Graph Algorithms 60
Dijkstra’s algorithm (as seen in NOSE2)
Algorithm finds shortest path between one vertex u and all others
− based on maintaining a set S containing all vertices for which shortest
path with u is currently known
− S initially contains only u (obviously shortest path between u and u is 0)
− eventually S contains all the vertices (so all shortest paths are known)

Each vertex v has a label d(v) indicating the length of a shortest

path between u and v passing only through vertices in S
− at each step we add to S the vertex v not in S such that d(v) is minimum
− after having added a vertex v to S, carry out edge relaxation operations
i.e. we update the length d(w) for all vertices w still not in S
• d(w) is the length of a shortest path between u and v passing only
through vertices in S
• and S has changed since we have added vertex v to S
Algorithmics I, 2022 33
2 Graphs and Graph Algorithms 61
Dijkstra’s algorithm – Edge relaxation
Each vertex v has a label d(v) indicating the length of a shortest
path between u and v passing only through vertices in S
− suppose v and w are not in S then we know
• the shortest path between u and v passing only through S equals d(v)
• the shortest path between u and w passing only through S equals d(w)

d(v)
v

w
u
d(w)

Algorithmics I, 2022 34
2 Graphs and Graph Algorithms 62
Dijkstra’s algorithm – Edge relaxation
Each vertex v has a label d(v) indicating the length of a shortest
path between u and v passing only through vertices in S
− suppose v and w are not in S then we know
• the shortest path between u and v passing only through S equals d(v)
• the shortest path between u and w passing only through S equals d(w)
− now suppose v is added to S and the edge e = {v,w} has weight wt(e)
− calculate the shortest path between u and w passing only through S∪{v}

shortest path is either:

d(v) - original path through S of length d(w)
v - path combining edge e and shortest
wt(e) path between v and u which has length
w wt(e) + d(v)
u
d(w) therefore length updated to:

Algorithmics I, 2022 d(w) = min{ d(w), d(v) + wt(e) } 35

2 Graphs and Graph Algorithms 63
Dijkstra’s algorithm – Pseudo code

// S is set of vertices for which shortest path with u is known

// d(w) represents length of a shortest path between u and w
// passing only through vertices of S

S = {u}; // initialise S
for (each vertex w) d(w) = wt(u,w); // initialise lengths

while (S != V){ // still vertices to add to S

find v not in S with d(v) minimum;
add v to S;
for (each w not in S and adjacent to v) // perform relaxation
d(w) = min{ d(w) , d(v)+wt(v,w) };
}

Algorithmics I, 2022 36
2 Graphs and Graph Algorithms 64
Dijkstra’s algorithm – Complexity
S = {u}; // initialise S
for (each vertex w) d(w) = wt(u,w); // initialise lengths

while (S != V){ // still vertices to add to S

find v not in S with d(v) minimum;
add v to S;
for (each w not in S and adjacent to v) // perform relaxation
d(w) = min{ d(w) , d(v)+wt(v,w) };
}

Analysis (n vertices and m edges) using unordered array for lengths

− O(n) to initialise lengths
− finding minimum is O(n2) overall
• each time it takes O(n) and there are n-1 to find
− relaxation is O(m) overall
• each edge is considered once and updating length takes O(1)
• note: we are not considering each iteration of the while loop but overall ops
hence O(n2) overall (number of edges at most n(n-1))
Algorithmics I, 2022 37
2 Graphs and Graph Algorithms 65
Dijkstra’s algorithm – Pseudo code
S = {u}; // initialise S
for (each vertex w) d(w) = wt(u,w); // initialise lengths

while (S != V){ // still vertices to add to S

find v not in S with d(v) minimum;
add v to S;
for (each w not in S and adjacent to v) // perform relaxation
d(w) = min{ d(w) , d(v)+wt(v,w) };
}

Analysis (n vertices and m edges) using a heap for lengths

− O(n) to initialise lengths and create heap
− finding minimum is O(n log n) overall
• each time it takes O(log n) and there are n-1 to find
− relaxation is O(m log n) overall
• each edge is considered once and updating length takes O(log n)
• note: this involves updating a specific value in the heap not the root
so care must be taken (need to keep track of positions of vertices in the heap)

Algorithmics I, 2022 38
2 Graphs and Graph Algorithms 66
Dijkstra’s algorithm – Pseudo code
S = {u}; // initialise S
for (each vertex w) d(w) = wt(u,w); // initialise lengths

while (S != V){ // still vertices to add to S

find v not in S with d(v) minimum;
add v to S;
for (each w not in S and adjacent to v) // perform relaxation
d(w) = min{ d(w) , d(v)+wt(v,w) };
}

Analysis (n vertices and m edges) using a heap for lengths

− O(n) to initialise lengths and create heap
− finding minimum is O(n log n) overall
• each time it takes O(log n) and there are n-1 to find
− relaxation is O(m log n) overall
• each edge is considered once and updating lengths takes O(log n)
hence O(m log n) overall (more edges than vertices)
− a graph with n vertices has O(n2) edges
Algorithmics I, 2022 39
2 Graphs and Graph Algorithms 67
Spanning trees
Spanning tree:
− subgraph (subset of edges) which is both a tree and ‘spans’ every vertex
− a spanning tree is obtained from a connected graph by deleting edges
− the weight of a spanning tree is the sum of the weights of its edges

Problem: for a weighted connected undirected graph, find a

minimum weight spanning tree
− this represents the ‘cheapest’ way of interconnecting the vertices

Applications include:
− design of networks for computer, telecommunications, transportation,
gas, electricity, ...
− clustering, approximating the travelling salesman problem

Algorithmics I, 2022 40
2 Graphs and Graph Algorithms 68
Weighted graphs – Example – Spanning tree
Weighted graph G spanning tree:
4 5 subgraph which is
both a tree and
5 ‘spans’ every vertex
6 7 5 4

8
6 7

5
Spanning tree for G delete edges while still
‘spanning’ vertices
− weight 28 5
6 5 cannot delete any
more edges and
we have a tree
7
Algorithmics I, 2022 41
2 Graphs and Graph Algorithms 69
Weighted graphs – Example – Spanning tree
Weighted graph G spanning tree:
4 5 subgraph which is
both a tree and
5 ‘spans’ every vertex
6 7 5 4

8
6 7

Spanning tree for G 4 5

− weight 24 delete edges while still
‘spanning’ vertices

5 4 cannot delete any

more edges and
we have a tree
6
Algorithmics I, 2022 42
2 Graphs and Graph Algorithms 70
Minimum weight spanning tree problem
An example of a problem in combinatorial optimisation
− find ‘best’ way of doing something among a (large) number of candidates
− can always be solved, at least in theory, by exhaustive search
− however this may be infeasible in practice
− typically an exponential-time algorithm
− e.g. Kn (clique of size n) has nn-2 spanning trees (Cayley’s formula)
• recall: a graph is a clique if every pair vertices is joined by an edge

− a much more efficient algorithm may be possible

and is true in the case of minimum weight spanning trees

Algorithmics I, 2022 43
2 Graphs and Graph Algorithms 71
Minimum weight spanning tree problem
An example of a problem in combinatorial optimisation
− find ‘best’ way of doing something among a (large) number of candidates
− can always be solved, at least in theory, by exhaustive search
− however this may be infeasible
− typically an exponential-time algorithm

The Prim-Jarnik minimum spanning tree algorithm

− an example of a greedy algorithm
− it makes a sequence of decisions based on local optimality
− and ends up with the globally optimal solution

For many problems, greedy algorithms do not yield optimal solution

− see examples later in the course

Algorithmics I, 2022 44
2 Graphs and Graph Algorithms 72
The Prim-Jarnik algorithm
Min spanning tree is constructed by choosing a sequence of edge
set an arbitrary vertex r to be a tree-vertex (tv);
set all other vertices to be non-tree-vertices (ntv);
while (number of ntv > 0){
find edge e = {p,q} of graph such that
p is a tv;
q is an ntv;
wt(e) is minimised over such edges;
adjoin edge e to the (spanning) tree;
make q a tv;
}
Analysis (n is the number of vertices)
− intitialisation O(n) (n operations to set vertices to be tv or ntv)
− the outer loop is executed n-1 times
− the inner loop checks all edges from a tree-vertex to a non-tree-vertex
− there can be O(n2) of these each time so overall the algorithm is O(n3)

Algorithmics I, 2022 45
2 Graphs and Graph Algorithms 73
The Prim-Jarnik algorithm – Example
Weighted graph G 4 5

5
6 7 5 4

8
6 7

u
Minimum spanning
4
tree for G
v w
− weight 24 5
6 5 4

x y

Algorithmics I, 2022 z 46
2 Graphs and Graph Algorithms 74
Dijkstra’s refinement
Introduce a attribute bestTV for each non-tree vertex (ntv) q
− bestTV is set to the tree vertex (tv) p for which wt({p,q}) is minimised

set an arbitrary vertex r to be a tree-vertex (tv);

set all other vertices to be non-tree-vertices (ntv);
for (each ntv s) set s.bestTV = r; // r is the only tv