0% found this document useful (0 votes)

11 views22 pages

Lecture Notes

The document discusses various algorithms and their time complexities, including sorting algorithms like quicksort, mergesort, and insertion sort, as well as search algorithms like binary search and naive search. It also covers concepts related to linked lists, hash tables, and graph traversal methods such as BFS and DFS. Additionally, it explains how to analyze time complexity for recursive functions and provides examples of calculating asymptotic complexity.

Uploaded by

sanjoyostad.acc

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views22 pages

Lecture Notes

Uploaded by

sanjoyostad.acc

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

Week1

• NameError can occur in Python code, if the variable/function cannot be found in the namespace,
when it is used.

Week2
• https://www.youtube.com/watch?v=zS1dLE66smA
• Example of calculating time complexity follows:

• Complexity of the above function can be represented as f(n) = 2n^2 + 2n + 3. To calculate the
asymptotic complexity of f(n), we will use another function g(n) that is above f(n), beyond a specific
value of n (n0) and when multiplied by a constant factor c.
• This is represented graphically as follows:

Here, g(n) = n^2, c = 4 and n0 = (1 + √7)/2 = 1.82

• Thus, we conclude that f(n) = O(g(n)) = O(n^2)

• Generally, it can be said

• Find time-complexity when using a while loop:

This is because, each iteration in the loop, s is halved and iterates only 5 times.
• If f1(n) is O(g1(n)) and f2(n) is O(g2(n)), then f1(n) + f2(n) is O(max(g1(n), g2(n)))
• How to find time complexity of recursive algorithms by unwinding?

For example, to find the time complexity of a recursive function f with the following complexity for
the initial call.

f(n) = 2*f(n-1) + 1

Substitute f(n-1) with 2*f(n-2) + 1.

This leads to f(n) = 2*2*f(n-2) + 2 + 1 = 22 *f(n-2) + 21 + 20

Substitute f(n-2) with 2f(n-3) + 1.

This leads to f(n) = 2*2*2*f(n-3) + 4 + 2 + 1 = 23 *f(n-3) + 22 + 21 + 20
…
Substitute f(n-k) with 2*f(n-k) + 1
This leads to f(n) = 2k *f(n-k) + (2k-1)

Assuming 1 as the base case, n – k = 1 => k = (n - 1)

Thus, 2(n-1) *f(1) + (2(n-1) - 1)

Since f(1) = 1,

The final answer will be 2(n-1) *1 + (2(n-1) - 1) = 2n - 1 (approximately 2n)

• If f1(n) is O(g1(n)) and f2(n) is O(g2(n)), then f1(n) + f2(n) is O(max(g1(n), g2(n)))

Search algorithms
Naïve search - O(1), O(n), O(n)
say, we’re searching for v in l.
for each item in the list {
If the item equals v, then you found what you’re searching for.
}
You didn’t find it anywhere!
Binary search
Implementation 1 - O(1), O(logn), O(logn)
say, you’re searching for v in l, starting at s and ending at e.
binary_search(v,l,s,e) {

if e <= s then you’ve just 1 element in the list, return True if it matches v, else False
divide the list at the midpoint //m
If v is equal to element at m, then you’ve found it (Best case)
else {
if v is more than element at m, then make a recursive call, with s = m + 1.
if v is less than element at m, then make a recursive cal, with e = m - 1.
}
}

NOTE: This algorithm will work only if the list is sorted.

• Recurrence relation for binary search is T(n) = T(n/2) + 1

Implementation 2 - O(1), O(logn), O(logn)

say, you’re searching for v in l.
Initialize “start” = 0, and “end” = length(l-1)
loop as long as “start” is less than/equal to “end” {
divide the list (from start to end) at the midpoint //“mid”
if element at “mid” equals v, then you’ve found it (Best case)
if element at “mid” is more than v, modify “start”= “mid”+ 1
if element at “mid” is more than v, modify “end”= “mid” - 1
}
You didn’t find it anywhere!

Sorting algorithms
Selection sort - O(n2), O(n2), O(n2)
say, you’re sorting l (assume l has n elements)
if there’s no element in l, return empty list
else {
loop over all elements in l {
find the least among remaining elements in the list //“least”
if “least” is less than current element, then swap them.
}
}

Insertion sort - O(n), O(n2), O(n2)

say, you’re sorting l (assume l has n elements)

if there’s no element in l, return empty list
else {
loop over all elements in l {
loop over all its previous elements { //while
if the current element is lesser than previous element, swap them.
Make previous element, the current element.
} // At the end of this loop, current element is placed in the “earliest” position.
}
}

Merge sort - O(nlogn), O(nlogn), O(nlogn)

say, you’re sorting l (assume l has n elements)

mergesort(L) {
Divide the list into two equal halves – a left half and a right half.
Call this recursively until it can’t be divided (there’s only one element in each half)
merge(A, B) { // into a new list C
say, you’ve two lists(halves) A and B
while there are elements in A and B {
If the current element in A is less than current element in B, pick former, else pick
---latter, and append to C.
}
If there are no more elements in A, add all elements in B to C.
If there are no more elements in B, add all elements in A to C.
}
}
• If the swap operation was costlier, selectionsort would be preferred over insertionsort. This is
because, insertionsort could (potentially) perform n swaps for each iteration, whereas selectionsort
would perform swap only once for each iteration.
• Time complexities of known algorithm, given n is the input size.
o Time complexity of Python’s slicing operation is O(n).
o Binary search has the best time complexity of O(1), if the middle element is what you’re
searching for; else O(log2n) always.
o Selection sort has a time complexity of O(n2) always. Best, worst and average remains the
same.
o Insertion sort has the best time complexity of O(n), if all elements in the list are already
sorted, or are equal; else O(n2) always.
o Merge sort has a time complexity of O(nlog2n) and has a recurrence relation T(n) = 2T(n/2)+
O(n). Best, worst and average remains the same.
o Merge function (which accepts two sorted lists with m and n elements respectively, and
merges into a third list) has a time complexity of O(m + n)
o In a linked list, append method takes O(1) time, where delete method takes O(n) time.
o In quicksort, worst case occurs if the partition process picks the largest/smallest element as
the pivot. This typically happens, when the input list is sorted in ascending/descending
order. In this case, the time complexity is O(n2) and the recurrence relation is T(n) = T(n-1)+
O(n). In all other cases, the time complexity is O(nlog2n) and the recurrence relation is T(n)
= 2T(n/2)+ O(n).
• selectionsort, quicksort and heapsort result in unstable sorting, while insertionsort, mergesort result
in stable sorting.
• In the case of mergesort, sorting is not in-place, whereas it’s in-place for all other sorting algorithms.
• In order to do a 2-way merging on lists having m and n elements respectively, the maximum number
of comparisons = m + n - 1
• In order find the maximum number of comparisons needed for an n-way merging, we’ve use
repeatedly perform 2-way merging on the lowest-list-size pair. An example is here.
https://discourse.onlinedegree.iitm.ac.in/t/doubt-in-prev-term-question/84778/8
• In the case of insertionsort, after m iterations of the loop, first m elements in the list are in sorted
order. But, they’re not necessarily the smallest elements in the list. This is because, there could be
smaller elements after (m - 1)th index, which will be swapped only during a later iteration.
• Here’s a classic question on time complexity.

•
For the above problem, the time complexity is O(n2). It’s obtained as follows
(n - 1) + (n - 2) + (n - 3) + …. + 2 + 1 = n * (n - 1) /2 = O(n2)

Week3
Quick sort - O(nlogn), O(nlogn), O(n2)

say, you’re sorting l (assume l has n elements)

quicksort(L, l, r)
If there is only one element between l and r, return the list.
Identify the pivot element. Typically the first element in the list //l
//”lower” points to the end of lower section; “upper” points to the end of upper section.
Initialize “lower” and “upper” to pivot index.
Loop until there are “unclassified” elements {
If the next “unclassified” element is greater than pivot, increment upper.
Else {
swap it with “lower” element //insert after current “end of lower” section
increase “lower” and “upper”
}
}
Swap pivot element with end of lower section //lower-1
Reduce “lower” by 1
Recursively call quicksort for all elements between l and lower //inclusive
Recursively call quicksort for all elements between lower+1 and upper //inclusive
}

Linked Lists
say, you’re inserting a new node to the start of a linked list
add_head(L, e)
newest = Node(e)
newest.next = L.head
L.head = newest

say, you’re inserting a new node to the end of a linked list

add_tail(L, e)
newest = Node(e)
newest.next = None
L.tail.next = newest
L.tail = newest

say, you’re removing the head of a linked list

remove_head(L)
if L.head == None {
error
}
L.head = Lhead.next
• Using linked-lists results in more memory consumption.
• Asymptotic worst-case running time for finding the size of a linked list is O(n)
• For Quicksort on an input size n, where we always select the first element as pivot, number of
partitioning levels required is logn in the best case and n in the worst case.
• In order to remove collisions in hash tables, either of the following two techniques can be used.
o Open addressing (close hashing). It works by taking the original hash index and adding
successive values linearly/quadratically, until a free slot is found. Note that this probing
mechanism loops over to the beginning of the list.
o Closed addressing (open hashing). It works by maintaining separate linked lists for each
possible generated index by the hash function.
• Performing a binary search using an array implementation is O(log2n), but using a linked-list is O(n).
• In a singly linked list with head (no tail), inserting an element at the end of the list takes O(n) time.
• In a singly linked list with head and tail, following operations takes O(1) time
o adding a new head
o adding a new tail
o deleting head

However, deleting the tail will take O(n) time, since in order to set the previous node’s next to
None, previous must be located first, which takes O(n) time. In the case of a doubly-linked list, this
operation will take O(1), since previous is readily available for each node.

• A stack can be used to evaluate pre-fix/post-fix expressions.

• Worst case time complexity to perform a binary-search in a linked list is O(n)

Week4 (Graphs)
BFS (Breadth-First Search)
Given an adjacency list and a node, search breadth first.
Initialize the "visited" dictionary for all keys in the list to False.
Consider the first key, add it to the queue. Mark it "visited"
Repeat as long as there are elements in the queue {
Take next element from the queue.
Get the list of values for the key from the adjacency list. These represent the incident nodes.
Append them in the same order into the queue, only if the they’re not already “visited”
}
After the queue is emptied, return the visited dictionary.

DFS (Depth-First Search)

Given an adjacency list and a node, search depth first.
Initialize the "visited" dictionary for all keys in the list to False.
Consider the first key, add it to the stack. DO NOT mark “visited”
Repeat as long as there are elements in the stack {
Pop out the last element from the stack. Mark it “visited”
Get the list of values for the key from the adjacency list. These represent the incident nodes.
Append them in the reverse order into the stack., only if they’re not already “visited”
}
After the stack is emptied, return the visited dictionary.

Find Components (BFS)

Given adjacency list, find the components associated with each node.
Initialize “components” dictionary for all nodes to -1
Start with “component_id” of 0
Repeat as long as there are keys in the adjacency list {
Locate the minimum key that has component = -1
Get the visited dictionary for the BFS traversal using this key.
Assign the same “component_id” to all nodes visited during this traversal.
}
After all nodes have been completed, return the components dictionary.

Pre-post order (DFS) – Recursive solution.

Given adjacency list, find the pre-post associated with each node.
Initialize “pre” and “post” dictionary for all nodes to -1
DFSPrePost(AList, v, count) {
Mark v as "visited" and assign a “pre” count.
Increment “count”
Repeat for all neighbors of v {
If not already visited, call DFSPrePost recursively for the neighbor
}
At this point, all neighbors have been completed; assign “post” count.
Increment and return “count”
}

Topological sort
Given adjacency list, find the topological sort
Initialize indegree of all keys in adjacency list to 0.
Calculate indegree of all nodes (incident on keys) in the adjacency list //indegree
Add all nodes whose indegree is 0 to the queue //zerodegreeq
Repeat as long as there are items in zerodegreeq {
Take next element from the queue, and add to “toposortlist”
Reduce indegree for the node
Reduce indegree for all its neighbors
If the indegree is 0, then add to queue //zerodegreeq
}
return “toposortlist”

Longest path (Very similar to topological sort)

Given adjacency list, find the topological sort
Initialize indegree of all keys in adjacency list to 0.
Initialize longest path to all keys in adjacency list to 0.
Calculate indegree of all nodes (incident on keys) in the adjacency list //indegree
Add all nodes whose indegree is 0 to the queue //zerodegreeq
Repeat as long as there are items in zerodegreeq {
Take next element from the queue, and add to “toposortlist”
Reduce indegree for the node
Longest path to the node is the larger of the current longest path and 1 more than longest path to
its parent.
Reduce indegree for all its neighbors
If the indegree is 0, then add to queue //zerodegreeq
}
return “toposortlist”

• Number of vertices in a graph is termed Order of the graph; Number of edges in a graph is termed
Size of the graph
• In a complete graph, every pair of distinct vertices is connected by an edge, and the degrees of all
vertices are equal.
• In a connected graph, there is a path between every pair of vertices.
• The maximum number of edges in an undirected graph with n vertices is n(n - 1)/2.
• The maximum number of edges in directed graph with n vertices is n(n - 1)
• Maximum number of edges in a directed acyclic graph (DAG) is n(n - 1)/2.
• Maximum path length in a DAG is (n - 1).
• The maximum number of edges in a connected graph with n vertices is (n - 1)(n - 2)/2.
• The minimum number of edges in a connected graph with n vertices is (n - 1). This happens when
the graph is a tree.
• It is possible but not necessary that a complete graph also be a connected graph. For example, a
complete graph with only one vertex is a connected graph, but a complete graph with two or more
disconnected components is not a connected graph.
• In any graph (both directed and undirected), the sum of the degrees of all the vertices is equal to
twice the number of edges. Thus, 2m = ∑deg(v), This is known as the handshaking lemma.
• In an undirected connected graph
o Sum of degrees of all vertices is even.
o Number of vertices with an odd degree is even.
• The minimum number of colors needed to color a planar graph with n vertices is 4.
• Complexity of DFS is O(n2) using adjacency matrix, and O(m + n) using adjacency list.
• Given adjacency list, the time complexity for finding the indegrees of all n vertices in a graph with m
edges, is O(m + n)
• BFS/DFS can be used to identify the count of connected components.
• In a depth-first traversal (DFS) of a graph G with n vertices, k edges are marked as tree edges. The
number of connected components is (n - k). See https://youtu.be/iPd5Q_MRmgM?&t=6553
• DFS can be used to detect cycles in a graph using pre/post numbering during traversal.
• While performing pre/post numbering using DFS algorithm in a graph, if (u, v) is an edge of the
graph such that [pre(v), post(v)] contains [pre(u), post(u)], then the graph has cycles. v->u is a back-
edge.
• DAG will have at least one vertex without incoming edges, but might have more than one too
(though not necessary)
• DAG will have at least one topological sort sequence, but might have more than one too (though not
necessary)
• It is not possible to topologically sort a graph with cycles. Thus, topological sort can be used as a
mechanism to identify if the graph has cycles.
• DFS will always produce same number of tree edges, irrespective of the order in which vertices are
considered. If the graph is connected, it’ll always produce (n-1) edges, if there are n vertices.
• Time complexity of topological sort using adjacency list, given that m=#edges and n = #vertices, is
O(m + n). Use of adjacency matrix will increase this to O(n2).
• Time complexity of finding the longest path in DAG using adjacency list, given that m=#edges and n
= #vertices, is O(m + n). Use of adjacency matrix will increase this to O(n2).
• Time complexity of an algorithm to compute incoming edges for each vertex, given that m=#edges
and n = #vertices, is O(m + n)
• If (u, v) is an edge of G that is not in the tree T generated as part of a BFS, and d is the shortest
distance of the vertex from the starting point, then the possible values of d(u) - d(v) are -1, 0, or 1. If
(u, v) is not an edge in G, then u is a leaf in T.
• Problem:
A connected undirected graph has 1081 edges. What is possible limits of #vertices?

• Problem:
In a graph with 4 nodes and 6 edges (with weights 1-6), what’s maximum possible weight that a
minimum weight spanning tree can have.
The most important thing to remember in this question is that the graph layout is not
known – it’s only known that the graph has 4 nodes and 6 edges. Here’re two possibilities
for MST for such a graph, and the second layout will generate the maximum possible
weight. Thus, the answer is 7.

Week5
• Dijkstra’s algorithm to find shortest path will not always work, when the graph has negative
weights. Use Bellman-Ford algorithm in this case.
• Bellman-Ford algorithm can work with negative weight edges, but not with negative weight
cycles.
• Bellman-Ford iterates through all edges in a set, each time relaxing the edges.
• Bellman-Ford is run for (n - 1) iterations, each time identifying paths that could reach each
vertex through one additional hop. Thus, the first iteration will check for all paths to each vertex
in one hop. Second iteration will check for all paths to each vertex in two hop. Until, we’ve
covered (n - 1) iterations.
• Bellman-Ford algorithm can detect negative weight cycles, by running it nth time. If the weights
reduce even after (n - 1) iterations, there’s a cycle.
• Dijkstra and Bellman-Ford can work on directed or undirected graphs.
• In a graph with all unique edge weights, it’s possible to have multiple shortest paths between
any two vertices. This is because, the number of edges might differ in both paths.
• Shortest path in a graph with n vertices can have 1 to (n - 1) edges.
• While finding the all-pairs shortest path using Floyd-Warshall algorithm on a directed weighted
graph, if any of the diagonal elements contain a negative weight at the end of the process, then
there exists a negative weighted cycle.
• Dijkstra has a time complexity of O(V2) using simple array, and O((E+V)logV) using a binary heap
for priority queue implementation.
• Bellman-Ford has a time complexity of O(VE). Normally, this is larger than Dijkstra’s since E > V
• All-pairs shortest paths obtained using Floyd-Warshall algorithm has a time complexity of O(n3)
• Formula to compute shortest path from vertex i to j (with k representing an intermediate vertex)

Spanning trees
• In a graph with n vertices, a spanning tree is a subset of the graph with the n vertices, and (n - 1)
edges.
• Number of spanning trees that can be constructed from a graph with n vertices and m edges is
mCn-1 - #cycles
• Maximum number of spanning trees that can be constructed from a graph is n(n - 2). This
happens when the graph is complete.
• Adding an edge to a spanning tree will create a cycle.
• In a spanning tree, every pair of nodes is connected by a unique path.
• For a weighted graph, multiple spanning trees could be constructed, but only one of them will
be minimum cost. Use Prim’s or Kruskal’s algorithm might yield different MST, but the cost of
both trees will be same.
• Both Kruskal and Prim can be used with arbitrary (including negative) weights and negative
weight cycles. Only shortest path algorithm is affected by negative weight cycles.
• It only makes sense to find minimal spanning trees for undirected graphs.
• Prim’s algorithm:
o Selected the edge with least weight/cost
o Select an edge connected to the last edge, with the least cost.
o Repeat this, until (n - 1) edges have been selected.
o Works very similar to Dijkstra, except the relaxation step, which assigns distances[v]
with min(d, distances[v]) instead of min(d + distances[u], distances[v])
• In a disconnected graph with multiple components, spanning trees doesn’t exist.
• If there are multiple components in the graph, Kruskal’s algorithm might be able to find the
minimum cost spanning tree for one of the components. Note that this isn’t possible using
Prim’s.
• Kruskals’s algorithm:
o Sort edges in the increasing order of weights, and select the one with least weight/cost
o Repeat this until (n - 1) edges have been selected, each time selecting the least
weight/cost. It’s not necessary that the selected edges are connected.
o Make sure, at each selection, that no cycles are formed at any point.
• Time complexity for Kruskal/Prim’s algorithm is O(n2). Using minheap, this can be reduced to
O(nlogn).
• Kruskal’s could find the “missing” edge costs. https://youtu.be/4ZlRH0eK-qQ?t=1001
• Kruskal’s might be able to find multiple “minimum cost spanning trees”, but the costs of all such
will be equal.
• In general, when the edge weights are unique and the graph is connected, both Prim's and
Kruskal's algorithms will produce the same minimum spanning tree. However, in certain cases
where the edge weights are not unique or the graph is not connected, the algorithms may
produce different trees.

Week6
• In Kruskal,
o Cost of union (of components) operation per pair of components is n. Considering that
this has to be repeated (n - 1) times, the total cost is n2 ; we can improve this to nlogn.
o This improvement can be achieved by maintaining a members and sizes dictionary.
o Sorting of edges is mlogm time, using merge sort.
o Total time complexity can be improved to (m + n) logn time.
• In Prim,
o The major bottleneck is to find the minimum cost edge from the graph, which is O(n).
Considering that this has to be repeated (n - 1) times, the total cost is n2 ;
o This can be reduced by keeping the costs in a 2-dimensional queue priority structure.
Each row in the matrix is kept sorted. An additional column keeps track of the number
of items in each row.
o Time complexity to insert into the matrix, while keeping a sorted row is O(logn)
o Time complexity to remove the maximum from the matrix is O(logn)
o Performing these operation n times, time complexity is O(nlogn)
o We can store the edge weights in a binary tree (heap), in order to improve the time
complexity.
o Using heap in Prim is advantageous over sorted arrays, since the distances are
recalculated each time an edge is added to the MCST. Since this step is missing in
Kruskal, using heap in Kruskal isn’t advantageous.
• Heaps are complete binary trees, with following constraints,
o Structural constraint, wherein the tree has to be filled up level by level, starting with
level0. Within each level, the nodes must be filled up from left to right. In other words
only the lowest level (and in the RHS) can be incomplete.
o Value constraint, wherein the nodes in the previous levels must be at least as high as
the lower levels.
• In the max-heap, value of each node (except the leaves) >= its children.
• In the min-heap, value of each node (except the leaves) <= its children.
• While inserting a node into the heap, every node must be reconciled with its parent for its
priority. If the node’s priority is more than the parent, it must be swapped with the parent.
• The number of nodes that fill up k levels is 20 + 21 + 22 + 23 + … + 2k = 2k+1 - 1
• If we have n nodes in the binary tree, the number of levels in the tree cannot exceed log(n + 1).
This also means that this is the maximum height you’ll have to navigate and perform swap
operations and hence the time complexity for inserting a node into a binary tree is O(logn), if n
represents the number of nodes.
• To “get” the highest priority node (delete_max) from a max_heap, follow these steps:
o Remove the root node and return.
o Swap the last node in the tree (right-most at the last level) into the root’s position.
o Find the largest among root’s children. Swap root with the highest among them.
o Repeat this until you reach the leaf node.
• The time complexity for the delete_max operation is also O(logn)
• If the nodes in a heap are stored in a list from left to right, then (2i + 1) and (2i + 2) are the
indices to the children of the ith node. Similarly, parent of ith node has an index (i - 1)//2
• Since insert and delete_max operation are done N times in the Prim’s, the total complexity is
nlog(n)
• To build a heap from an unsorted array, repeatedly “insert” each element into the list of nodes.
This can be done in O(n) time, although technically “heapify” gets called repeatedly (and
“heapify” has O(logn) time complexity.
• A binary tree (not binary search tree) is a structure that has at most two child per parent. At the
extreme case, this can be a linear structure.
• A binary tree is said to be complete, when each level is complete and can hold no more children.
• A binary tree with n leaves will have (n - 1) internal nodes include root.
• A heap is to be almost complete binary tree, since, except for the bottom-most level (leaves), all
other levels are complete in a heap. In a heap, it’s mandatory that at every level, nodes must be
inserted from left to right and don’t allow holes in between.
• Searching through a heap for an item will take O(n) time, because there’s no order to storing the
elements in a heap other than what parent-child relationship requires. Note that searching
through a “balanced” binary search tree is O(logn).
• Extracting the largest element in a BST is O(n), since elements are not necessarily arranged in
sorted order.
• Heap and Binary search tree are two different data structures, and it’s not necessary that a heap
is a BST or a BST is a heap.
• Binary search trees are best implemented using recursion. There are 3 traversal mechanisms:
in-order, pre-order and post-order, as depicted in the picture below.
• In-order is left-root-right, Pre-order is root-left-right, Post-order is left-right-root.
• In the case of BST, we can reconstruct the original tree, in the following cases.
o if only pre-order traversal is known, first node of which is always the root.
o if only post-order traversal is known, last node of which is always the root.
• Note that it’s not possible to reconstruct the tree, if only in-order traversal is known
• In-order traversal of a BST is always sorted.
• However, in the case of general binary tree, it’s not possible to reconstruct original tree, from
pre-order or post-order traversals, unless also supplied with in-order traversal order.
• Note that in the case of a traversal (pre-order, in-order or post-order), the time complexity
remains O(n).
• Every node in the binary search tree has a value, a left tree and a right tree. All leaves of the
tree have an empty node below it, in order to make it simpler for the recursion to work (serves
as a base case).
• Inserting a value into the binary search tree tries to locate the value to be inserted in the tree. If
the value is found, then it returns without doing anything (No duplicates are allowed in BST). If
not found, it sets its value and creates a new tree (node) on the left or the right of the current
node.
• It’s possible to have an unbalanced tree , for the which the complexity is O(n). If the tree is
balanced, all operations (including insert and delete) are O(logn)
• In a strict binary tree (where every node has 0 or more children), following property holds -
#leaves = (#nodes + 1)/2. Thus, if there are 21 leaves in a strict binary tree, it has 41 nodes.
https://discourse.onlinedegree.iitm.ac.in/t/no-of-nodes/56693/2
• A strict binary tree (each node has two children) with n leaves has (n - 1) internal nodes.
Similarly, a binary tree (strict or otherwise) with n leaves has (n - 1) internal nodes each with 2
children. Both of these are derivable from the previous note.

Week7
• Minimum number of nodes in AVL tree of height h is S(h) = S(h-2) + S(h-1) + 1 where S(0) = 0 and
S(1) = 1. In order to obtain this easily, use the formula S(h) = fib[h+2] - 1. For example, the
minimum number of nodes required to construct an AVL tree with height 12, S(12) = fib(14) - 1.
Since, fib(14) = 377. S(12) = 376.
• Similarly, maximum number of nodes in an AVL tree is 2h - 1. For example, the maximum
number of nodes in an AVL tree with height 12 is 214 - 1 = 16383
• Huffman encoding uses variable length encoding of letters.
• In this scheme, no letter will have other letters as prefixes. Otherwise, decoding isn’t possible.
• Following is the calculation involved.

• Binary trees can be used to represent encoding, where letters are leaves and path to leaf
describes encoding – 0 is left and 1 is right.
• In the above representation, every node in the tree has 0 or 2 children (no node with a single
child), thus constructing it as a full tree. In other words, none or two letters can be represented
with a certain number of bits.

• Following 3 rules apply to Huffman encoding:

o Any optimal prefix code produces a full tree
o In an optimal tree, if leaf x is at lesser depth than leaf y, f (x) >= f (y). In other words, all
letters with lowest frequencies are pushed to the bottom of the tree.
o In an optimal tree, for any leaf at maximum depth, its sibling is also a leaf.

Week8
• Given two sets of symbols, inversion is the set of all pairs (i , j) such that i < j, but j occurs before
i in either set. The maximum number of possible inversions is n(n - 1)/2, given n is the number of
elements in either set.
• In order to find all inversions, it’ll take O(n2) time using the naïve method. With the divide and
conquer approach (like merge-sort), it’ll take O(nlogn) time.
• Naïve algorithm used for Integer multiplication has a recurrence relation of 4T(n/2) + n = O(n2)
• Karatsuba’s algorithm used for integer multiplication has a recurrence relation of 3T(n/2) + n =
O(nlog3)
• Quick-select uses “wall index” returned by partitioning algorithm. This is the kth least in the
array.
• The time complexity of Quick-select algorithm is O(n) on average and O(n^2) in the worst case,
and is dependent on the partitioning strategy.
• The worst case time complexity of Fast-select algorithm is O(n).
• Time complexity of the brute force closest pair problem is O(n2), and using the divide-conquer
approach is O(nlogn)
• Time complexity of recurrence based algorithms at each level can be generally represented as ri
* f(n/ci), where r is the number of recurrence calls at the level i, c is the division factor and f(n)
represents the time spent on non-recursive work.

• Leaves correspond to the base case T(1).

• If L represents the number of levels, number of leaves is rL

• For a recurrence relation T(n) = 2T(n/8) + O(n), the time complexity is O(n). This is the decreasing
case above.
• For a recurrence relation T(n) = 4T(n/4) + O(n), the time complexity is O(nlogn). This is the equal
case above. Example: 4-way merge sort.
• For a recurrence relation T(n) = T(n - 1) + O(n), the time complexity is O(n2). Example: Worst case of
quicksort.
• For a recurrence relation T(n) = T(n/2) + O(1), the time complexity is O(logn). Example: binary
search.
• For a recurrence relation T(n) = T(n/2) + O(n), the time complexity is O(n). This is the decreasing
case above. Example: Find kth smallest (largest) element using fastselect. In this case, algorithm
uses divide and conquer, and uses MoM to find the middle element.

Week9
• Two requirements for a problem to be solvable using dynamic programming are:
o Optimal substructure
o Overlapping subproblems
• Inductive structure for Grid Paths problem

• Assuming that the grid has m x n size, time complexity of a memorized solution is O(m + n), and that
of a dynamic programming solution for Grid Paths problem is O(mn)
• Inductive structure Longest Common Sub-word (LCW) problem

• Assuming that the words have has m and n characters respectively, time complexity of a
memorized/dynamic programming solution for LCW is O(mn)
• Inductive structure for Longest Common Subsequence (LCS) problem
• Assuming that the words have has m and n characters respectively, time complexity of a
memorized/dynamic programming solution for LCS is O(mn)
• Inductive structure for Edit distance problem

• Here is a numerical problem (and solution) from Edit distance.

• Assuming that the documents have has m and n characters respectively, time complexity of a
memorized/dynamic programming solution for Edit Distance problem is O(mn)
• In order to multiply two matrices whose sizes are m x n and n x p respectively, it takes O(mnp) time.
• Associativity of multiplication of a sequence of matrices affects its time complexity.

Week10
• Boyer-Moore

• If the text has m characters and pattern has n characters, number of comparisons in the worst-case
of Boyer-Moore is (m - n + 1) * n. For example, see this post
https://discourse.onlinedegree.iitm.ac.in/t/practice-q-6-isnt-the-worst-case-complexity-of-boyer-
moore-m-n/84668
• In Rabin-Karp, the problem of matching text with pattern is reduced to numeric match, by replacing
each alphabet with a number. This means, comparison of text and pattern doesn’t have to done by
each character, but using simple arithmetic. Thus, the complexity can be reduced from O(m + n) to
O(n).
• Algorithm converts the text into blocks, before doing the arithmetic.
• However, the issue is that this depends on the size of alphabet. if there were only 10 characters in
the alphabet, you could use numbers 0-9, but with 80+ characters (ASCII) the gain in time
complexity will be offset with that of doing the earlier mentioned calculation with large numbers.
So, the algorithm resorts to using modulo arithmetic on the text block and pattern – modulo with a
specific prime (say, 13). If the modulo on the text block doesn’t match that from the pattern, the
pattern is obviously not found on the text block. Repeat with the next text block. In the best case,
the time complexity of Rabin-Karp is O(m +n), but in the worst case it can be O(mn)
• To draw the graph that represents finite automata machine of a text comparison based on a given
pattern, refer to https://www.youtube.com/watch?v=kuMuFu9IRtw. The process elaborated in this
video will help create a table that can be converted into a graph. Pattern matching a given text is
done by tracing through this graph. Input text is deemed to have matched the pattern, only if the
last node in the graph can be reached, while using the input text to trace through the states
depicted in the graph.
• To produce the LPS array (used in the KMP algorithm), part of process depicted in the above video
needed to be followed.

Thus, the LPS for the given pattern is [0,0,0,1,1,2,3,4,0]

• While trying to search for the same pattern in a text, use the LPS as follows:
Text = ‘abcaabcaa’
Pattern = ‘abcaabcac’
At the final character of the text ‘a’, there occurs a mismatch. In that case, modify ‘j’ (index
representing the pointer in pattern) from the current index 8 to lps[j - 1]. Note that ‘i’ (index
representing the pointer in the text) doesn’t get changed – it remains at 8. Thus, the new search will
start at lps[7], which is equal to 4, or the 5th character in the pattern. This works because, ‘abca’
occurs twice in the pattern and instead of starting the comparison from the beginning of the
pattern, we could start comparing 5th character in pattern and 8th character in the text.
• After computing the fail function - which takes O(m) time, pattern matching in the text takes O(n)
time. Thus, worst case time complexity of KMP algorithm is O(m + n)
• A few basic properties of a Trie structure are:
o Other than the root, every node is labelled by a letter from the alphabet
o Children of a node have distinct labels, no more than the number of letters in alphabet.
o One word should not be a prefix of another, and all such should be on separate branches.
o Valid words end with a $ symbol.
o Height of the Trie is the length of the longest word among all words.
o Number of leaves in a Trie is the total number of words.
o Total number of nodes in a Trie is number of letters in alphabet + number of words (leaves)
• Use Suffix Tries, in order to look for substrings
• Graded Question 6 (only Boyer-Moore part included)

Week11
• Ford-Fulkerson algorithm - https://www.youtube.com/watch?v=GiN3jRdgxU4
• Time complexity for the Ford-Fulkerson algorithm and finds an augmented path in the algorithm is
O(E * max_flow)
• A subset S of V is called an independent set of G if no two vertices in S are adjacent.
• minimum vertex cover size + maximum independent set size = total number of vertices
• In linear programming, variables can be multiplied by some constant, we can not multiply variable
by variable in objective function or constraints.
• P includes those problems that are solvable in polynomial time. NP includes those that are solvable
in non-polynomial time. NP problems can be verified in polynomial time and hence the verifiability
of NP belong to P. Thus, it’s known that P is a subset of NP. However, it has not yet been proved
whether P is a proper subset of NP, or P equals NP. The prevailing assumption, however, is that P !=
NP.
• If problem A is reducible to problem B, and B is solvable, then A is solvable too.
• If problem A is reducible to problem B, it implies that problem B is at least as hard as problem A. This
is because if we can solve problem B, then we can also solve problem A by transforming it into
problem B and then solving it. So if problem B is easy to solve, then so is problem A. Conversely, if
problem A is hard to solve, then so is problem B.
• In A -> B (A reduced to B), it’s not possible to infer anything about the time complexity of A, unless
more details on the transformation/reduction is known. It’s only possible to infer that B has a
greater time complexity than A.
• NP-hard problems are the class of problems that are harder than all NP problems. It’s possible that
the verification of these too could be non-polynomial time. Now, problems that are NP-hard, but
whose verification can be done in polynomial time are called NP-complete problems.
• NP-complete problems lie in the intersection between NP and NP-hard problems. They’re at least as
hard as all NP problems.
Appendix
Check out this spreadsheet for the time complexities of some common algorithms learnt in the course.

The New Way of The Cross
No ratings yet
The New Way of The Cross
9 pages
Unit 9
No ratings yet
Unit 9
34 pages
Searching Sorting Python
No ratings yet
Searching Sorting Python
36 pages
Lecture 3: Sorting: Set Interface (L03-L08)
No ratings yet
Lecture 3: Sorting: Set Interface (L03-L08)
6 pages
Lec12 Search Sort PDF
No ratings yet
Lec12 Search Sort PDF
42 pages
Lecture-3 (DivideAndConquer)
No ratings yet
Lecture-3 (DivideAndConquer)
83 pages
Unit 1
No ratings yet
Unit 1
116 pages
Algorithm Analysis
No ratings yet
Algorithm Analysis
14 pages
2 1 Ordenation Algorithms
No ratings yet
2 1 Ordenation Algorithms
60 pages
ALGO Notes
No ratings yet
ALGO Notes
24 pages
C34_EXP2_AOA
No ratings yet
C34_EXP2_AOA
8 pages
infIILecture6.en.handout
No ratings yet
infIILecture6.en.handout
34 pages
Doa Pvy Mid Sem Imp Que
No ratings yet
Doa Pvy Mid Sem Imp Que
5 pages
UNIT-5.docx
No ratings yet
UNIT-5.docx
11 pages
Chapter 2 Divide and Conquer
No ratings yet
Chapter 2 Divide and Conquer
36 pages
Rakin Sir Merged
No ratings yet
Rakin Sir Merged
271 pages
Assignment 1 AoA
No ratings yet
Assignment 1 AoA
14 pages
Desgin
No ratings yet
Desgin
4 pages
Sorting Algorithms & Data Structures
No ratings yet
Sorting Algorithms & Data Structures
4 pages
Algorithm Materials
No ratings yet
Algorithm Materials
43 pages
03 Sorting
No ratings yet
03 Sorting
57 pages
4 Quicksort and Balls in Bins
No ratings yet
4 Quicksort and Balls in Bins
74 pages
03 Sorting
No ratings yet
03 Sorting
57 pages
module 4
No ratings yet
module 4
7 pages
L9_sorting
No ratings yet
L9_sorting
50 pages
Alg Wks1 2
No ratings yet
Alg Wks1 2
67 pages
Lec 5 Dec, 7 Dec
No ratings yet
Lec 5 Dec, 7 Dec
45 pages
Recitation 3: To Algorithms: 6.006
No ratings yet
Recitation 3: To Algorithms: 6.006
7 pages
THAYIRSADAM4LIFE
No ratings yet
THAYIRSADAM4LIFE
15 pages
Cs 1311 Lecture 16 Wdl
No ratings yet
Cs 1311 Lecture 16 Wdl
211 pages
Sorting 1
No ratings yet
Sorting 1
40 pages
unit1 daa
No ratings yet
unit1 daa
6 pages
Lec08 Sort Scale
No ratings yet
Lec08 Sort Scale
102 pages
Design and Analysis of Algorithms __ 23CSH-282
No ratings yet
Design and Analysis of Algorithms __ 23CSH-282
72 pages
Lec12 Search Sort
No ratings yet
Lec12 Search Sort
42 pages
A.docxWeek10
No ratings yet
A.docxWeek10
6 pages
DSD Unit 3 Sorting and Searching
No ratings yet
DSD Unit 3 Sorting and Searching
36 pages
Ada Practical File
No ratings yet
Ada Practical File
21 pages
Lecture2 Algorithms-Complexity REV
No ratings yet
Lecture2 Algorithms-Complexity REV
16 pages
Algorithms - Weekly Test 03 - Test Paper
No ratings yet
Algorithms - Weekly Test 03 - Test Paper
5 pages
Data Structure - Chap 1
No ratings yet
Data Structure - Chap 1
42 pages
Unit3 Solutions
No ratings yet
Unit3 Solutions
3 pages
(A) What Is Randomized Quicksort? Analyse The Expected Running Time of Randomized Quicksort, With The Help of A Suitable Example. Answer
No ratings yet
(A) What Is Randomized Quicksort? Analyse The Expected Running Time of Randomized Quicksort, With The Help of A Suitable Example. Answer
14 pages
Chapter 6... Tree
No ratings yet
Chapter 6... Tree
39 pages
Algo_Summary
No ratings yet
Algo_Summary
12 pages
O Analysis of Methods and Data Structures Reasonable vs. Unreasonable Algorithms Using O Analysis in Design
No ratings yet
O Analysis of Methods and Data Structures Reasonable vs. Unreasonable Algorithms Using O Analysis in Design
96 pages
IGNOU MCA MCS-031 Solved Assignment 2010
No ratings yet
IGNOU MCA MCS-031 Solved Assignment 2010
13 pages
DSU Super 25 BY RAJAN SIR - V2V
No ratings yet
DSU Super 25 BY RAJAN SIR - V2V
37 pages
Lecture 12 AG
No ratings yet
Lecture 12 AG
10 pages
Design and Analysis of Algorithm
No ratings yet
Design and Analysis of Algorithm
33 pages
2 3-Algorithms 280155520
No ratings yet
2 3-Algorithms 280155520
1 page
Daa Unit-Ii
No ratings yet
Daa Unit-Ii
24 pages
CH7 Sorting
No ratings yet
CH7 Sorting
97 pages
Algorithms Project Report
No ratings yet
Algorithms Project Report
7 pages
Data Structures Using C (Modern College5)
50% (2)
Data Structures Using C (Modern College5)
51 pages
Data Structures and Algorithms
No ratings yet
Data Structures and Algorithms
6 pages
Lecture2 Compressed
No ratings yet
Lecture2 Compressed
64 pages
abstract.html.docs[1]
No ratings yet
abstract.html.docs[1]
11 pages
小考20181026 (第一次期中考)
No ratings yet
小考20181026 (第一次期中考)
8 pages
A Short Course in Discrete Mathematics
From Everand
A Short Course in Discrete Mathematics
Edward A. Bender
3/5 (1)
A Short Course in Automorphic Functions
From Everand
A Short Course in Automorphic Functions
Joseph Lehner
No ratings yet
Regent College London Acceptance Letter
No ratings yet
Regent College London Acceptance Letter
2 pages
Stats 2 Notes_removed (1)
No ratings yet
Stats 2 Notes_removed (1)
38 pages
G20 SUMMIT
No ratings yet
G20 SUMMIT
6 pages
Admit Card End Term An
No ratings yet
Admit Card End Term An
2 pages
28.13 Vammika S m23 Piya Tan
No ratings yet
28.13 Vammika S m23 Piya Tan
13 pages
Peru
No ratings yet
Peru
4 pages
DBQ
No ratings yet
DBQ
8 pages
Final Project Template - BPED 3050
No ratings yet
Final Project Template - BPED 3050
16 pages
Men of God
No ratings yet
Men of God
131 pages
Poor Mans Hearing Aid
No ratings yet
Poor Mans Hearing Aid
1 page
Pages From Q & As - Good To Study
No ratings yet
Pages From Q & As - Good To Study
9 pages
Soal SAS Bahasa Inggris Kelas 7
No ratings yet
Soal SAS Bahasa Inggris Kelas 7
8 pages
Assessing The Internal Environment of The Firm: Education
No ratings yet
Assessing The Internal Environment of The Firm: Education
24 pages
Extinguishment of Obligations (Art. 1231) Modes
No ratings yet
Extinguishment of Obligations (Art. 1231) Modes
50 pages
Galperti Rolamentos
No ratings yet
Galperti Rolamentos
32 pages
U Boot Technology
No ratings yet
U Boot Technology
8 pages
Instant download Creativity and Artificial Intelligence A Conceptual Blending Approach Applications of Cognitive Linguistics 1st Edition Cã¢Mara Pereira pdf all chapter
No ratings yet
Instant download Creativity and Artificial Intelligence A Conceptual Blending Approach Applications of Cognitive Linguistics 1st Edition Cã¢Mara Pereira pdf all chapter
67 pages
Ac 1
No ratings yet
Ac 1
22 pages
EX405
No ratings yet
EX405
11 pages
Curriculum Vitae: Kaleeckal Sankaran Anuj
No ratings yet
Curriculum Vitae: Kaleeckal Sankaran Anuj
3 pages
Neelam Bhosle-crl
No ratings yet
Neelam Bhosle-crl
10 pages
Textile Effluent Treatment: A Case Study in Home Textile Zone
No ratings yet
Textile Effluent Treatment: A Case Study in Home Textile Zone
6 pages
the-Maceda-Law-and-RA-957
No ratings yet
the-Maceda-Law-and-RA-957
6 pages
Robert Irwin - The Middle East in The Middle Ages - The Early Mamluk Sultanate, 1250-1382-Southern Illinois University Press (1986)
No ratings yet
Robert Irwin - The Middle East in The Middle Ages - The Early Mamluk Sultanate, 1250-1382-Southern Illinois University Press (1986)
192 pages
BSc-Data Science
No ratings yet
BSc-Data Science
9 pages
Conditionals
No ratings yet
Conditionals
2 pages
Assessment Brief 1 - Professional Practice
No ratings yet
Assessment Brief 1 - Professional Practice
5 pages
STMT
No ratings yet
STMT
4 pages
Edinburgh Pitch Guidelines Information 2023
No ratings yet
Edinburgh Pitch Guidelines Information 2023
2 pages
James W. Dean-ManagementTheoryTotal-1994
No ratings yet
James W. Dean-ManagementTheoryTotal-1994
28 pages
Collectible Maps - Hob Walkthrough - Neoseeker
No ratings yet
Collectible Maps - Hob Walkthrough - Neoseeker
5 pages
Conjunctions - Worksheet
100% (1)
Conjunctions - Worksheet
4 pages
359-1547094691719-MC6061 Lecture 02 PDF
No ratings yet
359-1547094691719-MC6061 Lecture 02 PDF
28 pages

Lecture Notes

Uploaded by

Lecture Notes

Uploaded by

Week1

Here, g(n) = n^2, c = 4 and n0 = (1 + √7)/2 = 1.82

• Thus, we conclude that f(n) = O(g(n)) = O(n^2)

• Find time-complexity when using a while loop:

Substitute f(n-1) with 2*f(n-2) + 1.

Substitute f(n-2) with 2f(n-3) + 1.

Assuming 1 as the base case, n – k = 1 => k = (n - 1)

The final answer will be 2(n-1) *1 + (2(n-1) - 1) = 2n - 1 (approximately 2n)

NOTE: This algorithm will work only if the list is sorted.

• Recurrence relation for binary search is T(n) = T(n/2) + 1

Implementation 2 - O(1), O(logn), O(logn)

Insertion sort - O(n), O(n2), O(n2)

say, you’re sorting l (assume l has n elements)

Merge sort - O(nlogn), O(nlogn), O(nlogn)

say, you’re sorting l (assume l has n elements)

say, you’re sorting l (assume l has n elements)

say, you’re inserting a new node to the end of a linked list

say, you’re removing the head of a linked list

• A stack can be used to evaluate pre-fix/post-fix expressions.

DFS (Depth-First Search)

Find Components (BFS)

Pre-post order (DFS) – Recursive solution.

Longest path (Very similar to topological sort)

• Following 3 rules apply to Huffman encoding:

• Leaves correspond to the base case T(1).

• Here is a numerical problem (and solution) from Edit distance.

Thus, the LPS for the given pattern is [0,0,0,1,1,2,3,4,0]

You might also like