0% found this document useful (0 votes)

56 views173 pages

APS1070 Lecture (2) Slides

This document provides a summary of Lecture 2 of the course APS1070, Fall 2022. The lecture introduces algorithms and asymptotic complexity analysis. It discusses the analysis of algorithms to evaluate and compare their efficiency. Key concepts covered include asymptotic notation, order of growth of functions, and time complexity analysis of algorithms. Examples of algorithms and data structures discussed include hashing and the dictionary abstract data type.

Uploaded by

Саша Цой

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

56 views173 pages

APS1070 Lecture (2) Slides

Uploaded by

Саша Цой

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 173

APS1070, Fall 2022: Lecture 2

Samin Aref

Based on course material by Mark C. Wilson

Class representative of APS1070
If you have any complaints or suggestions about this course, you can
email me directly. Alternatively, email one of the class representatives
who will then communicate your (anonymous) message to me and the
instruction team.
Haoyu (Bruce) Li <[email protected]>
Mengbo (Jason) Zhan <[email protected]>
Ayesha Patnaik <[email protected]>

Conflict with the time of the midterm

If you have a schedule conflict with Oct 21st, 9:00-13:00, please
reply to this Piazza post by Sep 26:
https://piazza.com/class/l4pso5v31t025i/post/22
What is Lecture 2 about?

▶ An introduction to the analysis of algorithms and data

structures.
▶ Asymptotic complexity analysis.
▶ Hashing and the dictionary abstract data type.
▶ Prerequisite: Fundamental concepts from discrete
mathematics (sets, functions, inequalities, limits) covered as
“backgrounds” in this slide deck.
▶ This is a “theory” lecture but we also assess ability to
implement these abstract structures and algorithms in
projects.
Background: Mathematics review
Sets

▶ A set is an unordered collection of objects (called elements).

Sets

▶ A set is an unordered collection of objects (called elements).

▶ Important sets:
Sets

▶ A set is an unordered collection of objects (called elements).

▶ Important sets:
▶ N = {0, 1, 2, . . . }, the set of natural numbers.
Sets

▶ A set is an unordered collection of objects (called elements).

▶ Important sets:
▶ N = {0, 1, 2, . . . }, the set of natural numbers.
▶ R, the set of real numbers.
Sets

▶ A set is an unordered collection of objects (called elements).

▶ Important sets:
▶ N = {0, 1, 2, . . . }, the set of natural numbers.
▶ R, the set of real numbers.
▶ ∅ = {}, the empty set having no elements.
Sets

▶ A set is an unordered collection of objects (called elements).

▶ Important sets:
▶ N = {0, 1, 2, . . . }, the set of natural numbers.
▶ R, the set of real numbers.
▶ ∅ = {}, the empty set having no elements.
▶ Notation: X = {2, 3, 5, 7, 11} or
X = {x ∈ N : x is prime and x < 12}.
▶ Operations:
▶ A ∩ B = {x : x ∈ A and x ∈ B}
Sets

▶ A set is an unordered collection of objects (called elements).

▶ A function is a mapping f from a set X (the domain) to a set

Y (the codomain) such that every x ∈ X maps to a unique
y ∈Y.
Functions

▶ A function is a mapping f from a set X (the domain) to a set

Y (the codomain) such that every x ∈ X maps to a unique
y ∈Y.
▶ Important functions from R to R:
Functions

▶ A function is a mapping f from a set X (the domain) to a set

Y (the codomain) such that every x ∈ X maps to a unique
y ∈Y.
▶ Important functions from R to R:
▶ Power functions f (x) = x, f (x) = x2 , f (x) = x3 , etc
Functions

▶ A function is a mapping f from a set X (the domain) to a set

Y (the codomain) such that every x ∈ X maps to a unique
y ∈Y.
▶ Important functions from R to R:
▶ Power functions f (x) = x, f (x) = x2 , f (x) = x3 , etc
▶ Exponential functions f (x) = 2x , f (x) = (1.5)x , etc
▶ Logarithm (inverse of exponential) has a different domain.
▶ Ceiling rounds up to nearest integer, e.g. ⌈3.7⌉ = 4. Floor
rounds down, e.g. ⌊3.7⌋ = 3 = ⌊3⌋.
Basic properties of important functions

▶ For a > 1 the exponential f (x) = ax is increasing and

positive, and satisfies ax+y = ax ay for all x, y ∈ R.
Basic properties of important functions

▶ For a > 1 the exponential f (x) = ax is increasing and

positive, and satisfies ax+y = ax ay for all x, y ∈ R.
▶ For a > 1 the logarithm loga is increasing and satisfies
loga (xy) = loga x + loga y for all x, y > 0.
Basic properties of important functions

▶ For a > 1 the exponential f (x) = ax is increasing and

positive, and satisfies ax+y = ax ay for all x, y ∈ R.
▶ For a > 1 the logarithm loga is increasing and satisfies
loga (xy) = loga x + loga y for all x, y > 0.
▶ We write ln = loge and lg = log2 . Note that
loga x = loga b logb x.
Basic properties of important functions

▶ For a > 1 the exponential f (x) = ax is increasing and

▶ A sequence is a function f : N → R.
Sums

▶ A sequence is a function f : N → R.
▶ Notation: f0 , f1 , f2 , · · · where fi = f (i).
Sums

▶ A sequence is a function f : N → R.
▶ Notation: f0 , f1 , f2 , · · · where fi = f (i).
▶ Sum: fm + fm+1 + · · · + fn = ni=m fi .
P
Sums

▶ A sequence is a function f : N → R.
▶ Notation: f0 , f1 , f2 , · · · where fi = f (i).
▶ Sum: fm + fm+1 + · · · + fn = ni=m fi .
P

▶ Important sums:
n
X n(n + 1)
i=
2
i=1
n
X a n+1 − am
ai = .
a−1
i=m
Part 1

Asymptotic Analysis of Algorithms

Warm call: Why should we analyse algorithms?
What is an algorithm?

▶ An algorithm is a sequence of clearly stated rules that specify

a step-by-step method for solving a given problem.
What is an algorithm?

▶ An algorithm is a sequence of clearly stated rules that specify

a step-by-step method for solving a given problem.
▶ The rules should be unambiguous and sufficiently detailed
that they can be carried out without creativity.
What is an algorithm?

▶ An algorithm is a sequence of clearly stated rules that specify

a step-by-step method for solving a given problem.
▶ The rules should be unambiguous and sufficiently detailed
that they can be carried out without creativity.
▶ Examples of algorithms: a (sufficiently detailed) cake recipe,
primary school method for multiplication of decimal integers;
quicksort.
▶ Algorithms predate electronic computers by thousands of years
(example: Euclid’s greatest common divisor algorithm).
What is an algorithm?

▶ An algorithm is a sequence of clearly stated rules that specify

▶ Experience shows that enormously more performance gains

can be achieved by optimizing the algorithm than by
optimizing other factors such as:
Why analyse an algorithm?

▶ Experience shows that enormously more performance gains

can be achieved by optimizing the algorithm than by
optimizing other factors such as:
▶ processor
Why analyse an algorithm?

▶ Experience shows that enormously more performance gains

can be achieved by optimizing the algorithm than by
optimizing other factors such as:
▶ processor
▶ language
Why analyse an algorithm?

▶ Experience shows that enormously more performance gains

can be achieved by optimizing the algorithm than by
optimizing other factors such as:
▶ processor
▶ language
▶ compiler
Why analyse an algorithm?

▶ Experience shows that enormously more performance gains

can be achieved by optimizing the algorithm than by
optimizing other factors such as:
▶ processor
▶ language
▶ compiler
▶ human programmer
Why analyse an algorithm?

▶ Experience shows that enormously more performance gains

can be achieved by optimizing the algorithm than by
optimizing other factors such as:
▶ processor
▶ language
▶ compiler
▶ human programmer
▶ The analysis process often results in us discovering simpler
algorithms.
▶ Many algorithms have parameters that must be set before
implementation. Analysis allows us to set the optimal values.
Why analyse an algorithm?

▶ Experience shows that enormously more performance gains

This sequence is recursively defined by

(
n if n = 0 or n = 1;
F (n) =
F (n − 1) + F (n − 2) if n ≥ 2.

This immediately suggests a recursive algorithm.

Algorithm 1 Slow method for computing Fibonacci numbers
1: function slowfib(integer n)
2: if n < 0 then return 0
3: else if n = 0 then return 0
4: else if n = 1 then return 1
5: else return slowfib(n − 1) + slowfib(n − 2)
Improving over slowfib

▶ The algorithm slowfib is obviously correct, but does a lot of

repeated computation. With a small (fixed) amount of extra
space, we can do better, by working from the bottom up
instead of from the top down.
Algorithm 2 Fast method for computing Fibonacci numbers
1: function fastfib(integer n)
2: if n < 0 then return 0
3: else if n = 0 then return 0
4: else if n = 1 then return 1
5: else
6: a←1 ▷ stores F (i) at bottom of loop
7: b←0 ▷ stores F (i − 1) at bottom of loop
8: for i ← 2 to n do
9: t←a
10: a ← a+b
11: b←t
12: return a
Analysis of the fast algorithm

▶ Even a bad implementation in a slow interpreted language on

an ancient machine of fastfib will beat the best
implementation of slowfib, once n becomes big enough.
Colab time

▶ Implement fastfib and slowfib in Python. Which is faster

for which values of n? What is the maximum value of n for
which each gives a result in a reasonable time?
How to measure running time?
Basic performance measures

There are three main characteristics of an algorithm designed to

solve a given problem.
▶ Domain of definition: the set of legal inputs.
Basic performance measures

There are three main characteristics of an algorithm designed to

solve a given problem.
▶ Domain of definition: the set of legal inputs.
▶ Correctness: it gives correct output for each legal input. This
depends on the problem we are trying to solve, and can be
tricky to prove.
▶ Resource use: usually computing time and memory space.
▶ This depends on the input, and on the implementation
(hardware, programmer skill, compiler, language, ...).
▶ It usually grows as the input size grows.
▶ There is a tradeoff between resources (for example, time vs
space).
Basic performance measures

There are three main characteristics of an algorithm designed to

▶ Given an algorithm A, the actual running time on a given

input ι depends on many implementation details. Can we
compare algorithms when we do not know the exact input and
details of implementation?
Warm call: How to compare algorithms?

▶ Given an algorithm A, the actual running time on a given

input ι depends on many implementation details. Can we
compare algorithms when we do not know the exact input and
details of implementation?
▶ The running time usually grows with the size of the input.
Running time for very small inputs is not usually important; it
is large inputs that cause problems if the algorithm is
inefficient.
Running time: input

▶ We define a notion of input size on the data. This is a

positive integer.
Running time: input

▶ We define a notion of input size on the data. This is a

positive integer.
▶ Example: number of records in a database to be sorted.
Elementary operations

▶ We use the concept of elementary operation as our basic

measuring unit of running time. This is any operation whose
execution time does not depend on the size of the input.
Elementary operations

▶ We use the concept of elementary operation as our basic

measuring unit of running time. This is any operation whose
execution time does not depend on the size of the input.
▶ The running time T (ι) of algorithm A on input ι is the
number of elementary operations used when ι is fed into A.
Running time Input size
Function Notation 10 100 1000 107
Constant 1 1 1 1 1
Logarithmic log n 1 2 3 7
Linear n 1 10 100 106
“Linearithmic” n log n 1 20 300 7 × 106
Quadratic n2 1 100 10000 1012
Cubic n3 1 1000 106 1018
Exponential 2n 1 10 27 10298 103010296

Note: there are about 3 × 1018 nanoseconds in a century.

Warm call

▶ Algorithm A takes n2 elementary operations to sort a file of n

lines, while Algorithm B takes 50n log n. Which algorithm is
better when n = 10?
Warm call

▶ Algorithm A takes n2 elementary operations to sort a file of n

lines, while Algorithm B takes 50n log n. Which algorithm is
better when n = 10?
▶ when n = 106 ? How do we decide which algorithm to use?
How to measure running time?
Running time techniques: easy

▶ Running time of disjoint blocks adds.

Running time techniques: easy

▶ Running time of disjoint blocks adds.

▶ Running time of nested loops with non-interacting variables
multiplies.
Running time techniques: easy

▶ Running time of disjoint blocks adds.

▶ Running time of nested loops with non-interacting variables
multiplies.
▶ Example: single, double, triple loops with fixed number of
elementary operations inside the inner loop yields linear,
quadratic, cubic running time.
Algorithm 3 Swapping two elements in an array
Require: 0 ≤ i ≤ j ≤ n − 1
function swap(array a[0..n − 1], integer i, integer j)
t ← a[i]
a[i] ← a[j]
a[j] ← t
return a

▶ Running time?
Algorithm 3 Swapping two elements in an array
Require: 0 ≤ i ≤ j ≤ n − 1
function swap(array a[0..n − 1], integer i, integer j)
t ← a[i]
a[i] ← a[j]
a[j] ← t
return a

▶ Running time?
▶ This is a constant time algorithm.
Algorithm 5 Finding the maximum in an array
function findmax(array a[0..n − 1])
k←0 ▷ location of maximum so far
for j ← 1 to n − 1 do
if a[k] < a[j] then
k=j
return k

▶ Running time?
Algorithm 5 Finding the maximum in an array
function findmax(array a[0..n − 1])
k←0 ▷ location of maximum so far
for j ← 1 to n − 1 do
if a[k] < a[j] then
k=j
return k

▶ Running time?
▶ This is a linear time algorithm, since it makes one pass
through the array and does a constant amount of work each
time.
Snippet: other loop increments

Algorithm 7 Example: exponential change of variable in loop

i←1
while i ≤ n do
i←2∗i
print i

▶ Running time?
Snippet: other loop increments

Algorithm 7 Example: exponential change of variable in loop

i←1
while i ≤ n do
i←2∗i
print i

▶ Running time?
▶ This runs in logarithmic time because i doubles about lg n
times until reaching n.
Example: nested loops

Algorithm 9 Snippet: Nested loops

for i ← 1 to n do
for j ← i to n do
print i + j

▶ Running time?
Example: nested loops

Algorithm 9 Snippet: Nested loops

for i ← 1 to n do
for j ← i to n do
print i + j

▶ Running time?
▶ The first iteration of the outer loop takes n elementary
operations. The second iteration of the outer loop takes n − 1
operations and so forth. Therefore, the algorithm takes
n + (n − 1) + · · · + 1 = n(n + 1)/2 elementary operations for
input size n.
Warm call

▶ What do we know about the running time T (n) of slowfib

for term n of the Fibonacci sequence?
▶ slowfib makes F (n) function calls each of which involves a
constant number of elementary operations. It turns out that
F (n) grows exponentially in n, so this is an exponential time
algorithm.
Asymptotic notation
Asymptotic comparison of functions

▶ In order to compare running times of algorithms we want a

way of comparing the growth rates of functions.
Asymptotic comparison of functions

▶ In order to compare running times of algorithms we want a

way of comparing the growth rates of functions.
▶ We want to see what happens for large values of n — small
ones are not relevant.
Asymptotic comparison of functions

▶ In order to compare running times of algorithms we want a

way of comparing the growth rates of functions.
▶ We want to see what happens for large values of n — small
ones are not relevant.
▶ We are not usually interested in constant factors and only
want to consider the dominant term.
Asymptotic comparison of functions

▶ In order to compare running times of algorithms we want a

▶ Suppose that f and g are functions from N to R, which take

on nonnegative values.
Big-O notation

▶ Suppose that f and g are functions from N to R, which take

on nonnegative values.
▶ Say f is O(g) (“f is Big-Oh of g”) if there is some C > 0 and
some n0 ∈ N such that for all n ≥ n0 , f (n) ≤ Cg(n).
Informally, f grows at most as fast as g.
Big-O notation

▶ Suppose that f and g are functions from N to R, which take

on nonnegative values.
▶ Say f is O(g) (“f is Big-Oh of g”) if there is some C > 0 and
some n0 ∈ N such that for all n ≥ n0 , f (n) ≤ Cg(n).
Informally, f grows at most as fast as g.
▶ Say f is Ω(g) (“f is big-Omega of g”) if g is O(f ).
Informally, f grows at least as fast as g.
▶ Say f is Θ(g) (“f is big-Theta of g”) if f is O(g) and g is
O(f ).
Informally, f grows at the same rate as g.
Big-O notation

▶ Suppose that f and g are functions from N to R, which take

▶ Every linear function f (n) = an + b, a > 0, is O(n).

Asymptotic comparison — examples

▶ Every linear function f (n) = an + b, a > 0, is O(n).

▶ Proof: an + b ≤ an + |b| ≤ (a + |b|)n for n ≥ 1.
What happens if there are many inputs of a given size?

▶ We usually don’t want to have to consider the distribution of

running time over all possible inputs of a given size. There
may be (infinitely) many inputs of a given size, and running
time may vary widely on these.
What happens if there are many inputs of a given size?

▶ We usually don’t want to have to consider the distribution of

running time over all possible inputs of a given size. There
may be (infinitely) many inputs of a given size, and running
time may vary widely on these.
▶ For example, for sorting the integers 1, . . . , n, there are n!
possible inputs, and this is large even for n = 10.
What happens if there are many inputs of a given size?

▶ We usually don’t want to have to consider the distribution of

running time over all possible inputs of a given size. There
may be (infinitely) many inputs of a given size, and running
time may vary widely on these.
▶ For example, for sorting the integers 1, . . . , n, there are n!
possible inputs, and this is large even for n = 10.
▶ We consider statistics of T (ι) such as worst-case W (n) or
average-case A(n) running time for instances ι of size n.
What are the pros and cons of worst and average case
analysis?

▶ Worst-case bounds are valid for all instances: this is important

for mission-critical applications.
What are the pros and cons of worst and average case
analysis?

▶ Worst-case bounds are valid for all instances: this is important

for mission-critical applications.
▶ Worst-case bounds are often easier to derive mathematically.
What are the pros and cons of worst and average case
analysis?

▶ Worst-case bounds are valid for all instances: this is important

for mission-critical applications.
▶ Worst-case bounds are often easier to derive mathematically.
▶ Worst-case bounds often hugely exceed typical running time
and have little predictive or comparative value.
▶ Average-case running time is often more realistic. Quicksort is
a classic example.
▶ Average-case analysis requires a good understanding of the
probability distribution of the inputs.
▶ Conclusion: a good worst-case bound is always useful, but it
is just a first step and we should aim to refine the analysis for
important algorithms. Average-case analysis is often more
practically useful, provided the algorithm will be run on
“random” data.
Why can constants often be ignored?
▶ A linear time algorithm when implemented will take at most
An + B seconds to run on an instance of size n, for some
implementation-specific constants A, B.
Why can constants often be ignored?
▶ A linear time algorithm when implemented will take at most
An + B seconds to run on an instance of size n, for some
implementation-specific constants A, B.
▶ For large n, this is well approximated by An. Small n are not
usually of interest anyway, since almost any algorithm is good
enough for tiny instances.
Why can constants often be ignored?
▶ A linear time algorithm when implemented will take at most
An + B seconds to run on an instance of size n, for some
implementation-specific constants A, B.
▶ For large n, this is well approximated by An. Small n are not
usually of interest anyway, since almost any algorithm is good
enough for tiny instances.
▶ No matter what A is, we can easily work out how the running
time scales with increasing problem size (linearly!).
Why can constants often be ignored?
▶ A linear time algorithm when implemented will take at most
An + B seconds to run on an instance of size n, for some
implementation-specific constants A, B.
▶ For large n, this is well approximated by An. Small n are not
usually of interest anyway, since almost any algorithm is good
enough for tiny instances.
▶ No matter what A is, we can easily work out how the running
time scales with increasing problem size (linearly!).
▶ The difference between a linear and a quadratic time
algorithm is usually huge, no matter what the constants are.
For large enough n, a linear time algorithm will always beat a
quadratic time one.
Why can constants often be ignored?
▶ A linear time algorithm when implemented will take at most
An + B seconds to run on an instance of size n, for some
implementation-specific constants A, B.
▶ For large n, this is well approximated by An. Small n are not
usually of interest anyway, since almost any algorithm is good
enough for tiny instances.
▶ No matter what A is, we can easily work out how the running
time scales with increasing problem size (linearly!).
▶ The difference between a linear and a quadratic time
algorithm is usually huge, no matter what the constants are.
For large enough n, a linear time algorithm will always beat a
quadratic time one.
▶ Conclusion: in practice we often need to make only crude
distinctions. We only need to know whether the running time
scales like n, n2 , n3 , n log n, 2n , . . . . If we need finer
distinctions, we can do more analysis.
Can we always ignore constants?

▶ When we want to choose between two good algorithms for

the same problem (“is my linear-time algorithm faster than
your linear-time algorithm?”), we may need to know
constants. These must be determined empirically.
Can we always ignore constants?

▶ When we want to choose between two good algorithms for

the same problem (“is my linear-time algorithm faster than
your linear-time algorithm?”), we may need to know
constants. These must be determined empirically.
▶ For important algorithms that will be used many times, it is
worth being more precise about the constants. Even small
savings will be worth the trouble.
▶ An algorithm with running time 10−10 n2 is probably better in
practice than one with running time 1010 n, since the latter
will eventually beat the former, but only on instances of size
at least 1020 , which is rarely met in practice.
Can we always ignore constants?

▶ When we want to choose between two good algorithms for

▶ Our goal is to find an asymptotic approximation for the (worst

or average case) running time of a given algorithm. Ideally we
can find a simple function f and prove that the running time
is Θ(f (n)).
Summary

▶ Our goal is to find an asymptotic approximation for the (worst

or average case) running time of a given algorithm. Ideally we
can find a simple function f and prove that the running time
is Θ(f (n)).
▶ The main f (n) occurring in applications are
log n, n, n log n, n2 , n3 , 2n , and each grows considerably faster
than the previous one. The gap between n and n log n is the
smallest.
Part 2

Dictionary Abstract Data Type (ADT)

Dictionary ADT

▶ An abstract data type that supports operations to insert, find,

and delete an element with given search key.
▶ Used for databases. Other names are table ADT and
associative array.
▶ There are many ways in which this could be implemented:
▶ unsorted list;
▶ sorted list;
▶ binary search tree;
▶ hash table.
Unsorted list

▶ Inserting an element is constant time.

Unsorted list

▶ Inserting an element is constant time.

▶ The only way to find an element is to check each element.
Unsorted list

▶ Inserting an element is constant time.

▶ The only way to find an element is to check each element.
▶ This takes time in O(n) for any reasonable implementation.
Unsorted list

▶ Inserting an element is constant time.

▶ The only way to find an element is to check each element.
▶ This takes time in O(n) for any reasonable implementation.
▶ Also, deletion is O(n) because first we should find it and then
other elements must be pushed to the left (array).
Sorted list

▶ In a sorted list, inserting an element is harder because you

have to find the right place and therefore it is O(n).
Sorted list

▶ In a sorted list, inserting an element is harder because you

have to find the right place and therefore it is O(n).
▶ In a sorted list, finding an element is easier because the order
allows us to perform a binary search which is O(log n)
Sorted list

▶ In a sorted list, inserting an element is harder because you

have to find the right place and therefore it is O(n).
▶ In a sorted list, finding an element is easier because the order
allows us to perform a binary search which is O(log n)
▶ Again, deletion is O(n) because other elements must be
pushed to the left (array).
Efficiency of various implementations of Dictionary ADT

Table: Average case running time (asymptotic order)

Data structure Insert Delete Find

Unsorted list (array) 1 n n
Sorted list (array) n n log n
Efficiency of various implementations of Dictionary ADT

Table: Average case running time (asymptotic order)

Data structure Insert Delete Find

Unsorted list (array) 1 n n
Sorted list (array) n n log n
Binary search tree log n log n log n

Image from Tamara Nelson-Fromm, UIUC

https://courses.engr.illinois.edu/cs225/sp2019/notes/bst/
Efficiency of various implementations of Dictionary ADT

Table: Average case running time (asymptotic order)

Data structure Insert Delete Find

Unsorted list (array) 1 n n
Sorted list (array) n n log n
Binary search tree log n log n log n
??? 1 1 1

Image from Tamara Nelson-Fromm, UIUC

https://courses.engr.illinois.edu/cs225/sp2019/notes/bst/
Hashing
Hashing

▶ A hash function is a function h that outputs an integer value

for each key. A hash table is an array implementation of the
table ADT, where each key is mapped via a hash function to
an array index.
Hashing

▶ A hash function is a function h that outputs an integer value

for each key. A hash table is an array implementation of the
table ADT, where each key is mapped via a hash function to
an array index.
▶ The number of possible keys is usually enormously more than
the actual number of keys. Thus allocating an array with
enough size to fit all possible keys would be very inefficient.
So hash functions are not 1-to-1; that is, two keys may be
mapped to the same index (a collision).
Hash functions

▶ There are several desirable properties of a hash function:

Hash functions

▶ There are several desirable properties of a hash function:

▶ it should be computable quickly (constant time).
Hash functions

▶ There are several desirable properties of a hash function:

▶ it should be computable quickly (constant time).
▶ if keys are drawn uniformly at random, then the hashed values
should be uniformly distributed.
Hash functions

▶ There are several desirable properties of a hash function:

▶ it should be computable quickly (constant time).
▶ if keys are drawn uniformly at random, then the hashed values
should be uniformly distributed.
▶ keys that are “close” should have their hash values “spread
out”.
▶ A hash function should be deterministic, but appear
“random” - in other words it should pass some statistical tests
(similar to pseudorandom number generators).
Collision resolution policies

▶ We need a collision resolution policy to prescribe what to do

when collisions occur.
Collision resolution policies

▶ We need a collision resolution policy to prescribe what to do

when collisions occur.
▶ We discuss two policies for resolving collisions (open
addressing and chaining).
Policy 1: Collision resolution via open addressing

▶ Open addressing uses no extra space - every element is stored

in the hash table.
Policy 1: Collision resolution via open addressing

▶ Open addressing uses no extra space - every element is stored

in the hash table.
▶ If a key k hashes to a value h(k) that is already occupied, we
probe (look for an empty space).
Policy 1: Collision resolution via open addressing

▶ Open addressing uses no extra space - every element is stored

in the hash table.
▶ If a key k hashes to a value h(k) that is already occupied, we
probe (look for an empty space).
▶ The most common probing method is linear probing, which
moves left one index at a time, wrapping around if necessary,
until it finds an empty address.
Collision resolution via open addressing (from our textbook
DGW2016)
Policy 1: Collision resolution via open addressing

▶ Open addressing uses no extra space - every element is stored

in the hash table. If it gets overfull, we can reallocate space
and rehash.
▶ If a key k hashes to a value h(k) that is already occupied, we
probe (look for an empty space).
▶ The most common probing method is linear probing, which
moves left one index at a time, wrapping around if necessary,
until it finds an empty address.
▶ Another method is double hashing. Use a second hash
∆(k) = t to find a place t and if it is occupied again move to
the left by a fixed step size t, wrapping around if necessary,
until we find an empty address.
Collision resolution via double hashing (from our textbook
DGW2016)
Policy 2: Collision resolution via chaining

▶ Chaining uses an “overflow” list for each element in the hash

table.
Policy 2: Collision resolution via chaining

▶ Chaining uses an “overflow” list for each element in the hash

table.
▶ Elements that hash to the same slot are placed in a list.
Policy 2: Collision resolution via chaining

▶ Chaining uses an “overflow” list for each element in the hash

table.
▶ Elements that hash to the same slot are placed in a list.
▶ A drawback is the additional space overhead. Also, the
distribution of sizes of lists turns out to be very uneven.
Warm call

▶ When hashing n keys into a table with m slots, how often do

you think collisions occur when n is much smaller than m ?
Analysis of hashing

▶ We count the cost (running time) by number of key

comparisons.
Analysis of hashing

▶ We count the cost (running time) by number of key

comparisons.
▶ We often use the simple uniform hashing model. That is, each
of the n keys is equally likely to hash into any of the m slots.
So we can consider a “balls in bins” model.
Analysis of hashing

▶ We count the cost (running time) by number of key

comparisons.
▶ We often use the simple uniform hashing model. That is, each
of the n keys is equally likely to hash into any of the m slots.
So we can consider a “balls in bins” model.
▶ If n is much smaller than m, collisions will be few and most
slots will be empty. If n is much larger than m, collisions will
be many and no slots will be empty. The most interesting
behaviour is when m and n are of comparable size.
Analysis of hashing

▶ We count the cost (running time) by number of key

▶ The probability of no collisions when n balls are thrown into

m bins uniformly at random is Q(m, n).
How often do collisions occur?

▶ The probability of no collisions when n balls are thrown into

m bins uniformly at random is Q(m, n).
▶ Note that when the load factor is very small λ → 0, collisions
are unlikely (for example Q(m, 0) = 1 and Q(m, 1) = 1).
How often do collisions occur?

▶ The probability of no collisions when n balls are thrown into

m bins uniformly at random is Q(m, n).
▶ Note that when the load factor is very small λ → 0, collisions
are unlikely (for example Q(m, 0) = 1 and Q(m, 1) = 1).
▶ At the other extreme case of λ > 1, collisions are absolutely
certain (i.e. Q(m, n) = 0) according to the pigeonhole
principle.
▶ Birthday Paradox: If there are 23 or more people in a room,
the chance is greater than 50% that two or more of them
have the same birthday.
How often do collisions occur?

▶ The probability of no collisions when n balls are thrown into

m boxes uniformly at random is Q(m, n).
Analysis of balls in bins

▶ The probability of no collisions when n balls are thrown into

m boxes uniformly at random is Q(m, n).
▶ The probability of no collisions is

mm−1 m−n+1 m!
Q(m, n) = ... = .
m m m (m − n)!mn
Analysis of balls in bins

▶ The probability of no collisions when n balls are thrown into

m boxes uniformly at random is Q(m, n).
▶ The probability of no collisions is

mm−1 m−n+1 m!
Q(m, n) = ... = .
m m m (m − n)!mn
▶ Q(m, n) = 0 unless 0 ≤ n ≤ m.
Plots of Q(m, n) against n, m = 25, 100, 400, 1600
Plots of Q(m, n) against n, m = 25, 100, 400, 1600

▶ Actually, the point where collision become almost certain

√
scales with m. This means that for example, in a table with
1600 slots, we almost certainly get a collision after filling 128
elements (table being 92% empty).
Running time for chaining (under simple uniform hashing)

▶ The expected length of a chain is λ = n/m under the “balls

in bins” model because each ball goes to a given slot with the
probability 1/m and there are n balls.
Running time for chaining (under simple uniform hashing)

▶ The expected length of a chain is λ = n/m under the “balls

in bins” model because each ball goes to a given slot with the
probability 1/m and there are n balls.
▶ The average cost for unsuccessful search is the average list
length, namely λ because we exhaust the list.
▶ The average cost for successful search is roughly λ/2 because
on average we will find it half-way down the list.
Running time for chaining (under simple uniform hashing)

▶ The expected length of a chain is λ = n/m under the “balls

in bins” model because each ball goes to a given slot with the
probability 1/m and there are n balls.
▶ The average cost for unsuccessful search is the average list
length, namely λ because we exhaust the list.
▶ The average cost for successful search is roughly λ/2 because
on average we will find it half-way down the list.
▶ Thus provided the load factor is kept bounded, find, delete,
and insert run in constant time, O(λ) on average.
Efficiency of various implementations of Dictionary ADT

Table: Average case running time (asymptotic order)

Data structure Insert Delete Find

Unsorted list (array) 1 n n
Sorted list (array) n n log n
Binary search tree log n log n log n
Hash table (chaining) λ λ λ
Efficiency of various implementations of Dictionary ADT

Table: Worst case running time (asymptotic order)

Data structure Insert Delete Find

Unsorted list (array) 1 n n
Sorted list (array) n n log n
Binary search tree n n n
Hash table (chaining) n n n
Questions

▶ What hashing methods are used by major programming

languages?
Hashing in practice (Dec 2018)

▶ Java Collections Framework uses chaining to implement

HashMap, resizing when λ > 0.75, and table size a power of 2.
Hashing in practice (Dec 2018)

▶ Java Collections Framework uses chaining to implement

HashMap, resizing when λ > 0.75, and table size a power of 2.
▶ C++ uses chaining to implement unordered map, resizing
when λ > 1, and prime table size.
Hashing in practice (Dec 2018)

▶ Java Collections Framework uses chaining to implement

HashMap, resizing when λ > 0.75, and table size a power of 2.
▶ C++ uses chaining to implement unordered map, resizing
when λ > 1, and prime table size.
▶ C# uses chaining, resizing when λ > 1, and prime table size.
Hashing in practice (Dec 2018)

▶ Java Collections Framework uses chaining to implement

▶ Division method: h(k) = k mod m

▶ Especially bad if m has some common factors with k.
▶ Multiplication method: h(k) = ak mod 2w >> (w − r)
▶ Does not work well for some situations.
▶ Universal hashing: ha,b (k) = [(ak + b) mod p] mod m
▶ Requires selecting p a prime, p > |U | and selecting a, b,
0 ≤ a, b ≤ p − 1.
▶ For worst-case keys, probability of collision is bounded by 2/m.
Next time

▶ Reading assignment 2 (due in 5 days as per course schedule)

▶ Week 3 Lab: Q&A support session

ask all your questions and ask many!

▶ Week 3 Lecture – Foundations of Learning

▶ Project 1 Due - Oct. 6 at 21:00

Foc QP 3
No ratings yet
Foc QP 3
18 pages
Introduction To Mathematics
No ratings yet
Introduction To Mathematics
184 pages
Set Theory Group 5
No ratings yet
Set Theory Group 5
55 pages
ADA CSE - IT (3rd Year) Engg. Lecture Notes, Ebook PDF Download PART1
No ratings yet
ADA CSE - IT (3rd Year) Engg. Lecture Notes, Ebook PDF Download PART1
82 pages
Cpe Dma Lec05
No ratings yet
Cpe Dma Lec05
63 pages
Neww Chapter 2
No ratings yet
Neww Chapter 2
37 pages
Bs 140 Pgs 1-48
No ratings yet
Bs 140 Pgs 1-48
48 pages
ADA Unit-1
No ratings yet
ADA Unit-1
43 pages
UNIT1DAApptx 2022 07 24 10 57 06
No ratings yet
UNIT1DAApptx 2022 07 24 10 57 06
44 pages
Unit1 2
No ratings yet
Unit1 2
43 pages
VIVA Questions forOOAD
71% (7)
VIVA Questions forOOAD
10 pages
Unit 2
No ratings yet
Unit 2
17 pages
Lectures 2
No ratings yet
Lectures 2
14 pages
Function and Algorithm
No ratings yet
Function and Algorithm
15 pages
Calculus Notes (Final)
No ratings yet
Calculus Notes (Final)
100 pages
Ch413 2 SET
No ratings yet
Ch413 2 SET
52 pages
Automata Theory: CS411 & CS675 2015F-01 Set Theory & Proof Techniques
No ratings yet
Automata Theory: CS411 & CS675 2015F-01 Set Theory & Proof Techniques
81 pages
02-Basic Structures
No ratings yet
02-Basic Structures
32 pages
Discreate Structure
No ratings yet
Discreate Structure
21 pages
MA2002 Chap0
No ratings yet
MA2002 Chap0
15 pages
Lecture 0
No ratings yet
Lecture 0
22 pages
Ada Unit 1 Handbook
No ratings yet
Ada Unit 1 Handbook
14 pages
Set Theory
No ratings yet
Set Theory
22 pages
Concepts and Assignments Class 12
No ratings yet
Concepts and Assignments Class 12
42 pages
MA320 - Discrete Mathematics
No ratings yet
MA320 - Discrete Mathematics
73 pages
MA313 Real and Complex Analysis: Dr. Robin S. Havea
No ratings yet
MA313 Real and Complex Analysis: Dr. Robin S. Havea
7 pages
Unit 2
No ratings yet
Unit 2
17 pages
Functions, Sequence and Relations
No ratings yet
Functions, Sequence and Relations
43 pages
Nume - Lesson 1
No ratings yet
Nume - Lesson 1
11 pages
Discrete Mathematics: R. Johnsonbaugh
No ratings yet
Discrete Mathematics: R. Johnsonbaugh
70 pages
4-The Four Basic Concepts
No ratings yet
4-The Four Basic Concepts
23 pages
Unit Notes
No ratings yet
Unit Notes
228 pages
Aafmt PDF
No ratings yet
Aafmt PDF
610 pages
Preliminary
No ratings yet
Preliminary
65 pages
Precalc UnderConstruction
No ratings yet
Precalc UnderConstruction
214 pages
Chapter4 26 NOV
No ratings yet
Chapter4 26 NOV
80 pages
Unit 2 Progress Test PDF
0% (1)
Unit 2 Progress Test PDF
8 pages
Comprehensive Class 11-12 Math Formulae
No ratings yet
Comprehensive Class 11-12 Math Formulae
42 pages
Optimization Techniques: Dr. Muhammad Naeem Dr. Ashfaq Ahmed
No ratings yet
Optimization Techniques: Dr. Muhammad Naeem Dr. Ashfaq Ahmed
64 pages
Notes Week7
No ratings yet
Notes Week7
67 pages
Chapter1 5
No ratings yet
Chapter1 5
91 pages
Chapter 1 2 and 3
No ratings yet
Chapter 1 2 and 3
63 pages
Discrete Mathematics - Odt
100% (1)
Discrete Mathematics - Odt
20 pages
2.1.1 Some Important Definitions: A X B y B A F
No ratings yet
2.1.1 Some Important Definitions: A X B y B A F
30 pages
MA1200 Chapter 2 Sets and Functions
No ratings yet
MA1200 Chapter 2 Sets and Functions
15 pages
1 ACAlgebranotes
No ratings yet
1 ACAlgebranotes
101 pages
Engineering Analysis 1
No ratings yet
Engineering Analysis 1
61 pages
Toc Recursive Function Theory
100% (1)
Toc Recursive Function Theory
83 pages
CHPT 2 Functions Seq
No ratings yet
CHPT 2 Functions Seq
55 pages
CHPT 2 Functions Seqs - v.5
No ratings yet
CHPT 2 Functions Seqs - v.5
57 pages
Pure Mathemtics
No ratings yet
Pure Mathemtics
133 pages
Copyreading & Headline Writing-Division Virtual Training
No ratings yet
Copyreading & Headline Writing-Division Virtual Training
56 pages
MA2001 Chapter0
No ratings yet
MA2001 Chapter0
15 pages
Chap 0
No ratings yet
Chap 0
31 pages
M1 Vol1 Sets&functions
No ratings yet
M1 Vol1 Sets&functions
66 pages
Ready Reckoner For JEE-Main
No ratings yet
Ready Reckoner For JEE-Main
83 pages
Concepts and Assignments Class 11
No ratings yet
Concepts and Assignments Class 11
49 pages
Functions I: Supporting Australian Mathematics Project
50% (2)
Functions I: Supporting Australian Mathematics Project
31 pages
Final Time Table For Mock 2025
No ratings yet
Final Time Table For Mock 2025
2 pages
Chapter-1 Introduction and Searching
No ratings yet
Chapter-1 Introduction and Searching
56 pages
Math Analysis W 1 2021
No ratings yet
Math Analysis W 1 2021
12 pages
Gmas Algebra
No ratings yet
Gmas Algebra
8 pages
Ma1102R Calculus Lesson 1: Wang Fei
No ratings yet
Ma1102R Calculus Lesson 1: Wang Fei
12 pages
Pe3 - Week 3 4 - Classification of Dance
100% (1)
Pe3 - Week 3 4 - Classification of Dance
25 pages
Linux Cheat Sheet
No ratings yet
Linux Cheat Sheet
3 pages
Vsphere Esxi 672 Installation Setup Guide
No ratings yet
Vsphere Esxi 672 Installation Setup Guide
222 pages
Beyonce - If I Were A Boy (Conditionals)
100% (1)
Beyonce - If I Were A Boy (Conditionals)
2 pages
ELE2120 Digital Circuits and Systems: Tutorial Note 9
No ratings yet
ELE2120 Digital Circuits and Systems: Tutorial Note 9
25 pages
GC WebCollect Error Codes Overview 6.8.2
100% (1)
GC WebCollect Error Codes Overview 6.8.2
65 pages
ControlLogix Controller Portfolio Customer Presentation
No ratings yet
ControlLogix Controller Portfolio Customer Presentation
22 pages
SAT Writing - Punctuation and Grammar
100% (1)
SAT Writing - Punctuation and Grammar
5 pages
Mac Network Commands Cheat Sheet
No ratings yet
Mac Network Commands Cheat Sheet
1 page
Catch 22 Thesis Statement
100% (3)
Catch 22 Thesis Statement
8 pages
Iqra' Grade - One Curriculum Aqidah, Fiqh & Ahklaq: Tasneema Ghazi
No ratings yet
Iqra' Grade - One Curriculum Aqidah, Fiqh & Ahklaq: Tasneema Ghazi
25 pages
Todays FPSC Computer Operator Paper (29-09-2020)
No ratings yet
Todays FPSC Computer Operator Paper (29-09-2020)
4 pages
Connecting Instrument To LIMS v2
No ratings yet
Connecting Instrument To LIMS v2
7 pages
Extended Workbook
No ratings yet
Extended Workbook
10 pages
Notes
No ratings yet
Notes
72 pages
New 6
No ratings yet
New 6
29 pages
CSC213 Object Oriented Programming-Lab Manual-Sol
No ratings yet
CSC213 Object Oriented Programming-Lab Manual-Sol
83 pages
Reflection 1. Every Child Is Special
No ratings yet
Reflection 1. Every Child Is Special
2 pages
The Adventures of A Desi Girl
No ratings yet
The Adventures of A Desi Girl
1 page
Pratikesh Dasharath Vishe - AD & Windows - Skillmine
No ratings yet
Pratikesh Dasharath Vishe - AD & Windows - Skillmine
3 pages
HM-10 - 11 V542 Self-Learning Function Introduction
No ratings yet
HM-10 - 11 V542 Self-Learning Function Introduction
5 pages
FOF Preview
No ratings yet
FOF Preview
7 pages
Individual Assignment II
No ratings yet
Individual Assignment II
2 pages
Annotating: Why and How: How To Mark A Book by Mortimer J. Adler, PH.D
No ratings yet
Annotating: Why and How: How To Mark A Book by Mortimer J. Adler, PH.D
3 pages
Mohammad Alfar CV-Accounting - Supplychain Coordinator
No ratings yet
Mohammad Alfar CV-Accounting - Supplychain Coordinator
2 pages
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Calculus I Essentials
From Everand
Calculus I Essentials
Editors of REA
1/5 (1)