0% found this document useful (0 votes)

7 views

04-Analysis Of Algorithms

The document discusses the analysis of algorithms, focusing on running time and performance prediction. It highlights the importance of understanding algorithm efficiency through examples like the Discrete Fourier Transform and N-body simulations, comparing brute force methods to optimized algorithms. Additionally, it emphasizes the scientific method in analyzing performance and includes practical examples and empirical analysis techniques.

Uploaded by

imeddabbech

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views

04-Analysis Of Algorithms

Uploaded by

imeddabbech

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 135

Datenstrukturen

und Algorithmen

Analysis of Algorithms
1.4 A NALYSIS OF A LGORITHMS

h tt p : / / a l g s 4 . c s . p r i n c e t o n . e d u

2
Running time
“ As soon as an Analytic Engine exists, it will necessarily guide the future
course of the science. Whenever any result is sought by its aid, the question
will arise—By what course of calculation can these results be arrived at by
the machine in the shortest time? ” — Charles Babbage (1864)

Analytic Engine
3
Running time
“ As soon as an Analytic Engine exists, it will necessarily guide the future
course of the science. Whenever any result is sought by its aid, the question
will arise—By what course of calculation can these results be arrived at by
the machine in the shortest time? ” — Charles Babbage (1864)

how many times do you

have to turn the crank?

Analytic Engine
3
Cast of characters

4
Cast of characters
Programmer needs to develop
a working solution.

Client wants to solve

problem ef ciently.

4
fi
Cast of characters
Programmer needs to develop
a working solution.

Client wants to solve

problem ef ciently.

Theoretician wants
to understand.

4
fi
Cast of characters
Programmer needs to develop
a working solution.

Student might play

Client wants to solve any or all of these

problem ef ciently. roles someday.

Theoretician wants
to understand.

4
fi
Predict performance.

Compare algorithms. this course

Provide guarantees.

Understand theoretical basis. „Berechenbarkeit und Komplexität“

Primary practical reason: avoid performance issues.

client gets poor performance because programmer

did not understand performance characteristics

5
Discrete Fourier transform.
• Break down waveform of N samples into periodic components.

• Applications: DVD, JPEG, MRI, astrophysics, ….

• Brute force: N2 steps.

• FFT algorithm: N log N steps, enabled new technology.

time
quadratic
64T

32T

16T
linearithmic
8T
linear

size 1K 2K 4K 8K 6
N-body simulation
• Simulate gravitational interactions among N bodies.

• Brute force: N 2 steps.

• Barnes-Hut algorithm: N log N steps, enabled new research.

7
N-body simulation
• Simulate gravitational interactions among N bodies.

• Brute force: N 2 steps.

• Barnes-Hut algorithm: N log N steps, enabled new research.

7
The challenge

Q: Will my program be able to solve a large practical input?

8
The challenge
Why is my program so slow ?

Q: Will my program be able to solve a large practical input?

8
The challenge
Why is my program so slow ? Why does it run out of memory ?

Q: Will my program be able to solve a large practical input?

8
The challenge
Why is my program so slow ? Why does it run out of memory ?

Q: Will my program be able to solve a large practical input?

Insight: [Knuth 1970s] Use scienti c method to understand
performance.
8
fi
Scienti c method

9
fi
Scienti c method

• Observe some feature of the natural world.

9
fi
Scienti c method

• Observe some feature of the natural world.

• Hypothesize a model that is consistent with the observations.

9
fi
Scienti c method

• Observe some feature of the natural world.

• Hypothesize a model that is consistent with the observations.

• Predict events using the hypothesis.

9
fi
Scienti c method

• Observe some feature of the natural world.

• Hypothesize a model that is consistent with the observations.

• Predict events using the hypothesis.

• Verify the predictions by making further observations.

9
fi
Scienti c method

• Observe some feature of the natural world.

• Hypothesize a model that is consistent with the observations.

• Predict events using the hypothesis.

• Verify the predictions by making further observations.

• Validate by repeating until the hypothesis and observations agree.

9
fi
Principles

• Experiments must be reproducible.

• Hypotheses must be falsi able.

10
fi
Example: 3-Sum
Given N distinct integers, how many triples sum to exactly zero?

% cat 8ints.txt
a[i] a[j] a[k] sum
30 -40 -20 -10 40 0 10 5
30 -40 10 0

% python3 3Sum.py 8ints.txt 30 -20 -10 0

4
-40 40 0 0

-10 0 10 0

11
Example: 3-Sum
Given N distinct integers, how many triples sum to exactly zero?

% cat 8ints.txt
a[i] a[j] a[k] sum
30 -40 -20 -10 40 0 10 5
30 -40 10 0

% python3 3Sum.py 8ints.txt 30 -20 -10 0

4
-40 40 0 0

-10 0 10 0

11
3-Sum
brute-force algorithm
import sys
import DSA

def count(a):
N = len(a)
count = 0
for i in range(0,N):
for j in range(i+1,N):
for k in range(j+1,N):
if a[i] + a[j] + a[k] == 0: check each triple
count+=1
return count

f = DSA.In(sys.argv[1])
a = f.readAllInts()
print(count(a))

12
Q: How to time a program? Manual!
% python3 3Sum.py 1Kints.txt

tick tick tick

70
% python3 3Sum.py 2Kints.txt

tick tick tick tick tick tick tick tick

tick tick tick tick tick tick tick tick
tick tick tick tick tick tick tick tick

528
% python3 3Sum.py 4Kints.txt
tick tick tick tick tick tick tick tick
tick tick tick tick tick tick tick tick
tick tick tick tick tick tick tick tick
% python3 3Sum.py 1Kints.txt
tick tick tick tick tick tick tick tick
tick tick tick tick tick tick tick tick
tick tick tick tick tick tick tick tick tick tick tick
tick tick tick tick tick tick tick tick
tick tick tick tick tick tick tick tick
tick tick tick tick tick tick tick tick
70 tick tick tick tick tick tick tick tick
% python3 3Sum.py 2Kints.txt tick tick tick tick tick tick tick tick
tick tick tick tick tick tick tick tick
tick tick tick tick tick tick tick tick
tick tick tick tick tick tick tick tick tick tick tick tick tick tick tick tick
tick tick tick tick tick tick tick tick tick tick tick tick tick tick tick tick
tick tick tick tick tick tick tick tick
tick tick tick tick tick tick tick tick
tick tick tick tick tick tick tick tick
528 tick tick tick tick tick tick tick tick
% python3 3Sum.py 4Kints.txt tick tick tick tick tick tick tick tick
tick tick tick tick tick tick tick tick
tick tick tick tick tick tick tick tick tick tick tick tick tick tick tick tick
tick tick tick tick tick tick tick tick tick tick tick tick tick tick tick tick
tick tick tick tick tick tick tick tick tick tick tick tick tick tick tick tick
tick tick tick tick tick tick tick tick tick tick tick tick tick tick tick tick
tick tick tick tick tick tick tick tick 4039
tick tick tick tick tick tick tick tick
tick tick tick tick tick tick tick tick
tick tick tick tick tick tick tick tick
tick tick tick tick tick tick tick tick 13
Automatic
class Stopwatch (part of DSA.py )

Stopwatch() create a new stopwatch

float : elapsedTime() time since creation (in seconds)

f = DSA.In(sys.argv[1])
a = f.readAllInts()
s = DSA.Stopwatch()
c = count(a)
time = s.elapsedTime()
print("elapsed time:",time,"seconds")
print(c)
client code

14
Empirical analysis

Run the program for

various input sizes
and measure running
time.

15
Empirical analysis

Run the program for

various input sizes
and measure running
time.

15
Empirical analysis
N time (seconds) †

250 0

Run the program for 500 0

various input sizes 1.000 0,1

and measure running 2.000 0,8

time. 4.000 6,4

8.000 51,1

16.000 ?

16
Standard plot. Plot running time T(N) vs. input size N.

standard plot 50 log-log plot 51.2 straight line

of slope 3
25.6

40 12.8

running time T(N)

6.4

lg(T(N))
30 3.2

1.6

20 .8

10 .2

1K 2K 4K 8K 1K 2K 4K 8K
problem size N lg N
Analysis of experimental data (the running time of ThreeSum)

17
Log-log plot: Plot running time T(N) vs. input size N using log-log scale.

log-log plot 51.2 straight line

of slope 3
25.6

lg (T (N)) = b ⋅ lg (N) + c
12.8

6.4
b = 2.999

lg(T(N))
3.2

1.6
c = − 33.2103
.8
b c
.4 T (N) = a ⋅ N , where a = 2
.2

1K 2K 4K 8K 1K 2K 4K 8K
problem size N lg N
Analysis of experimental data (the running time of ThreeSum)
b
Regression: Fit straight line through data points: a ⋅ N .
−10 2.999
Hypothesis: The running time is about 1.006 ⋅ 10 ⋅N seconds.
18
−10 2.999
Hypothesis: The running time is about 1.006 ⋅ 10 ⋅N seconds.

Predictions:

• 51.0 seconds for N = 8,000.

• 408.1 seconds for N = 16,000.

Observations: N time (seconds)

8.000 51,1

8.000 51

8.000 51,1

16.000 410,8

validates hypothesis!

19
Doubling hypothesis
Run program, doubling the size of the input.
b
T (2N) a ⋅ (2N) b
= = 2
T (N) a⋅N b

Quick way to estimate b in a power-law relationship.

20
b
Doubling Hypothesis: Running time is about a ⋅ N with b = lg(ratio)

N time (seconds) † ratio lg ratio

250 0 –

b
T (2N) a ⋅ (2N)
500 0 4,8 2,3
b
1.000 0,1 6,9 2,8 = = 2
T (N) a ⋅ Nb
2.000 0,8 7,7 2,9

4.000 6,4 8 3 lg (6.4 / 0.8) = 3.0

8.000 51,1 8 3

seems to converge to a constant b ≈ 3

Hypothesis: Running time is about 0.998 ⋅ 10 −10 3

⋅N

21
Experimental algorithmics
• System independent effects:
• Algorithm
determines exponent

• Input data in power law

determines
• System dependent effects. constant in power

law
• Hardware: CPU, memory, cache, …

• Software: compiler, VM, garbage collector, …

• System: operating system, network, other apps, …

22
Mathematical models
for running time

23
Donald Knuth’s approach
Total running time:
sum of cost * frequency for operations.

• Need to analyze program to Donald Knuth

determine set of operations. 1974 Turing Award

• Cost depends on machine, compiler.

• Frequency depends on algorithm, input data.

24
Challenge: How to estimate
constants.
operation example nanoseconds †

integer add a + b 2,1

integer multiply a * b 2,4

integer divide a // b 5,4

oating-point add a + b 4,6

oating-point multiply a * b 4,2

oating-point divide a / b 13,5

sine math.sin(theta) 91,3

arctangent math.atan2(y, x) 129

... ... ...

† Running OS X on Macbook Pro 2.2GHz with 2GB RAM

25
fl
fl
fl
Observation: Most primitive operations take
constant time.
operation example nanoseconds †

assignment statement a = b c2

integer compare a < b c3

array element access a[i] c4

array length len(a) c5

1D array allocation intArray(N) c6 N

Caveat: Non-primitive operations often take more than constant time.

e.g. "+ operator" (concatenate) two strings
26
Example: 1-Sum
Q: How many instructions as a function of input size N ?

operation frequency

variable declaration 2
count = 0 assignment statement 2
for i in range(0,N):
if a[i] == 0: less than compare N+1
count+=1
equal to compare N

array access N

N array accesses increment N to 2 N

27
2-Sum

count = 0
for i in range(0,N):
for j in range(i+1,N):
if a[i] + a[j] == 0:
count+=1

28
2-Sum

count = 0
for i in range(0,N):
for j in range(i+1,N):
if a[i] + a[j] == 0:
count+=1

1
0 + 1 + 2 + . . . + (N 1) = N (N 1)
2 ⇥
N
=
2

28
In nite Sum

https://www.youtube.com/watch?v=w-I6XTVZXww
29
fi
2-Sum
operation frequency

count = 0 variable declaration N+2

for i in range(0,N):
for j in range(i+1,N): assignment statement N+2
if a[i] + a[j] == 0:
range accesses ½ (N + 1) (N + 2)
count+=1
equal to compare ½ N (N − 1)

array access N (N − 1)

increment ½ N (N − 1) to N (N − 1)

30
2-Sum
operation frequency

count = 0 variable declaration N+2

for i in range(0,N):
for j in range(i+1,N): assignment statement N+2
if a[i] + a[j] == 0:
range accesses ½ (N + 1) (N + 2)
count+=1
equal to compare ½ N (N − 1)
1 array access N (N − 1)
0 + 1 + 2 + . . . + (N 1) = N (N 1)
2 ⇥
=
N increment ½ N (N − 1) to N (N − 1)
2

30
“It is convenient to have a measure of the amount of work involved in a computing process, even
though it be a very crude one. We may count up the number of times that various elementary operations
are applied in the whole process and then given them various weights. We might, for instance, count the
number of additions, subtractions, multiplications, divisions, recording of numbers, and extractions of
figures from tables. In the case of computing with matrices most of the work consists of multiplications
and writing down numbers, and we shall therefore only attempt to count the number of multiplications
and recordings.” — Alan Turing

ROUNDING-OFF ERRORS IN MATRIX PROCESSES

By A. M. TURING
{National Physical Laboratory, Teddington, Middlesex)
[Received 4 November 1947]
SUMMARY
A number of methods of solving sets of linear equations and inverting matrices
are discussed. The theory of the rounding-off errors involved is investigated for
some of the methods. In all cases examined, including the well-known 'Gauss
elimination process', it is found that the errors are normally quite moderate: no
exponential build-up need occur.
Included amongst the methods considered is a generalization of Choleski's method
which appears to have advantages over other known methods both as regards
accuracy and convenience. This method may also be regarded as a rearrangement
of the elimination process.

Downlo
THIS paper contains descriptions of a number of methods for solving sets 31
Simpli cation 1 ”Cost Model“

Use some basic operation as a proxy for

running time.

32
fi
Cost model
count = 0
for i in range(0,N):
for j in range(i+1,N):
if a[i] + a[j] == 0:
count+=1
operation frequency

variable declaration N+2

assignment statement N+2

range accesses ½ (N + 1) (N + 2)

equal to compare ½ N (N − 1)

array access N (N − 1)
cost model = array accesses

increment ½ N (N − 1) to N (N − 1)
(we assume compiler or runtime do not

optimize any array accesses away!)

33
Cost model
count = 0
for i in range(0,N):
for j in range(i+1,N):
if a[i] + a[j] == 0:
count+=1
operation frequency

1 variable declaration N+2

0 + 1 + 2 + . . . + (N 1) = N (N 1)
2 ⇥
=
N assignment statement N+2
2
range accesses ½ (N + 1) (N + 2)

equal to compare ½ N (N − 1)

array access N (N − 1)
cost model = array accesses

increment ½ N (N − 1) to N (N − 1)
(we assume compiler or runtime do not

optimize any array accesses away!)

33
Simpli cation 2 ”Tilde Notation“

Estimate running time (or memory) as a function of input size N.

Ignore lower order terms.

(when N is large, terms are negligible when N is small, we don't care)

34
fi
Tilde Notation

N 3/6

Ex 1: ⅙ N 3 + 20 N + 16 ~ ⅙N3
166,666,667 N 3/6 ! N 2/2 + N /3
Ex 2: ⅙ N 3 + 100 N 4/3 + 56 ~ ⅙N3
166,167,000
Ex 3: ⅙N3 - ½N 2 + ⅓ N ~ ⅙N3
N 1,000
Leading-term approximation

Technical de nition. f(N) ~ g(N) means

35
fi
Tilde Notation

N 3/6

Ex 1: ⅙ N 3 + 20 N + 16 ~ ⅙N3
166,666,667 N 3/6 ! N 2/2 + N /3
Ex 2: ⅙ N 3 + 100 N 4/3 + 56 ~ ⅙N3
166,167,000
Ex 3: ⅙N3 - ½N 2 + ⅓ N ~ ⅙N3
N 1,000
Leading-term approximation

f (N)
Technical de nition. f(N) ~ g(N) means lim = 1
N → ∞ g(N)

35
fi
Tilde Notation
operation frequency tilde notation

variable declaration N+2 ~N

assignment statement N+2 ~N

range accesses ½ (N + 1) (N + 2) ~½N2

equal to compare ½ N (N − 1) ~½N2

array access N (N − 1) ~N2

increment ½ N (N − 1) to N (N − 1) ~ ½ N 2 to ~ N 2

36
2-Sum
Q: Approximately how many array
accesses as a function of input size N ?

count = 0
for i in range(0,N):
for j in range(i+1,N):
if a[i] + a[j] == 0: "inner loop"
count+=1

A: ~ N2 array accesses.
37
2-Sum
Q: Approximately how many array
accesses as a function of input size N ?

count = 0
for i in range(0,N):
for j in range(i+1,N):
if a[i] + a[j] == 0: "inner loop"
count+=1

1
0 + 1 + 2 + . . . + (N 1) = N (N 1)
2 ⇥
N
=
2

A: ~ N2 array accesses.
37
Bottom line:
Use cost model and tilde
notation to simplify counts!

38
3-Sum
Q: Approximately how many array accesses
as a function of input size N ?
count = 0
for i in range(0,N):
for j in range(i+1,N):
for k in range(j+1,N):
if a[i] + a[j] + a[k] == 0: "inner loop"
count+=1

(3)
N N (N − 1) (N − 2) 1 3
= ∼ N
3! 6

A: ~ ½N 3 array accesses.
39
Q: How to estimate a discrete sum?
• A1: Take a discrete mathematics course.
• A2: Replace the sum with an integral, and use calculus
(doesn’t always work)!
N N
1 2
∑ ∫x=1
Ex 1. 1 + 2 + … + N. i∼ x dx ∼ N
i=1
2

N N
1
∫x=1
ik ∼ x k dx ∼ N k+1
Ex 2. 1k + 2k + … + N k. ∑ k + 1
i=1

N N
1 1
∑i ∫x=1 x
Ex 3. 1 + 1/2 + 1/3 + … + 1/N. ∼ dx ∼ ln N
i=1
40
A3: Use Maple, Wolfram Alpha etc.

41
A3: Use Maple, Wolfram Alpha etc.

wolframalpha.com

41
A3: Use Maple, Wolfram Alpha etc.

wolframalpha.com

[wayne:nobel.princeton.edu] > maple15

|\^/| Maple 15 (X86 64 LINUX)
._|\| |/|_. Copyright (c) Maplesoft, a division of Waterloo Maple Inc. 2011
\ MAPLE / All rights reserved. Maple is a trademark of
<____ ____> Waterloo Maple Inc.
| Type ? for help.
> factor(sum(sum(sum(1, k=j+1..N), j = i+1..N), i = 1..N));

N (N - 1) (N - 2)
-----------------
6
41
• In principle, accurate mathematical models are available.
• In practice,
• Formulas can be complicated.
• Advanced mathematics might be required.
• Exact models best left for experts.
• Bottom line: We use approximate models in this course:
T(N) ~ c N .
3

42
Order-of-growth
Classi cations

43
fi
Common order-of-growth
classi cations
De nition. If f(N) ~ c g(N) for some constant c > 0, then the order
of growth of f(N) is g(N).
• Ignores leading coef cient.
• Ignores lower-order terms.

44
fi
fi
fi
Example
count = 0
for i in range(0,N):
for j in range(i+1,N):
for k in range(j+1,N):
if a[i] + a[j] + a[k] == 0:
count+=1

The order of growth of the running time of this code is N 3.

45
time
Common Classes
200T

100T
logarithmic
constant
Good news: The set of functions: 1, log N, N, N log N, N 2, N 3, and 2N
suf ces to describe the order of growth of most common algorithms.
100K 200K 500K
size

log-log plot
512T

exponential

ic
cubi

hm
rat

r
rit
d

ea
qua

lin
lin
64T
time

2T
logarithmic
T
constant

1K 2K 4K 8K size 512K

Typical orders of growth 46

fi
order of
name typical code framework description example T(2N) / T(N)
growth
add two
1 constant a = b + c statement 1
numbers

while N > 1:
log N logarithmic divide in half binary search ~1
N = N // 2 ...

for i in range(0,N): nd the

N linear loop 2
... maximum
divide
N log N linearithmic [see mergesort lecture] mergesort ~2
and conquer
for i in range(0,N):
N2 quadratic for j in range(0,N): double loop check all pairs 4
...
for i in range(0,N):
for j in range(0,N):
N 3 cubic triple loop check all triples 8
for k in range(0,N):
...
exhaustive check all
2N exponential [see combinatorial search lecture] T(N)
search subsets
47
fi
growth problem size solvable in minutes time to process millions of inputs

rate 1970s 1980s 1990s 2000s 1970s 1980s 1990s 2000s

1 "any" "any" "any" "any" "instant" "instant" "instant" "instant"

log N "any" "any" "any" "any" "instant" "instant" "instant" "instant"

tens of hundreds of
N millions billions minutes seconds second "instant"
millions millions

hundreds of hundreds of tens of

N log N millions millions hour minutes seconds
thousands millions seconds

tens of
N2 hundreds thousand thousands decades years months weeks
thousands

N3 hundred hundreds thousand thousands "never" "never" "never" millennia

48
Binary search
• Goal: Given a sorted array and a key, nd index of the key
in the array?
• Idea: Compare key against middle entry.
• Too small, go left.
• Too big, go right.
• Equal, found.

49
fi
Binary search demo

successful search for 33

6 13 14 25 33 43 51 53 64 72 84 93 95 96 97
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14

lo hi

50
Binary search demo

successful search for 33

6 13 14 25 33 43 51 53 64 72 84 93 95 96 97
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14

lo mid hi

50
Binary search demo

successful search for 33

6 13 14 25 33 43 51 53 64 72 84 93 95 96 97
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14

lo hi

51
Binary search demo

successful search for 33

6 13 14 25 33 43 51 53 64 72 84 93 95 96 97
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14

lo mid hi

51
Binary search demo

successful search for 33

6 13 14 25 33 43 51 53 64 72 84 93 95 96 97
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14

lo hi

52
Binary search demo

successful search for 33

6 13 14 25 33 43 51 53 64 72 84 93 95 96 97
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14

lo mid hi

52
Binary search demo

lo = hi
successful search for 33

6 13 14 25 33 43 51 53 64 72 84 93 95 96 97
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14

53
Binary search demo

lo = hi
successful search for 33

6 13 14 25 33 43 51 53 64 72 84 93 95 96 97
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14

mid

return 4

53
Binary search demo

unsuccessful search for 34

6 13 14 25 33 43 51 53 64 72 84 93 95 96 97
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14

lo hi

54
Binary search demo

unsuccessful search for 34

6 13 14 25 33 43 51 53 64 72 84 93 95 96 97
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14

lo mid hi

54
Binary search demo

unsuccessful search for 34

6 13 14 25 33 43 51 53 64 72 84 93 95 96 97
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14

lo hi

55
Binary search demo

unsuccessful search for 34

6 13 14 25 33 43 51 53 64 72 84 93 95 96 97
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14

lo mid hi

55
Binary search demo

unsuccessful search for 34

6 13 14 25 33 43 51 53 64 72 84 93 95 96 97
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14

lo hi

56
Binary search demo

unsuccessful search for 34

6 13 14 25 33 43 51 53 64 72 84 93 95 96 97
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14

lo mid hi

56
Binary search demo

lo = hi
unsuccessful search for 34

6 13 14 25 33 43 51 53 64 72 84 93 95 96 97
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14

57
Binary search demo

lo = hi
unsuccessful search for 34

6 13 14 25 33 43 51 53 64 72 84 93 95 96 97
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14

mid

return -1

57
Binary search implementation

Trivial to implement?

58
Binary search implementation

Trivial to implement?
• First binary search published in 1946.

58
Binary search implementation

Trivial to implement?
• First binary search published in 1946.
• First bug-free one in 1962.

58
Binary search implementation

Trivial to implement?
• First binary search published in 1946.
• First bug-free one in 1962.
• Bug in Java's Arrays.binarySearch() discovered in 2006.

58
Binary search: Python3
def binSearch(a,e):
lo = 0
hi = len(a)-1
while hi >= lo:
mid = lo+(hi-lo)//2
if e > a[mid]:
lo = mid+1
elif e < a[mid]:
hi = mid-1
else:
return mid
return -1

Invariant: If key appears in the array a[], then a[lo] ≤ key ≤ a[hi].
59
Mathematical analysis
Proposition: Binary search uses at most 1 + lg N key compares to search
in a sorted array of size N.

Def.: T(N ) = # key compares to binary search a sorted subarray of

size ≤ N.

Binary search recurrence:

T (N ) ≤ T (N // 2) + 1 for N > 1, with T (1) = 1.

60
Proof Sketch
T (N) ≤ T (N // 2) + 1 [ given ]

61
Proof Sketch
T (N) ≤ T (N // 2) + 1 [ given ]
≤ T (N // 4) + 1 + 1 [ apply recurrence to rst term ]

61
fi
Proof Sketch
T (N) ≤ T (N // 2) + 1 [ given ]
≤ T (N // 4) + 1 + 1 [ apply recurrence to rst term ]
≤ T (N // 8) + 1 + 1 + 1 [ apply recurrence to rst term ]

61
fi
fi
Proof Sketch
T (N) ≤ T (N // 2) + 1 [ given ]
≤ T (N // 4) + 1 + 1 [ apply recurrence to rst term ]
≤ T (N // 8) + 1 + 1 + 1 [ apply recurrence to rst term ]
⋮

61
fi
fi
Proof Sketch
T (N) ≤ T (N // 2) + 1 [ given ]
≤ T (N // 4) + 1 + 1 [ apply recurrence to rst term ]
≤ T (N // 8) + 1 + 1 + 1 [ apply recurrence to rst term ]
⋮
≤ T (N // N) + 1 + 1 + … + 1 [ stop applying, T(1) = 1 ]

61
fi
fi
N 2 log N algorithm for 3-Sum
input

30 -40 -20 -10 40 0 10 5

• Step 1: Sort the N (distinct) numbers.

• Step 2: For each pair of numbers a[i]

and a[j], binary search for -(a[i] + a[j]).

62
N 2 log N algorithm for 3-Sum
input

30 -40 -20 -10 40 0 10 5

sort

-40 -20 -10 0 5 10 30 40

• Step 1: Sort the N (distinct) numbers.

• Step 2: For each pair of numbers a[i]

and a[j], binary search for -(a[i] + a[j]).

62
N 2 log N algorithm for 3-Sum
binary search
sort (-40, -20) 60

-40 -20 -10 0 5 10 30 40 (-40, -10) 50

(-40, 0) 40
(-40, 5) 35
(-40, 10) 30
⋮ ⋮

(-20, -10) 30
• Step 1: Sort the N (distinct) numbers. only count if
⋮ ⋮
a[i] < a[j] < a[k]
• Step 2: For each pair of numbers a[i] (-10, 0) 10 to avoid
and a[j], binary search for -(a[i] + a[j]).
⋮ ⋮ double counting

( 10, 30) -40

( 10, 40) -50
( 30, 40) -70

62
Analysis

• Order of growth is N 2 log N.

• Step 1: N 2 with insertion sort (details later).
• Step 2: N 2 log N with binary search.

Remark: Can achieve N 2 by modifying binary search step.

63
Speed Comparison
N time (seconds) N time (seconds)

1.000 0,1 1.000 0,14

2.000 0,8 2.000 0,18

4.000 6,4 4.000 0,34

8.000 51,1 8.000 0,96

3Sum.py 16.000 3,67

32.000 14,88

64.000 59,16

3Sum-fast.py

64
Theory of algorithms

65
Types of analyses
• Best case
Lower bound on cost

• Worst case
Upper bound on cost

• Average case
Expected cost for random input
Ex 1. Array accesses for brute-force 3-SUM. Ex 2. Compares for binary search.
Best: ~ ½ N3 Best: ~ 1
Average: ~ ½ N3 Average: ~ lg N
Worst: ~ ½ N3 Worst: ~ lg N
66
Types of analyses
• Best case
Lower bound on cost

• Worst case
Upper bound on cost
this course

• Establish “dif culty” of a problem.

• Develop “optimal” algorithms.

• Approach:

• Suppress details in analysis: analyze “to within a constant factor.”

• Eliminate variability in input model: focus on the worst case.

67
fi
Theory of algorithms

• Upper bound:
Performance guarantee of algorithm for any input.

• Lower bound:
Proof that no algorithm can do better.

• Optimal algorithm:
Lower bound = upper bound (to within a constant factor).

68
Commonly-used notations in the theory of
algorithms
notation provides example shorthand for used to

½ N2
asymptotic 10 N 2 classify
Big Theta Θ(N2)
order of growth 5 N 2 + 22 N log N + 3N algorithms
⋮

10 N 2
100 N develop
Big Oh Θ(N2) and smaller O(N2)
22 N log N + 3 N upper bounds
⋮

½N2
N5 develop
Big Omega Θ(N2) and larger Ω(N2)
N 3 + 22 N log N + 3 N lower bounds
⋮
69
Example 1

1-Sum
“Is there a 0 in the array?”

70
• Upper bound: A speci c algorithm.

• Ex. Brute-force algorithm for 1-Sum: Look at every array entry.

• Running time of the optimal algorithm for 1-Sum is O(N).

71
fi
• Upper bound: A speci c algorithm.

• Ex. Brute-force algorithm for 1-Sum: Look at every array entry.

• Running time of the optimal algorithm for 1-Sum is O(N).

• Lower bound: Proof that no algorithm can do better.

• Ex. Have to examine all N entries

(any unexamined one might be 0).

• Running time of the optimal algorithm for 1-Sum is Ω(N).

• Optimal algorithm.

71
fi
• Upper bound: A speci c algorithm.

• Ex. Brute-force algorithm for 1-Sum: Look at every array entry.

• Running time of the optimal algorithm for 1-Sum is O(N).

• Lower bound: Proof that no algorithm can do better.

• Ex. Have to examine all N entries

(any unexamined one might be 0).

• Running time of the optimal algorithm for 1-Sum is Ω(N).

• Optimal algorithm.

• Lower bound equals upper bound (to within a constant factor).

• Brute-force algorithm for 1-Sum is optimal:

its running time is Θ(N).

71
fi
Example 2

3-Sum
72
• Upper bound: A speci c algorithm.

• Ex. Brute-force algorithm for 3-Sum.

• Running time of the optimal algorithm for 3-Sum is O(N3).

73
fi
• Upper bound: A speci c algorithm.

• Ex. Improved algorithm for 3-Sum.

• Running time of the optimal algorithm for 3-Sum is O(N2 log N ).

74
fi
• Upper bound: A speci c algorithm.

• Ex. Improved algorithm for 3-Sum.

• Running time of the optimal algorithm for 3-Sum is O(N2 log N ).

• Lower bound: Proof that no algorithm can do better.

• Ex. Have to examine all N entries to solve 3-Sum.

• Running time of the optimal algorithm for solving 3-Sum is Ω(N ).

75
fi
• Upper bound: A speci c algorithm.

• Ex. Improved algorithm for 3-Sum.

• Running time of the optimal algorithm for 3-Sum is O(N2 log N ).

• Lower bound: Proof that no algorithm can do better.

• Ex. Have to examine all N entries to solve 3-Sum.

• Running time of the optimal algorithm for solving 3-Sum is Ω(N ).

• Open problems:

• Optimal algorithm for 3-Sum? Subquadratic algorithm for 3-Sum?

Quadratic lower bound for 3-Sum?

75
fi
Algorithm Design

76
Algorithm Design
Develop a new Algorithm

Prove a lower bound

76
Algorithm Design
Develop a new Algorithm

Gap?

Prove a lower bound

76
Algorithm Design
Lower the upper bound
Develop a new Algorithm (discover a new algorithm).

Gap?

Prove a lower bound

76
Algorithm Design
Lower the upper bound
Develop a new Algorithm (discover a new algorithm).

Gap?

Raise the lower bound

Prove a lower bound (more dif cult).

76
fi
Golden Age of Algorithm Design

• 1970s
• Steadily decreasing upper bounds for many important problems.
• Many known optimal algorithms.

77
‑
Caveats

• Overly pessimistic to focus on worst case?

• Often we need better than “to within a constant factor”
to predict performance.

78
Commonly-used notations in the theory of
algorithms
notation provides example shorthand for used to

10 N 2
provide
Tilde leading term ~ 10 N 2 10 N 2 + 22 N log N
approximate model
10 N 2 + 2 N + 37

½ N2
asymptotic classify
Big Theta Θ(N2) 10 N 2
order of growth algorithms
5 N 2 + 22 N log N + 3N

10 N 2
develop
Big Oh Θ(N2) and smaller O(N2) 100 N
upper bounds
22 N log N + 3 N

½N2
develop
Big Omega Θ(N2) and larger Ω(N2) N5
lower bounds
N 3+ 22 N log N + 3 N
79
Turning the crank: summary

80
Summary
Empirical analysis.
• Execute program to perform
experiments.
• Assume power law and formulate a
hypothesis for running time.
• Model enables us to make predictions.

81
Summary
Mathematical analysis.
• Analyze algorithm to count frequency of
operations.
• Use tilde notation to simplify analysis.
• Model enables us to explain behavior.

82
Summary
Scienti c method.
• Mathematical model is independent of a
particular system; applies to machines not
yet built.
• Empirical analysis is necessary to validate mathematical
models and to make predictions.

83
fi
R EADING L IST :

2.1 E LEMENTARY S ORTS

h tt p : / / a l g s 4 . c s . p r i n c e t o n . e d u

Reiki Doc - Divine Healing Codes and How To Use Them 2015
100% (2)
Reiki Doc - Divine Healing Codes and How To Use Them 2015
105 pages
AOCS Method Cd12b-92 - Estabilidade Oxidativa
100% (2)
AOCS Method Cd12b-92 - Estabilidade Oxidativa
5 pages
Analysis of Algorithms Into Coursenotes
No ratings yet
Analysis of Algorithms Into Coursenotes
68 pages
14 Analysis of Algorithms
No ratings yet
14 Analysis of Algorithms
68 pages
14 Analysis of Algorithms
No ratings yet
14 Analysis of Algorithms
59 pages
Lecture4 2024
No ratings yet
Lecture4 2024
67 pages
3sum and Improvement
No ratings yet
3sum and Improvement
38 pages
Data Structures U1
No ratings yet
Data Structures U1
88 pages
04 Analysis
No ratings yet
04 Analysis
61 pages
Algo Analysis Lect
No ratings yet
Algo Analysis Lect
34 pages
Chapter 1 - Analysis of Algorithms 2
No ratings yet
Chapter 1 - Analysis of Algorithms 2
44 pages
Alg Chapter2 Part 1
No ratings yet
Alg Chapter2 Part 1
55 pages
Lec 2
No ratings yet
Lec 2
59 pages
DSAIIMidBank
No ratings yet
DSAIIMidBank
323 pages
Algorithm Analysis
No ratings yet
Algorithm Analysis
19 pages
Analysis of Algorithms
No ratings yet
Analysis of Algorithms
43 pages
Lecture 3.1
No ratings yet
Lecture 3.1
29 pages
Algorithms Rosen
No ratings yet
Algorithms Rosen
60 pages
Order of Complexity Analysis
No ratings yet
Order of Complexity Analysis
9 pages
Lec - 3 Final
No ratings yet
Lec - 3 Final
52 pages
DAA Module 1 Power Point-S.Mercy
No ratings yet
DAA Module 1 Power Point-S.Mercy
49 pages
Lecture 2-The Big-Oh Notation
No ratings yet
Lecture 2-The Big-Oh Notation
30 pages
DSAP-Lecture 3 - Algorithm Analysis
No ratings yet
DSAP-Lecture 3 - Algorithm Analysis
35 pages
Design and Analysis of Algorithms: Dr. Muhammad Safysn Spring 2019
No ratings yet
Design and Analysis of Algorithms: Dr. Muhammad Safysn Spring 2019
89 pages
MIT1 204S10 Lec05
No ratings yet
MIT1 204S10 Lec05
13 pages
L2.1 Intro To DSA
No ratings yet
L2.1 Intro To DSA
57 pages
01-algo
No ratings yet
01-algo
95 pages
Daa PPT - Dhaval Bhoi - Ce355
No ratings yet
Daa PPT - Dhaval Bhoi - Ce355
77 pages
Introduction
No ratings yet
Introduction
36 pages
DSweek3 Algo
No ratings yet
DSweek3 Algo
29 pages
1 Introduction
No ratings yet
1 Introduction
104 pages
Time and Space Complexity _ 2023
No ratings yet
Time and Space Complexity _ 2023
47 pages
Big O Notation For Time Complexity of Algorithms
No ratings yet
Big O Notation For Time Complexity of Algorithms
6 pages
Presentation 23953 Content Document 20240906040454PM (2)
No ratings yet
Presentation 23953 Content Document 20240906040454PM (2)
93 pages
European Semiconductor
No ratings yet
European Semiconductor
14 pages
Algorithm Analysis: Prof - Dr.Eng - Ir Taufik Djatna
No ratings yet
Algorithm Analysis: Prof - Dr.Eng - Ir Taufik Djatna
39 pages
ALGORITHM LAB
No ratings yet
ALGORITHM LAB
48 pages
Algorithm Efficiency Analysis
No ratings yet
Algorithm Efficiency Analysis
31 pages
1.4. Analysis of Algorithms
No ratings yet
1.4. Analysis of Algorithms
75 pages
Slide 2
No ratings yet
Slide 2
22 pages
Basic Concept of Data Structures
No ratings yet
Basic Concept of Data Structures
37 pages
Algorithm Analysis
No ratings yet
Algorithm Analysis
51 pages
Algorithms Intro
No ratings yet
Algorithms Intro
86 pages
Lec16-Algorithm-Analysis-09092024-090952pm (1)
No ratings yet
Lec16-Algorithm-Analysis-09092024-090952pm (1)
30 pages
Lec8 PDF
No ratings yet
Lec8 PDF
42 pages
CSE225 Lecture04 AnalysisAlgorithms
No ratings yet
CSE225 Lecture04 AnalysisAlgorithms
39 pages
05 CSE225 Analysis of Algorithms
No ratings yet
05 CSE225 Analysis of Algorithms
32 pages
DAA Unit 1 Notes
No ratings yet
DAA Unit 1 Notes
34 pages
1 2
No ratings yet
1 2
56 pages
L1_AlgoAnalysis
No ratings yet
L1_AlgoAnalysis
57 pages
Data Structures - Lecture 3 - Solution - English
No ratings yet
Data Structures - Lecture 3 - Solution - English
36 pages
Iare DS PPT 0
No ratings yet
Iare DS PPT 0
221 pages
Chapter 1 - Introduction
No ratings yet
Chapter 1 - Introduction
13 pages
DS PPT
No ratings yet
DS PPT
221 pages
Introduction
No ratings yet
Introduction
18 pages
1 TimeComplexity
No ratings yet
1 TimeComplexity
74 pages
Algorithms_Rosen
No ratings yet
Algorithms_Rosen
18 pages
Growth of Functions
100% (1)
Growth of Functions
11 pages
Analysis of Algorithms: CS 302 - Data Structures Section 2.6
No ratings yet
Analysis of Algorithms: CS 302 - Data Structures Section 2.6
48 pages
Natural Computing with Python: Learn to implement genetic and evolutionary algorithms to solve problems in a pythonic way
From Everand
Natural Computing with Python: Learn to implement genetic and evolutionary algorithms to solve problems in a pythonic way
Giancarlo Zaccone
No ratings yet
Neural Networks with Python
From Everand
Neural Networks with Python
Mei Wong
No ratings yet
Mastering matplotlib
From Everand
Mastering matplotlib
Duncan M. McGreggor
No ratings yet
Survey of Digitised Materials in Bengali
No ratings yet
Survey of Digitised Materials in Bengali
18 pages
Diagram 2
100% (3)
Diagram 2
11 pages
Syllabus Macro ENG
No ratings yet
Syllabus Macro ENG
6 pages
Section 1 - Rfds General Information: Umts Frequency
No ratings yet
Section 1 - Rfds General Information: Umts Frequency
24 pages
Research Proposal Hira Saghir 1
No ratings yet
Research Proposal Hira Saghir 1
11 pages
Strata Bound Dolomitization in The Eocene Laki Formation Matyaro Jabal
No ratings yet
Strata Bound Dolomitization in The Eocene Laki Formation Matyaro Jabal
16 pages
ENG 101 Course Outline
No ratings yet
ENG 101 Course Outline
3 pages
Evolve Company Profile 23 - 24 - Compressed
No ratings yet
Evolve Company Profile 23 - 24 - Compressed
20 pages
VNR Vignana Jyothi Institute of Engineering & Technology
No ratings yet
VNR Vignana Jyothi Institute of Engineering & Technology
1 page
Prac-6 Darshan ADA Gtu
No ratings yet
Prac-6 Darshan ADA Gtu
9 pages
Stevenson Kaysee Resume Spring 2017
No ratings yet
Stevenson Kaysee Resume Spring 2017
2 pages
(MindTap Course List) Carlos Coronel, Steven Morris - Database Systems_ Design, Implementation, & Management-Cengage (2023) (1)-109-129
No ratings yet
(MindTap Course List) Carlos Coronel, Steven Morris - Database Systems_ Design, Implementation, & Management-Cengage (2023) (1)-109-129
21 pages
Abrasive Wear Behavior of Boronized AISI 8620 Steel 2008 PDF
No ratings yet
Abrasive Wear Behavior of Boronized AISI 8620 Steel 2008 PDF
7 pages
Measuring Information Technology Performance
No ratings yet
Measuring Information Technology Performance
38 pages
R&R House Rectification Works
No ratings yet
R&R House Rectification Works
41 pages
Suraj Dadmal
No ratings yet
Suraj Dadmal
17 pages
PRC-0008 Current
No ratings yet
PRC-0008 Current
50 pages
Nube Lizer
No ratings yet
Nube Lizer
5 pages
Ad 52090
No ratings yet
Ad 52090
25 pages
Essential Plant Nutrients: Uptake, Use Efficiency and Management
No ratings yet
Essential Plant Nutrients: Uptake, Use Efficiency and Management
2 pages
Choral Speaking Text
100% (3)
Choral Speaking Text
3 pages
DO - s2019 - 021 Policy Guidelines On The K To 12 Program
No ratings yet
DO - s2019 - 021 Policy Guidelines On The K To 12 Program
175 pages
.003 Tokkudu Billa 01 12
67% (45)
.003 Tokkudu Billa 01 12
73 pages
Week 1 Processor
No ratings yet
Week 1 Processor
24 pages
EC105
No ratings yet
EC105
23 pages
Crackens Rebel Operatives WEG40084 PDF
100% (4)
Crackens Rebel Operatives WEG40084 PDF
98 pages
Rudolf Walter Meyer - Leibnitz and The Seventeenth-Century Revolution (Leibniz Und Die Europäische Ordnungskrise, Hamburg, 1948) - Bowes and Bowes (1952)
No ratings yet
Rudolf Walter Meyer - Leibnitz and The Seventeenth-Century Revolution (Leibniz Und Die Europäische Ordnungskrise, Hamburg, 1948) - Bowes and Bowes (1952)
235 pages
Intro To Philosophy PPT 1
No ratings yet
Intro To Philosophy PPT 1
25 pages