Introduction To Algorithms: Order Statistics

Uploaded by

geniusamit

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF or read online on Scribd

0% found this document useful (0 votes)

210 views30 pages

Introduction To Algorithms: Order Statistics

Uploaded by

geniusamit

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF or read online on Scribd

You are on page 1/ 30

Introduction to Algorithms

6.046J/18.401J
LECTURE 6
Order Statistics
• Randomized divide and
conquer
• Analysis of expected time
• Worst-case linear-time
order statistics
• Analysis

Prof. Erik Demaine

September 28, 2005 Copyright © 2001-5 by Erik D. Demaine and Charles E. Leiserson L6.1
Order statistics
Select the ith smallest of n elements (the
element with rank i).
• i = 1: minimum;
• i = n: maximum;
• i = ⎣(n+1)/2⎦ or ⎡(n+1)/2⎤: median.
Naive algorithm: Sort and index ith element.
Worst-case running time = Θ(n lg n) + Θ(1)
= Θ(n lg n),
using merge sort or heapsort (not quicksort).
September 28, 2005 Copyright © 2001-5 by Erik D. Demaine and Charles E. Leiserson L6.2
Randomized divide-and-
conquer algorithm
RAND-SELECT(A, p, q, i) ⊳ ith smallest of A[ p . . q]
if p = q then return A[ p]
r ← RAND-PARTITION(A, p, q)
k←r–p+1 ⊳ k = rank(A[r])
if i = k then return A[ r]
if i < k
then return RAND-SELECT(A, p, r – 1, i )
else return RAND-SELECT(A, r + 1, q, i – k )
k
≤≤ A[r]
A[r] ≥≥ A[r]
A[r]
p r q
September 28, 2005 Copyright © 2001-5 by Erik D. Demaine and Charles E. Leiserson L6.3
Example
Select the i = 7th smallest:
66 10
10 13
13 55 88 33 22 11
11 i=7
pivot
Partition:
22 55 33 66 88 13
13 10
10 11
11 k=4

Select the 7 – 4 = 3rd smallest recursively.

September 28, 2005 Copyright © 2001-5 by Erik D. Demaine and Charles E. Leiserson L6.4
Intuition for analysis
(All our analyses today assume that all elements
are distinct.)
Lucky:
T(n) = T(9n/10) + Θ(n) n log10 / 9 1 = n 0 = 1
= Θ(n) CASE 3
Unlucky:
T(n) = T(n – 1) + Θ(n) arithmetic series
= Θ(n2)
Worse than sorting!
September 28, 2005 Copyright © 2001-5 by Erik D. Demaine and Charles E. Leiserson L6.5
Analysis of expected time
The analysis follows that of randomized
quicksort, but it’s a little different.
Let T(n) = the random variable for the running
time of RAND-SELECT on an input of size n,
assuming random numbers are independent.
For k = 0, 1, …, n–1, define the indicator
random variable
1 if PARTITION generates a k : n–k–1 split,
Xk =
0 otherwise.
September 28, 2005 Copyright © 2001-5 by Erik D. Demaine and Charles E. Leiserson L6.6
Analysis (continued)
To obtain an upper bound, assume that the ith
element always falls in the larger side of the
partition:
T(max{0, n–1}) + Θ(n) if 0 : n–1 split,
T(max{1, n–2}) + Θ(n) if 1 : n–2 split,
T(n) =
M
T(max{n–1, 0}) + Θ(n) if n–1 : 0 split,
n −1
= ∑ X k (T (max{k , n − k − 1}) + Θ(n)) .
k =0
September 28, 2005 Copyright © 2001-5 by Erik D. Demaine and Charles E. Leiserson L6.7
Calculating expectation
⎡ n −1 ⎤
E[T (n)] = E ⎢ ∑ X k (T (max{k , n − k − 1}) + Θ(n) )⎥
⎣k =0 ⎦

Take expectations of both sides.

September 28, 2005 Copyright © 2001-5 by Erik D. Demaine and Charles E. Leiserson L6.8
Calculating expectation
⎡ n −1 ⎤
E[T (n)] = E ⎢ ∑ X k (T (max{k , n − k − 1}) + Θ(n) )⎥
⎣k =0 ⎦
n −1
= ∑ E[ X k (T (max{k , n − k − 1}) + Θ(n) )]
k =0

Linearity of expectation.

September 28, 2005 Copyright © 2001-5 by Erik D. Demaine and Charles E. Leiserson L6.9
Calculating expectation
⎡ n −1 ⎤
E[T (n)] = E ⎢ ∑ X k (T (max{k , n − k − 1}) + Θ(n) )⎥
⎣k =0 ⎦
n −1
= ∑ E[ X k (T (max{k , n − k − 1}) + Θ(n) )]
k =0
n −1
= ∑ E[ X k ] ⋅ E[T (max{k , n − k − 1}) + Θ(n)]
k =0

Independence of Xk from other random

choices.

September 28, 2005 Copyright © 2001-5 by Erik D. Demaine and Charles E. Leiserson L6.10
Calculating expectation
⎡ n −1 ⎤
E[T (n)] = E ⎢ ∑ X k (T (max{k , n − k − 1}) + Θ(n) )⎥
⎣k =0 ⎦
n −1
= ∑ E[ X k (T (max{k , n − k − 1}) + Θ(n) )]
k =0
n −1
= ∑ E[ X k ] ⋅ E[T (max{k , n − k − 1}) + Θ(n)]
k =0
n −1 n −1
= 1 ∑ E [T (max{k , n − k − 1})] + 1 ∑ Θ(n)
n k =0 n k =0

Linearity of expectation; E[Xk] = 1/n .

September 28, 2005 Copyright © 2001-5 by Erik D. Demaine and Charles E. Leiserson L6.11
Calculating expectation
⎡ n −1 ⎤
E[T (n)] = E ⎢ ∑ X k (T (max{k , n − k − 1}) + Θ(n) )⎥
⎣k =0 ⎦
n −1
= ∑ E[ X k (T (max{k , n − k − 1}) + Θ(n) )]
k =0
n −1
= ∑ E[ X k ] ⋅ E[T (max{k , n − k − 1}) + Θ(n)]
k =0
n −1 n −1
= 1 ∑ E [T (max{k , n − k − 1})] + 1 ∑ Θ(n)
n k =0 n k =0
n −1
≤ 2 ∑ E [T (k )] + Θ(n) Upper terms
n k = ⎣n / 2 ⎦
appear twice.
September 28, 2005 Copyright © 2001-5 by Erik D. Demaine and Charles E. Leiserson L6.12
Hairy recurrence
(But not quite as hairy as the quicksort one.)
n −1
E[T (n)] = 2 ∑ E [T (k )] + Θ(n)
n k= n/2
⎣ ⎦
Prove: E[T(n)] ≤ cn for constant c > 0 .
• The constant c can be chosen large enough
so that E[T(n)] ≤ cn for the base cases.
n −1
Use fact: ∑ 8 (exercise).
k ≤ 3n 2
k = ⎣n / 2 ⎦
September 28, 2005 Copyright © 2001-5 by Erik D. Demaine and Charles E. Leiserson L6.13
Substitution method
n −1
E [T (n)] ≤ 2 ∑ ck + Θ(n)
n k= n/2
⎣ ⎦
Substitute inductive hypothesis.

September 28, 2005 Copyright © 2001-5 by Erik D. Demaine and Charles E. Leiserson L6.14
Substitution method
n −1
E [T (n)] ≤ 2 ∑ ck + Θ(n)
n k= n/2
⎣ ⎦
≤ 2c ⎛⎜ 3 n 2 ⎞⎟ + Θ(n)
n ⎝8 ⎠
Use fact.

September 28, 2005 Copyright © 2001-5 by Erik D. Demaine and Charles E. Leiserson L6.15
Substitution method
n −1
E [T (n)] ≤ 2 ∑ ck + Θ(n)
n k= n/2
⎣ ⎦
≤ 2c ⎛⎜ 3 n 2 ⎞⎟ + Θ(n)
n ⎝8 ⎠
= cn − ⎛⎜ cn − Θ(n) ⎞⎟
⎝4 ⎠
Express as desired – residual.

September 28, 2005 Copyright © 2001-5 by Erik D. Demaine and Charles E. Leiserson L6.16
Substitution method
n −1
E [T (n)] ≤ 2 ∑ ck + Θ(n)
n k= n/2
⎣ ⎦
≤ 2c ⎛⎜ 3 n 2 ⎞⎟ + Θ(n)
n ⎝8 ⎠
= cn − ⎛⎜ cn − Θ(n) ⎞⎟
⎝4 ⎠
≤ cn ,
if c is chosen large enough so
that cn/4 dominates the Θ(n).
September 28, 2005 Copyright © 2001-5 by Erik D. Demaine and Charles E. Leiserson L6.17
Summary of randomized
order-statistic selection
• Works fast: linear expected time.
• Excellent algorithm in practice.
• But, the worst case is very bad: Θ(n2).
Q. Is there an algorithm that runs in linear
time in the worst case?
A. Yes, due to Blum, Floyd, Pratt, Rivest,
and Tarjan [1973].
IDEA: Generate a good pivot recursively.

September 28, 2005 Copyright © 2001-5 by Erik D. Demaine and Charles E. Leiserson L6.18
Worst-case linear-time order
statistics
SELECT(i, n)
1. Divide the n elements into groups of 5. Find
the median of each 5-element group by rote.
2. Recursively SELECT the median x of the ⎣n/5⎦
group medians to be the pivot.
3. Partition around the pivot x. Let k = rank(x).
4. if i = k then return x
elseif i < k Same as
then recursively SELECT the ith RAND-
smallest element in the lower part SELECT
else recursively SELECT the (i–k)th
smallest element in the upper part
September 28, 2005 Copyright © 2001-5 by Erik D. Demaine and Charles E. Leiserson L6.19
Choosing the pivot

September 28, 2005 Copyright © 2001-5 by Erik D. Demaine and Charles E. Leiserson L6.20
Choosing the pivot

1. Divide the n elements into groups of 5.

1. Divide the n elements into groups of 5. Find lesser

the median of each 5-element group by rote.

1. Divide the n elements into groups of 5. Find lesser

the median of each 5-element group by rote.
2. Recursively SELECT the median x of the ⎣n/5⎦
group medians to be the pivot. greater
September 28, 2005 Copyright © 2001-5 by Erik D. Demaine and Charles E. Leiserson L6.23
Analysis

At least half the group medians are ≤ x, which lesser

is at least ⎣ ⎣n/5⎦ /2⎦ = ⎣n/10⎦ group medians.

At least half the group medians are ≤ x, which lesser

is at least ⎣ ⎣n/5⎦ /2⎦ = ⎣n/10⎦ group medians.
• Therefore, at least 3 ⎣n/10⎦ elements are ≤ x.
greater
September 28, 2005 Copyright © 2001-5 by Erik D. Demaine and Charles E. Leiserson L6.25
Analysis (Assume all elements are distinct.)

At least half the group medians are ≤ x, which lesser

is at least ⎣ ⎣n/5⎦ /2⎦ = ⎣n/10⎦ group medians.
• Therefore, at least 3 ⎣n/10⎦ elements are ≤ x.
• Similarly, at least 3 ⎣n/10⎦ elements are ≥ x. greater
September 28, 2005 Copyright © 2001-5 by Erik D. Demaine and Charles E. Leiserson L6.26
Minor simplification
• For n ≥ 50, we have 3 ⎣n/10⎦ ≥ n/4.
• Therefore, for n ≥ 50 the recursive call to
SELECT in Step 4 is executed recursively
on ≤ 3n/4 elements.
• Thus, the recurrence for running time
can assume that Step 4 takes time
T(3n/4) in the worst case.
• For n < 50, we know that the worst-case
time is T(n) = Θ(1).

September 28, 2005 Copyright © 2001-5 by Erik D. Demaine and Charles E. Leiserson L6.27
Developing the recurrence
T(n) SELECT(i, n)
1. Divide the n elements into groups of 5. Find
Θ(n) the median of each 5-element group by rote.
2. Recursively SELECT the median x of the ⎣n/5⎦
T(n/5) group medians to be the pivot.
Θ(n) 3. Partition around the pivot x. Let k = rank(x).
4. if i = k then return x
elseif i < k
T(3n/4) then recursively SELECT the ith
smallest element in the lower part
else recursively SELECT the (i–k)th
smallest element in the upper part
September 28, 2005 Copyright © 2001-5 by Erik D. Demaine and Charles E. Leiserson L6.28
Solving the recurrence
T (n) = T ⎛⎜ 1 n ⎞⎟ + T ⎛⎜ 3 n ⎞⎟ + Θ(n)
⎝5 ⎠ ⎝4 ⎠

Substitution: T (n) ≤ 1 cn + 3 cn + Θ(n)

T(n) ≤ cn 5 4
= 19 cn + Θ(n)
20
= cn − ⎛⎜ 1 cn − Θ(n) ⎞⎟
⎝ 20 ⎠
≤ cn ,
if c is chosen large enough to handle both the
Θ(n) and the initial conditions.
September 28, 2005 Copyright © 2001-5 by Erik D. Demaine and Charles E. Leiserson L6.29
Conclusions
• Since the work at each level of recursion
is a constant fraction (19/20) smaller, the
work per level is a geometric series
dominated by the linear work at the root.
• In practice, this algorithm runs slowly,
because the constant in front of n is large.
• The randomized algorithm is far more
practical.
Exercise: Why not divide into groups of 3?
September 28, 2005 Copyright © 2001-5 by Erik D. Demaine and Charles E. Leiserson L6.30

Caie As Level Psychology 9990 Methodology 63d5229efa0a7313631e05cb 853
No ratings yet
Caie As Level Psychology 9990 Methodology 63d5229efa0a7313631e05cb 853
9 pages
FLAC Manual
90% (10)
FLAC Manual
3,058 pages
All Newtons Laws Math
0% (2)
All Newtons Laws Math
2 pages
Introduction To Algorithms: 6.046J/18.401J/SMA5503
No ratings yet
Introduction To Algorithms: 6.046J/18.401J/SMA5503
30 pages
KTH Smallest Number Algo
No ratings yet
KTH Smallest Number Algo
17 pages
Lecture 5: The Linear Time Selection in The Worst Case
No ratings yet
Lecture 5: The Linear Time Selection in The Worst Case
7 pages
Slides04 Selection
No ratings yet
Slides04 Selection
46 pages
08 Medians and Order Statistics
No ratings yet
08 Medians and Order Statistics
43 pages
Cs 161 Lecture 04
No ratings yet
Cs 161 Lecture 04
6 pages
Algorithms, Fall 2005. (Massachusetts Institute of Technology: MIT
No ratings yet
Algorithms, Fall 2005. (Massachusetts Institute of Technology: MIT
14 pages
Introduction To Algorithms: Divide and Conquer
No ratings yet
Introduction To Algorithms: Divide and Conquer
54 pages
L15 Median OrderStatistics
No ratings yet
L15 Median OrderStatistics
33 pages
Lecture4 Notes
No ratings yet
Lecture4 Notes
7 pages
Análisis y Diseño de Algoritmos (Algorítmica III) : - Selection Algorithms
No ratings yet
Análisis y Diseño de Algoritmos (Algorítmica III) : - Selection Algorithms
20 pages
Median Order Statistics
No ratings yet
Median Order Statistics
26 pages
L02-Median Finding
No ratings yet
L02-Median Finding
22 pages
Analysis of Median of Medians Algorithm
No ratings yet
Analysis of Median of Medians Algorithm
3 pages
7-5 Solution PDF
No ratings yet
7-5 Solution PDF
3 pages
Writeup
No ratings yet
Writeup
3 pages
Divide and Conquer
No ratings yet
Divide and Conquer
10 pages
Data Structures and Algorithms: (CS210/ESO207/ESO211)
No ratings yet
Data Structures and Algorithms: (CS210/ESO207/ESO211)
20 pages
QuickSort Cormen Algorithms Slides
No ratings yet
QuickSort Cormen Algorithms Slides
46 pages
05 Ch9 LinearSelection
No ratings yet
05 Ch9 LinearSelection
12 pages
06 Prune and Search
No ratings yet
06 Prune and Search
6 pages
Announcements: Weekly Reading: Chap 11 (CLRS) (Not On Upcoming Exam)
No ratings yet
Announcements: Weekly Reading: Chap 11 (CLRS) (Not On Upcoming Exam)
28 pages
Csce411 Random3
No ratings yet
Csce411 Random3
25 pages
Medians and Order Statistics: CLRS Chapter 9
No ratings yet
Medians and Order Statistics: CLRS Chapter 9
19 pages
5 D&C Ii (Orig)
No ratings yet
5 D&C Ii (Orig)
58 pages
11110 計算方法設計許建平 quiz1
No ratings yet
11110 計算方法設計許建平 quiz1
6 pages
Lec 5
No ratings yet
Lec 5
30 pages
1
No ratings yet
1
33 pages
CH 5
No ratings yet
CH 5
47 pages
Devide and Conqure Rule
No ratings yet
Devide and Conqure Rule
11 pages
Minimum and Maximum
No ratings yet
Minimum and Maximum
28 pages
373 Lecture 6
No ratings yet
373 Lecture 6
11 pages
Lec 11
No ratings yet
Lec 11
57 pages
Daa Vtu Module Ii
No ratings yet
Daa Vtu Module Ii
86 pages
Slides Algo Select Danalysis Typed
No ratings yet
Slides Algo Select Danalysis Typed
8 pages
CS4311 Design and Analysis of Algorithms: Lecture 8: Order Statistics
No ratings yet
CS4311 Design and Analysis of Algorithms: Lecture 8: Order Statistics
28 pages
Median Finding Algorithm: Submitted By: Arjun Saraswat Nishant Kapoor
No ratings yet
Median Finding Algorithm: Submitted By: Arjun Saraswat Nishant Kapoor
14 pages
Answer 2021
No ratings yet
Answer 2021
16 pages
Divide and Conquer
No ratings yet
Divide and Conquer
25 pages
Divide and Conquer: Analysis of Algorithms
No ratings yet
Divide and Conquer: Analysis of Algorithms
11 pages
Chapter Two
No ratings yet
Chapter Two
58 pages
Recurrence Relations PDF
100% (1)
Recurrence Relations PDF
16 pages
CS4311 Design and Analysis of Algorithms: Lecture 8: Order Statistics
No ratings yet
CS4311 Design and Analysis of Algorithms: Lecture 8: Order Statistics
25 pages
CS-E3190 Lect04 PDF
No ratings yet
CS-E3190 Lect04 PDF
19 pages
CS 332: Algorithms: Linear-Time Sorting Continued Medians and Order Statistics
No ratings yet
CS 332: Algorithms: Linear-Time Sorting Continued Medians and Order Statistics
29 pages
Analysis of Algorithms - Medians and Order Statistics
No ratings yet
Analysis of Algorithms - Medians and Order Statistics
2 pages
Lecture5 Compressed
No ratings yet
Lecture5 Compressed
36 pages
6515 Transcripts DC2
No ratings yet
6515 Transcripts DC2
27 pages
Quiz1 PDF
No ratings yet
Quiz1 PDF
4 pages
Advanced Algorithms Course. Lecture Notes. Part 9: 3-SAT: How To Satisfy Most Clauses
No ratings yet
Advanced Algorithms Course. Lecture Notes. Part 9: 3-SAT: How To Satisfy Most Clauses
4 pages
Lecture 9
No ratings yet
Lecture 9
61 pages
Selection PDF
No ratings yet
Selection PDF
3 pages
L13 Median
No ratings yet
L13 Median
10 pages
Week 5
No ratings yet
Week 5
37 pages
Deterministic Selection and Sorting:: Problem P
No ratings yet
Deterministic Selection and Sorting:: Problem P
18 pages
Ads Unit-4
No ratings yet
Ads Unit-4
36 pages
Maths 220, Discrete Maths, Test 3, Chapter 7&8
No ratings yet
Maths 220, Discrete Maths, Test 3, Chapter 7&8
7 pages
Math220 Discrete Math Test 3 Solutions
No ratings yet
Math220 Discrete Math Test 3 Solutions
8 pages
Lec 15
No ratings yet
Lec 15
31 pages
Lec13 PDF
No ratings yet
Lec13 PDF
42 pages
Lec8 PDF
No ratings yet
Lec8 PDF
19 pages
Zuckerberg Statement To Congress
100% (1)
Zuckerberg Statement To Congress
7 pages
Intro To Algorithms - Lecture 1
100% (1)
Intro To Algorithms - Lecture 1
52 pages
1 s2.0 S0022169421007320 Main
No ratings yet
1 s2.0 S0022169421007320 Main
13 pages
Electrical First Term Allocation
No ratings yet
Electrical First Term Allocation
1 page
GOVT 702: Advanced Political Analysis Georgetown University
No ratings yet
GOVT 702: Advanced Political Analysis Georgetown University
5 pages
Placement With MCTS
No ratings yet
Placement With MCTS
15 pages
Nailing Downside Risk
No ratings yet
Nailing Downside Risk
4 pages
CS 4476 Project 1 Description
No ratings yet
CS 4476 Project 1 Description
8 pages
Term Project
No ratings yet
Term Project
8 pages
Example Think Aloud Script
No ratings yet
Example Think Aloud Script
1 page
Anova 2
No ratings yet
Anova 2
4 pages
Fig-1 in (Lec - 05 - Ver - 01.vsd) : Common Emitter Amplifier Frequency Response
No ratings yet
Fig-1 in (Lec - 05 - Ver - 01.vsd) : Common Emitter Amplifier Frequency Response
16 pages
406d PDF
No ratings yet
406d PDF
6 pages
FYP Final Report
No ratings yet
FYP Final Report
40 pages
Cut & Bent Reinforcement
No ratings yet
Cut & Bent Reinforcement
3 pages
Emotion Recognition Based On Joint Visual and Audi
No ratings yet
Emotion Recognition Based On Joint Visual and Audi
4 pages
X Viber Balancing Method
No ratings yet
X Viber Balancing Method
8 pages
SFC of GT
100% (1)
SFC of GT
25 pages
Kawasaki 1987
No ratings yet
Kawasaki 1987
23 pages
Reporting Document-Sap BPC Epm
100% (1)
Reporting Document-Sap BPC Epm
43 pages
Second Moment of Area
No ratings yet
Second Moment of Area
4 pages
1.4 Circle Diagram of Slip Ring Motor
No ratings yet
1.4 Circle Diagram of Slip Ring Motor
9 pages
Homological Algebra
0% (1)
Homological Algebra
279 pages
Year 2 Autumn Block 1 Step 1 PPT Count Objects To 100
No ratings yet
Year 2 Autumn Block 1 Step 1 PPT Count Objects To 100
21 pages
A Literature Survey of Benchmark Functions For Global Optimisation Problems PDF
No ratings yet
A Literature Survey of Benchmark Functions For Global Optimisation Problems PDF
45 pages
Probability Test Grade 4 2018 2019
No ratings yet
Probability Test Grade 4 2018 2019
5 pages
Study of Residential Land Use Transport Interaction For Madurai Lpa
No ratings yet
Study of Residential Land Use Transport Interaction For Madurai Lpa
78 pages
Hydrocarbon Reservoir Modeling Comparison Between Theoretical and Real Petrophysical Properties From The Namorado Field (Brazil) Case Study
No ratings yet
Hydrocarbon Reservoir Modeling Comparison Between Theoretical and Real Petrophysical Properties From The Namorado Field (Brazil) Case Study
17 pages
Applied Elasticity - Chapter 1
No ratings yet
Applied Elasticity - Chapter 1
59 pages