0% found this document useful (0 votes)

144 views30 pages

Introduction To Algorithms: 6.046J/18.401J/SMA5503

This document discusses algorithms for selecting the ith smallest element from a list of n elements, known as order statistics. It first describes a naive sorting-based algorithm with Θ(n log n) time, then presents a randomized divide-and-conquer algorithm that runs in expected linear time. The document analyzes the expected running time of the randomized algorithm and proves it is O(n). It also briefly mentions a deterministic linear time algorithm.

Uploaded by

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

144 views30 pages

Introduction To Algorithms: 6.046J/18.401J/SMA5503

Uploaded by

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 30

Introduction to Algorithms

6.046J/18.401J/SMA5503

Lecture 6
Prof. Erik Demaine
Order statistics
Select the ith smallest of n elements (the
element with rank i).
• i = 1: minimum;
• i = n: maximum;
• i = (n+1)/2 or (n+1)/2: median.
Naive algorithm: Sort and index ith element.
Worst-case running time = Θ(n lg n) + Θ(1)
= Θ(n lg n),
using merge sort or heapsort (not quicksort).
© 2001 by Charles E. Leiserson Introduction to Algorithms Day 9 L6.2
Randomized divide-and-
conquer algorithm
RAND-SELECT(A, p, q, i) ⊳ ith smallest of A[ p . . q]
if p = q then return A[ p]
r ← RAND-PARTITION(A, p, q)
k←r–p+1 ⊳ k = rank(A[r])
if i = k then return A[ r]
if i < k
then return RAND-SELECT(A, p, r – 1, i )
else return RAND-SELECT(A, r + 1, q, i – k )
k
≤≤ A[r]
A[r] ≥≥ A[r]
A[r]
p r q
© 2001 by Charles E. Leiserson Introduction to Algorithms Day 9 L6.3
Example
Select the i = 7th smallest:
66 10
10 13
13 55 88 33 22 11
11 i=7
pivot
Partition:
22 55 33 66 88 13
13 10
10 11
11 k=4

Select the 7 – 4 = 3rd smallest recursively.

© 2001 by Charles E. Leiserson Introduction to Algorithms Day 9 L6.4
Intuition for analysis
(All our analyses today assume that all elements
are distinct.)
Lucky:
T(n) = T(9n/10) + Θ(n) n log10 / 9 1 = n 0 = 1
= Θ(n) CASE 3
Unlucky:
T(n) = T(n – 1) + Θ(n) arithmetic series
= Θ(n2)
Worse than sorting!
© 2001 by Charles E. Leiserson Introduction to Algorithms Day 9 L6.5
Analysis of expected time
The analysis follows that of randomized
quicksort, but it’s a little different.
Let T(n) = the random variable for the running
time of RAND-SELECT on an input of size n,
assuming random numbers are independent.
For k = 0, 1, …, n–1, define the indicator
random variable
1 if PARTITION generates a k : n–k–1 split,
Xk =
0 otherwise.
© 2001 by Charles E. Leiserson Introduction to Algorithms Day 9 L6.6
Analysis (continued)
To obtain an upper bound, assume that the ith
element always falls in the larger side of the
partition:
T(max{0, n–1}) + Θ(n) if 0 : n–1 split,
T(max{1, n–2}) + Θ(n) if 1 : n–2 split,
T(n) =
M
T(max{n–1, 0}) + Θ(n) if n–1 : 0 split,
n −1
= ∑ X k (T (max{k , n − k − 1}) + Θ(n)) .
k =0
© 2001 by Charles E. Leiserson Introduction to Algorithms Day 9 L6.7
Calculating expectation
 n −1 
E[T (n)] = E  ∑ X k (T (max{k , n − k − 1}) + Θ(n) )
k =0 

Take expectations of both sides.

© 2001 by Charles E. Leiserson Introduction to Algorithms Day 9 L6.8

Calculating expectation
 n −1 
E[T (n)] = E  ∑ X k (T (max{k , n − k − 1}) + Θ(n) )
k =0 
n −1
= ∑ E[ X k (T (max{k , n − k − 1}) + Θ(n) )]
k =0

Linearity of expectation.

© 2001 by Charles E. Leiserson Introduction to Algorithms Day 9 L6.9

Calculating expectation
 n −1 
E[T (n)] = E  ∑ X k (T (max{k , n − k − 1}) + Θ(n) )
k =0 
n −1
= ∑ E[ X k (T (max{k , n − k − 1}) + Θ(n) )]
k =0
n −1
= ∑ E[ X k ] ⋅ E[T (max{k , n − k − 1}) + Θ(n)]
k =0

Independence of Xk from other random

choices.

© 2001 by Charles E. Leiserson Introduction to Algorithms Day 9 L6.10

Calculating expectation
 n −1 
E[T (n)] = E  ∑ X k (T (max{k , n − k − 1}) + Θ(n) )
k =0 
n −1
= ∑ E[ X k (T (max{k , n − k − 1}) + Θ(n) )]
k =0
n −1
= ∑ E[ X k ] ⋅ E[T (max{k , n − k − 1}) + Θ(n)]
k =0
n −1 n −1
= 1 ∑ E [T (max{k , n − k − 1})] + 1 ∑ Θ(n)
n k =0 n k =0

Linearity of expectation; E[Xk] = 1/n .

© 2001 by Charles E. Leiserson Introduction to Algorithms Day 9 L6.11
Calculating expectation
 n −1 
E[T (n)] = E  ∑ X k (T (max{k , n − k − 1}) + Θ(n) )
k =0 
n −1
= ∑ E[ X k (T (max{k , n − k − 1}) + Θ(n) )]
k =0
n −1
= ∑ E[ X k ] ⋅ E[T (max{k , n − k − 1}) + Θ(n)]
k =0
n −1 n −1
= 1 ∑ E [T (max{k , n − k − 1})] + 1 ∑ Θ(n)
n k =0 n k =0
n −1
≤ 2 ∑ E [T (k )] + Θ(n) Upper terms
n k = n / 2 
appear twice.
© 2001 by Charles E. Leiserson Introduction to Algorithms Day 9 L6.12
Hairy recurrence
(But not quite as hairy as the quicksort one.)
n −1
E[T (n)] = 2 ∑ E [T (k )] + Θ(n)
n k= n/2
 
Prove: E[T(n)] ≤ cn for constant c > 0 .
• The constant c can be chosen large enough
so that E[T(n)] ≤ cn for the base cases.
n −1
Use fact: ∑ 8 (exercise).
k ≤ 3n 2
k = n / 2 
© 2001 by Charles E. Leiserson Introduction to Algorithms Day 9 L6.13
Substitution method
n −1
E [T (n)] ≤ 2 ∑ ck + Θ(n)
n k= n/2
 
Substitute inductive hypothesis.

© 2001 by Charles E. Leiserson Introduction to Algorithms Day 9 L6.14

Substitution method
n −1
E [T (n)] ≤ 2 ∑ ck + Θ(n)
n k= n/2
 
≤ 2c  3 n 2  + Θ(n)
n 8 
Use fact.

© 2001 by Charles E. Leiserson Introduction to Algorithms Day 9 L6.15

Substitution method
n −1
E [T (n)] ≤ 2 ∑ ck + Θ(n)
n k= n/2
 
≤ 2c  3 n 2  + Θ(n)
n 8 
= cn −  cn − Θ(n) 
4 
Express as desired – residual.

© 2001 by Charles E. Leiserson Introduction to Algorithms Day 9 L6.16

Substitution method
n −1
E [T (n)] ≤ 2 ∑ ck + Θ(n)
n k= n/2
 
≤ 2c  3 n 2  + Θ(n)
n 8 
= cn −  cn − Θ(n) 
4 
≤ cn ,
if c is chosen large enough so
that cn/4 dominates the Θ(n).
© 2001 by Charles E. Leiserson Introduction to Algorithms Day 9 L6.17
Summary of randomized
order-statistic selection
• Works fast: linear expected time.
• Excellent algorithm in practice.
• But, the worst case is very bad: Θ(n2).
Q. Is there an algorithm that runs in linear
time in the worst case?
A. Yes, due to Blum, Floyd, Pratt, Rivest,
and Tarjan [1973].
IDEA: Generate a good pivot recursively.

© 2001 by Charles E. Leiserson Introduction to Algorithms Day 9 L6.18

Worst-case linear-time order
statistics
SELECT(i, n)
1. Divide the n elements into groups of 5. Find
the median of each 5-element group by rote.
2. Recursively SELECT the median x of the n/5
group medians to be the pivot.
3. Partition around the pivot x. Let k = rank(x).
4. if i = k then return x
elseif i < k Same as
then recursively SELECT the ith RAND-
smallest element in the lower part SELECT
else recursively SELECT the (i–k)th
smallest element in the upper part
© 2001 by Charles E. Leiserson Introduction to Algorithms Day 9 L6.19
Choosing the pivot

© 2001 by Charles E. Leiserson Introduction to Algorithms Day 9 L6.20

Choosing the pivot

1. Divide the n elements into groups of 5.

Choosing the pivot

1. Divide the n elements into groups of 5. Find lesser

the median of each 5-element group by rote.

1. Divide the n elements into groups of 5. Find lesser

the median of each 5-element group by rote.
2. Recursively SELECT the median x of the n/5
group medians to be the pivot. greater
© 2001 by Charles E. Leiserson Introduction to Algorithms Day 9 L6.23
Analysis

At least half the group medians are ≤ x, which lesser

is at least  n/5 /2 = n/10 group medians.

greater
© 2001 by Charles E. Leiserson Introduction to Algorithms Day 9 L6.24
Analysis (Assume all elements are distinct.)

At least half the group medians are ≤ x, which lesser

is at least  n/5 /2 = n/10 group medians.
• Therefore, at least 3 n/10 elements are ≤ x.
greater
© 2001 by Charles E. Leiserson Introduction to Algorithms Day 9 L6.25
Analysis (Assume all elements are distinct.)

At least half the group medians are ≤ x, which lesser

is at least  n/5 /2 = n/10 group medians.
• Therefore, at least 3 n/10 elements are ≤ x.
• Similarly, at least 3 n/10 elements are ≥ x. greater
© 2001 by Charles E. Leiserson Introduction to Algorithms Day 9 L6.26
Minor simplification
• For n ≥ 50, we have 3 n/10 ≥ n/4.
• Therefore, for n ≥ 50 the recursive call to
SELECT in Step 4 is executed recursively
on ≤ 3n/4 elements.
• Thus, the recurrence for running time
can assume that Step 4 takes time
T(3n/4) in the worst case.
• For n < 50, we know that the worst-case
time is T(n) = Θ(1).

Developing the recurrence
T(n) SELECT(i, n)
1. Divide the n elements into groups of 5. Find
Θ(n) the median of each 5-element group by rote.
2. Recursively SELECT the median x of the n/5
T(n/5) group medians to be the pivot.
Θ(n) 3. Partition around the pivot x. Let k = rank(x).
4. if i = k then return x
elseif i < k
T(3n/4) then recursively SELECT the ith
smallest element in the lower part
else recursively SELECT the (i–k)th
smallest element in the upper part
© 2001 by Charles E. Leiserson Introduction to Algorithms Day 9 L6.28
Solving the recurrence
T (n) = T  1 n  + T  3 n  + Θ(n)
5  4 

Substitution: T (n) ≤ 1 cn + 3 cn + Θ(n)

T(n) ≤ cn 5 4
= 19 cn + Θ(n)
20
= cn −  1 cn − Θ(n) 
 20 
≤ cn ,
if c is chosen large enough to handle both the
Θ(n) and the initial conditions.
© 2001 by Charles E. Leiserson Introduction to Algorithms Day 9 L6.29
Conclusions
• Since the work at each level of recursion
is a constant fraction (19/20) smaller, the
work per level is a geometric series
dominated by the linear work at the root.
• In practice, this algorithm runs slowly,
because the constant in front of n is large.
• The randomized algorithm is far more
practical.
Exercise: Why not divide into groups of 3?
© 2001 by Charles E. Leiserson Introduction to Algorithms Day 9 L6.30

Data Structures and Algorithm Analysis in C 4th Edition Mark A
No ratings yet
Data Structures and Algorithm Analysis in C 4th Edition Mark A
12 pages
Introduction To Algorithms
90% (10)
Introduction To Algorithms
769 pages
Lecture 05
No ratings yet
Lecture 05
24 pages
Introduction To Algorithms: Order Statistics
No ratings yet
Introduction To Algorithms: Order Statistics
30 pages
Introduction To Algorithms: 6.046J/18.401J/SMA5503
No ratings yet
Introduction To Algorithms: 6.046J/18.401J/SMA5503
47 pages
Cs 161 Lecture 04
No ratings yet
Cs 161 Lecture 04
6 pages
Lecture 06
No ratings yet
Lecture 06
62 pages
CH 05
No ratings yet
CH 05
27 pages
Lecture01 Slides
No ratings yet
Lecture01 Slides
37 pages
Writeup
No ratings yet
Writeup
3 pages
Lecture4 Notes
No ratings yet
Lecture4 Notes
7 pages
Introduction To Algorithms: 6.046J/18.401J/SMA5503
No ratings yet
Introduction To Algorithms: 6.046J/18.401J/SMA5503
25 pages
Algorithms, Fall 2005. (Massachusetts Institute of Technology: MIT
No ratings yet
Algorithms, Fall 2005. (Massachusetts Institute of Technology: MIT
14 pages
CH 05
No ratings yet
CH 05
30 pages
11110 計算方法設計許建平 quiz1
No ratings yet
11110 計算方法設計許建平 quiz1
6 pages
KTH Smallest Number Algo
No ratings yet
KTH Smallest Number Algo
17 pages
L15 Median OrderStatistics
No ratings yet
L15 Median OrderStatistics
33 pages
Sorting in Linear Time Counting Sort: Introduction To Algorithms, Lecture 5 February 20, 2003
No ratings yet
Sorting in Linear Time Counting Sort: Introduction To Algorithms, Lecture 5 February 20, 2003
12 pages
unit-6(A)
No ratings yet
unit-6(A)
10 pages
Algo_book_2-252-258
No ratings yet
Algo_book_2-252-258
7 pages
CMP3501 Analysis of Algorithms: Lecture Notes 5 - Divide and Conquer Algorithms
No ratings yet
CMP3501 Analysis of Algorithms: Lecture Notes 5 - Divide and Conquer Algorithms
38 pages
Lecture 5: The Linear Time Selection in The Worst Case
No ratings yet
Lecture 5: The Linear Time Selection in The Worst Case
7 pages
08 Medians and Order Statistics
No ratings yet
08 Medians and Order Statistics
43 pages
7-5 Solution PDF
No ratings yet
7-5 Solution PDF
3 pages
06 Prune and Search
No ratings yet
06 Prune and Search
6 pages
Lecture Notes 5 Divide and Conquer Algorithms
No ratings yet
Lecture Notes 5 Divide and Conquer Algorithms
42 pages
Week 5
No ratings yet
Week 5
37 pages
Today's Material: - Medians & Order Statistics - Ch. 9
No ratings yet
Today's Material: - Medians & Order Statistics - Ch. 9
15 pages
DAA UNIT1 QuestionBank
No ratings yet
DAA UNIT1 QuestionBank
31 pages
Median Order Statistics
No ratings yet
Median Order Statistics
26 pages
Lecture 01
No ratings yet
Lecture 01
52 pages
Introduction To Algorithms: 6.046J/18.401J/SMA5503
100% (2)
Introduction To Algorithms: 6.046J/18.401J/SMA5503
27 pages
Problem Set 5
No ratings yet
Problem Set 5
4 pages
Introduction & Median Finding: 1.1 The Course
No ratings yet
Introduction & Median Finding: 1.1 The Course
8 pages
4 Quicksort.v2
No ratings yet
4 Quicksort.v2
101 pages
01 Slides
No ratings yet
01 Slides
109 pages
Introduction To Algorithms
100% (1)
Introduction To Algorithms
640 pages
Soluzioni Cormen
No ratings yet
Soluzioni Cormen
20 pages
CS4311 Design and Analysis of Algorithms: Lecture 8: Order Statistics
No ratings yet
CS4311 Design and Analysis of Algorithms: Lecture 8: Order Statistics
25 pages
CS4311 Design and Analysis of Algorithms: Lecture 8: Order Statistics
No ratings yet
CS4311 Design and Analysis of Algorithms: Lecture 8: Order Statistics
28 pages
CS 332: Algorithms: Linear-Time Sorting Continued Medians and Order Statistics
No ratings yet
CS 332: Algorithms: Linear-Time Sorting Continued Medians and Order Statistics
29 pages
Divide and Conquer
No ratings yet
Divide and Conquer
17 pages
Divide and Conquer
No ratings yet
Divide and Conquer
17 pages
CS-E3190 Lect04 PDF
No ratings yet
CS-E3190 Lect04 PDF
19 pages
Minimum and Maximum
No ratings yet
Minimum and Maximum
28 pages
Lecture 15
No ratings yet
Lecture 15
45 pages
CS648A 2023 Lecture 1
No ratings yet
CS648A 2023 Lecture 1
35 pages
TCSLec5(1)
No ratings yet
TCSLec5(1)
8 pages
Lecture5 Compressed
No ratings yet
Lecture5 Compressed
36 pages
Daa Vtu Module Ii
No ratings yet
Daa Vtu Module Ii
86 pages
Sorting
No ratings yet
Sorting
41 pages
15-451 Algorithms, Fall 2011
No ratings yet
15-451 Algorithms, Fall 2011
3 pages
MITP CH 2 Slides
No ratings yet
MITP CH 2 Slides
52 pages
Algorithms
No ratings yet
Algorithms
29 pages
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Lectures on Integral Equations
From Everand
Lectures on Integral Equations
Harold Widom
4.5/5 (2)
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
An Introduction to Linear Algebra and Tensors
From Everand
An Introduction to Linear Algebra and Tensors
M. A. Akivis
1/5 (1)
Theory of Approximation
From Everand
Theory of Approximation
N. I. Achieser
No ratings yet
Logical progression of twelve double binary tables of physical-mathematical elements correlated with scientific-philosophical as well as metaphysical key concepts evidencing the dually four-dimensional basic structure of the universe
From Everand
Logical progression of twelve double binary tables of physical-mathematical elements correlated with scientific-philosophical as well as metaphysical key concepts evidencing the dually four-dimensional basic structure of the universe
Federico Tambara
No ratings yet
Embedded Software Design: Peter R. Wihl
No ratings yet
Embedded Software Design: Peter R. Wihl
59 pages
Tactical Communications Protocol 2
100% (2)
Tactical Communications Protocol 2
111 pages
Introduction To Algorithms: 6.046J/18.401J/SMA5503
No ratings yet
Introduction To Algorithms: 6.046J/18.401J/SMA5503
28 pages
Introduction To Algorithms: 6.046J/18.401J/SMA5503
No ratings yet
Introduction To Algorithms: 6.046J/18.401J/SMA5503
19 pages
Zuckerberg Statement To Congress
100% (1)
Zuckerberg Statement To Congress
7 pages
Introduction To Algorithms: 6.046J/18.401J/SMA5503
No ratings yet
Introduction To Algorithms: 6.046J/18.401J/SMA5503
24 pages
HSM Design Principles v5 1
No ratings yet
HSM Design Principles v5 1
6 pages
1643 hw1
No ratings yet
1643 hw1
2 pages
M.SC Math 2nd Year Research
No ratings yet
M.SC Math 2nd Year Research
3 pages
LDA Two Classes - Example: Compute The Linear Discriminant Projection For The Following Two-Dimensional Dataset
No ratings yet
LDA Two Classes - Example: Compute The Linear Discriminant Projection For The Following Two-Dimensional Dataset
14 pages
BSSEII
No ratings yet
BSSEII
12 pages
Jee Mains Maths Formulas 673d1f2f
No ratings yet
Jee Mains Maths Formulas 673d1f2f
22 pages
THeorem Phsase
No ratings yet
THeorem Phsase
28 pages
Illustration of Quadratic Equation
No ratings yet
Illustration of Quadratic Equation
8 pages
Effective Action and Hawking Ux From Covariant Perturbation Theory
No ratings yet
Effective Action and Hawking Ux From Covariant Perturbation Theory
12 pages
0270 PDF C22 PDF
No ratings yet
0270 PDF C22 PDF
8 pages
Lecture 2 Vectors
No ratings yet
Lecture 2 Vectors
43 pages
Ijomev8n1 01 PDF
No ratings yet
Ijomev8n1 01 PDF
4 pages
Transformations: Mhf4U Unit 1 Lesson 4
No ratings yet
Transformations: Mhf4U Unit 1 Lesson 4
4 pages
Ashutosh Behera - Math Project
No ratings yet
Ashutosh Behera - Math Project
25 pages
Experiment - 1 AIM: To Determine The Reduced Level of An Object When Base Is Instruments Used: Theodolite, Tape, Leveling Staff, Ranging Rod
100% (1)
Experiment - 1 AIM: To Determine The Reduced Level of An Object When Base Is Instruments Used: Theodolite, Tape, Leveling Staff, Ranging Rod
12 pages
Lec 302 W7 Heat Wave
No ratings yet
Lec 302 W7 Heat Wave
31 pages
Linear Programming Problems (LP) Formulation and Graphical Method
No ratings yet
Linear Programming Problems (LP) Formulation and Graphical Method
20 pages
9.4 Singular Value Decomposition: 9.4.1 Definition of The SVD
No ratings yet
9.4 Singular Value Decomposition: 9.4.1 Definition of The SVD
4 pages
(Math 53 Study Guide) Topic 1 - Week 1 - 3 - Limits
No ratings yet
(Math 53 Study Guide) Topic 1 - Week 1 - 3 - Limits
5 pages
Signal Spectra, Signal Processing
100% (1)
Signal Spectra, Signal Processing
16 pages
June 2017 Question Paper 11
No ratings yet
June 2017 Question Paper 11
12 pages
Fall 2019 ECE269 Syllabus
No ratings yet
Fall 2019 ECE269 Syllabus
9 pages
Recursive Least-Squares (RLS) Adaptive Filters
No ratings yet
Recursive Least-Squares (RLS) Adaptive Filters
21 pages
3.3 Rational Root Theorem
100% (1)
3.3 Rational Root Theorem
15 pages
PreCalculus Fall 2012 Lesson 018 - Inverse Functions
No ratings yet
PreCalculus Fall 2012 Lesson 018 - Inverse Functions
4 pages
Direct and Indirect Variation
No ratings yet
Direct and Indirect Variation
2 pages
The NTH Term of A Linear Sequence
No ratings yet
The NTH Term of A Linear Sequence
3 pages
Determinants
No ratings yet
Determinants
4 pages
Csc349a f2023 Asn3
No ratings yet
Csc349a f2023 Asn3
4 pages

Introduction To Algorithms: 6.046J/18.401J/SMA5503

Uploaded by

Introduction To Algorithms: 6.046J/18.401J/SMA5503

Uploaded by

Introduction to Algorithms

Select the 7 – 4 = 3rd smallest recursively.

Take expectations of both sides.

© 2001 by Charles E. Leiserson Introduction to Algorithms Day 9 L6.8

© 2001 by Charles E. Leiserson Introduction to Algorithms Day 9 L6.9

Independence of Xk from other random

© 2001 by Charles E. Leiserson Introduction to Algorithms Day 9 L6.10

Linearity of expectation; E[Xk] = 1/n .

© 2001 by Charles E. Leiserson Introduction to Algorithms Day 9 L6.14

© 2001 by Charles E. Leiserson Introduction to Algorithms Day 9 L6.15

© 2001 by Charles E. Leiserson Introduction to Algorithms Day 9 L6.16

© 2001 by Charles E. Leiserson Introduction to Algorithms Day 9 L6.18

© 2001 by Charles E. Leiserson Introduction to Algorithms Day 9 L6.20

1. Divide the n elements into groups of 5.

© 2001 by Charles E. Leiserson Introduction to Algorithms Day 9 L6.21

1. Divide the n elements into groups of 5. Find lesser

1. Divide the n elements into groups of 5. Find lesser

At least half the group medians are ≤ x, which lesser

At least half the group medians are ≤ x, which lesser

At least half the group medians are ≤ x, which lesser

© 2001 by Charles E. Leiserson Introduction to Algorithms Day 9 L6.27

Substitution: T (n) ≤ 1 cn + 3 cn + Θ(n)

You might also like