0% found this document useful (0 votes)

16 views61 pages

Lecture 9

Median and Order Statistics

Uploaded by

Sake Anila

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views61 pages

Lecture 9

Median and Order Statistics

Uploaded by

Sake Anila

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 61

Medians and Order Statistics

Prof. Prateek Vishnoi

August 29, 2024

Medians and Order Statistics

Definition and Terminology

Let X ⊆ N and |X | = n

Prof. Prateek Vishnoi Medians and Order Statistics

Medians and Order Statistics

Definition and Terminology

Let X ⊆ N and |X | = n
ith order statistics of X

Prof. Prateek Vishnoi Medians and Order Statistics

Medians and Order Statistics

Definition and Terminology

Let X ⊆ N and |X | = n
ith order statistics of X
ith smallest element of the set.

Prof. Prateek Vishnoi Medians and Order Statistics

Medians and Order Statistics

Definition and Terminology

Let X ⊆ N and |X | = n
ith order statistics of X
ith smallest element of the set.
Minimum of X is first order statistic.

Prof. Prateek Vishnoi Medians and Order Statistics

Medians and Order Statistics

Definition and Terminology

Let X ⊆ N and |X | = n
ith order statistics of X
ith smallest element of the set.
Minimum of X is first order statistic.
Maximum of X is called nth order statistic.

Prof. Prateek Vishnoi Medians and Order Statistics

Medians and Order Statistics

Definition and Terminology

Let X ⊆ N and |X | = n
ith order statistics of X
ith smallest element of the set.
Minimum of X is first order statistic.
Maximum of X is called nth order statistic.
Median of X :
$ %th
n+1
order statistic of X
2

Prof. Prateek Vishnoi Medians and Order Statistics

Find the i th order statistic of X , where 1 ≤ i ≤ n

First approach

Prof. Prateek Vishnoi Medians and Order Statistics

Find the i th order statistic of X , where 1 ≤ i ≤ n

First approach
Sort the X and return the i th element.

Prof. Prateek Vishnoi Medians and Order Statistics

Find the i th order statistic of X , where 1 ≤ i ≤ n

First approach
Sort the X and return the i th element.
Time Complexity = Θ(n log n)

Second Approach

Prof. Prateek Vishnoi Medians and Order Statistics

Find the i th order statistic of X , where 1 ≤ i ≤ n

First approach
Sort the X and return the i th element.
Time Complexity = Θ(n log n)

Second Approach
Divide and conquer approach.

Prof. Prateek Vishnoi Medians and Order Statistics

Find the i th order statistic of X , where 1 ≤ i ≤ n

First approach
Sort the X and return the i th element.
Time Complexity = Θ(n log n)

Second Approach
Divide and conquer approach.
Apply the RANDOMISED PARTITION from quick sort on the
array by selecting a pivot.

Prof. Prateek Vishnoi Medians and Order Statistics

Find the i th order statistic of X , where 1 ≤ i ≤ n

First approach
Sort the X and return the i th element.
Time Complexity = Θ(n log n)

Prof. Prateek Vishnoi Medians and Order Statistics

Find the i th order statistic of X , where 1 ≤ i ≤ n

First approach
Sort the X and return the i th element.
Time Complexity = Θ(n log n)

Prof. Prateek Vishnoi Medians and Order Statistics

Find the i th order statistic of X , where 1 ≤ i ≤ n

First approach
Sort the X and return the i th element.
Time Complexity = Θ(n log n)

Second Approach
Divide and conquer approach.
Apply the RANDOMISED PARTITION from quick sort on the
array by selecting a pivot.
Once the partition ends the array is divided into two parts
with (k − 1) elements on left array and (n − k) on right array.
If k= i, then RETURN.
If k < i, recursively call the RANDOMISED PARTITION on
right subarray.

Prof. Prateek Vishnoi Medians and Order Statistics

Find the i th order statistic of X , where 1 ≤ i ≤ n

First approach
Sort the X and return the i th element.
Time Complexity = Θ(n log n)

Best Case

Prof. Prateek Vishnoi Medians and Order Statistics

Complexity Analysis

Best Case
Best Case occurs when the pivot selected is the i th order
statistic.

Prof. Prateek Vishnoi Medians and Order Statistics

Complexity Analysis

Best Case
Best Case occurs when the pivot selected is the i th order
statistic.
Recurrence Relation

T (n) = O(1) + Θ(n)

Prof. Prateek Vishnoi Medians and Order Statistics

Complexity Analysis

Best Case
Best Case occurs when the pivot selected is the i th order
statistic.
Recurrence Relation

T (n) = O(1) + Θ(n)

Θ(n) requires for the partitioning.

Prof. Prateek Vishnoi Medians and Order Statistics

Complexity Analysis

Best Case
Best Case occurs when the pivot selected is the i th order
statistic.
Recurrence Relation

T (n) = O(1) + Θ(n)

Θ(n) requires for the partitioning.

O(1) requires for the returning the element.

Prof. Prateek Vishnoi Medians and Order Statistics

Complexity Analysis

Best Case
Best Case occurs when the pivot selected is the i th order
statistic.
Recurrence Relation

T (n) = O(1) + Θ(n)

Θ(n) requires for the partitioning.

O(1) requires for the returning the element.
Time Complexity = Θ(n)

Prof. Prateek Vishnoi Medians and Order Statistics

Complexity Analysis

Worst Case

Prof. Prateek Vishnoi Medians and Order Statistics

Complexity Analysis

Worst Case
Worst Case occurs when the pivot selected divides array into
two parts of size (n − 1) and 0 and pivot is not the i th order
statistic.

Prof. Prateek Vishnoi Medians and Order Statistics

Complexity Analysis

Worst Case
Worst Case occurs when the pivot selected divides array into
two parts of size (n − 1) and 0 and pivot is not the i th order
statistic.
Recurrence Relation

T (n) = T (n − 1) + Θ(n)

Prof. Prateek Vishnoi Medians and Order Statistics

Complexity Analysis

Worst Case
Worst Case occurs when the pivot selected divides array into
two parts of size (n − 1) and 0 and pivot is not the i th order
statistic.
Recurrence Relation

T (n) = T (n − 1) + Θ(n)

Θ(n) requires for the partitioning.

Prof. Prateek Vishnoi Medians and Order Statistics

Complexity Analysis

Worst Case
Worst Case occurs when the pivot selected divides array into
two parts of size (n − 1) and 0 and pivot is not the i th order
statistic.
Recurrence Relation

T (n) = T (n − 1) + Θ(n)

Θ(n) requires for the partitioning.

T (n − 1) requires for the recursive call.

Prof. Prateek Vishnoi Medians and Order Statistics

Complexity Analysis

Worst Case
Worst Case occurs when the pivot selected divides array into
two parts of size (n − 1) and 0 and pivot is not the i th order
statistic.
Recurrence Relation

T (n) = T (n − 1) + Θ(n)

Θ(n) requires for the partitioning.

T (n − 1) requires for the recursive call.
Time Complexity = O(n2 )

Prof. Prateek Vishnoi Medians and Order Statistics

Average Case
Recurrence Relation

Prof. Prateek Vishnoi Medians and Order Statistics

Average Case
Recurrence Relation

T (n) = (n − 1) + T (X )

Prof. Prateek Vishnoi Medians and Order Statistics

Average Case
Recurrence Relation

T (n) = (n − 1) + T (X )
where X is a random variable s.t, 0 ≤ X ≤ (n − 1)

Prof. Prateek Vishnoi Medians and Order Statistics

Average Case
Recurrence Relation

T (n) = (n − 1) + T (X )
where X is a random variable s.t, 0 ≤ X ≤ (n − 1)

T (n) = (n − 1) + E [T (X )]

Prof. Prateek Vishnoi Medians and Order Statistics

Average Case
Recurrence Relation

T (n) = (n − 1) + T (X )
where X is a random variable s.t, 0 ≤ X ≤ (n − 1)

T (n) = (n − 1) + E [T (X )]
Possible splits of array :

(0, n−1), (1, n−2), (2, n−3) . . . (n/2−2, n/2+1), (n/2−1, n/2)

Prof. Prateek Vishnoi Medians and Order Statistics

Average Case
Recurrence Relation

T (n) = (n − 1) + T (X )
where X is a random variable s.t, 0 ≤ X ≤ (n − 1)

T (n) = (n − 1) + E [T (X )]
Possible splits of array :

(0, n−1), (1, n−2), (2, n−3) . . . (n/2−2, n/2+1), (n/2−1, n/2)

Expected size of larger array= 3n/4

Prof. Prateek Vishnoi Medians and Order Statistics

Average Case
Recurrence Relation

T (n) = (n − 1) + T (X )
where X is a random variable s.t, 0 ≤ X ≤ (n − 1)

T (n) = (n − 1) + E [T (X )]
Possible splits of array :

(0, n−1), (1, n−2), (2, n−3) . . . (n/2−2, n/2+1), (n/2−1, n/2)

Expected size of larger array= 3n/4

E [T (X )] ≤ 12 T ( 3n 1 1 3n 1
4 ) + 2 T (n − 1) ≤ 2 T ( 4 ) + 2 T (n)

Prof. Prateek Vishnoi Medians and Order Statistics

Average Case
Recurrence Relation

T (n) = (n − 1) + T (X )
where X is a random variable s.t, 0 ≤ X ≤ (n − 1)

T (n) = (n − 1) + E [T (X )]
Possible splits of array :

(0, n−1), (1, n−2), (2, n−3) . . . (n/2−2, n/2+1), (n/2−1, n/2)

Expected size of larger array= 3n/4

E [T (X )] ≤ 12 T ( 3n 1 1 3n 1
4 ) + 2 T (n − 1) ≤ 2 T ( 4 ) + 2 T (n)
Place the bound on the upper equation.

Prof. Prateek Vishnoi Medians and Order Statistics

Average Case
Recurrence Relation

T (n) = (n − 1) + T (X )
where X is a random variable s.t, 0 ≤ X ≤ (n − 1)

T (n) = (n − 1) + E [T (X )]
Possible splits of array :

(0, n−1), (1, n−2), (2, n−3) . . . (n/2−2, n/2+1), (n/2−1, n/2)

Expected size of larger array= 3n/4

E [T (X )] ≤ 12 T ( 3n 1 1 3n 1
4 ) + 2 T (n − 1) ≤ 2 T ( 4 ) + 2 T (n)
Place the bound on the upper equation.
T (n) ≤ T ( 3n
4 ) + 2(n − 1) = O(n)
Prof. Prateek Vishnoi Medians and Order Statistics
Deterministic Algorithm for Selection