0% found this document useful (0 votes)

3 views

lecture7_v2

The document discusses the Mergesort and Quicksort algorithms, detailing their implementation, efficiency, and analysis. Mergesort operates by recursively dividing the array and merging sorted halves, achieving a time complexity of O(n log n), while Quicksort uses a partitioning method to sort elements around a pivot, with average-case performance also at O(n log n). Randomized Quicksort improves performance by selecting pivots randomly, mitigating worst-case scenarios, and is generally faster than Mergesort and Heapsort in practice.

Uploaded by

abbaboukarmissira

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

lecture7_v2

Uploaded by

abbaboukarmissira

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 27

Lecture 7:

Mergesort / Quicksort
Problem of the Day
Give an efficient algorithm to determine whether two sets (of
size m and n) are disjoint. Analyze the complexity of your
algorithm in terms of m and n. Be sure to consider the case
where m is substantially smaller than n.
Mergesort
Recursive algorithms are based on reducing large problems
into small ones.
A nice recursive approach to sorting involves partitioning
the elements into two groups, sorting each of the smaller
problems recursively, and then interleaving the two sorted
lists to totally order the elements.
https://upload.wikimedia.org/wikipedia/commons/c/cc/Merge-sort-example-300px.

gif
Mergesort Implementation

mergesort(item type s[], int low, int high)

{
int i; (* counter *)
int middle; (* index of middle element *)

if (low < high) {

middle = (low+high)/2;
mergesort(s,low,middle);
mergesort(s,middle+1,high);

merge(s, low, middle, high);

}
}
Merging Sorted Lists
The efficiency of mergesort depends upon how efficiently we
combine the two sorted halves into a single sorted list.
This smallest element can be removed, leaving two sorted
lists behind, one slighly shorter than before.
Repeating this operation until both lists are empty merges two
sorted lists (with a total of n elements between them) into one,
using at most n − 1 comparisons or O(n) total work
Example: A = {5, 7, 12, 19} and B = {4, 6, 13, 15}.
Mergesort Analysis
A linear amount of work is done merging along all levels of
the mergesort tree.
The height of this tree is O(log n).
Thus the worst case time is O(n log n).
Divide and Conquer
Divide and conquer is an important algorithm design tech-
nique using in mergesort, binary search, the fast Fourier trans-
form (FFT), and Strassen’s matrix multiplication algorithm.
We divide the problem into two smaller subproblems, solve
each recursively, and then meld the two partial solutions into
one solution for the full problem.
When merging takes less time than solving the two subprob-
lems, we get an efficient algorithm.

T (n) = 2 · T (n/2) + Θ(n) → T (n) = Θ(n log n)

External Sorting
Which O(n log n) algorithm you use for sorting doesn’t
matter much until n is so big the data does not fit in memory.
Mergesort proves to be the basis for the most efficient
external sorting programs.
Disks are much slower than main memory, and benefit from
algorithms that read and write data in long streams – not
random access.
Quicksort
In practice, the fastest internal sorting algorithm is Quicksort,
which uses partitioning as its main idea.
Example: pivot about 10.
Before: 17 12 6 19 23 8 5 10
After: 6 8 5 10 23 19 12 17
Partitioning places all the elements less than the pivot in the
left part of the array, and all elements greater than the pivot in
the right part of the array. The pivot fits in the slot between.
Note that the pivot element ends up in the correct place in the
total order!
Partitioning the Elements
We can partition an array about the pivot in one linear scan, by
maintaining three sections: < pivot, > pivot, and unexplored.
https://upload.wikimedia.org/wikipedia/commons/9/9c/Quicksort-example.gif

As we scan from left to right, we move the left bound to the

right when the element is less than the pivot, otherwise we
swap it with the rightmost unexplored element and move the
right bound one step closer to the left.
Why Partition?
Since the partitioning step consists of at most n swaps, takes
time linear in the number of keys. But what does it buy us?
1. The pivot element ends up in the position it retains in the
final sorted order.
2. After a partitioning, no element flops to the other side of
the pivot in the final sorted order.
Thus we can sort the elements to the left of the pivot and the
right of the pivot independently, giving us a recursive sorting
algorithm!
Quicksort Pseudocode

Sort(A)
Quicksort(A,1,n)

Quicksort(A, low, high)

if (low < high)
pivot-location = Partition(A,low,high)
Quicksort(A,low, pivot-location - 1)
Quicksort(A, pivot-location+1, high)
Partition Implementation

Partition(A,low,high)
pivot = A[low]
leftwall = low
for i = low+1 to high
if (A[i] < pivot) then
leftwall = leftwall+1
swap(A[i],A[leftwall])
swap(A[low],A[leftwall])
Best Case for Quicksort
Since each element ultimately ends up in the correct position,
the algorithm correctly sorts. But how long does it take?
The best case for divide-and-conquer algorithms comes when
we split the input as evenly as possible. Thus in the best case,
each subproblem is of size n/2.
The partition step on each subproblem is linear in its size.
Thus the total effort in partitioning the 2k problems of size
n/2k is O(n).
Best Case Recursion Tree

p p

p p p p

The total partitioning on each level is O(n), and it take

lg n levels of perfect partitions to get to single element
subproblems. When we are down to single elements, the
problems are sorted. Thus the total time in the best case is
O(n lg n).
Worst Case for Quicksort
Suppose instead our pivot element splits the array as
unequally as possible. Thus instead of n/2 elements in the
smaller half, we get zero, meaning that the pivot element is
the biggest or smallest element in the array.
p

Now we have n−1 levels, instead of lg n, for a worst case time

of Θ(n2), since the first n/2 levels each have ≥ n/2 elements
to partition.
To justify its name, Quicksort had better be good in the
average case. Showing this requires some intricate analysis.
The divide and conquer principle applies to real life. If you
break a job into pieces, make the pieces of equal size!
Intuition: The Average Case for Quicksort
Suppose we pick the pivot element at random in an array of n
keys.

1 n/4 n/2 3n/4 n

Half the time, the pivot element will be from the center half
of the sorted array.
Whenever the pivot element is from positions n/4 to 3n/4, the
larger remaining subarray contains at most 3n/4 elements.
Optional

Average-Case Analysis of Quicksort

To do a precise average-case analysis of quicksort, we
formulate a recurrence given the exact expected time T (n):
n 1
T (n) = (T (p − 1) + T (n − p)) + n − 1
X

p=1 n

Each possible pivot p is selected with equal probability. The

number of comparisons needed to do the partition is n − 1.
We will need one useful fact about the Harmonic numbers
Hn, namely
n
Hn = 1/i ≈ ln n
X

i=1
Optional

It is important to understand (1) where the recurrence relation

comes from and (2) how the log comes out from the
summation. The rest is just messy algebra.
n 1
T (n) = (T (p − 1) + T (n − p)) + n − 1
X

p=1 n
2 Xn
T (n) = T (p − 1) + n − 1
n p=1
n
nT (n) = 2 T (p − 1) + n(n − 1) multiply by n
X

p=1
n−1
(n−1)T (n−1) = 2 T (p−1)+(n−1)(n−2) apply to n-1
X

p=1
nT (n) − (n − 1)T (n − 1) = 2T (n − 1) + 2(n − 1)
rearranging the terms give us:
T (n) T (n − 1) 2(n − 1)
= +
n+1 n n(n + 1)
Optional

substituting an = A(n)/(n + 1) gives

2(n − 1) n 2(i − 1)
an = an−1 + =
X

n(n + 1) i=1 i(i + 1)

n 1
an ≈ 2 ≈ 2 ln n
X

i=1 (i + 1)
We are really interested in A(n), so
A(n) = (n + 1)an ≈ 2(n + 1) ln n ≈ 1.38n lg n
Randomized Quicksort
Suppose you are writing a sorting program, to run on data
given to you by your worst enemy. Quicksort is good on
average, but bad on certain worst-case instances.
If you used Quicksort, what kind of data would your enemy
give you to run it on? Exactly the worst-case instance, to
make you look bad.
But suppose you picked the pivot element at random.
Now your enemy cannot design a worst-case instance to give
to you, because no matter which data they give you, you
would have the same probability of picking a good pivot!
Randomized Guarantees
Randomization is a very important and useful idea. By either
picking a random pivot or scrambling the permutation before
sorting it, we can say:
“With high probability, randomized quicksort runs in
Θ(n lg n) time.”
Where before, all we could say is:
“If you give me random input data, quicksort runs in
expected Θ(n lg n) time.”
Importance of Randomization
Since the time bound how does not depend upon your input
distribution, this means that unless we are extremely unlucky
(as opposed to ill prepared or unpopular) we will certainly get
good performance.
Randomization is a general tool to improve algorithms with
bad worst-case but good average-case complexity.
The worst-case is still there, but we almost certainly won’t
see it.
Pick a Better Pivot
Having the worst case occur when they are sorted or almost
sorted is very bad, since that is likely to be the case in certain
applications.
To eliminate this problem, pick a better pivot:
1. Use the middle element of the subarray as pivot.
2. Use a random element of the array as the pivot.
3. Perhaps best of all, take the median of three elements
(first, last, middle) as the pivot. Why should we use
median instead of the mean?
Whichever of these three rules we use, the worst case remains
O(n2).
Is Quicksort really faster than Heapsort?

Since Heapsort is Θ(n lg n) and selection sort is Θ(n2), there

is no debate about which will be better for decent-sized files.
When Quicksort is implemented well, it is typically 2-3 times
faster than mergesort or heapsort.
The primary reason is that the operations in the innermost
loop are simpler.
Since the difference between the two programs will be limited
to a multiplicative constant factor, the details of how you
program each algorithm will make a big difference.

Quicksort
0% (2)
Quicksort
12 pages
Mergesort / Quicksort: Steven Skiena
No ratings yet
Mergesort / Quicksort: Steven Skiena
32 pages
Quicksort-Lecture 4 and 5
No ratings yet
Quicksort-Lecture 4 and 5
11 pages
Lec9 Quicksort
No ratings yet
Lec9 Quicksort
20 pages
CPT212-07-Sorting_Efficient
No ratings yet
CPT212-07-Sorting_Efficient
25 pages
Lec8 1-Quicksort
No ratings yet
Lec8 1-Quicksort
21 pages
Sort
No ratings yet
Sort
15 pages
Online Class 10-Quicksort
No ratings yet
Online Class 10-Quicksort
41 pages
04 CS251 Devide and Conquer
No ratings yet
04 CS251 Devide and Conquer
20 pages
Lec6 Quick-Merge Sort
No ratings yet
Lec6 Quick-Merge Sort
31 pages
Divide and Conquer
No ratings yet
Divide and Conquer
16 pages
3._Quick_sort.ppt
No ratings yet
3._Quick_sort.ppt
18 pages
ds ppt
No ratings yet
ds ppt
9 pages
Quick Sort
No ratings yet
Quick Sort
117 pages
Notes 03 Sorting PDF
No ratings yet
Notes 03 Sorting PDF
126 pages
Quicksort: Pseudo Code For Recursive Quicksort Function
No ratings yet
Quicksort: Pseudo Code For Recursive Quicksort Function
11 pages
Quick Sort: Characteristics
No ratings yet
Quick Sort: Characteristics
20 pages
Unit 1 - 2
No ratings yet
Unit 1 - 2
14 pages
Sorting and Searching
No ratings yet
Sorting and Searching
17 pages
Week 6
No ratings yet
Week 6
39 pages
Sorting and Searching II
No ratings yet
Sorting and Searching II
34 pages
Csce411 Random3
No ratings yet
Csce411 Random3
25 pages
Quick Sort Algorithm
No ratings yet
Quick Sort Algorithm
6 pages
Quick Sort: As The Name Implies, It Is Quick, and It Is The Algorithm Generally Preferred For Sorting
No ratings yet
Quick Sort: As The Name Implies, It Is Quick, and It Is The Algorithm Generally Preferred For Sorting
21 pages
Understanding Quicksort Algorithm a Comprehensive Guide
No ratings yet
Understanding Quicksort Algorithm a Comprehensive Guide
20 pages
Lecture 8 QuickSort
No ratings yet
Lecture 8 QuickSort
64 pages
(Divide and Conquer) - Merge and Quick Sort
No ratings yet
(Divide and Conquer) - Merge and Quick Sort
34 pages
Quick Sort Lomu To
No ratings yet
Quick Sort Lomu To
4 pages
Cs 161 Lecture 05
No ratings yet
Cs 161 Lecture 05
5 pages
2 1 Ordenation Algorithms
No ratings yet
2 1 Ordenation Algorithms
60 pages
Quick Sort
No ratings yet
Quick Sort
19 pages
ADA Unit II GCR
No ratings yet
ADA Unit II GCR
58 pages
07 Sort2
No ratings yet
07 Sort2
87 pages
Quick Sort
No ratings yet
Quick Sort
10 pages
4 Quicksort and Balls in Bins
No ratings yet
4 Quicksort and Balls in Bins
74 pages
Good
No ratings yet
Good
12 pages
Week 02 (Complexity of Sorting Algorithms)
No ratings yet
Week 02 (Complexity of Sorting Algorithms)
62 pages
Lecture11 Updated
No ratings yet
Lecture11 Updated
57 pages
Data Structure-52-54
No ratings yet
Data Structure-52-54
3 pages
quicksort
No ratings yet
quicksort
14 pages
Cmput204 Week 7 QuickSort SLB BST HashTable Handout
No ratings yet
Cmput204 Week 7 QuickSort SLB BST HashTable Handout
100 pages
Algorithm Ch 3 Lec 2
No ratings yet
Algorithm Ch 3 Lec 2
17 pages
quockSort irtuallab ass
No ratings yet
quockSort irtuallab ass
9 pages
Today's Material: - Divide & Conquer (Recursive) Sorting Algorithms
0% (1)
Today's Material: - Divide & Conquer (Recursive) Sorting Algorithms
18 pages
Quick Sort Algorithm
No ratings yet
Quick Sort Algorithm
11 pages
Divide-And-Conquer Sorting
No ratings yet
Divide-And-Conquer Sorting
28 pages
Quicksort Algorithm Average Case Analysis
No ratings yet
Quicksort Algorithm Average Case Analysis
16 pages
Randomized Algorithm
No ratings yet
Randomized Algorithm
28 pages
Quicksort: Quicksort: Advantages and Disadvantages Quicksort
No ratings yet
Quicksort: Quicksort: Advantages and Disadvantages Quicksort
15 pages
2 Quick Sort
No ratings yet
2 Quick Sort
5 pages
Quicksort: By: Vimal Awasthi
No ratings yet
Quicksort: By: Vimal Awasthi
20 pages
Sorting Algorithms
No ratings yet
Sorting Algorithms
6 pages
Correctness and Complexity Analysis of Quick Sort
No ratings yet
Correctness and Complexity Analysis of Quick Sort
23 pages
Dsa Ass 2 by Syed
No ratings yet
Dsa Ass 2 by Syed
11 pages
CSC 344 - Algorithms and Complexity: Lecture #3 - Internal Sorting
No ratings yet
CSC 344 - Algorithms and Complexity: Lecture #3 - Internal Sorting
33 pages
DAA Unit 2 notes - 4
No ratings yet
DAA Unit 2 notes - 4
14 pages
Devide and Conqure Rule
No ratings yet
Devide and Conqure Rule
11 pages
Lecture Quick Sort
No ratings yet
Lecture Quick Sort
42 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Calculus: Maths of the Gods
From Everand
Calculus: Maths of the Gods
Bill Todorovich
No ratings yet
Data Types Variables Constant
No ratings yet
Data Types Variables Constant
35 pages
New Trends in Applied Machine Intelligence
No ratings yet
New Trends in Applied Machine Intelligence
17 pages
Unit 4 Introduction To Documentation of Systems: Structure Page No
No ratings yet
Unit 4 Introduction To Documentation of Systems: Structure Page No
17 pages
Chapter - 2
No ratings yet
Chapter - 2
38 pages
CCW CST308
No ratings yet
CCW CST308
7 pages
cs53 Super-Imp-Tie-23
No ratings yet
cs53 Super-Imp-Tie-23
2 pages
Unit 5 NOSQL
No ratings yet
Unit 5 NOSQL
102 pages
CMT Paper ID- 88
No ratings yet
CMT Paper ID- 88
6 pages
Soil Health Prediction Using Supervised Machine Learning Technique
No ratings yet
Soil Health Prediction Using Supervised Machine Learning Technique
9 pages
Mysql Tutorial: What Is Dbms
No ratings yet
Mysql Tutorial: What Is Dbms
11 pages
AI UNIT IV
No ratings yet
AI UNIT IV
15 pages
Nidhi S CV PDF
No ratings yet
Nidhi S CV PDF
1 page
Open NN
No ratings yet
Open NN
2 pages
Chapter - 3 - Artificial Intelligence (AI)
100% (1)
Chapter - 3 - Artificial Intelligence (AI)
63 pages
ans
No ratings yet
ans
9 pages
Lesson 3
No ratings yet
Lesson 3
17 pages
Computerized Medical Lab Record System
No ratings yet
Computerized Medical Lab Record System
41 pages
CSE2026 Data Handling and Visualization Syllabus
No ratings yet
CSE2026 Data Handling and Visualization Syllabus
2 pages
Planilla de Metrados de Movimiento de Tierras
No ratings yet
Planilla de Metrados de Movimiento de Tierras
4 pages
Top 10 Machine Learning Algorithms
No ratings yet
Top 10 Machine Learning Algorithms
12 pages
Furuno FMD ECDIS PDF
100% (1)
Furuno FMD ECDIS PDF
14 pages
15 Essential Data Structure and Algorithm
No ratings yet
15 Essential Data Structure and Algorithm
8 pages
Compiler Construction Lectures
No ratings yet
Compiler Construction Lectures
20 pages
SQL DDL Summary
No ratings yet
SQL DDL Summary
4 pages
M.sc. Information Technology Vide Item No. 6.2 N Sem. III IV
No ratings yet
M.sc. Information Technology Vide Item No. 6.2 N Sem. III IV
51 pages
BS Artificial Intelligence (BS AI) - Information Technology University
No ratings yet
BS Artificial Intelligence (BS AI) - Information Technology University
4 pages
Srs Document Hms-1
No ratings yet
Srs Document Hms-1
12 pages
Computer Science Paper 2 HL Markscheme
No ratings yet
Computer Science Paper 2 HL Markscheme
26 pages
Tybscit Sem 5&6 Syllabus
No ratings yet
Tybscit Sem 5&6 Syllabus
31 pages
Format of Monthly Progress and Hse Report
No ratings yet
Format of Monthly Progress and Hse Report
10 pages

lecture7_v2

Uploaded by

lecture7_v2

Uploaded by

Lecture 7:

mergesort(item type s[], int low, int high)

if (low < high) {

merge(s, low, middle, high);

T (n) = 2 · T (n/2) + Θ(n) → T (n) = Θ(n log n)

As we scan from left to right, we move the left bound to the

Quicksort(A, low, high)

The total partitioning on each level is O(n), and it take

Now we have n−1 levels, instead of lg n, for a worst case time

1 n/4 n/2 3n/4 n

Average-Case Analysis of Quicksort

Each possible pivot p is selected with equal probability. The

It is important to understand (1) where the recurrence relation

substituting an = A(n)/(n + 1) gives

n(n + 1) i=1 i(i + 1)

Since Heapsort is Θ(n lg n) and selection sort is Θ(n2), there

You might also like