0% found this document useful (0 votes)

21 views13 pages

Week 3

The document discusses the quicksort algorithm, highlighting its advantages over merge sort, such as in-place sorting and average-case efficiency of O(n log n). It also covers the importance of choosing a good pivot and the implications of using randomized pivots to avoid worst-case scenarios. Additionally, it compares lists and arrays in Python, explaining their implementations and performance characteristics.

Uploaded by

Harshdeep Sharma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views13 pages

Week 3

Uploaded by

Harshdeep Sharma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

Week 3

QUICK SORT

Shortcomings of merge sort

Merge needs to create a new list to hold the merged elements

No obvious way to efficiently merge two lists in place

Extra storage can be costly
Inherently recursive

Recursive calls and returns are expensive

Merging happens because elements in the left half need to move to the right half and vice
versa

Consider an input of the form [0,2,4,6,1,3,5,9]

Can we divide the list so that everything on the right?

No need to merge!

Divide and conquer without merging

Suppose the median L is m

Move all values ≤ m to left half of L

Right half has values > m

Recursively sort left and right halves

L is now sorted, no merge!

Recurrence: T(n) = 2T(n/2) - n

Rearrange in a single pass, time O(n)

So T(n) is O(nlogn)

How do we find the median?

Sort and pick up the middle element

But our aim is to sort the list!
Instead pick some value in L - pivot

Split L with respect to the pivot element

Quicksort [C.A.R Hoare]

Choose a pivot element

Typically the first element in the array

Partition L into lower and upper parts with respect to the pivot
Move the pivot between the lower and upper partition
Recursively sort the two partitions

High level view of quicksort

Input list

Identify pivot
Mark lower elements and upper elements
Rearrange the elements as lower-pivot-upper

Recursively sort the lower and upper partitions

Partitioning

Scan the list from left to right

Four segments: Pivot , Lower , Upper , Unclassified
Examine the first unclassified element

If it is larger than the pivot, extend Upper to include this element

If it is less than or equal to the pivot, exchange with the first element in Upper . This
extends Lower and shifts Upper by one position.

Pivot is always the first element

Maintain two indices to mark the end of the Lower and Upper segments
After partitioning, exchange the pivot with the last element of the Lower segment.

def quicksort(L, l, r): # Sort L[l:r]

if (r - l <= 1):
return L
(pivot, lower, upper) = (L[l], l + 1, l + 1)
for i in range(l+1,r):
if L[i] > pivot: # Extend upper segment
upper = upper + 1
else: # Exchange L[i] with start of upper segment
(L[i], L[lower]) = (L[lower], L[i])
# Shift both segments
(lower, upper) = (lower + 1, upper + 1)
# Move pivot between lower and upper
(L[i] L[lower-1]) = (L[lower-1] L[l])
(L[i], L[lower 1]) = (L[lower 1], L[l])
lower = lower - 1
# Recursive calls
quicksort(L,l,lower)
quicksort(L, lower+1,upper)
return L

Summary

Quicksort uses divide and conquer, like merge sort.

By partitioning the list carefully, we avoid a merge step

This allows an in place sort

We can also provide an iterative implementation to avoid the cost of recursive calls
The partitioning strategy described is not the only one used in the literature

Can build the lower and upper segments from opposite ends and meet in the middle
Need to analyze the complexity of quicksort

ANALYSIS OF QUICK SORT

Analysis

Partitioning wrt the pivot takes time O(n)

If the pivot is the median

T(n) = 2T(n/2) + n
T(n) is O(nlogn)

Worst case? Pivot is maximum or minimum

Partitions are of size 0, n - 1

T(n) = T(n - 1) + n
T(n) = n + (n - 1) + ... + 1
T(n) is O(n2 )

Already sorted array, worst case!

However, average case is O(nlogn)

Sorting is a rare situation where we can compute this

Values don't matter, only relative order is important

Analyze behaviour over permutations of {1,2,...,n}
Each input permutation is equally likely
Expected running time is O(nlogn)

Randomizaton
Any fixed choice of pivot allows us to construct worst case input
Instead, choose pivot position randomly at each step
Expected run time is again O(nlogn)

Iterative quicksort

Recursive calls work on disjoint segments

No recombination of results is required

Can explicitly keep track of left and right endpoints of each segment to be sorted.

Quicksort in practice

In practice, quicksort is very fast

Very often the default algorithm used for in-built sort functions

Sorting a column in a spreadsheet

Library sort function in a programming language

Summary

The worst case complexity of quicksort is O(n2 )

However, the average case is O(nlogn)
Randomly choosing the pivot is a good strategy to beat worst case inputs
Quicksort works in-place and can be impleted iteratively
Very fast in practice, and often used for built-in sorting functions

Good example of a situation when the worst case upper bound is pessimistic

CONCLUDING REMARKS ON SORTING ALGORITHMS

Stable Sorting

Often list values are tuples

Rows from a table, with multiple columns / attributes

A list of students, each student entry has a roll number, names, marks, ...
Suppose students have already been sorted by roll number
If we now sort by name, will all students with the same name remain in sorted order with
respect to roll number?
Stability of sorting is crucial in many applications
Sorting on column B should not disturb sorting on column A

The quicksort implementation we described is not stable

Swapping values while partitioning can disturb existing sorted order

Merge Sort is stable if we merge carefully

Do not allow elements from the right to overtake elements on the left
While merging, prefer the left list while breaking ties

Other criteria

Minimizing data movement

Imagine each element is a heavy carton

Reduce the effort of moving values around

Best sorting algorithm?

Quicksort is often the algorithm of choice, despite O(n2 ) worst case

Merge sort is typically used for "external" sorting

Database tables taht are too large to store in memory all at once
Retrieve in parts from the disk and write back
Other O(nlogn) algorithms exist - heapsort
Sometimes hybrid strategies are used

Use divide and conquer for large n

Switch to insertion sort when n becomes small (e.g., n < 16)

DIFFERENCE BETWEEN LISTS AND ARRAYS

Sequences

Two basic ways of storing a seequence of values

Lists
Arrays
Lists

Flexible length
Easy to modify the structure
Values are scattered in memory
Arrays

Fixed size
Allocate a contigous block of memory
Supports random access

Lists
Typically a sequence of nodes
Each node contains a value and points to the next node in the sequence

"Linked" list
Easy to modify

Inserting and deletion is easy via local "plumbing"

Flexible size
Need to follow links to access A[i]

Takes time O(i)

Arrays

Fixed size, declared in advance

Allocate a contiguous block of memory

n times the storage for a single value

"Random" access

Compute offset to A[i] from A[0]

Accessing A[i] takes constant time, independent of i
Inserting and deleting elements is expensive

Expanding and contracting requires moving O(n) elements in the worst

Operations

Exchange A[i] and A[j]

Constant time for arrays

O(n) for lists

Delete A[i] , insert v after A[i]

Constant time for lists if we are already at A[i]

O(n) for arrays

Need to keep implementation in mind when analyzing data structures

For instance, can we use binary search to insert in a sorted sequence?

Either search is slow, or insertion is slow, still O(n)

Summary

Sequences can be stored as lists or arrays

Lists are flexible but accessing an element is O(n)
Arrays support random access but are difficult to expand, contract
Algorithm analysis needs to take into account the underlying implementation.
In Python:

Is the built-in type in Python really a "linked" list?

Numpy library provides arrays - are these faster than lists?

DESIGNING A FLEXIBLE LIST AND OPERATIONS ON THE SAME

Implementing lists in Python

Python class Node

A list is a sequence of nodes

self.value is the stored value

self.next points in the next node

Empty list?

self.value is None

Creating lists

l1 = Node() - empty list

l2 = Node(5) - singleton list
l1.isempty() == True
l2.isempty() == False

class Node:
def __init__(self, v = None):
self.value = v
self.next = None
return
def isempty(self):
if self.value == None:
return True
else:
return False

Appending to a list

Add v to the end of list l

If l is empty, update l.value from None
If at last value, l.next is None

Point next at new node with value v

Otherwise, recursively append to rest of list

def append(self, v):

# append, recursive
if self.isempty():
self.value = v
elif self.next == None:
self.next = Node(v)
else:
self.next.append(v)
return

Iterative implementation

If empty, replace l.value by v

Loop through l.next to end of list
Add v to the end of the list

def appendi(self, v):

# append, iterative
if self.isempty():
self.value = v
return

temp = self
while temp.next != None:
temp = temp.next

temp.next = Node(v)
return

Insert at the start of the list

Want to insert v at head

Create a new node with v
Cannot change where the head points!

Exchange the values v0 , v

Make new node point to head.next
Make head.next point to new node

def insert(self, v):

if self.isempty():
self.value = v
return

newnode = Node(v)

# Exchange values in self and newnode

(self.value, newnode.value) = (newnode.value, self.value)

# Switch links
(self.next, newnode.next) = (newnode, self.next)

return

Delete a value v

Remove first occurence of v

Scan list for first v - look ahead at next node
If next node value is v, bypass it
Cannot bypass the first node in the list

Instead, copy the second node value to head

Bypass second node
Recursive implementation

def delete(self, v):

# delete, recursive
if self.isempty():
return
if self.value == v:
self.value = None
if self.next != None:
self.value = self.next.value
self.next = self.next.next
return
else:
if self.next != None:
self.next.delete(v):
if self.next.value == None:
self.next = None
return

Summary

Use a linked list of nodes to implement a flexible list

Append is easy
Insert requires some care, cannot change where the head points to
When deleting, look one step ahead to bypass the node to be deleted

IMPLEMENTATION OF LISTS IN PYTHON

Lists in Python

Python lists are not implemented as flexible linked lists

Underlying interpretation maps the list to an array

Assign a fixed block when you create a list

Double the size if the list overflows the array
Keep track of the last position of the list in the array

l.append() and l.pop() are constant time, amortised - O(1)

Insertion/deletion require time O(n)
Effectively, Python lists behave more like arrays than lists

Arrays v/s Lists in Python

Arrays are useful for representing matrices

In list notation, these are nested lists
0 1
( )
0 1

that is [[0,1], [1,0]]

Need to be careful when initializing a multidimensional list

zerolist = [0,0,0]
zeromatrix = [zerolist, zerolist, zerolist]

zeromatrix[1][1] = 1
print(zeromatrix)

[[0, 1, 0], [0, 1, 0], [0, 1, 0]]

Mutuability aliases different values

Instead use list comprehension

zeromatrix = [ [0 for i in range(3)] for j in range(3) ]

Numpy Arrays

The Numpy library provides arrays as a basic type

import numpy as np
zeromatrix = np.zeros(shape = (3,3))

Can create an array from any sequence type

newarray = np.array([[0,1],[1,0]])

arange is the equivalent of range for lists

row2 = np.arange(5)

Can operate on amtrix as a whole

C = 3*A + B
C = np.matmul(A,B)
same as C[i,j] = A[i.k].B[k,j]
Very useful for data science

Summary

Python lists are not implemented as flexible linked structures

Instead, allocate an array and double space as needed
Append is cheap, insert is expensive
Arrays can be represented as multidimensional lists,but need to be careful about
mutability, aliasing
Numpy arrays are easier to use

IMPLEMENTATION OF DICTIONARY IN PYTHON

Dictionary

An array/list allows access through positional indices

A dictionary allows access through arbitrary keys

A collection of key-value pairs

Random access - access time is the same for all keys

Implementing a dictionary

The underlying storage is an array

Given an offset i , find A[i] in constant time

Keys have to be mapped to {0,1,..,n-1}

Given an key k , convert it to an offset i

Hash function

h : S -> X maps a set of values S to a small range of integers X = {0,1,...,n-1}

Typically |X| << |S |, so there will be collisions, h(s) = h(s′ ) , s ≠ s′
A good hash function will minimize collisions
SHA-256 is an industry standard hashing function whose range is 256 bits

Use to hash large files - avoid uploading to cloud storage

Hash Table

An array A of size n combined with a hash function h

h maps keys to {0,1,...,n-1}
Ideally, when we create an entry for key k , A[h(k)] will be unused

What if there is already a value at that location?

Dealing with collisions

Open addressing (closed hashing)

Probe a sequence of alternate slots in the same array

Open hashing
Each slot in the array points to a list of values
Insert into the list for the given slot
Dictionary keys in Python must be immutable

If value changes, hash also changes!

Summary

A dictionary is implemented as a hash table

An array plus a hash function

Creating a good hash function is important
Need a strategy to deal with collisions

Open addressing/closed hashing - probe for free space in the array

Open hashing - each slot in the hash table points to a list of key-value pairs
many heuristics/optimizations possible for dea

Sorting and Hashing
100% (1)
Sorting and Hashing
83 pages
Merge and Quick
100% (1)
Merge and Quick
23 pages
Quick Sort
No ratings yet
Quick Sort
18 pages
2.7-Quick Sort
100% (1)
2.7-Quick Sort
16 pages
PM Clinic Dozers Komatsu
100% (1)
PM Clinic Dozers Komatsu
3 pages
Design and Analysis of Algorithm
No ratings yet
Design and Analysis of Algorithm
33 pages
CS3353 Unit5
No ratings yet
CS3353 Unit5
21 pages
Unit 1
No ratings yet
Unit 1
116 pages
01 Road Roller Basic Knowledge (6611E)
0% (1)
01 Road Roller Basic Knowledge (6611E)
16 pages
Algorithms Project Report
No ratings yet
Algorithms Project Report
7 pages
Lecture-3 (DivideAndConquer)
No ratings yet
Lecture-3 (DivideAndConquer)
83 pages
Week-4 Sorting, Dictionaries and Functions
No ratings yet
Week-4 Sorting, Dictionaries and Functions
110 pages
Mergesort: Merge Sort Visualizer
No ratings yet
Mergesort: Merge Sort Visualizer
90 pages
Week 6
No ratings yet
Week 6
39 pages
11 Sorting
No ratings yet
11 Sorting
103 pages
Module 6 Search Sort Hashing
No ratings yet
Module 6 Search Sort Hashing
62 pages
Sorting UNIT 5
No ratings yet
Sorting UNIT 5
66 pages
L11 Sorting&Searching
No ratings yet
L11 Sorting&Searching
61 pages
Lecture 4 - Quicksort
No ratings yet
Lecture 4 - Quicksort
52 pages
Petts, Ann - Shapley, Bernard - On Supervision - Psychoanalytic and Jungian Analytic Perspectives-Karnac (2007)
100% (1)
Petts, Ann - Shapley, Bernard - On Supervision - Psychoanalytic and Jungian Analytic Perspectives-Karnac (2007)
266 pages
Week 02 (Complexity of Sorting Algorithms)
No ratings yet
Week 02 (Complexity of Sorting Algorithms)
62 pages
Dsa CH 2
No ratings yet
Dsa CH 2
50 pages
L9 Sorting
No ratings yet
L9 Sorting
50 pages
Sorting and Searching II
No ratings yet
Sorting and Searching II
34 pages
Quick-Sort Algorithm
No ratings yet
Quick-Sort Algorithm
53 pages
07 Sort2
No ratings yet
07 Sort2
87 pages
C Programming and Data Structures 41394658 2025 06-20-08 20
No ratings yet
C Programming and Data Structures 41394658 2025 06-20-08 20
35 pages
Notes 03 Sorting PDF
No ratings yet
Notes 03 Sorting PDF
126 pages
Lecture 7 - Sorting
No ratings yet
Lecture 7 - Sorting
38 pages
UNIT V Data Structures OU
No ratings yet
UNIT V Data Structures OU
42 pages
PDSA Week 3
No ratings yet
PDSA Week 3
33 pages
s4 Quick Sort
No ratings yet
s4 Quick Sort
27 pages
(Divide and Conquer) - Merge and Quick Sort
No ratings yet
(Divide and Conquer) - Merge and Quick Sort
34 pages
Sortings
No ratings yet
Sortings
92 pages
Sorting
No ratings yet
Sorting
54 pages
DSD Unit 3 Sorting and Searching
No ratings yet
DSD Unit 3 Sorting and Searching
36 pages
CPT212 07 Sorting - Efficient
No ratings yet
CPT212 07 Sorting - Efficient
25 pages
Cse Daa Lab Manual
No ratings yet
Cse Daa Lab Manual
29 pages
Lecture No.45 Data Structures: Dr. Sohail Aslam
No ratings yet
Lecture No.45 Data Structures: Dr. Sohail Aslam
54 pages
Quick Sort
No ratings yet
Quick Sort
19 pages
Lecture 12 Sorting Complete
No ratings yet
Lecture 12 Sorting Complete
32 pages
Sorting Visualizer Daa Lab Project
No ratings yet
Sorting Visualizer Daa Lab Project
19 pages
Sorting Algorithms
No ratings yet
Sorting Algorithms
66 pages
Lab 8
No ratings yet
Lab 8
8 pages
Topic: Searching and Sorting Algorithm: Joy of Python Using Cloud Computing
No ratings yet
Topic: Searching and Sorting Algorithm: Joy of Python Using Cloud Computing
17 pages
DAA03 Quick Sort Stressen
No ratings yet
DAA03 Quick Sort Stressen
35 pages
DS
No ratings yet
DS
13 pages
Merge, Quick, Radix Sort
No ratings yet
Merge, Quick, Radix Sort
9 pages
Lec32 35
No ratings yet
Lec32 35
40 pages
DAA Module 4
No ratings yet
DAA Module 4
11 pages
DS Data Structures PPT-module-6
No ratings yet
DS Data Structures PPT-module-6
13 pages
Dsa Ass 2 by Syed
No ratings yet
Dsa Ass 2 by Syed
11 pages
Compiled By: Dr. Mohammad Omar Alhawarat: Sorting
No ratings yet
Compiled By: Dr. Mohammad Omar Alhawarat: Sorting
52 pages
DSA Lab 5
No ratings yet
DSA Lab 5
11 pages
Quicksort, Mergesort, and Heapsort
No ratings yet
Quicksort, Mergesort, and Heapsort
22 pages
Quick Sort: As The Name Implies, It Is Quick, and It Is The Algorithm Generally Preferred For Sorting
No ratings yet
Quick Sort: As The Name Implies, It Is Quick, and It Is The Algorithm Generally Preferred For Sorting
21 pages
Sorting Techniques
No ratings yet
Sorting Techniques
6 pages
Design & Analysis of Algorithms: Submitted To Prof. Hashim Javed Submitted by Affifa ID: 30 21-Aug-2019
No ratings yet
Design & Analysis of Algorithms: Submitted To Prof. Hashim Javed Submitted by Affifa ID: 30 21-Aug-2019
11 pages
Sorting Method
No ratings yet
Sorting Method
6 pages
H2S Presentation
No ratings yet
H2S Presentation
66 pages
Clarion Dxz838rmp
No ratings yet
Clarion Dxz838rmp
28 pages
Quick Sort
No ratings yet
Quick Sort
4 pages
Action Plan: Department of Education
No ratings yet
Action Plan: Department of Education
3 pages
Print - Udyam Registration Certificate
No ratings yet
Print - Udyam Registration Certificate
2 pages
5.size Oriented and Function Oriented Metrics
No ratings yet
5.size Oriented and Function Oriented Metrics
4 pages
Untitled 2
No ratings yet
Untitled 2
31 pages
Sunlight Dishwashing Liquid Msds
No ratings yet
Sunlight Dishwashing Liquid Msds
12 pages
Ce2304 Nol
No ratings yet
Ce2304 Nol
171 pages
Development Agreement
No ratings yet
Development Agreement
36 pages
ARIBA Supplier Manual
No ratings yet
ARIBA Supplier Manual
23 pages
GL850G Icpdf
No ratings yet
GL850G Icpdf
38 pages
Solutions
No ratings yet
Solutions
30 pages
Intellect OCR To SAP FB60 Integration Proposal
No ratings yet
Intellect OCR To SAP FB60 Integration Proposal
2 pages
Concurrence of Big Data Analytics and Healthcare
No ratings yet
Concurrence of Big Data Analytics and Healthcare
10 pages
1612-Article Text-6168-1-4-20250219
No ratings yet
1612-Article Text-6168-1-4-20250219
20 pages
HTML - Multiple Web Frameset
No ratings yet
HTML - Multiple Web Frameset
8 pages
HV Link Boxe Epp 1665 5-13
No ratings yet
HV Link Boxe Epp 1665 5-13
8 pages
ZEOFREE® 600 - Evonik
No ratings yet
ZEOFREE® 600 - Evonik
2 pages
Datasheet For Steel Grades High Alloy Aquamet 22
No ratings yet
Datasheet For Steel Grades High Alloy Aquamet 22
3 pages
GBV Monthly Work Plan
No ratings yet
GBV Monthly Work Plan
20 pages
Journey Management Plan 3
No ratings yet
Journey Management Plan 3
1 page
C1 Reading Political Manifestos
No ratings yet
C1 Reading Political Manifestos
3 pages
GBC - Group Contract Assignment Guidelines and Rubric 2023 3
No ratings yet
GBC - Group Contract Assignment Guidelines and Rubric 2023 3
4 pages
Determination of MSW Specific Weight
No ratings yet
Determination of MSW Specific Weight
10 pages
EN Checklist ISO Aanvulling Ontwerp 7 - 3 260303
No ratings yet
EN Checklist ISO Aanvulling Ontwerp 7 - 3 260303
3 pages
DCIT 65 Class Activity 1
No ratings yet
DCIT 65 Class Activity 1
2 pages
Quiz No. 2
No ratings yet
Quiz No. 2
1 page
Design And Analysis Of Algorithm
From Everand
Design And Analysis Of Algorithm
Bhupendra Mandloi
No ratings yet
300+ Python Algorithms: Mastering the Art of Problem-Solving
From Everand
300+ Python Algorithms: Mastering the Art of Problem-Solving
Hernando Abella
5/5 (1)
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet