0% found this document useful (0 votes)

7 views

Module 5

Uploaded by

georgythomasgeo

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views

Module 5

Uploaded by

georgythomasgeo

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 163

Module 5

SORTING TECHNIQUES
AND HASHING
Sorting Techniques
1. Bubble sort
2. Insertion sort O(n^2)
3. Selection sort
4. Quick sort
5. Merge sort O(nlogn)
6. Heap sort
Sorting Techniques
Task of rearranging the data in an order

Or, task of rearranging a set of records based on their key values

when records are stored in a file

Sorting Techniques - Terminologies
▪Internal Sort – done within primary memory
▪External Sort – needs external slow memory too
▪Ascending Order – less than or equal to
▪Descending Order – greater than or equal to
▪Lexicographic Order – order of dictionary
▪Collating Sequence – ordering a set of characters ( higher, lower or same
order)
Sorting Techniques - Terminologies
Random Order – no ordering in particular
Swap: interchanging the data contents b/w two storages
Stable Sort: preserving the relative ordering of equal data values
In Place Sort: sorting values within a data structure without the help of any
external storage
Item : element to be sorted
Sorting Techniques
Sorting

Internal External

By
By comparison
Distribution

Insertion Selection Exchange Enumeration

Sorting Techniques
Straight Insertion
Binary Insertion
insertion
Two – way Insertion
List Insertion
comparison
Straight Selection
Selection Tree Selection
Heap
internal
Bubble
Exchange Shell
Quick
Simple Merge
Merge
Two way Merge
sorting
Radix
distribution
Bucket
Counting
External Merge
Two Way Merge
external
Multi Way Merge
Polyphase Merge
Insertion Sort
Idea: like sorting a hand of playing cards
◦ Start with an empty left hand and the cards facing down on
the table.
◦ Remove one card at a time from the table, and insert it into
the correct position in the left hand
◦ compare it with each of the cards already in the hand, from right to left
◦ The cards held in the left hand are sorted
◦ these cards were originally the top cards of the pile on the table

9
Insertion Sort

To insert 12, we need to make

room for it by moving first 36
and then 24.

10
Insertion Sort

11
Insertion Sort

12
Insertion Sort
input array
5 2 4 6 1 3

at each iteration, the array is divided in two sub-arrays:

left sub-array right sub-array

sorted unsorted

13
INSERTION-SORT
Alg.: INSERTION-SORT(A)
1 2 3 4 5 6 7 8

a1 a2 a3 a4 a5 a6 a7 a8

key
Insertion Sort
Insertion Sort
1. #include <stdio.h> 14. t = array[d];
2. int main() 15. array[d] = array[d-1];
3. { 16. array[d-1] = t;
4. int n, array[1000], c, d, t; 17. d--;
5. printf("Enter number of 18. }
elements\n"); 19. }
6. scanf("%d", &n); 20. printf("Sorted list in ascending
7. printf("Enter %d integers\n", n); order:\n");
8. for (c = 0; c < n; c++) { 21. for (c = 0; c <= n - 1; c++) {
9. scanf("%d", &array[c]); 22. printf("%d\n", array[c]);
10. } 23. }
11. for (c = 1 ; c <= n - 1; c++) { 24. return 0;
12. d = c; 25. }
13. while ( d > 0 && array[d] < array[d-1])
{
Insertion Sort
Selection Sort
Idea:
◦ Find the smallest element in the array
◦ Exchange it with the element in the first position
◦ Find the second smallest element and exchange it with the element in the second
position
◦ Continue until the array is sorted

18
Example

8 4 6 9 2 3 1 1 2 3 4 9 6 8

1 4 6 9 2 3 8 1 2 3 4 6 9 8

1 2 6 9 4 3 8 1 2 3 4 6 8 9

1 2 3 9 4 6 8 1 2 3 4 6 8 9

19
Selection Sort
Alg.: SELECTION-SORT(A) 8 4 6 9 2 3 1
n ← length[A]
for j ← 1 to n - 1
do smallest ← j
for i ← j + 1 to n
do if A[i] < A[smallest]
then smallest ← i
exchange A[j] ↔ A[smallest]
Selection Sort
1. #include <stdio.h> 18. if ( small != c )
2. int main() 19. {
3. { 20. temp = array[c];
4. int array[100], n, c, d, small, temp; 21. array[c] = array[small];
5. printf("Enter number of elements\n"); 22. array[small] = temp;
6. scanf("%d", &n); 23. }
7. printf("Enter %d integers\n", n); 24. }
8. for ( c = 0 ; c < n ; c++ ) 25. printf("Sorted list in ascending
9. scanf("%d", &array[c]); order:\n");
10. for ( c = 0 ; c < ( n - 1 ) ; c++ ) 26. for ( c = 0 ; c < n ; c++ )
11. { 27. printf("%d\n", array[c]);
12. small = c; 28. return 0;
13. for ( d = c + 1 ; d < n ; d++ ) 29. }
14. {
15. if ( array[position] > array[d] )
16. small = d;
17. }
Selection Sort
Quicksort
Basic Concept: divide and conquer
Select a pivot and split the data into two groups: (< pivot) and (> pivot):

(<pivot) (> pivot)

LEFT group RIGHT group

• Recursively apply Quicksort to the subgroups

Quicksort Start
Start with all data
in an array, and
Unsorted Array
consider it unsorted
Quicksort Step 1
Step 1, select a pivot pivot
(it is arbitrary)
26 33 35 29 19 12 22

We will select the first

element, as presented in the
original algorithm by
C.A.R. Hoare in 1962.
Quicksort Step 2
Step 2, start process of pivot
dividing data into LEFT
and RIGHT groups: 26 33 35 29 19 12 22

The LEFT group will left right

have elements less than
the pivot.
The RIGHT group will have
elements greater that the pivot.

Use markers left and right

Quicksort Step 3
Step 3, pivot
If left element belongs
to LEFT group, then increment 26 33 35 29 19 12 22
left index.

If right index element belongs left right

to RIGHT, then decrement right.

Exchange when you find

elements that belong to the other
group.
Quicksort Step 4
Step 4: pivot

Element 33 belongs 26 33 35 29 19 12 22
to RIGHT group.
left right
Element 22 belongs
to LEFT group.
pivot
Exchange the two
elements. 26 22 35 29 19 12 33

left right
Quicksort Step 5
Step 5: pivot

After the exchange, 26 22 35 29 19 12 33

increment left marker,
decrement right marker. left right
Quicksort Step 6
Step 6: pivot

Element 35 belongs 26 22 35 29 19 12 33
to RIGHT group.
left right
Element 12 belongs
to LEFT group.
pivot
Exchange,
increment left, and 26 22 12 29 19 35 33
decrement right.
left right
Quicksort Step 7
Step 7: pivot

Element 29 belongs 26 22 12 29 19 35 33
to RIGHT.
left right
Element 19 belongs
to LEFT.
pivot
Exchange,
increment left, 26 22 12 19 29 35 33
decrement right.
right left
Quicksort Step 8
Step 8: pivot
When the left and right
markers pass each other, 26 22 12 19 29 35 33
we are done with the
partition task. right left

Swap the right with pivot.

pivot
26
19 22 12 29 35 33
LEFT RIGHT
Quicksort Step 8
Step 8a: pivot
Apply quicksort over the
left and right partitions 26 22 12 19 29 35 33

Left partition: right left

• Pivot is 19
• Step 1: interchange 12
and 22 pivot
• Step 2: interchange 12
26
and 19 19 22 12 29 35 33
• Pivot placed in right
location LEFT RIGHT

Similarly apply quicksort

on right with 29 as pivot
Quicksort Step 9
previous pivot
Step 9:
Apply Quicksort 26
Quicksort Quicksort
to the LEFT and
RIGHT groups, 19 22 12 29 35 33
recursively. pivot pivot

12 19 22 26 29 33 35
Assemble parts when done

12 19 22 26 29 33 35
pivot_index = 0 40 20 10 80 60 50 7 30 100

[0] [1] [2] [3] [4] [5] [6] [7] [8]

too_big_index too_small_index
1. While data[too_big_index] <= data[pivot]
++too_big_index

pivot_index = 0 40 20 10 80 60 50 7 30 100

[0] [1] [2] [3] [4] [5] [6] [7] [8]

too_big_index too_small_index
1. While data[too_big_index] <= data[pivot]
++too_big_index

pivot_index = 0 40 20 10 80 60 50 7 30 100

[0] [1] [2] [3] [4] [5] [6] [7] [8]

too_big_index too_small_index
1. While data[too_big_index] <= data[pivot]
++too_big_index

pivot_index = 0 40 20 10 80 60 50 7 30 100

[0] [1] [2] [3] [4] [5] [6] [7] [8]

too_big_index too_small_index
1. While data[too_big_index] <= data[pivot]
++too_big_index
2. While data[too_small_index] > data[pivot]
--too_small_index

pivot_index = 0 40 20 10 80 60 50 7 30 100

[0] [1] [2] [3] [4] [5] [6] [7] [8]

too_big_index too_small_index
1. While data[too_big_index] <= data[pivot]
++too_big_index
2. While data[too_small_index] > data[pivot]
--too_small_index

pivot_index = 0 40 20 10 80 60 50 7 30 100

[0] [1] [2] [3] [4] [5] [6] [7] [8]

too_big_index too_small_index
1. While data[too_big_index] <= data[pivot]
++too_big_index
2. While data[too_small_index] > data[pivot]
--too_small_index
3. If too_big_index < too_small_index
swap data[too_big_index] and data[too_small_index]

pivot_index = 0 40 20 10 80 60 50 7 30 100

[0] [1] [2] [3] [4] [5] [6] [7] [8]

pivot_index = 0 40 20 10 30 60 50 7 80 100

[0] [1] [2] [3] [4] [5] [6] [7] [8]

pivot_index = 0 40 20 10 30 60 50 7 80 100

[0] [1] [2] [3] [4] [5] [6] [7] [8]

pivot_index = 0 40 20 10 30 60 50 7 80 100

[0] [1] [2] [3] [4] [5] [6] [7] [8]

pivot_index = 0 40 20 10 30 60 50 7 80 100

[0] [1] [2] [3] [4] [5] [6] [7] [8]

pivot_index = 0 40 20 10 30 60 50 7 80 100

[0] [1] [2] [3] [4] [5] [6] [7] [8]

pivot_index = 0 40 20 10 30 60 50 7 80 100