0% found this document useful (0 votes)

122 views

Ds-Module 5 Lecture Notes

This document discusses sorting and searching algorithms. It covers internal and external sorting techniques, including sorting by comparison (insertion sort, selection sort, merge sort) and sorting by distribution (radix sort, counting sort, hashing). Specific sorting algorithms like bubble sort, selection sort, and quicksort are explained through examples. For searching, it discusses sequential and binary search algorithms, analyzing their time complexities. Finally, it introduces hash tables and hashing techniques like division hashing for storing and retrieving data through hash keys.

Uploaded by

Leela Krishna M

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

122 views

Ds-Module 5 Lecture Notes

Uploaded by

Leela Krishna M

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 12

MODULE 5

SORTING (Part-I)

Sorting: Sorting means arranging data in a particular format (ascending or descending).

Sorting Algorithm: Sorting algorithm specifies the way to arrange data in a particular order.
Sorting techniques:
There are two types of sorting techniques. They are:
 Internal sorting
 External sorting

Internal sorting: This sorting is performed in computer main memory that is restricted to sort
small set of data items.
Internal sorting techniques are based on two principles:
 Sorting by comparison
 Sorting by distribution
Sorting by comparison: A data item is compared with other items in the list of items in order to
find its place in the sorted list. In this, there are four types:
 Insertion sort
 Exchange sort
 Selection sort
 Merge sort
Sorting by distribution: All items under sorting are distributed over an auxiliary storage space
and then grouped together to get the sorted list. In this, there are three types:
 Radix
 Counting
 Hashing
Radix: Radix sort is a non-comparative integer sorting algorithm that sorts data with integer
keys by grouping keys by the individual digits which share the same significant position and
value.
Counting: Items are sorted based on their relative counts
Hashing: In this method, Items are hashed into a list based on hash function.

BUBBLE SORT:

Bubble sort is a simple sorting algorithm. This sorting algorithm is comparison-based algorithm
in which each pair of adjacent elements is compared and the elements are swapped if they are not
in order.
Example:
Consider the elements 5,1,4,2,8.

SELECTION SORT:

Selection sort requires n-1 pass to sort an array of n elements. In each pass we search for the
smallest element from the search range and swap it with appropriate place.
Straight selection sort:
The following are the two steps to be followed in straight selection sort:
Select: select the smallest key in the list of remaining key values say, ki, ki + 1,..............kn.
Let the smallest key value be kj.
Swap: Swap the two key values ki and kj.

QUICK SORT:
 Quick sort is a highly efficient sorting algorithm and is based on partitioning of array of
data into smaller arrays.
 A large array is partitioned into two arrays one of which holds values smaller than the
specified value, say pivot, based on which the partition is made and another array holds
values greater than the pivot value.

Quick Sort Pivot Algorithm

Based on our understanding of partitioning in quick sort, we will now try to write an algorithm
for it, which is as follows.
Step 1 − Choose the highest index value has pivot
Step 2 − Take two variables to point left and right of the list excluding pivot
Step 3 − left points to the low index
Step 4 − right points to the high
Step 5 − while value at left is less than pivot move right
Step 6 − while value at right is greater than pivot move left
Step 7 − if both step 5 and step 6 does not match swap left and right
Step 8 − if left ≥ right, the point where they met is new pivot
Quick Sort Algorithm
Using pivot algorithm recursively, we end up with smaller possible partitions. Each partition is
then processed for quick sort. We define recursive algorithm for quicksort as follows −
Step 1 − Make the right-most index value pivot
Step 2 − partition the array using pivot value
Step 3 − quicksort left partition recursively
Step 4 − quicksort right partition recursively

Example:
Let's consider an array with values 54, 26, 93, 17, 77, 31, 44, 55, 20
Below, we have a pictorial representation of how quick sort will sort the given array.

Now swap pivot and right mark values

Recursively Quick sort both left and right half. Final sorted list: 17, 20, 26, 31, 44, 55, 77, 93.

MODULE 5 PART II

Searching: Searching is a process of finding a value in a list of values. Searching can be

performed by following two techniques.

List search
There are two types of list search. They are a) sequential (linear) search b) Binary search
5.1 Sequential search (linear search):
 Linear search is a very simple search algorithm.
 In this, searching starts from beginning of an array and compares each element with the
given element and continues until the desired element is found or end of the array is
reached.
 Linear search is used for small and unsorted arrays.
Example:
Algorithm:
Step 1: Linear Search ( Array A, Value x)
Step 2: Set i=1
Step 3: if (A[i] == x) then
Print “search is successful and x is found at index i”
stop
Step 4: else
i=i+1
if ( i ≤ n ) then go to step 3
Step 5: else
Print “unsuccessful”
stop

5.2 Binary search:

 Binary search is a fast search algorithm with run-time complexity of Ο(log n).
 Binary search algorithm works on the principle of divide and conquer.
 Binary search is used for sorted arrays.
 In this, searching starts at middle of the array.
 Search element = middle element (search successful)
 Search element < middle element ( search in left sub-list)
 Search element > middle element (search in right sub-list)

Example:
Pseudocode/ Algorithm:
Step 1: Binarysearch(list, key, low, high)
Step 2: if (low ≤ high) then
Mid = (low + high)/2
Step 3: if(List[mid] = key) then
return mid // search success
Step 4: else if(key < list[mid]) then
return Binarysearch(list, key, low, mid-1) // search left sub-list
Step 5: else return Binarysearch(list,key,mid + 1, high) // search right sub-list
Step 6: end if
Step 7: end if

Analysing search algorithm:

Analysing search algorithm means which algorithm is efficient for searching.
Sequential search: In this, searching starts from beginning of an array and compares each
element with the given element and continues until the desired element is found or end of the
array is reached.
Algorithm:
Step 1: Linear Search ( Array A, Value x)
Step 2: Set i=1
Step 3: if (A[i] == x) then
Print “search is successful and x is found at index i”
stop
Step 4: else
i=i+1
if ( i ≤ n ) then go to step 3
Step 5: else
Print “unsuccessful”
stop

Time complexity of linear search:

Unsuccessful search: O(n)

Successful search:
Best-case: Item is in the first location of an array = O(1)
Worst-case: Item is in the last location of an array = O(n)
Average case: The number of key comparisons 1,2,…….n = O(n)

Binary search: In binary search no need of searching entire list because of if target element is
greater than mid value search only right of the list. if target is less than mid value search only left
half the list.
Step 1: Binarysearch(list, key, low, high)
Step 2: if (low ≤ high) then
Mid = (low + high)/2
Step 3: if(List[mid] = key) then
return mid // search success
Step 4: else if(key < list[mid]) then
return Binarysearch(list, key, low, mid-1) // search left sub-list
Step 5: else return Binarysearch(list,key,mid + 1, high) // search right sub-list
Step 6: end if
Step 7: end if

Time complexity of Binary search:

Unsuccessful search: O(log2n)

Successful search:
Best-case: no. of iterations is 1 = O(1)
Worst-case: no. of iterations = O(log2n)
Average case: no. of iterations = O(log2n)

No. of items Linear Binary

16 16 4

64 64 6

256 256 8

1024 1024 10

16,384 16,384 14

131,072 131,072 17

262,144 262,144 18

524,288 524,288 19

From the above comparison, binary search algorithm has less number of comparisons and it is
more efficient than linear search algorithm.
HASH TABLES (Part-III)
Tables: Hash tables:
 Hashing is the process of indexing and retrieving element (data) in a data structure to
provide faster way of finding the element using the hash key.
 Hash key is a value which provides the index value where the actual data is likely to store
in the data structure.
 In this data structure, we use a concept called Hash table to store data.
 All the data values are inserted into the hash table based on the hash key value.
 Hash key value is used to map the data with index in the hash table.
 And the hash key is generated for every data using a hash function.
 That means every entry in the hash table is based on the key value generated using a hash
function.

To achieve a good hashing mechanism, It is important to have a good hash function with the
following basic requirements:
1. Easy to compute
2. Uniform distribution
3. Less collision

STATIC HASHING

Hashing Techniques:
The main idea behind any Hashing technique is to find a one-to-one correspondence between a
index value and an index in the hash table where the key value can be placed.
In the following figure, K denotes a set of key values, I denote a range of indices and H denotes
the mapping function from K to I.
The following are the hashing techniques:

1. Division method
2. Mid suqare method
3. Folding method
4. Digit Analysis method
Division Hash method:

The key K is divided by some number m and the remainder is used as the Hash address of K.

h(k) = k mod m.

This gives indexes in the range 0 to m-1so the Hash table should be of size m.

This is an example of uniform hash function if value of m is chosen carefully.

Folding method:

 The key K is partitioned into a number of parts, each of which has the same length as the
required address with the possible exception of the last part.
 The parts are then added together, ignoring the final carry, to form an address
 There are two types in folding method. They are:
 Fold-shift hashing
 Fold-boundary hashing
Fold-shift hashing:

Key value is divided into parts whose size matches the size of the required address.

Example: K= 123456789 (K is divided into three equal parts and added)

123 + 456 + 789 = 1368

Remove 1
Therefore, h(k) = 368

Fold-boundary hashing:

Left and right number are folded on a fixed boundary between them.

Example: K = 123456789 ( Here, 123 and 789 are reversed).

321 + 456 + 987 = 1764

Remove 1

Therefore, h(k) = 764

Mid-Square method:

 The key K is multiplied by itself and the address is obtained by selecting an appropriate
number of digits from the middle of the square.
 The number of digits selected depends on the size of the table.
 Example: If K= 123456
o then K2 = 15241383936
 If three digit addresses is required, positions 5 to 7 is chosen giving address 138.
Digit Analysis method

 In the digit analysis method, the index is formed by extracting and then manipulating
specific digits from the key.
 For example, if our key is 1234567, we might select the digits in positions 2 through 4
yielding 234
 The manipulations can then take many forms:
o Reversing the digits (432)
o Performing a circular shift to the right (423)
o Performing a circular shift to the left (342)
o Swapping each pair of digits (324)

DYNAMIC HASHING:

Motivation for Dynamic Hashing Traditional hashing schemes as described in the

previous section are not ideal. This follows from the fact that one must statically allocate
a portion of memory to hold the hash table. This hash table is used to point to the pages
used to hold identifiers, or it may actually. hold the identifiers themselves. In either case,
if the table is allocated to be as large as possible. then space can be wasted. If it is
allocated to be too small, then when the data exceed the capacity of the hash table, the
entire file must be restructured. a time-consuming process. The purpose of dynamic
hashing (also referred to as extendible hashing) is to retain the fast retrieval time of
conventional hashing while extending the technique so that it can accommodate
dynamically increasing and decreasing tile size without penalty.

CBIS
100% (1)
CBIS
27 pages
History of Bucket Sort
No ratings yet
History of Bucket Sort
5 pages
Characteristics of Information Systems
67% (15)
Characteristics of Information Systems
3 pages
Assignment Report
No ratings yet
Assignment Report
12 pages
Worksheet: Introduction To Databases and DBMS: Part 1: Important Database Terms
100% (1)
Worksheet: Introduction To Databases and DBMS: Part 1: Important Database Terms
11 pages
Personal or Social Letters Applications Formal or Business Letters Personal or Social Letters: Informal Letters
No ratings yet
Personal or Social Letters Applications Formal or Business Letters Personal or Social Letters: Informal Letters
16 pages
Chapter 1
No ratings yet
Chapter 1
8 pages
The Computer Revolution
No ratings yet
The Computer Revolution
9 pages
Os Notes Unit 1 For BCA
No ratings yet
Os Notes Unit 1 For BCA
8 pages
CH 2 Component of Computer System
No ratings yet
CH 2 Component of Computer System
37 pages
PPT5 - Data Types in C++
100% (1)
PPT5 - Data Types in C++
26 pages
COurse Outline For All Smesters of BS (Commerce)
No ratings yet
COurse Outline For All Smesters of BS (Commerce)
75 pages
C Program To Update Details of Employee Using Files
No ratings yet
C Program To Update Details of Employee Using Files
4 pages
Lab 6 HTML Tables: Objectives
No ratings yet
Lab 6 HTML Tables: Objectives
9 pages
The Basic Elements of Database
No ratings yet
The Basic Elements of Database
3 pages
Fybbaca Practical Slip Computer Fundamental
No ratings yet
Fybbaca Practical Slip Computer Fundamental
6 pages
Prog Chapter 7 Answers
100% (1)
Prog Chapter 7 Answers
3 pages
COAL Lec 6 Addressing Modes
No ratings yet
COAL Lec 6 Addressing Modes
30 pages
History of Computing Hardware
No ratings yet
History of Computing Hardware
5 pages
Practice Question On Array: Basic Questions On Arrays Traversal and Searching
No ratings yet
Practice Question On Array: Basic Questions On Arrays Traversal and Searching
5 pages
A Presentation On: File Organization
No ratings yet
A Presentation On: File Organization
18 pages
IT Infrastructure Architecture: Infrastructure Building Blocks and Concepts
No ratings yet
IT Infrastructure Architecture: Infrastructure Building Blocks and Concepts
42 pages
C++ Friend Function
No ratings yet
C++ Friend Function
35 pages
Interacting With XML Data in Visual Programming
No ratings yet
Interacting With XML Data in Visual Programming
2 pages
Basic Units of Data Storage
No ratings yet
Basic Units of Data Storage
25 pages
Statistics, Data, and Statistical Thinking
No ratings yet
Statistics, Data, and Statistical Thinking
19 pages
Archival Principles Respect Des Fonds and Principe de Provenance
100% (1)
Archival Principles Respect Des Fonds and Principe de Provenance
2 pages
Features of Web Browser
No ratings yet
Features of Web Browser
7 pages
20CS101 - S1 - Need For Information Storage & Processing
100% (2)
20CS101 - S1 - Need For Information Storage & Processing
28 pages
Chap06 - Physical Database Design and Performance
No ratings yet
Chap06 - Physical Database Design and Performance
36 pages
PHP Programming Unit III
No ratings yet
PHP Programming Unit III
23 pages
Social Science CH 1 Notes
No ratings yet
Social Science CH 1 Notes
6 pages
IT-243 DBS Outline Fall2017
No ratings yet
IT-243 DBS Outline Fall2017
9 pages
MS Word of MS Case Study 4
No ratings yet
MS Word of MS Case Study 4
4 pages
CSIS122 Syllabus
No ratings yet
CSIS122 Syllabus
4 pages
Excel Lab Manual-2
No ratings yet
Excel Lab Manual-2
62 pages
Computer Languages
No ratings yet
Computer Languages
6 pages
Hashing
No ratings yet
Hashing
35 pages
C++ Practical File (3sem)
No ratings yet
C++ Practical File (3sem)
50 pages
Best Artificial Intelligence (AI) Universities in Karachi (Rankings)
No ratings yet
Best Artificial Intelligence (AI) Universities in Karachi (Rankings)
1 page
Economics New
No ratings yet
Economics New
160 pages
Computer Fundamentals: Pradeep K. Sinha & Priti Sinha
No ratings yet
Computer Fundamentals: Pradeep K. Sinha & Priti Sinha
0 pages
Linked List
No ratings yet
Linked List
20 pages
MS1.2 Data Structures and Algorithms Using C++: Lab - 1 (2 Hrs Real Time)
No ratings yet
MS1.2 Data Structures and Algorithms Using C++: Lab - 1 (2 Hrs Real Time)
83 pages
UNIT 5 Editors and World Processors
No ratings yet
UNIT 5 Editors and World Processors
14 pages
01 - Fundamentals of The Analysis of Algorithm Efficiency
No ratings yet
01 - Fundamentals of The Analysis of Algorithm Efficiency
43 pages
Introduction To The Visual: Powerpoint Slides Modified From Deitel & Deitel
No ratings yet
Introduction To The Visual: Powerpoint Slides Modified From Deitel & Deitel
32 pages
Write A Program To Insert An Element at End of An Array
No ratings yet
Write A Program To Insert An Element at End of An Array
14 pages
Riphah International University: FC SE
No ratings yet
Riphah International University: FC SE
5 pages
To Data Structures: by Prof. K. Adisesha
No ratings yet
To Data Structures: by Prof. K. Adisesha
88 pages
Cbse Board Practical Examination-2024-25 (Information Technology)
No ratings yet
Cbse Board Practical Examination-2024-25 (Information Technology)
1 page
BBA CA Add On Courses2 - 20.092019 PDF
No ratings yet
BBA CA Add On Courses2 - 20.092019 PDF
2 pages
SAAD Cost Benefit Analysis (Notes)
No ratings yet
SAAD Cost Benefit Analysis (Notes)
24 pages
Bscs CC - 112 PF 2023 Solved PP
No ratings yet
Bscs CC - 112 PF 2023 Solved PP
9 pages
Unit-3 C++ Functions: 2140705 Object Oriented Programming With C++
No ratings yet
Unit-3 C++ Functions: 2140705 Object Oriented Programming With C++
52 pages
HTML Assignments Solutions
No ratings yet
HTML Assignments Solutions
18 pages
EMPLOYABILITY Class-12 Cbse
No ratings yet
EMPLOYABILITY Class-12 Cbse
40 pages
Lecture 4-Managing Users1-1
No ratings yet
Lecture 4-Managing Users1-1
28 pages
Ds Unit 3
No ratings yet
Ds Unit 3
5 pages
Module 6 Search Sort Hashing
No ratings yet
Module 6 Search Sort Hashing
62 pages
ds unit 3 (1)
No ratings yet
ds unit 3 (1)
4 pages
Emf - Assignment QB Necg
No ratings yet
Emf - Assignment QB Necg
2 pages
IMSM - Assienment Question - II Sem 2021-22
No ratings yet
IMSM - Assienment Question - II Sem 2021-22
2 pages
Dronacharya CLG. of ENG IMSM LAB Manual-Pages-13-15
No ratings yet
Dronacharya CLG. of ENG IMSM LAB Manual-Pages-13-15
3 pages
Lab5 Parallel Resonance
No ratings yet
Lab5 Parallel Resonance
25 pages
Deld Module 2
No ratings yet
Deld Module 2
24 pages
DS Assign2 DB
No ratings yet
DS Assign2 DB
6 pages
CSEGATE
No ratings yet
CSEGATE
138 pages
Automation Test - Interview Q&A - FPT Software Academy
No ratings yet
Automation Test - Interview Q&A - FPT Software Academy
45 pages
Description of Ntshrui - DLL: Shell Sharing
No ratings yet
Description of Ntshrui - DLL: Shell Sharing
3 pages
Forensic Analysis of Internet Explorer Activity Files: Keith J. Jones Foundstone
No ratings yet
Forensic Analysis of Internet Explorer Activity Files: Keith J. Jones Foundstone
21 pages
Fds Solved 2
No ratings yet
Fds Solved 2
31 pages
Data Structure MCQ
No ratings yet
Data Structure MCQ
50 pages
Ds Solved 2021-22
No ratings yet
Ds Solved 2021-22
54 pages
CS604 Current Final Term Papers 2022
No ratings yet
CS604 Current Final Term Papers 2022
48 pages
Cse 357 Worksheet Mcqs Solution1
No ratings yet
Cse 357 Worksheet Mcqs Solution1
51 pages
Free Span - As Lay - 12 Inch-Production (KP0-KP6.5) No Lock Function
100% (1)
Free Span - As Lay - 12 Inch-Production (KP0-KP6.5) No Lock Function
37 pages
Java Interview Ques
No ratings yet
Java Interview Ques
9 pages
Dsa Assignment 1 (3) by Zamir Ali 091440
No ratings yet
Dsa Assignment 1 (3) by Zamir Ali 091440
44 pages
Table - 12 Storage Load Factors Heat Gain Lights PDF
No ratings yet
Table - 12 Storage Load Factors Heat Gain Lights PDF
1 page
Vertica Eon Sigmod Paper
No ratings yet
Vertica Eon Sigmod Paper
13 pages
JNTUH Usedpapers March 2022
No ratings yet
JNTUH Usedpapers March 2022
1 page
Born To Be Parallel and Beyond - DA015152
No ratings yet
Born To Be Parallel and Beyond - DA015152
15 pages
5.2.2. Application: Bucket Sort: 12-Feb-15 Mat-72306 Randal, Spring 2015 207
No ratings yet
5.2.2. Application: Bucket Sort: 12-Feb-15 Mat-72306 Randal, Spring 2015 207
26 pages
ADS & A Unit-3 Study Material
No ratings yet
ADS & A Unit-3 Study Material
27 pages
Collision
No ratings yet
Collision
16 pages
Lean C Compiler
No ratings yet
Lean C Compiler
23 pages
Syllabus
No ratings yet
Syllabus
279 pages
DS SORTING SEARCHING Notes
No ratings yet
DS SORTING SEARCHING Notes
27 pages
Compiler - Mod 5-Symbol Table
No ratings yet
Compiler - Mod 5-Symbol Table
17 pages
Folded Trie: Efficient Data Structure For All of Unicode
No ratings yet
Folded Trie: Efficient Data Structure For All of Unicode
21 pages
Orange Tsai - Lets Dance in The Cache - Destabilizing Hash Table On Microsoft IIS
No ratings yet
Orange Tsai - Lets Dance in The Cache - Destabilizing Hash Table On Microsoft IIS
111 pages
Java Collections
No ratings yet
Java Collections
95 pages
Advanced Computer Architecture
No ratings yet
Advanced Computer Architecture
12 pages
OpenWRT On The Belkin F5D7230-4 - Serial Console
No ratings yet
OpenWRT On The Belkin F5D7230-4 - Serial Console
23 pages