0% found this document useful (0 votes)

4 views

Unit 3 Hashing

ADS

Uploaded by

tusharmhans

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

Unit 3 Hashing

ADS

Uploaded by

tusharmhans

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

Unit 3 Hashing

Hashing is a technique that is used to uniquely identify a specific object from a group of similar
objects. Some examples of how hashing is used in our lives include:

• In universities, each student is assigned a unique roll number that can be used to
retrieve information about them.
• In libraries, each book is assigned a unique number that can be used to determine
information about the book, such as its exact position in the library or the users it has
been issued to etc.

In both these examples the students and books were hashed to a unique number.

Assume that you have an object and you want to assign a key to it to make searching easy.
To store the key/value pair, you can use a simple array like a data structure where keys
(integers) can be used directly as an index to store values. However, in cases where the keys
are large and cannot be used directly as an index, you should use hashing.

In hashing, large keys are converted into small keys by using hash functions. The values are
then stored in a data structure called hash table. The idea of hashing is to distribute entries
(key/value pairs) uniformly across an array. Each element is assigned a key (converted key).
By using that key you can access the element in O(1) time. Using the key, the algorithm (hash
function) computes an index that suggests where an entry can be found or inserted.

Hashing is implemented in two steps:

1. An element is converted into an integer by using a hash function. This element can be
used as an index to store the original element, which falls into the hash table.

2. The element is stored in the hash table where it can be quickly retrieved using hashed
key.

hash = hashfunc(key)
index = hash % array_size

In this method, the hash is independent of the array size and it is then reduced to an index (a
number between 0 and array_size − 1) by using the modulo operator (%).

Hash function
A hash function is any function that can be used to map a data set of an arbitrary size to a
data set of a fixed size, which falls into the hash table. The values returned by a hash function
are called hash values, hash codes, hash sums, or simply hashes.

To achieve a good hashing mechanism, It is important to have a good hash function with the
following basic requirements:

1. Easy to compute: It should be easy to compute and must not become an algorithm in
itself.

2. Uniform distribution: It should provide a uniform distribution across the hash table
and should not result in clustering.
3. Less collisions: Collisions occur when pairs of elements are mapped to the same hash
value. These should be avoided.

Note: Irrespective of how good a hash function is, collisions are bound to occur.
Therefore, to maintain the performance of a hash table, it is important to manage
collisions through various collision resolution techniques.

Need for a good hash function

Let us understand the need for a good hash function. Assume that you have to store strings
in the hash table by using the hashing technique {“abcdef”, “bcdefa”, “cdefab” , “defabc” }.

To compute the index for storing the strings, use a hash function that states the following:

The index for a specific string will be equal to the sum of the ASCII values of the characters
modulo 599.

As 599 is a prime number, it will reduce the possibility of indexing different strings
(collisions). It is recommended that you use prime numbers in case of modulo. The ASCII
values of a, b, c, d, e, and f are 97, 98, 99, 100, 101, and 102 respectively. Since all the strings
contain the same characters with different permutations, the sum will 599.

The hash function will compute the same index for all the strings and the strings will be
stored in the hash table in the following format. As the index of all the strings is the same,
you can create a list on that index and insert all the strings in that list.
Here, it will take O(n) time (where n is the number of strings) to access a specific string. This
shows that the hash function is not a good hash function.

Let’s try a different hash function. The index for a specific string will be equal to sum of ASCII
values of characters multiplied by their respective order in the string after which it is modulo
with 2069 (prime number).

String Hash function Index

abcdef (971 + 982 + 993 + 1004 + 1015 + 1026)%2069 38
bcdefa (981 + 992 + 1003 + 1014 + 1025 + 976)%2069 23
cdefab (991 + 1002 + 1013 + 1024 + 975 + 986)%2069 14
defabc (1001 + 1012 + 1023 + 974 + 985 + 996)%2069 11
Hash table
A hash table is a data structure that is used to store keys/value pairs. It uses a hash function
to compute an index into an array in which an element will be inserted or searched. By using
a good hash function, hashing can work well. Under reasonable assumptions, the average time
required to search for an element in a hash table is O(1).

Let us consider string S. You are required to count the frequency of all the characters in this
string.
string S = “ababcd”

The simplest way to do this is to iterate over all the possible characters and count their
frequency one by one. The time complexity of this approach is O(26*N) where N is the size
of the string and there are 26 possible characters.

void countFre(string S)
{
for(char c = ‘a’;c <= ‘z’;++c)
{
int frequency = 0;
for(int i = 0;i < S.length();++i)
if(S[i] == c)
frequency++;
cout << c << ‘ ‘ << frequency << endl;
}
}

Output

a2
b2
c1
d1
e0
f0
…
z0

Let us apply hashing to this problem. Take an array frequency of size 26 and hash the 26
characters with indices of the array by using the hash function. Then, iterate over the string
and increase the value in the frequency at the corresponding index for each character. The
complexity of this approach is O(N) where N is the size of the string.

int Frequency[26];

int hashFunc(char c)
{
return (c - ‘a’);
}

void countFre(string S)
{
for(int i = 0;i < S.length();++i)
{
int index = hashFunc(S[i]);
Frequency[index]++;
}
for(int i = 0;i < 26;++i)
cout << (char)(i+’a’) << ‘ ‘ << Frequency[i] << endl;
}

Output

a2
b2
c1
d1
e0
f0
…
z0

Collision resolution techniques

Separate chaining (open hashing)

Separate chaining is one of the most commonly used collision resolution techniques. It is
usually implemented using linked lists. In separate chaining, each element of the hash table
is a linked list. To store an element in the hash table you must insert it into a specific linked
list. If there is any collision (i.e. two different elements have same hash value) then store both
the elements in the same linked list.

The cost of a lookup is that of scanning the entries of the selected linked list for the required
key. If the distribution of the keys is sufficiently uniform, then the average cost of a lookup
depends only on the average number of keys per linked list. For this reason, chained hash
tables remain effective even when the number of table entries (N) is much higher than the
number of slots.

For separate chaining, the worst-case scenario is when all the entries are inserted into the
same linked list. The lookup procedure may have to scan all its entries, so the worst-case cost
is proportional to the number (N) of entries in the table.

In the following image, CodeMonk and Hashing both hash to the value 2. The linked list at
the index 2 can hold only one entry, therefore, the next entry (in this case Hashing) is linked
(attached) to the entry of CodeMonk.
Implementation of hash tables with separate chaining (open hashing)
Assumption

Hash function will return an integer from 0 to 19.

vector <string> hashTable[20];

int hashTableSize=20;

Insert

void insert(string s)
{
// Compute the index using Hash Function
int index = hashFunc(s);
// Insert the element in the linked list at the particular index
hashTable[index].push_back(s);
}

void search(string s)
{
//Compute the index by using the hash function
int index = hashFunc(s);
//Search the linked list at that specific index
for(int i = 0;i < hashTable[index].size();i++)
{
if(hashTable[index][i] == s)
{
cout << s << " is found!" << endl;
return;
}
}
cout << s << " is not found!" << endl;
}

Linear probing (open addressing or closed hashing)

In open addressing, instead of in linked lists, all entry records are stored in the array itself.
When a new entry has to be inserted, the hash index of the hashed value is computed and
then the array is examined (starting with the hashed index). If the slot at the hashed index is
unoccupied, then the entry record is inserted in slot at the hashed index else it proceeds in
some probe sequence until it finds an unoccupied slot.

The probe sequence is the sequence that is followed while traversing through entries. In
different probe sequences, you can have different intervals between successive entry slots or
probes.

When searching for an entry, the array is scanned in the same sequence until either the target
element is found or an unused slot is found. This indicates that there is no such key in the
table. The name "open addressing" refers to the fact that the location or address of the item
is not determined by its hash value.

Linear probing is when the interval between successive probes is fixed (usually to 1). Let’s
assume that the hashed index for a particular entry is index. The probing sequence for linear
probing will be:

index = index % hashTableSize

index = (index + 1) % hashTableSize
index = (index + 2) % hashTableSize
index = (index + 3) % hashTableSize

and so on…
Hash collision is resolved by open addressing with linear probing.
Since CodeMonk and Hashing are hashed to the same index i.e. 2, store Hashing at 3 as the
interval between successive probes is 1.

Implementation of hash table with linear probing

Assumption

• There are no more than 20 elements in the data set.

• Hash function will return an integer from 0 to 19.
• Data set must have unique elements.

string hashTable[21];
int hashTableSize = 21;

Insert

void insert(string s)
{
//Compute the index using the hash function
int index = hashFunc(s);
//Search for an unused slot and if the index will exceed the hashTableSize then roll back
while(hashTable[index] != "")
index = (index + 1) % hashTableSize;
hashTable[index] = s;
}
Search

void search(string s)
{
//Compute the index using the hash function
int index = hashFunc(s);
//Search for an unused slot and if the index will exceed the hashTableSize then roll
back
while(hashTable[index] != s and hashTable[index] != "")
index = (index + 1) % hashTableSize;
//Check if the element is present in the hash table
if(hashTable[index] == s)
cout << s << " is found!" << endl;
else
cout << s << " is not found!" << endl;
}

Quadratic Probing

Quadratic probing is similar to linear probing and the only difference is the interval between
successive probes or entry slots. Here, when the slot at a hashed index for an entry record is
already occupied, you must start traversing until you find an unoccupied slot. The interval
between slots is computed by adding the successive value of an arbitrary polynomial in the
original hashed index.

Let us assume that the hashed index for an entry is index and at index there is an occupied
slot. The probe sequence will be as follows:

index = index % hashTableSize

index = (index + 12) % hashTableSize
index = (index + 22) % hashTableSize
index = (index + 32) % hashTableSize

and so on…

Implementation of hash table with quadratic probing

Assumption

• There are no more than 20 elements in the data set.

• Hash function will return an integer from 0 to 19.
• Data set must have unique elements.

string hashTable[21];
int hashTableSize = 21;

Insert

void insert(string s)
{
//Compute the index using the hash function
int index = hashFunc(s);
//Search for an unused slot and if the index will exceed the hashTableSize roll back
int h = 1;
while(hashTable[index] != "")
{
index = (index + h*h) % hashTableSize;
h++;
}
hashTable[index] = s;
}

void search(string s)
{
//Compute the index using the Hash Function
int index = hashFunc(s);
//Search for an unused slot and if the index will exceed the hashTableSize roll back
int h = 1;
while(hashTable[index] != s and hashTable[index] != "")
{
index = (index + h*h) % hashTableSize;
h++;
}
//Is the element present in the hash table
if(hashTable[index] == s)
cout << s << " is found!" << endl;
else
cout << s << " is not found!" << endl;
}

Double hashing

Double hashing is similar to linear probing and the only difference is the interval between
successive probes. Here, the interval between probes is computed by using two hash functions.

Let us say that the hashed index for an entry record is an index that is computed by one
hashing function and the slot at that index is already occupied. You must start traversing in
a specific probing sequence to look for an unoccupied slot. The probing sequence will be:

index = (index + 1 * indexH) % hashTableSize;

index = (index + 2 * indexH) % hashTableSize;

and so on…

Here, indexH is the hash value that is computed by another hash function.

Implementation of hash table with double hashing

Assumption
• There are no more than 20 elements in the data set.
• Hash functions will return an integer from 0 to 19.
• Data set must have unique elements.

string hashTable[21];
int hashTableSize = 21;

Insert

void insert(string s)
{
//Compute the index using the hash function1
int index = hashFunc1(s);
int indexH = hashFunc2(s);
//Search for an unused slot and if the index exceeds the hashTableSize roll back
while(hashTable[index] != "")
index = (index + indexH) % hashTableSize;
hashTable[index] = s;
}

void search(string s)
{
//Compute the index using the hash function
int index = hashFunc1(s);
int indexH = hashFunc2(s);
//Search for an unused slot and if the index exceeds the hashTableSize roll back
while(hashTable[index] != s and hashTable[index] != "")
index = (index + indexH) % hashTableSize;
//Is the element present in the hash table
if(hashTable[index] == s)
cout << s << " is found!" << endl;
else
cout << s << " is not found!" << endl;
}

Applications

• Associative arrays: Hash tables are commonly used to implement many types of in-
memory tables. They are used to implement associative arrays (arrays whose indices
are arbitrary strings or other complicated objects).
• Database indexing: Hash tables may also be used as disk-based data structures and
database indices (such as in dbm).
• Caches: Hash tables can be used to implement caches i.e. auxiliary data tables that are
used to speed up the access to data, which is primarily stored in slower media.
• Object representation: Several dynamic languages, such as Perl, Python, JavaScript, and
Ruby use hash tables to implement objects.
• Hash Functions are used in various algorithms to make their computing faster

Collision Resolution Techniques-

• Hashing is a well-known searching technique.

• Collision occurs when hash value of the new key maps to an occupied bucket of the
hash table.
• Collision resolution techniques are classified as-

Open Addressing-

In open addressing,
• Unlikeseparate chaining, all the keys are stored inside the hash table.
• No key is stored outside the hash table.

Techniques used for open addressing are-

• Linear Probing
• QuadraticProbing
• Double Hashing

Operations in Open Addressing-

Let us discuss how operations are performed in open addressing-

Insert Operation-

• Hash function is used to compute the hash value for a key to be inserted.
• Hash value is then used as an index to store the key in the hash table.

In case of collision,
• Probing is performed until an empty bucket is found.
• Once an empty bucket is found, the key is inserted.
• Probing is performed in accordance with the technique used for open addressing.

Search Operation-

To search any particular key,

• Its hash value is obtained using the hash function used.
• Using the hash value, that bucket of the hash table is checked.
• If the required key is found, the key is searched.
• Otherwise, the subsequent buckets are checked until the required key or an empty
bucket is found.
• The empty bucket indicates that the key is not present in the hash table.

Delete Operation-

• The key is first searched and then deleted.

• After deleting the key, that particular bucket is marked as “deleted”.

NOTE-

• During insertion, the buckets marked as “deleted” are treated like any other empty
bucket.
• During searching, the search is not terminated on encountering the bucket marked as
“deleted”.
• The search terminates only after the required key or an empty bucket is found.

Open Addressing Techniques-

Techniques used for open addressing are-

1. Linear Probing-

In linear probing,
• When collision occurs, we linearly probe for the next bucket.
• We keep probing until an empty bucket is found.

Advantage-

• It is easy to compute.

Disadvantage-

• Themain problem with linear probing is clustering.

• Many consecutive elements form groups.
• Then, it takes time to search an element or to find an empty bucket.

Time Complexity-

Worst time to search an element in linear probing is O (table size).

This is because-
• Even if there is only one element present and all other elements are deleted.
• Then, “deleted” markers present in the hash table makes search the entire table.

Challenges in Linear Probing :

1. Primary Clustering: One of the problems with linear probing is Primary

clustering, many consecutive elements form groups and it starts taking time to
find a free slot or to search for an element.
2. Secondary Clustering: Secondary clustering is less severe, two records only have
the same collision chain (Probe Sequence) if their initial position is the same.

2. Quadratic Probing-

In quadratic probing,
• When collision occurs, we probe for i2‘th bucket in ith iteration.
• We keep probing until an empty bucket is found.
let hash(x) be the slot index computed using hash function.
If slot hash(x) % S is full, then we try (hash(x) + 1*1) % S
If (hash(x) + 1*1) % S is also full, then we try (hash(x) + 2*2) % S
If (hash(x) + 2*2) % S is also full, then we try (hash(x) + 3*3) % S
..................................................
..................................................
•
• If collision occurs, i.e. if key is mapped to location i and cell i is already occupied.
• Then, location i, (i+1), (i+4), (i+9), (i+16), (i+25), ………. are searched to find first
empty cell where the key is to be inserted.
Example 1:

Example 2

o Advantage:
▪ This table reduces primary clustering.
o Disadvantage:
▪ Doesn’t ensures that all cells in the table will be examined to find an
empty cell.

3. Double Hashing-

In double hashing,
• We use another hash function hash2(x) and look for i * hash2(x) bucket in ith
iteration.
• It requires more computation time as two hash functions need to be computed.

Comparison of Open Addressing Techniques-

Linear Probing Quadratic Probing Double Hashing

Primary Clustering Yes No No

Secondary Clustering Yes Yes No

Number of Probe
Sequence m m m2
(m = size of table)

Cache performance Best Lies between the two Poor

Conclusions-

• Linear Probing has the best cache performance but suffers from clustering.
• Quadratic probing lies between the two in terms of cache performance and clustering.
• Double caching has poor cache performance but no clustering.

Load Factor (α)-

Load factor (α) is defined as-

In open addressing, the value of load factor always lie between 0 and 1.

This is because-
• In open addressing, all the keys are stored inside the hash table.
• So, size of the table is always greater or at least equal to the number of keys stored in
the table.

PRACTICE PROBLEM BASED ON OPEN ADDRESSING-

Problem-

Using the hash function ‘key mod 7’, insert the following sequence of keys in the hash table-
50, 700, 76, 85, 92, 73 and 101

Use linear probing technique for collision resolution.

Solution-
The given sequence of keys will be inserted in the hash table as-
Step-01:
• Draw an empty hash table.
• For the given hash function, the possible range of hash values is [0, 6].
• So, draw an empty hash table consisting of 7 buckets as-
Step-02:

• Insert the given keys in the hash table one by one.

• The first key to be inserted in the hash table = 50.
• Bucket of the hash table to which key 50 maps = 50 mod 7 = 1.
• So, key 50 will be inserted in bucket-1 of the hash table as-

Step-03:

• The next key to be inserted in the hash table = 700.

• Bucket of the hash table to which key 700 maps = 700 mod 7 = 0.
• So, key 700 will be inserted in bucket-0 of the hash table as-
Step-04:
• The next key to be inserted in the hash table = 76.
• Bucket of the hash table to which key 76 maps = 76 mod 7 = 6.
• So, key 76 will be inserted in bucket-6 of the hash table as-

Step-05:
• The next key to be inserted in the hash table = 85.
• Bucket of the hash table to which key 85 maps = 85 mod 7 = 1.
• Since bucket-1 is already occupied, so collision occurs.
• To handle the collision, linear probing technique keeps probing linearly until an
empty bucket is found.
• The first empty bucket is bucket-2.
• So, key 85 will be inserted in bucket-2 of the hash table as-
Step-06:
• The next key to be inserted in the hash table = 92.

• Bucket of the hash table to which key 92 maps = 92 mod 7 = 1.

• Since bucket-1 is already occupied, so collision occurs.
• To handle the collision, linear probing technique keeps probing linearly until an
empty bucket is found.
• The first empty bucket is bucket-3.
• So, key 92 will be inserted in bucket-3 of the hash table as-

Step-07:
• The next key to be inserted in the hash table = 73.
• Bucket of the hash table to which key 73 maps = 73 mod 7 = 3.
• Since bucket-3 is already occupied, so collision occurs.
• To handle the collision, linear probing technique keeps probing linearly until an
empty bucket is found.
• The first empty bucket is bucket-4.
• So, key 73 will be inserted in bucket-4 of the hash table as-
Step-08:
• The next key to be inserted in the hash table = 101.
• Bucket of the hash table to which key 101 maps = 101 mod 7 = 3.
• Since bucket-3 is already occupied, so collision occurs.
• To handle the collision, linear probing technique keeps probing linearly until an
empty bucket is found.
• The first empty bucket is bucket-5.
• So, key 101 will be inserted in bucket-5 of the hash table as-

Sprint 350 Use Manual
67% (6)
Sprint 350 Use Manual
28 pages
22CS302_LM21
No ratings yet
22CS302_LM21
7 pages
Hashing
No ratings yet
Hashing
11 pages
Hashing and Indexing
No ratings yet
Hashing and Indexing
28 pages
Unit 5 Data Structure
No ratings yet
Unit 5 Data Structure
12 pages
Chapter 5_Hashing _Part1
No ratings yet
Chapter 5_Hashing _Part1
28 pages
ADI Hashing
No ratings yet
ADI Hashing
47 pages
DSA2 Chapter 5 Hashing
No ratings yet
DSA2 Chapter 5 Hashing
44 pages
Hashing in Data Structure
No ratings yet
Hashing in Data Structure
43 pages
DS Module 5 Hashing
No ratings yet
DS Module 5 Hashing
23 pages
DS - Unit 5 - Notes
No ratings yet
DS - Unit 5 - Notes
8 pages
09 Hashtable
No ratings yet
09 Hashtable
53 pages
Intro To Hashing
No ratings yet
Intro To Hashing
10 pages
CH 4
No ratings yet
CH 4
58 pages
Lecture 3.2.1 Hashing
No ratings yet
Lecture 3.2.1 Hashing
17 pages
ds 5 update
No ratings yet
ds 5 update
26 pages
Lec12-Hash-Tables-09092024-090609pm (1)
No ratings yet
Lec12-Hash-Tables-09092024-090609pm (1)
48 pages
11 Hashtable-1
No ratings yet
11 Hashtable-1
48 pages
Hashing Data Structure
No ratings yet
Hashing Data Structure
22 pages
Cse373 10 Hashing
No ratings yet
Cse373 10 Hashing
36 pages
Hashing
No ratings yet
Hashing
44 pages
Hashing
No ratings yet
Hashing
25 pages
Hashing Techniques
No ratings yet
Hashing Techniques
13 pages
DS Module-X
No ratings yet
DS Module-X
74 pages
Hash Tables
No ratings yet
Hash Tables
45 pages
Lab08 - DS - Hash Tables
No ratings yet
Lab08 - DS - Hash Tables
9 pages
8 Hashtables
No ratings yet
8 Hashtables
84 pages
Idst 2016 SA 05 Hashing
No ratings yet
Idst 2016 SA 05 Hashing
68 pages
Lecture 27 - Hashing
No ratings yet
Lecture 27 - Hashing
48 pages
Week 12 Hashing
No ratings yet
Week 12 Hashing
24 pages
Lect Hashing
No ratings yet
Lect Hashing
36 pages
Hashing in Data Structures
No ratings yet
Hashing in Data Structures
27 pages
Hashing - 1
No ratings yet
Hashing - 1
29 pages
Module 5: HASHING: Functions. The Values Are Then Stored in A Data Structure Called Hash Table
No ratings yet
Module 5: HASHING: Functions. The Values Are Then Stored in A Data Structure Called Hash Table
39 pages
Hash Tables: A Detailed Description
No ratings yet
Hash Tables: A Detailed Description
10 pages
Notes of advanced data structures
No ratings yet
Notes of advanced data structures
202 pages
Week 9_Hash Functions and Collision
No ratings yet
Week 9_Hash Functions and Collision
73 pages
UNIT V - Hashing
No ratings yet
UNIT V - Hashing
20 pages
Unit-5
No ratings yet
Unit-5
50 pages
Algo_Lec3
No ratings yet
Algo_Lec3
53 pages
Maps
No ratings yet
Maps
36 pages
Lecture 7 - Hash - Table - Direct - Adreess - Tables - Hash - Tables - Intro - Separate - Chaining
No ratings yet
Lecture 7 - Hash - Table - Direct - Adreess - Tables - Hash - Tables - Intro - Separate - Chaining
77 pages
Hashing
No ratings yet
Hashing
14 pages
Lecture 8 Hashing
No ratings yet
Lecture 8 Hashing
47 pages
Dsa 4
No ratings yet
Dsa 4
55 pages
HASHING
No ratings yet
HASHING
8 pages
Lecture Notes On Hash Tables: 15-122: Principles of Imperative Computation Frank Pfenning, Rob Simmons February 28, 2013
No ratings yet
Lecture Notes On Hash Tables: 15-122: Principles of Imperative Computation Frank Pfenning, Rob Simmons February 28, 2013
7 pages
05 Hashing
No ratings yet
05 Hashing
47 pages
Dsa Lecture 13 Hash Tables
No ratings yet
Dsa Lecture 13 Hash Tables
15 pages
Hashing Algorithms
No ratings yet
Hashing Algorithms
22 pages
Hashing
No ratings yet
Hashing
13 pages
AST20105 Data Structure and Algorithms: Chapter 9 - Hash Table
No ratings yet
AST20105 Data Structure and Algorithms: Chapter 9 - Hash Table
39 pages
Hash
No ratings yet
Hash
7 pages
Hashng Notes SVIMS
No ratings yet
Hashng Notes SVIMS
14 pages
MCA Data Structures With Algorithms 14
No ratings yet
MCA Data Structures With Algorithms 14
12 pages
Hashing
No ratings yet
Hashing
7 pages
10 Hash Table
No ratings yet
10 Hash Table
25 pages
Hash Table
No ratings yet
Hash Table
9 pages
Unit 3.4 Hashing Techniques
No ratings yet
Unit 3.4 Hashing Techniques
7 pages
Hashing
From Everand
Hashing
Prakash Hegade
No ratings yet
300+ Python Algorithms: Mastering the Art of Problem-Solving
From Everand
300+ Python Algorithms: Mastering the Art of Problem-Solving
Hernando Abella
5/5 (1)
DAA_3
No ratings yet
DAA_3
56 pages
Unit-II.pptx
No ratings yet
Unit-II.pptx
55 pages
Unit-V
No ratings yet
Unit-V
91 pages
FINAL UNIT 4 TRANSPORT LAYER
No ratings yet
FINAL UNIT 4 TRANSPORT LAYER
59 pages
UNIT-II
No ratings yet
UNIT-II
69 pages
Updated DAA Unit 1
No ratings yet
Updated DAA Unit 1
59 pages
DAA Unit2
No ratings yet
DAA Unit2
50 pages
CS-403 Software Engineering
No ratings yet
CS-403 Software Engineering
2 pages
Personal Software Process: Mohammed Ahmed Ali
No ratings yet
Personal Software Process: Mohammed Ahmed Ali
16 pages
Lecture Notes 1.3.3
No ratings yet
Lecture Notes 1.3.3
2 pages
Final Fds Ir Templates January
No ratings yet
Final Fds Ir Templates January
12 pages
Practical Work in Geography Ch-6 Gis New
No ratings yet
Practical Work in Geography Ch-6 Gis New
44 pages
Part 2
No ratings yet
Part 2
22 pages
PPOp E
No ratings yet
PPOp E
10 pages
WSN 1
No ratings yet
WSN 1
16 pages
CV-Tarun Kumar
No ratings yet
CV-Tarun Kumar
3 pages
2D2024_2687 Appellants Motion to Disqualify Circuit Judge Patricia Muscarella Due to Conflicts of Interest and to Vacate All Orders Issued in Lower Court
No ratings yet
2D2024_2687 Appellants Motion to Disqualify Circuit Judge Patricia Muscarella Due to Conflicts of Interest and to Vacate All Orders Issued in Lower Court
156 pages
1-Planning and scheduling procedures from A to Z - Planning Engineer
No ratings yet
1-Planning and scheduling procedures from A to Z - Planning Engineer
5 pages
Climate Predictability Tool (CPT) : Ousmane Ndiaye and Simon J. Mason
No ratings yet
Climate Predictability Tool (CPT) : Ousmane Ndiaye and Simon J. Mason
51 pages
Whitepaper: The Evolution of LAN Bypass Technology: Lanner's Generation One To Generation Three Bypass
No ratings yet
Whitepaper: The Evolution of LAN Bypass Technology: Lanner's Generation One To Generation Three Bypass
9 pages
2710 Making The Right Decisions Based On Quality KPI
100% (1)
2710 Making The Right Decisions Based On Quality KPI
56 pages
IPTV
No ratings yet
IPTV
11 pages
Revit - Autodesk Revit Architecture 2012
No ratings yet
Revit - Autodesk Revit Architecture 2012
19 pages
Top 5 attack combinations for Town Hall 10 in Clash of Clans (2024)
No ratings yet
Top 5 attack combinations for Town Hall 10 in Clash of Clans (2024)
1 page
Customer Ageing Logics
No ratings yet
Customer Ageing Logics
4 pages
Sartorius Combics 1 - Combics 2: Service Manual
No ratings yet
Sartorius Combics 1 - Combics 2: Service Manual
92 pages
Abbreviations GSM
No ratings yet
Abbreviations GSM
2 pages
My File
No ratings yet
My File
6 pages
Breadth-First Search (BFS) - Iterative and Recursive-Implementation
No ratings yet
Breadth-First Search (BFS) - Iterative and Recursive-Implementation
7 pages
Median Polish: Purpose
No ratings yet
Median Polish: Purpose
5 pages
USG Risk Assessment Checklist
No ratings yet
USG Risk Assessment Checklist
7 pages
OSCE 10 5 Best Practice Guide
No ratings yet
OSCE 10 5 Best Practice Guide
68 pages
Professional Experience: Avijit Das
No ratings yet
Professional Experience: Avijit Das
2 pages
RLTC Annual Report
No ratings yet
RLTC Annual Report
29 pages
12th History - Kavin Guide - TM 2023
100% (2)
12th History - Kavin Guide - TM 2023
102 pages
Multiple Regression Example (Salary Experience and Score)
No ratings yet
Multiple Regression Example (Salary Experience and Score)
4 pages

Unit 3 Hashing

Uploaded by

Unit 3 Hashing

Uploaded by

Unit 3 Hashing

Hashing is implemented in two steps:

Need for a good hash function

String Hash function Index

Collision resolution techniques

Separate chaining (open hashing)

Hash function will return an integer from 0 to 19.

vector <string> hashTable[20];

Linear probing (open addressing or closed hashing)

index = index % hashTableSize

Implementation of hash table with linear probing

• There are no more than 20 elements in the data set.

index = index % hashTableSize

Implementation of hash table with quadratic probing

• There are no more than 20 elements in the data set.

index = (index + 1 * indexH) % hashTableSize;

Implementation of hash table with double hashing

Collision Resolution Techniques-

Techniques used for open addressing are-

Operations in Open Addressing-

Let us discuss how operations are performed in open addressing-

To search any particular key,

• The key is first searched and then deleted.

Open Addressing Techniques-

Techniques used for open addressing are-

• Themain problem with linear probing is clustering.

Worst time to search an element in linear probing is O (table size).

Challenges in Linear Probing :

1. Primary Clustering: One of the problems with linear probing is Primary

Comparison of Open Addressing Techniques-

Linear Probing Quadratic Probing Double Hashing

Primary Clustering Yes No No

Secondary Clustering Yes Yes No

Cache performance Best Lies between the two Poor

Load Factor (α)-

Load factor (α) is defined as-

PRACTICE PROBLEM BASED ON OPEN ADDRESSING-

Use linear probing technique for collision resolution.

• Insert the given keys in the hash table one by one.

• The next key to be inserted in the hash table = 700.

• Bucket of the hash table to which key 92 maps = 92 mod 7 = 1.

You might also like