Hashing

Hashing Notes

Uploaded by

azharullahkhan1405

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views20 pages

Hashing

Hashing Notes

Uploaded by

azharullahkhan1405

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

Hashing

General Idea
• Facilitates search ideally in O(1) time
• The ideal hash table structure is merely an array of some fixed size,
containing the items.
• A stored item needs to have a data member, called key, that will be used in
computing the index value for the item.
• Key could be an integer, a string, etc
▪ e.g. a name or Id that is a part of a large employee structure
• The size of the array is TableSize.
• The items that are stored in the hash table are indexed by values from 0 to
TableSize – 1.
• Each key is mapped into some number in the range 0 to TableSize – 1.
• The mapping is called a hash function.
Example Hash
Table
0
1
Items
2
john 25000
3 john 25000
Hash
phil 31250 key 4 phil 31250
Functio
dave 27500 n 5
mary 28200 6 dave 27500
7 mary 28200
key 8
9
Hash Function
• Function H from set of K keys to set of L memory locations
▪ H:K→L
• The hash function:
▪ must be simple to compute.
▪ must distribute the keys evenly among the cells.
• If we know which keys will occur in advance we can write
perfect hash functions, but we don’t
• Problems:
▪ Keys may not be numeric.
▪ Number of possible keys is much larger than the space available in
table.
• Different keys may map into same location
▪ Hash function is not one-to-one => collision.
▪ If there are too many collisions, the performance of the hash table will
Some popular hash functions

• Division Method
• Midsquare Mthod
• Folding Method
Division Method

• h(k) = k mod M
• Generally, it is best to choose M to be a prime
number because making M a prime increases
the likelihood that the keys are mapped with a
uniformity in the output range of values.
Midsquare Method
• Step 1: Square the value of the key. That is, find k2
• Step 2: Extract the middle r bits of the result obtained
in Step 1 where r is the size of the address of location
Example: Calculate the hash value for keys 1234 and 5642 using the mid
square method. The hash table has 100 memory locations.

Note the hash table has 100 memory locations whose indices vary from
0-99. this means, only two digits are needed to map the key to a location in
the hash table, so r = 2.

When k = 1234, k2 = 1522756, h (k) = 27

When k = 5642, k2 = 31832164, h (k) = 21

Observe that 3rd and 4th digits starting from the right are chosen.
Folding Method
• The folding method works in two steps.
• Step 1: Divide the key value into a number of parts. That is divide k into
parts, k1, k2, …, kn, where each part has the same number of digits except
the last part which may have lesser digits than the other parts.
• Step 2: Add the individual parts. That is obtain the sum of k1 + k2 + .. +
kn. Hash value is produced by ignoring the last carry, if any.
• Note that the number of digits in each part of the key will vary depending
upon the size of the hash table. For example, if the hash table has a size of
1000. Then it means there are 1000 locations in the hash table. To address
these 1000 locations, we will need at least three digits, therefore, each part
of the key must have three digits except the last part which may have lesser
digits.
Collision Resolution

• Collision occurs when the hash function maps

two different keys to same location
• Methods for handling collision:
• Open addressing
▪ Linear Probing
▪ Quadratic Probing
▪ Double Hashing
• Chaining
Collision Resolution with Open Addressing
• In an open addressing hashing system, all the data go
inside the table.
– Thus, a bigger table is needed.
• Generally the load factor should be below 0.5.
– If a collision occurs, alternative cells are tried until an empty cell is
found.
• More formally:
– Cells h0(x), h1(x), h2(x), …are tried in succession where hi(x) =
(hash(x) + f(i)) mod TableSize, with f(0) = 0.
– The function f is the collision resolution strategy.
• There are three common collision resolution strategies:
– Linear Probing
– Quadratic probing
– Double hashing
Linear Probing

• In linear probing, collisions are resolved by

sequentially scanning an array (with
wraparound) until an empty cell is found.
▪ i.e. f is a linear function of i, typically f(i)= i.
• Example:
▪ Insert items with keys: 89, 18, 49, 58, 9 into an
empty hash table.
▪ Table size is 10.
▪ Hash function is hash(x) = x mod 10.
o f(i) = i;
Linear probing
hash table after
each insertion
Find and Delete

• The find algorithm follows the same probe

sequence as the insert algorithm.
▪ A find for 58 would involve 4 probes.
▪ A find for 19 would involve 5 probes.
• We must use lazy deletion (i.e. marking items
as deleted)
▪ Standard deletion (i.e. physically removing the
item) cannot be performed.
▪ e.g. remove 89 from hash table
Clustering Problem

• As long as table is big enough, a free cell can

always be found, but the time to do so can get
quite large.
• Worse, even if the table is relatively empty,
blocks of occupied cells start forming.
• This effect is known as primary clustering.
• Any key that hashes into the cluster will
require several attempts to resolve the
collision, and then it will add to the cluster.
Quadratic Probing

• Quadratic Probing eliminates primary clustering

problem of linear probing.
• Collision function is quadratic.
▪ The popular choice is f(i) = i2.
• If the hash function evaluates to h and a search in cell
h is inconclusive, we try cells h + 12, h+22, … h + i2.
▪ i.e. It examines cells 1,4,9 and so on away from the original
probe.
• Remember that subsequent probe points are a
quadratic number of positions from the original probe
point.
A quadratic probing
hash table after
each insertion (note
that the table size
was poorly chosen
because it is not a
prime number).
Double Hashing

• A second hash function is used to drive the collision

resolution.
▪ f(i) = i * hash2(x)
• We apply a second hash function to x and probe at a
distance hash2(x), 2*hash2(x), … and so on.
• The function hash2(x) must never evaluate to zero.
▪ e.g. Let hash2(x) = x mod 9 and try to insert 99 in the
previous example.
Chaining
• The idea is to keep a list of all elements that hash
to the same value.
– The array elements are pointers to the first nodes of the
lists.
– A new item is inserted to the front of the list.
• Advantages:
– Better space utilization for large items.
– Simple collision handling: searching linked list.
– Overflow: we can store more items than the hash table
size.
– Deletion is quick and easy: deletion from the linked list.
Example
Keys: 0, 1, 4, 9, 16, 25, 36, 49, 64, 81
hash(key) = key % 10.
0 0
1 81 1
2

4 64 4
5 25
6 36 16
7

9 49 9
Operations
• Initialization: all entries are set to NULL
• Find:
– locate the cell using hash function.
– sequential search on the linked list in that cell.
• Insertion:
– Locate the cell using hash function.
– (If the item does not exist) insert it as the first item in
the list.
• Deletion:
– Locate the cell using hash function.
– Delete the item from the linked list.

Hash Tables: Dr. Dibakar Saha
No ratings yet
Hash Tables: Dr. Dibakar Saha
26 pages
Bernd Bruegge, Allen H. Dutoit Object-Oriented Software Engineering Using UML, Patterns and Java 2nd Edition PDF
No ratings yet
Bernd Bruegge, Allen H. Dutoit Object-Oriented Software Engineering Using UML, Patterns and Java 2nd Edition PDF
406 pages
Write Your Own PHP MVC Framework
No ratings yet
Write Your Own PHP MVC Framework
20 pages
Hashing
No ratings yet
Hashing
35 pages
Hashing PPT For Student
No ratings yet
Hashing PPT For Student
53 pages
Z BR Log Customizing Analyzerw
No ratings yet
Z BR Log Customizing Analyzerw
90 pages
Balaguruswamy Object Oriented Programming With C - Fourth Edition
100% (1)
Balaguruswamy Object Oriented Programming With C - Fourth Edition
656 pages
Chapter One - Hashing PDF
No ratings yet
Chapter One - Hashing PDF
30 pages
3 Hashing
No ratings yet
3 Hashing
20 pages
Hash Table PDF
No ratings yet
Hash Table PDF
25 pages
Algo Cha 8
No ratings yet
Algo Cha 8
20 pages
Hashing Updated
No ratings yet
Hashing Updated
26 pages
DS Lecture - 6 (Hashing)
No ratings yet
DS Lecture - 6 (Hashing)
32 pages
11 Hashing
No ratings yet
11 Hashing
60 pages
Hashing 1
No ratings yet
Hashing 1
26 pages
Hashing
No ratings yet
Hashing
30 pages
Hashing
No ratings yet
Hashing
23 pages
Hashing Techniques
No ratings yet
Hashing Techniques
13 pages
Lecture 27 - Hashing
No ratings yet
Lecture 27 - Hashing
48 pages
Hashing
No ratings yet
Hashing
30 pages
Lect Hashing
No ratings yet
Lect Hashing
36 pages
Hashing
No ratings yet
Hashing
56 pages
Hashing
No ratings yet
Hashing
23 pages
Lab5 Hashing Algos
No ratings yet
Lab5 Hashing Algos
10 pages
HASHING
No ratings yet
HASHING
16 pages
HAshing (Satish Sir)
No ratings yet
HAshing (Satish Sir)
52 pages
Unit 1 Hashing
No ratings yet
Unit 1 Hashing
61 pages
Lecture 08 - Hash Tables
No ratings yet
Lecture 08 - Hash Tables
21 pages
Hashing PDF
No ratings yet
Hashing PDF
56 pages
Hash Table: Didih Rizki Chandranegara
No ratings yet
Hash Table: Didih Rizki Chandranegara
33 pages
Cse373 10 Hashing
No ratings yet
Cse373 10 Hashing
36 pages
HASHING
No ratings yet
HASHING
63 pages
Ch7 Hashing
No ratings yet
Ch7 Hashing
12 pages
HASHING
No ratings yet
HASHING
21 pages
Hashing
No ratings yet
Hashing
35 pages
Hashing v2 12032018
No ratings yet
Hashing v2 12032018
23 pages
SQL Set2 qns1
No ratings yet
SQL Set2 qns1
18 pages
Module 5
No ratings yet
Module 5
33 pages
Dsa Labtask 12
No ratings yet
Dsa Labtask 12
5 pages
DS Lecture - 6 (Hashing)
No ratings yet
DS Lecture - 6 (Hashing)
26 pages
Hashing Algorithms
No ratings yet
Hashing Algorithms
22 pages
DSAL Manual Assignment 4
No ratings yet
DSAL Manual Assignment 4
6 pages
Ads M Tech Mid 2
No ratings yet
Ads M Tech Mid 2
26 pages
5 Hash - New
No ratings yet
5 Hash - New
24 pages
Collision
No ratings yet
Collision
24 pages
09 Hashtable
No ratings yet
09 Hashtable
53 pages
Hashing in Data Structure
No ratings yet
Hashing in Data Structure
43 pages
Lecture 14 Hashing
No ratings yet
Lecture 14 Hashing
44 pages
Struktur Data: By: Sri Rezeki Candra Nursari
No ratings yet
Struktur Data: By: Sri Rezeki Candra Nursari
34 pages
Hashing
No ratings yet
Hashing
34 pages
05 Hashing
No ratings yet
05 Hashing
47 pages
Hashing New
No ratings yet
Hashing New
48 pages
Chapter 8 - Searching
No ratings yet
Chapter 8 - Searching
44 pages
What Is Hashing
No ratings yet
What Is Hashing
11 pages
08 Hashing
No ratings yet
08 Hashing
26 pages
Hashing: Amar Jukuntla
No ratings yet
Hashing: Amar Jukuntla
22 pages
SORTING PROGRAMS - Counting + Bucket + Heap
No ratings yet
SORTING PROGRAMS - Counting + Bucket + Heap
27 pages
Unit-5 2
No ratings yet
Unit-5 2
9 pages
06 - APS - Hash Table
No ratings yet
06 - APS - Hash Table
28 pages
Flowcharting and Algorithms
No ratings yet
Flowcharting and Algorithms
36 pages
11 Hash Tables Slides
No ratings yet
11 Hash Tables Slides
34 pages
Hashing
No ratings yet
Hashing
7 pages
KONTAKT 602 KSP Reference Manual
100% (1)
KONTAKT 602 KSP Reference Manual
223 pages
DS Lecture - 6 (Hashing)
No ratings yet
DS Lecture - 6 (Hashing)
27 pages
DevOps Shack - Mastering Multi-Stage Docker Builds
No ratings yet
DevOps Shack - Mastering Multi-Stage Docker Builds
36 pages
Ds 17hashing
No ratings yet
Ds 17hashing
27 pages
CDZ OSUser
No ratings yet
CDZ OSUser
184 pages
Dbms Lab Manual
No ratings yet
Dbms Lab Manual
58 pages
Introduction To The CImg Library
No ratings yet
Introduction To The CImg Library
219 pages
Lecture3 Assembly
No ratings yet
Lecture3 Assembly
26 pages
List Manipulation in Turbo Prolog: V.Cotelea
No ratings yet
List Manipulation in Turbo Prolog: V.Cotelea
14 pages
Discussion Web Development and Programming
No ratings yet
Discussion Web Development and Programming
11 pages
L Python PDF
No ratings yet
L Python PDF
52 pages
Write A Program To Fin Tte Sum of Iumbers II Ai Array Usiig Poiiters
No ratings yet
Write A Program To Fin Tte Sum of Iumbers II Ai Array Usiig Poiiters
6 pages
Encapsulation: Jin L.C. Guo
No ratings yet
Encapsulation: Jin L.C. Guo
45 pages
Mysql Subquery
No ratings yet
Mysql Subquery
19 pages
FSD Module 9
No ratings yet
FSD Module 9
8 pages
Pipelined RISC-V Processor With Cache
No ratings yet
Pipelined RISC-V Processor With Cache
7 pages
HTB Linux Priv Esc
No ratings yet
HTB Linux Priv Esc
3 pages
Pathway Electives: For Spring Term: 21222
No ratings yet
Pathway Electives: For Spring Term: 21222
17 pages
Settingsprovider
No ratings yet
Settingsprovider
28 pages
Javafx Tableview by Adding New Row in The Table During Runtime With Radiobutton
No ratings yet
Javafx Tableview by Adding New Row in The Table During Runtime With Radiobutton
11 pages
Design and Analysis of Algorithms
No ratings yet
Design and Analysis of Algorithms
8 pages
LBEPS Session 16
No ratings yet
LBEPS Session 16
17 pages
Eclipse Error Log
No ratings yet
Eclipse Error Log
1 page
(NS) NodePosition
No ratings yet
(NS) NodePosition
6 pages
Class-BCA 1 Semester Subject - General English Session: 2020-21
No ratings yet
Class-BCA 1 Semester Subject - General English Session: 2020-21
9 pages
Java Problemsheet 2-1
No ratings yet
Java Problemsheet 2-1
2 pages
The Tech Interview Playbook: From DSA to System Design
From Everand
The Tech Interview Playbook: From DSA to System Design
Chinmoy Mukherjee
No ratings yet
Hashing
From Everand
Hashing
Prakash Hegade
No ratings yet

Hashing

Uploaded by

Hashing

Uploaded by

Hashing

When k = 1234, k2 = 1522756, h (k) = 27

• Collision occurs when the hash function maps

• In linear probing, collisions are resolved by

• The find algorithm follows the same probe

• As long as table is big enough, a free cell can

• Quadratic Probing eliminates primary clustering

• A second hash function is used to drive the collision

You might also like