0% found this document useful (0 votes)

351 views27 pages

DS Lecture - 6 (Hashing)

The document discusses hashing techniques including hash functions that map keys to table locations, hash tables that store the keys, and strategies for resolving collisions when multiple keys map to the same location such as separate chaining, linear probing, quadratic probing, and double hashing which uses a secondary hash function. Hashing is used for searching, encryption, and other applications by distributing keys evenly throughout a hash table.

Uploaded by

Noman Mirza

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

351 views27 pages

DS Lecture - 6 (Hashing)

Uploaded by

Noman Mirza

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 27

HASHING

Course teacher: Moona Kanwal

1
Hashing
• Mathematical concept
– To define any number as set of numbers in
given interval
– To cut down part of number
– Used in discreet maths, e.g graph theory, set
theory
– Used in Searching technique
– Used in encryption methods

2
Hash Functions and Hash
Tables
• Hashing has 2 major components
– Hash function h
– Hash Table Data Structure of size N
• A hash function h maps keys (a identifying element of record set) to hash value or
hash key which refers to specific location in Hash table
• Example:
h(x) = x mod N
is a hash function for integer keys
• The integer h(x) is called the hash value of key x

3
Hash Functions and Hash Tables
• A hash table data structure is an array or array
type ADTof some fixed size, containing the
keys.
• An array in which records are not stored
consecutively - their place of storage is
calculated using the key and a hash function

hash array
Key index
function

4
• Hashed key: the result of applying a hash function to a
key
• Keys and entries are scattered throughout the array
• Contains the main advantages of both Arrays and Trees
• Mainly the topic of hashing depends upon the two main
factors / parts
(a) Hash Function (b) Collision Resolution
• Table Size is also an factor (miner) in Hashing, which is
0 to tablesize-1.

5
Table Size
• Hash table size
– Should be appropriate for the hash function used

– Too big will waste memory; too small will

increase collisions and may eventually force
rehashing (copying into a larger table)

6
Example
• We design a hash table for a
dictionary storing items 0 ∅
(SSN, Name), where SSN 1 025-612-0001
(social security number) is a 2 981-101-0002
nine-digit positive integer 3 ∅

• The actual data is not stored 4 451-229-0004

…
in hash table
• Pin points the location of 9997 ∅
actual data or set of data 9998 200-751-9998
9999 ∅
• Our hash table uses an array
of size N = 10,000 and the
hash function
h(x) = last four digits of x
7
Hash Function
• The mapping of keys into the table is called Hash
Function

• A hash function,
– Ideally, it should distribute keys and entries evenly
throughout the table
– It should be easy and quick to compute.
– It should minimize collisions, where the position
given by the hash function is already occupied
– It should be applicable to all objects
8
• Different types of hash functions are used for the
mapping of keys into tables.

(a) Division Method

(b) Mid-square Method
(c) Folding Method

9
1. Division Method
• Choose a number m larger than the number n of keys
in k.
• The number m is usually chosen to be a prime no.
• The hash function H is defined as,
H(k) = k(mod m) or H(k) = k(mod m) + 1
• Denotes the remainder, when k is divided by m
• 2nd formula is used when range is from 1 to m.

10
• Example:
Elements are: 3205, 7148, 2345

Table size: 0 – 99 (prime)

m = 97 (prime)

H(3205)= 4, H(7148)=67, H(2345)=17

• For 2nd formula add 1 into the remainders.

11
2. Folding Method
• The key k is partitioned into no. of parts
• Then add these parts together and ignoring the
last carry.
• One can also reverse the first part before adding
(right or left justified. Mostly right)
H(k) = k1 + k2 + ………. + kn

12
• Example:

H(3205)=32+05=37 or H(3250)=32+50=82

H(7148)=71+43=19 or H(7184)=71+84=55

H(2345)=23+45=77 or H(2354)=23+54=68

13
3. Mid-Square Method
• The key k is squared. Then the hash function H is
defined as
H(k) = l
• The l is obtained by deleting the digits from both
ends of K2.

• The same position must be used for all the keys.

14
• Example:
k: 3205 7148 2345
k2: 10272025 51093904 5499025
H(k): 72 93 99

• 4th and 5th digits have been selected. From the

right side.

15
Collision Resolution Strategies
• If two keys map on the same hash table index then we
have a collision.
• As the number of elements in the table increases, the
likelihood of a collision increases - so make the table
as large as practical
• Collisions may still happen, so we need a collision
resolution strategy

16
• Two approaches are used to resolve collisions.
(a) Separate chaining: chain together several keys/entries
in each position.
(b) Open addressing: store the key/entry in a different
position.
• Probing: If the table position given by the hashed
key is already occupied, increase the position by
some amount, until an empty position is found

17
Open Addressing

• Types of open addressing are

1. Linear Probing
2. Quadratic Probing
3. Double Hashing.

18
1. Linear Probing
• Locations are checked from the hash location k to the end
of the table and the element is placed in the first empty
slot
• If the bottom of the table is reached, checking “wraps
around” to the start of the table. Modulus is used for this
purpose
• Thus, if linear probing is used, these routines must
continue down the table until a match or empty location
is found

19
• Linear probing is guaranteed to find a slot for the
insertion if there still an empty slot in the table.
• Even though the hash table size is a prime number is
probably not an appropriate size; the size should be at
least 30% larger than the maximum number of elements
ever to be stored in the table.

• If the load factor is greater than 50% - 70% then the

time to search or to add a record will increase.

20
H(k)=h, h+1, h+2, h+3,……, h+I

• However, linear probing also tends to promote

clustering within the table.

1 2 3 4 5 6 7 8

21
2. Quadratic Probing
• Quadratic probing is a solution to the clustering
problem
– Linear probing adds 1, 2, 3, etc. to the original
hashed key
– Quadratic probing adds 12, 22, 32 etc. to the original
hashed key
• However, whereas linear probing guarantees that all
empty positions will be examined if necessary,
quadratic probing does not

22
• If the table size is prime, this will try approximately
half the table slots.
• More generally, with quadratic probing, insertion may
be impossible if the table is more than half-full!

H(k) = h, h+1, h+4, h+5, h+6,……, h+i2

23
3. Double Hashing
• 2nd hash function H’ is used to resolve the collision.
• Here H’(k) = h’ ≠ m
• Therefore we can search the locations with addresses,
H’(k) = h, h+h’, h+2h’, h+3h’,…….
• If m is prime, then this sequence access all the
locations.

24
Double Hashing
• Double hashing uses a
secondary hash function • Common choice of
d(k) and handles compression map for the
collisions by placing an secondary hash function:
item in the first available d2(k) = k mod q
cell of the series
(h + jd(k)) mod N where
for j = 0, 1, … , N − 1 – q<N
• The secondary hash – q is a prime
function d(k) cannot • The possible values for
have zero values d2(k) are
• The table size N must be 1, 2, … , q
a prime to allow probing
of all the cells
25
Example of Double Hashing
k h (k ) d (k ) Probes
• Consider a hash 18
41
5
2
9
8
5
2

table storing integer 22

44
9
5
10
5
9
5 7

keys that handles 59

32
7
6
10
4
7
6
10 0

31 5 8 5 8
collision with double 73 8 11 8 11

hashing
– N = 13
– h(k) = k mod 13
– d(k) = k mod 7
0 1 2 3 4 5 6 7 8 9 10 11 12
• Insert keys 18, 41,
22, 44, 59, 32, 31,
73, in this order 59 41 183244 8 224411
0 1 2 3 4 5 6 7 8 9 10 11 12
26
Applications of Hashing
• Compilers use hash tables to keep track of declared variables
• A hash table can be used for on-line spelling checkers — if
misspelling detection (rather than correction) is important, an entire
dictionary can be hashed and words checked in constant time
• Game playing programs use hash tables to store seen positions,
thereby saving computation time if the position is encountered
again
• Hash functions can be used to quickly check for inequality — if two
elements hash to different values they must be different

Bmo SL 2022
No ratings yet
Bmo SL 2022
45 pages
Complex Number - Module - Lakshya JEE AIR Recorded 2025
No ratings yet
Complex Number - Module - Lakshya JEE AIR Recorded 2025
74 pages
06 Small Signal Angle Stability
0% (1)
06 Small Signal Angle Stability
160 pages
DSA DSA: Questions For MAANG Interviews Questions For MAANG Interviews
No ratings yet
DSA DSA: Questions For MAANG Interviews Questions For MAANG Interviews
21 pages
DP-900 Handwritten Notes
No ratings yet
DP-900 Handwritten Notes
31 pages
Case Studies
No ratings yet
Case Studies
17 pages
Java Programming Part I
No ratings yet
Java Programming Part I
120 pages
Dynamic Programming Vs Greedy MEthod
100% (1)
Dynamic Programming Vs Greedy MEthod
33 pages
Bit Manipulation Notes by Kapil Yadav
No ratings yet
Bit Manipulation Notes by Kapil Yadav
79 pages
Unit 4 Three-Dimensional Geometric Transformation
100% (1)
Unit 4 Three-Dimensional Geometric Transformation
15 pages
NITK Placement Gyan 2014 PDF
No ratings yet
NITK Placement Gyan 2014 PDF
216 pages
Recurrent Neural Network: Dr. Sukanta Ghosh
100% (1)
Recurrent Neural Network: Dr. Sukanta Ghosh
34 pages
Complex Number PDF
No ratings yet
Complex Number PDF
30 pages
Greedy Algorithm
100% (1)
Greedy Algorithm
18 pages
Collision Resolution Techniques
No ratings yet
Collision Resolution Techniques
15 pages
DSA Cheat Sheet
No ratings yet
DSA Cheat Sheet
4 pages
Data Structure Study Notes For IBPS SO IT Officer - Team MME
No ratings yet
Data Structure Study Notes For IBPS SO IT Officer - Team MME
7 pages
"Doing" Strategy
No ratings yet
"Doing" Strategy
10 pages
Greedy Algorithm
No ratings yet
Greedy Algorithm
34 pages
Lesson Plan in Math Measurement - Compress
No ratings yet
Lesson Plan in Math Measurement - Compress
6 pages
Pre-Placements Checklist
100% (1)
Pre-Placements Checklist
9 pages
Data Structures 2
No ratings yet
Data Structures 2
82 pages
Segmentation
100% (1)
Segmentation
51 pages
Lecture 4: Divide and Conquer: Van Emde Boas Trees
No ratings yet
Lecture 4: Divide and Conquer: Van Emde Boas Trees
7 pages
Binary Tree in Java
No ratings yet
Binary Tree in Java
79 pages
DSA by Shradha Didi & Aman Bhaiya
No ratings yet
DSA by Shradha Didi & Aman Bhaiya
7 pages
Unit-4 Greedy Algorithms
No ratings yet
Unit-4 Greedy Algorithms
71 pages
Collection & Maps Question 1
100% (1)
Collection & Maps Question 1
11 pages
AL1-Project2-123-24 Assessment Learning 1
No ratings yet
AL1-Project2-123-24 Assessment Learning 1
12 pages
SSC Selection Post Phase 9 Syllabus PDF
No ratings yet
SSC Selection Post Phase 9 Syllabus PDF
5 pages
Algorithms All Sortings
No ratings yet
Algorithms All Sortings
91 pages
Unit II Requirements Elicitation
No ratings yet
Unit II Requirements Elicitation
23 pages
Study-Materials CSE 6th Cryptography-Network-Security S.-Dalai
No ratings yet
Study-Materials CSE 6th Cryptography-Network-Security S.-Dalai
63 pages
Final Comprehensive Timetable 2025
No ratings yet
Final Comprehensive Timetable 2025
8 pages
Syllabus Python Masterclass
No ratings yet
Syllabus Python Masterclass
8 pages
Van Emde Boas Tree
No ratings yet
Van Emde Boas Tree
27 pages
For HR Rounds, They Will Test Your Analytical Skills and How You Approach Towards A
No ratings yet
For HR Rounds, They Will Test Your Analytical Skills and How You Approach Towards A
3 pages
Shinozuka 1990 - Stochastic Methods in Wind Engineering PDF
No ratings yet
Shinozuka 1990 - Stochastic Methods in Wind Engineering PDF
15 pages
CIB R&a Banking Junior Analyst Academic Intern FinTech
No ratings yet
CIB R&a Banking Junior Analyst Academic Intern FinTech
2 pages
MA T102 Linear Algebra & Calculus
No ratings yet
MA T102 Linear Algebra & Calculus
4 pages
Data Structures Unit 5
No ratings yet
Data Structures Unit 5
20 pages
Algorithms 2
No ratings yet
Algorithms 2
49 pages
Compiler Design Unit 4
No ratings yet
Compiler Design Unit 4
28 pages
Logic-Ai A4
No ratings yet
Logic-Ai A4
54 pages
Circles in Coordinate Plane
No ratings yet
Circles in Coordinate Plane
12 pages
Resource and Energy Economics: Solving Optimal Timing Problems in Environmental Economics
No ratings yet
Resource and Energy Economics: Solving Optimal Timing Problems in Environmental Economics
8 pages
Algorithm To Become A Good Programmer by Ashish Kedia
No ratings yet
Algorithm To Become A Good Programmer by Ashish Kedia
2 pages
Knuth-Morris-Pratt Algorithm KENT
No ratings yet
Knuth-Morris-Pratt Algorithm KENT
4 pages
1.3 Algorithms and Convergence
No ratings yet
1.3 Algorithms and Convergence
13 pages
Stable/Patient Sort: Bit-Manipulation-To-Solve-Problems-Easily-And-Efficiently
No ratings yet
Stable/Patient Sort: Bit-Manipulation-To-Solve-Problems-Easily-And-Efficiently
2 pages
Study of Van Emde Boas Tree With Application To Dijkstra: Advanced Problem Solving
No ratings yet
Study of Van Emde Boas Tree With Application To Dijkstra: Advanced Problem Solving
16 pages
Reading Material-R1 Rocket Java
No ratings yet
Reading Material-R1 Rocket Java
2 pages
SC QB
No ratings yet
SC QB
24 pages
Week4 Assignment Solution
No ratings yet
Week4 Assignment Solution
2 pages
Pascal's Triangle: Patterns Within The Triangle
No ratings yet
Pascal's Triangle: Patterns Within The Triangle
5 pages
Japanese Math 2001
No ratings yet
Japanese Math 2001
9 pages
Morphological PCB
No ratings yet
Morphological PCB
5 pages
Euclid's Geometry-1
No ratings yet
Euclid's Geometry-1
10 pages
DSP Model Question
No ratings yet
DSP Model Question
4 pages
Hassan Raza Test
No ratings yet
Hassan Raza Test
4 pages
IronFX Case Study
No ratings yet
IronFX Case Study
18 pages
Algorithm Types and Classification
No ratings yet
Algorithm Types and Classification
5 pages
Data Warehousing and Data Mining
No ratings yet
Data Warehousing and Data Mining
7 pages
Ex 4 7 FSC Part1 M Shahid PDF
No ratings yet
Ex 4 7 FSC Part1 M Shahid PDF
3 pages
Transportation Problem Using Stepping Stone Method (Optimal Solution) Calculator
No ratings yet
Transportation Problem Using Stepping Stone Method (Optimal Solution) Calculator
3 pages
Outline and Reading: Tries 4/1/2003 9:02 AM
No ratings yet
Outline and Reading: Tries 4/1/2003 9:02 AM
3 pages
Google Technical Interview Prep - Trello
No ratings yet
Google Technical Interview Prep - Trello
4 pages
241-423 Advanced Data Structures and Algorithms: 9. Queues
No ratings yet
241-423 Advanced Data Structures and Algorithms: 9. Queues
41 pages
Name1: Name2: Name3: Name4: Exercise On Logic. Write All Answers On The Space Provided
No ratings yet
Name1: Name2: Name3: Name4: Exercise On Logic. Write All Answers On The Space Provided
2 pages
Data Structures and Algorithms Made Easy: Narasimha Karumanchi
No ratings yet
Data Structures and Algorithms Made Easy: Narasimha Karumanchi
12 pages
2.7.4 Practice - Modeling - Similarity Theorems (Practice)
No ratings yet
2.7.4 Practice - Modeling - Similarity Theorems (Practice)
4 pages
Klmas CompEng F2010 2603 T2
No ratings yet
Klmas CompEng F2010 2603 T2
5 pages
Van Emde Boas Trees
No ratings yet
Van Emde Boas Trees
5 pages
Java Lab Manual SDES
No ratings yet
Java Lab Manual SDES
41 pages
Mae101 Exercises Guide 1
No ratings yet
Mae101 Exercises Guide 1
6 pages
Dynamic Programming
No ratings yet
Dynamic Programming
110 pages
Actuarial CT3 Probability & Mathematical Statistics Sample Paper 2011 by ActuarialAnswers
No ratings yet
Actuarial CT3 Probability & Mathematical Statistics Sample Paper 2011 by ActuarialAnswers
9 pages
Recursive Mmse Estimation of Wireless Channels Based On Training Data and Structured Correlation Learning
No ratings yet
Recursive Mmse Estimation of Wireless Channels Based On Training Data and Structured Correlation Learning
6 pages
Comparative Analysis of Brute Force and Boyer Moore Algorithms in Word Suggestion Search
No ratings yet
Comparative Analysis of Brute Force and Boyer Moore Algorithms in Word Suggestion Search
5 pages
Bayesian Networks Exercise 2
No ratings yet
Bayesian Networks Exercise 2
6 pages
Priority Queues
No ratings yet
Priority Queues
6 pages
Android Application For Crop Yield Prediction and Crop Disease Detection
No ratings yet
Android Application For Crop Yield Prediction and Crop Disease Detection
4 pages
Logistic Regression Model - A Review
No ratings yet
Logistic Regression Model - A Review
5 pages
Programming & Algorithms
No ratings yet
Programming & Algorithms
5 pages
Android Training Online
No ratings yet
Android Training Online
6 pages
Segmentation and Object Recognition Using Edge Detection Techniques
No ratings yet
Segmentation and Object Recognition Using Edge Detection Techniques
9 pages
Django 1.0 Template Development
From Everand
Django 1.0 Template Development
Scott Newman
No ratings yet
Hebbian Learning: Fundamentals and Applications for Uniting Memory and Learning
From Everand
Hebbian Learning: Fundamentals and Applications for Uniting Memory and Learning
Fouad Sabry
No ratings yet
Google Cloud Platform Complete Self-Assessment Guide
From Everand
Google Cloud Platform Complete Self-Assessment Guide
Gerardus Blokdyk
1/5 (1)

DS Lecture - 6 (Hashing)

Uploaded by

DS Lecture - 6 (Hashing)

Uploaded by

HASHING

Course teacher: Moona Kanwal

– Too big will waste memory; too small will

• The actual data is not stored 4 451-229-0004

(a) Division Method

Table size: 0 – 99 (prime)

H(3205)= 4, H(7148)=67, H(2345)=17

• For 2nd formula add 1 into the remainders.

• The same position must be used for all the keys.

• 4th and 5th digits have been selected. From the

• Types of open addressing are

• If the load factor is greater than 50% - 70% then the

• However, linear probing also tends to promote

H(k) = h, h+1, h+4, h+5, h+6,……, h+i2

table storing integer 22

keys that handles 59

You might also like