0% found this document useful (0 votes)

176 views9 pages

Hash Function

Hash functions map data to hash values called keys that are used to identify and locate the data. Collisions occur when different data maps to the same key, but algorithms aim to minimize collisions. Hash tables use hash functions to map keys to array indexes to efficiently store and retrieve data based on keys. When collisions occur, linked lists or other techniques are used to store data at the same index. Extendible hashing is a dynamic hashing method that uses directories and buckets to store and retrieve data while allowing the hash function and structure to change over time to minimize collisions as more data is added.

Uploaded by

Pham Minh Long

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

176 views9 pages

Hash Function

Uploaded by

Pham Minh Long

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 9

Hash function

Hash function is an algorithm to generate hash values corresponding to each block of data (be it a
string of characters, an object in object-oriented programming, etc.). The hash value acts as a key to
distinguish data blocks, however, people accept the phenomenon of key duplication or collision and
try to improve the algorithm to minimize such collisions. Hash functions are often used in hash tables
to reduce the computational cost of finding a block of data in a set (because comparing hashes is
faster than comparing large blocks of data).

Hash table
In computing, a hash table (hash map) is a data structure that implements an associative array
abstract data type, a structure that can map keys to values. A hash table uses a hash function to
compute an index, also called a hash code, into an array of buckets or slots, from which the desired
value can be found. During lookup, the key is hashed and the resulting hash indicates where the
corresponding value is stored.

Ideally, the hash function will assign each key to a unique bucket, but most hash table designs employ
an imperfect hash function, which might cause hash collisions where the hash function generates the
same index for more than one key. Such collisions are typically accommodated in some way.

In many situations, hash tables turn out to be on average more efficient than search trees or any
other table lookup structure. For this reason, they are widely used in many kinds of computer
software, particularly for associative arrays, database indexing, caches, and sets.
The division method
The division method involves mapping a key k into one of m slots by taking the
remainder of k divided by m as expressed in the hash function

h(k) = k mod m .
For example, if the hash table has size m = 12 and the key is k = 100, then h(k) = 4.

Folding Method in Hashing

Algorithm:
 The folding method is used for creating hash functions starts with the item
being divided into equal-sized pieces i.e., the last piece may not be of equal size.
 The outcome of adding these bits together is the hash value, H(x) = (a + b +c)
mod M, where a, b, and c represent the preconditioned key broken down into
three parts and M is the table size, and mod stands for modulo
 In other words, the sum of three parts of the preconditioned key is divided by
the table size. The remainder is the hash key.
Explanation:
Example 1: The task is to fold the key 123456789 into a Hash Table of ten spaces (0
through 9).
 It is given that the key, say X is 123456789 and the table size (i.e., M = 10).
 Since it can break X into three parts in any order. Let’s divide it evenly.
 Therefore, a = 123, b = 456, c = 789.
 Now, H(x) = (a + b + c) mod M i.e., H(123456789) =(123 + 456 + 789) mod 10 =
1368 mod 10 = 8.
 Hence, 123456789 is inserted into the table at address 8.

Mid-Square hashing
Mid-Square hashing is a hashing technique in which unique keys are generated. In this technique, a
seed value is taken and it is squared. Then, some digits from the middle are extracted. These
extracted digits form a number which is taken as the new seed. This technique can generate keys
with high randomness if a big enough seed value is taken. However, it has a limitation. As the seed
is squared, if a 6-digit number is taken, then the square will have 12-digits. This exceeds the range
of int data type. So, overflow must be taken care of. In case of overflow, use long long int data type
or uses string as multiplication if overflow still occurs. The chances of a collision in mid-square
hashing are low, not obsolete. So, in the chances, if a collision occurs, it is handled using some hash
map.
Example:

Suppose a 4-digit seed is taken. seed = 4765

Hence, square of seed is = 4765 * 4765 = 22705225
Now, from this 8-digit number, any four digits are extracted (Say, the
middle four).
So, the new seed value becomes seed = 7052
Now, square of this new seed is = 7052 * 7052 = 49730704
Again, the same set of 4-digits is extracted.
So, the new seed value becomes seed = 7307

Collision
The problem arises of duplicate (index) cases if the hash algorithm is not good, and
almost no hash algorithm is really perfect to generate a unique key if storing a large
amount of data, to solve the problem. In this topic we use a linked list to store one
more layer of elements for that index.

It can be considered that generating a hash is to create a key Hash and store it in the
array of values where there is a linked list with an element containing our real key,
after we get there we will use the real key to retrieve the value.

Quadratic Probing in Hashing

Hashing is an improvement over Direct Access Table. The idea is to use a hash
function that converts a given phone number or any other key to a smaller
number and uses the small number as the index in a table called a hash table.
Hash Function: A function that converts a given big number to a small
practical integer value. The mapped integer value is used as an index in the
hash table. In simple terms, a hash function maps a big number or string to a
small integer that can be used as an index in the hash table.
In this article, the collision technique, quadratic probing is discussed.
Quadratic Probing: Quadratic probing is an open-addressing scheme where
we look for i2‘th slot in i’th iteration if the given hash value x collides in the
hash table.
How Quadratic Probing is done?
Let hash(x) be the slot index computed using the hash function.

If the slot hash(x) % S is full, then we try (hash(x) + 1*1) % S.

If (hash(x) + 1*1) % S is also full, then we try (hash(x) + 2*2) % S.
If (hash(x) + 2*2) % S is also full, then we try (hash(x) + 3*3) % S.
This process is repeated for all the values of i until an empty slot is found.
For example: Let us consider a simple hash function as “key mod 7” and
sequence of keys as 50, 700, 76, 85, 92, 73, 101.
Coalesced hashing
Coalesced hashing is a collision avoidance technique when there is a fixed sized data. It is a
combination of both Separate chaining and Open addressing . It uses the concept of Open
Addressing(linear probing) to find first empty place for colliding element from the bottom of the
hash table and the concept of Separate Chaining to link the colliding elements to each other
through pointers. The hash function used is h=(key)%(total number of keys). Inside the hash table,
each node has three fields:
 h(key): The value of hash function for a key.
 Data: The key itself.
 Next: The link to the next colliding elements.
The basic operations of Coalesced hashing are:

1. INSERT(key): The insert Operation inserts the key according to the hash value of that key if that
hash value in the table is empty otherwise the key is inserted in first empty place from the
bottom of the hash table and the address of this empty place is mapped in NEXT field of the
previous pointing node of the chain.(Explained in example below).
2. DELETE(Key): The key if present is deleted.Also if the node to be deleted contains the address of
another node in hash table then this address is mapped in the NEXT field of the node pointing to
the node which is to be deleted
3. SEARCH(key): Returns True if key is present, otherwise return False.
The best case complexity of all these operations is O(1) and the worst case complexity is O(n)
where n is the total number of keys.It is better than separate chaining because it inserts the
colliding element in the memory of hash table only instead of creating a new linked list as in
separate chaining.
Illustration:
Example:

n = 10
Input : {20, 35, 16, 40, 45, 25, 32, 37, 22, 55}
Hash function

h(key) = key%10

Hash Buckets
In computing, a hash table [hash map] is a data structure that provides virtually
direct access to objects based on a key [a unique String or Integer]. A hash table uses
a hash function to compute an index into an array of buckets or slots, from which the
desired value can be found. Here are the main features of the key used:

 The key used can be your SSN, your telephone number, account number, etc
 Must have unique keys
 Each key is associated with–mapped to–a value

Hash buckets are used to apportion data items for sorting or lookup purposes. The
aim of this work is to weaken the linked lists so that searching for a specific item can
be accessed within a shorter
Perfect Hash Functions
Consider the table in which the keys are stored using linear probing. Suppose we
delete A4 and then then try to find B4. Because when searching B we hash it to
position 4 and see that this position is empty and conclude that B4 is not found
(which is not true).

In computer science , a perfect hash function h for a set S is a hash function that

maps distinct elements in S to a set of m integers, with no collisions. In mathematical
terms, it is an injective function.
Perfect hash functions may be used to implement a lookup table with constant
worst-case access time.

Extendible Hashing
Extendible Hashing is a dynamic hashing method wherein directories, and buckets
are used to hash data. It is an aggressively flexible method in which the hash
function also experiences dynamic changes.

Main features of Extendible Hashing: The main features in this hashing technique
are:

Directories: The directories store addresses of the buckets in pointers. An id is

assigned to each directory which may change each time when Directory Expansion
takes place.
Buckets: The buckets are used to hash the actual data.
Basic Structure of Extendible Hashing:

requently used terms in Extendible Hashing :

 Directories: These containers store pointers to buckets. Each directory is given a
unique id which may change each time when expansion takes place. The hash
function returns this directory id which is used to navigate to the appropriate
bucket. Number of Directories = 2^Global Depth.
 Buckets: They store the hashed keys. Directories point to buckets. A bucket may
contain more than one pointers to it if its local depth is less than the global
depth.
 Global Depth: It is associated with the Directories. They denote the number of
bits which are used by the hash function to categorize the keys. Global Depth =
Number of bits in directory id.
 Local Depth: It is the same as that of Global Depth except for the fact that Local
Depth is associated with the buckets and not the directories. Local depth in
accordance with the global depth is used to decide the action that to be
performed in case an overflow occurs. Local Depth is always less than or equal
to the Global Depth.
 Bucket Splitting: When the number of elements in a bucket exceeds a particular
size, then the bucket is split into two parts.
 Directory Expansion: Directory Expansion Takes place when a bucket overflows.
Directory Expansion is performed when the local depth of the overflowing
bucket is equal to the global depth.
 Cryptographic Hash Functions

Artificial Intelligence by Michael Negnevitsky
0% (2)
Artificial Intelligence by Michael Negnevitsky
4 pages
C# Cheat Sheet PDF For Your Quick Reference
No ratings yet
C# Cheat Sheet PDF For Your Quick Reference
25 pages
Functional Dependencies and Normalization
No ratings yet
Functional Dependencies and Normalization
7 pages
Object Oriented Programming: Exception Handling in Java
No ratings yet
Object Oriented Programming: Exception Handling in Java
21 pages
Informed Search Algorithms in AI - Javatpoint
No ratings yet
Informed Search Algorithms in AI - Javatpoint
10 pages
Example 3-16: The Hash Table Algorithm
No ratings yet
Example 3-16: The Hash Table Algorithm
5 pages
Queue: Basic Operation On Queue
No ratings yet
Queue: Basic Operation On Queue
14 pages
IO Streams in Java
No ratings yet
IO Streams in Java
50 pages
Class Diagram Tutorial
No ratings yet
Class Diagram Tutorial
8 pages
Query Optimization
No ratings yet
Query Optimization
9 pages
Java - Input-Output
No ratings yet
Java - Input-Output
23 pages
UNIT 3 Queue
No ratings yet
UNIT 3 Queue
17 pages
Package in Java
No ratings yet
Package in Java
22 pages
Stacks: (Infix, Postfix and Prefix Expressions)
No ratings yet
Stacks: (Infix, Postfix and Prefix Expressions)
48 pages
OOP Program Answers
No ratings yet
OOP Program Answers
47 pages
OOPS
No ratings yet
OOPS
31 pages
Stacks and Queue
No ratings yet
Stacks and Queue
17 pages
19.python OOPs Concepts
No ratings yet
19.python OOPs Concepts
28 pages
Gaddis Python 4e Chapter 11
No ratings yet
Gaddis Python 4e Chapter 11
14 pages
Ds Lab Programs
No ratings yet
Ds Lab Programs
30 pages
C, C++ Questions
No ratings yet
C, C++ Questions
39 pages
Arrays 1
No ratings yet
Arrays 1
24 pages
Data Structure Unit 2 Important Questions
No ratings yet
Data Structure Unit 2 Important Questions
48 pages
Binary Search Tree Notes
No ratings yet
Binary Search Tree Notes
7 pages
Lecture 1.7 - Array Traversing Insert Delete Presentation
No ratings yet
Lecture 1.7 - Array Traversing Insert Delete Presentation
38 pages
Complete DSA Notes
No ratings yet
Complete DSA Notes
223 pages
Java String
No ratings yet
Java String
29 pages
Chapter 2 Querry Proccessing
No ratings yet
Chapter 2 Querry Proccessing
7 pages
Unit 2: Role of Lexical Analyzer
No ratings yet
Unit 2: Role of Lexical Analyzer
11 pages
Chap04 Linked List
100% (1)
Chap04 Linked List
41 pages
B+ Tree & B Tree
No ratings yet
B+ Tree & B Tree
38 pages
Exception Handling in Java
No ratings yet
Exception Handling in Java
18 pages
Python Data Types
No ratings yet
Python Data Types
23 pages
Serilization
No ratings yet
Serilization
9 pages
Coding Interview Rules
No ratings yet
Coding Interview Rules
12 pages
Compiler Design Problem Set
No ratings yet
Compiler Design Problem Set
10 pages
Programming Questions
No ratings yet
Programming Questions
16 pages
Hashing
50% (2)
Hashing
43 pages
Graph and Graph Traaversals
No ratings yet
Graph and Graph Traaversals
19 pages
TCS NQT - Coding Sheet by Arsh
No ratings yet
TCS NQT - Coding Sheet by Arsh
4 pages
Cyber Security IMP Points Short Notes
No ratings yet
Cyber Security IMP Points Short Notes
20 pages
1.4 Conversion of Infix To Postfix
No ratings yet
1.4 Conversion of Infix To Postfix
7 pages
Data Structures
No ratings yet
Data Structures
11 pages
Python Full Stack
0% (1)
Python Full Stack
6 pages
CAT Questions
No ratings yet
CAT Questions
2 pages
Scenario Based 3 (SIDHARTH K RA2011003010008)
No ratings yet
Scenario Based 3 (SIDHARTH K RA2011003010008)
6 pages
Time Complexity
No ratings yet
Time Complexity
5 pages
OO PInheritance & Polymorphism
No ratings yet
OO PInheritance & Polymorphism
32 pages
Floyd Warshall Algorithm
No ratings yet
Floyd Warshall Algorithm
19 pages
Capgemini Interview
No ratings yet
Capgemini Interview
13 pages
Unit-7 Transaction Processing
No ratings yet
Unit-7 Transaction Processing
107 pages
Graph Traversal
No ratings yet
Graph Traversal
4 pages
Unit-4 Complete Notes
No ratings yet
Unit-4 Complete Notes
30 pages
Top 30 Array Interview Questions and Answers For Programmers
No ratings yet
Top 30 Array Interview Questions and Answers For Programmers
12 pages
05 NumPy - Arrays and Vectorized Computation
No ratings yet
05 NumPy - Arrays and Vectorized Computation
47 pages
Data Structure
No ratings yet
Data Structure
81 pages
What Is Asymptotic Notation
No ratings yet
What Is Asymptotic Notation
51 pages
Data Structure 18043
No ratings yet
Data Structure 18043
125 pages
Java Reflection Complete Self-Assessment Guide
From Everand
Java Reflection Complete Self-Assessment Guide
Gerardus Blokdyk
No ratings yet
PostgreSQL 9 High Availability Cookbook
From Everand
PostgreSQL 9 High Availability Cookbook
Shaun M. Thomas
5/5 (2)
Excel 2013/2016: Get Your Hands Dirty
From Everand
Excel 2013/2016: Get Your Hands Dirty
Sam Akrasi
No ratings yet
Started On
No ratings yet
Started On
100 pages
Question Bank - BPLCK205B - IA2 - 2024-25
No ratings yet
Question Bank - BPLCK205B - IA2 - 2024-25
2 pages
Mobile Cloud Computing
No ratings yet
Mobile Cloud Computing
23 pages
Dropbox
No ratings yet
Dropbox
4 pages
Git HG Rosetta Stone
No ratings yet
Git HG Rosetta Stone
1 page
Come 301 Dbms Lecture 4 Presentation Slides
No ratings yet
Come 301 Dbms Lecture 4 Presentation Slides
25 pages
Embedded Systems & Iot
No ratings yet
Embedded Systems & Iot
9 pages
Applied Digital Signal Processing 1St Edition Manolakis Solutions Manual Full Chapter PDF
100% (20)
Applied Digital Signal Processing 1St Edition Manolakis Solutions Manual Full Chapter PDF
67 pages
50 Objectives Questions On Digital Electronics
No ratings yet
50 Objectives Questions On Digital Electronics
2 pages
CD Final PDF
No ratings yet
CD Final PDF
140 pages
AI Project
No ratings yet
AI Project
63 pages
Systems Programming - Lab3
No ratings yet
Systems Programming - Lab3
2 pages
KWB
No ratings yet
KWB
29 pages
Axon Body 3 User Manual
No ratings yet
Axon Body 3 User Manual
44 pages
1 Data Sheet Nokia 7368 ISAM ONT G-240W-C
No ratings yet
1 Data Sheet Nokia 7368 ISAM ONT G-240W-C
4 pages
IT Desktop Support
No ratings yet
IT Desktop Support
3 pages
Zaria Poly Computer Troubleshooting II Theory
No ratings yet
Zaria Poly Computer Troubleshooting II Theory
146 pages
Cygnus 850: 2-Wire G.SHDSL Modem Family
No ratings yet
Cygnus 850: 2-Wire G.SHDSL Modem Family
2 pages
CGR Microproject PDF
No ratings yet
CGR Microproject PDF
9 pages
Invitedpaper Aspdac 24
No ratings yet
Invitedpaper Aspdac 24
7 pages
Awake Security vs. Extrahop Reveal (X) : Network Traffic Analysis: Feature Comparison Guide
No ratings yet
Awake Security vs. Extrahop Reveal (X) : Network Traffic Analysis: Feature Comparison Guide
10 pages
CS411 Final Term Solved MCQs File-2 by JUNAID
No ratings yet
CS411 Final Term Solved MCQs File-2 by JUNAID
19 pages
Cartoonifying An Image: T. E. Computer Engineering
No ratings yet
Cartoonifying An Image: T. E. Computer Engineering
52 pages
Synopsys Synplify Pro For Microsemi Edition Attribute Reference Manual, December 2019
No ratings yet
Synopsys Synplify Pro For Microsemi Edition Attribute Reference Manual, December 2019
290 pages
Computer Bane or Boon
No ratings yet
Computer Bane or Boon
2 pages
Advances of Machine Learning in Materials Science: Ideas and Techniques
No ratings yet
Advances of Machine Learning in Materials Science: Ideas and Techniques
40 pages
Design of An Enterprise Networkinfrastructurefor A Company Using Cisco Packet Tracer 4e3f0bc4 C2a3 4963 8671 A6220a27a803
No ratings yet
Design of An Enterprise Networkinfrastructurefor A Company Using Cisco Packet Tracer 4e3f0bc4 C2a3 4963 8671 A6220a27a803
4 pages
Distributed Databases Introduction
100% (1)
Distributed Databases Introduction
16 pages
DAT213a MS400890MX-BS PLR-ZERT EN 0421
No ratings yet
DAT213a MS400890MX-BS PLR-ZERT EN 0421
11 pages

Hash Function

Uploaded by

Hash Function

Uploaded by

Hash function

Folding Method in Hashing

Suppose a 4-digit seed is taken. seed = 4765

Quadratic Probing in Hashing

If the slot hash(x) % S is full, then we try (hash(x) + 1*1) % S.

In computer science , a perfect hash function h for a set S is a hash function that

Directories: The directories store addresses of the buckets in pointers. An id is

requently used terms in Extendible Hashing :

You might also like