0% found this document useful (0 votes)

84 views42 pages

Group 15 Hash Tables

The document discusses hash tables and hashing. It defines a hash table as a data structure used to store information by mapping keys to values. It describes how hashing works by using a hash function to convert a key into an integer index in an array where the associated value can be stored or retrieved. The document discusses factors that affect hash table design like hashing functions, table size, and collision handling schemes like separate chaining and open addressing. It provides examples of operations like insertion and different techniques to resolve collisions.

Uploaded by

reagan oloya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

84 views42 pages

Group 15 Hash Tables

Uploaded by

reagan oloya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 42

HASH TABLES

NAMES
REG NO.
TUMITHO STEVEN 20/U/2871/GIT
KITARA DANIEL 20/U/0835/GIK/PS
TUKASHABA DICKENS 20/2475/GIT
HASHING

• A Hash table is data structures used to store

information.
• OR
• Hashing is an array of some fixed size number,
usually a prime number.
INTRODUCTION TO HASH
• Key: Paul
TABLES
0 1 2 3
20 60 9

• Value:Age

• Hash(“key”) = index
• 9 Hash(“Paul”) = 3
• 20 Hash( “Tina”) = 0
• 60 Hash(“Somebody”) = 2
Definition cont’d……….
Hash Table Data Structure : Purpose

• To support insertion, deletion and search in

average - case constant time
• Assumption: Order of elements irrelevant .
• Hash[ “string key”] ==> integer value
HASH TABLE OPERATIONS
INSERT OPERATION
• int y =(“ Dickson”);
0 y is now 2

1 Ben int y = 2;

2 Dickson Dickson
int x = (“Aidah”) ;
3
x is now 5
4
int x = 5;
5 Aidah
int z =(“Ben”);
z now 1
FACTORS AFFECTING
HASH TABLE DESIGN
• Hashing functions
• Table size- its usually fixed at the start
• Collision handling scheme- This normally occur
when two or more keys maps to the same array
index.
HASH FUNCTIONS

• Hash functions :These are mathematical operations

through which data is run in order to be inserted
into or obtained from the hash tables.
• How to define a function to be used
• Use only the data being hashed
• Use all the data being hashed
• Uniformly distribute the data
HASH FUNCTIONS

Division Based Hash Functions

The division method uses the modulus operation (%),
which is actually a form of division both conceptually
and operationally. In the division method, the hash
function is of the form
• h(x) = x % M
Collision
• Collision: when two keys map to the same location in the hash table.
QN How does collision come to happen in hash tables
A collision occurs when two pieces of data, when run through the hash function
they yield the same hash code
int y = (“Dickson”);
y is now 2
int y =2;
Dickson
int x = (“Ayda”)
x is now 2
int x =2
Ayda
Collision cont’d
TECHNIQUES FOR
RESOLVING COLLISION
Separate Chaining
• In separate chaining, the hash table is an array of linked lists,
with all keys that hash to the same location in the same list.
New keys are inserted in the front of the list. In other words,
each hash table entry is a pointer to a list of keys and their
associated data items.
• To insert a key into the table, the hash table index is
computed, and then the list is searched to see if the key is
already in the table. If it is not, it is inserted at the head of the
list.
Chaining cont’d

• In the worst case, this requires a search of the

entire list. On average, half of the list is searched
on each insertion. It is not worth keeping the list in
sorted order if it is short.
Chaining cont’d
Chaining cont’d…….
Open Addressing

• In open addressing, there are no separate lists attached to the

table. All values are in the table itself. When a collision occurs,
the cells of the hash table itself are searched until an empty one is
found. Which cells are searched depends upon the specific
method of open addressing. All variations can be described
generically by a sequence of functions
• h0(x) = h(x) + f(0; x) % M
• h1(x) = h(x) + f(1; x) % M
• :::
• hk(x) = h(x) + f(k; x) % M
Open addressing cont’d

• where hi(x) is the i^th location tested and f(i; x) is a function that
returns some integer value based on the values of i and x. The idea
is that the hash function h(x) is first used to find a location in the
hash table for x. If we are trying to insert x into the table, and the
index h(x) is empty, we insert it there. Otherwise we need to
search for another place in the table into which we can store x. The
function f(i; x), called the collision resolution function, serves that
purpose. We search the locations until either an empty cell is found
or the search returns to a cell previously visited in the sequence.
The function f(i; x) need not depend on both i and x.
Open addressing cont’d
• h(x) + f(0; x) % M
• h(x) + f(1; x) % M
• h(x) + f(2; x) % M
• :::
• h(x) + f(k; x)% M
• To search for an item, the same collision resolution function is used. The
hash function is applied to find the first index. If the key is there, the
search stops. Otherwise, the table is searched until either the item is
found or an empty cell is reached. If an empty cell is reached, it implies
that the item is not in the table. This raises a question about how to delete
items. If an item is deleted,
Open addressing cont’d

then there will be no way to tell whether the search should

stop when it reaches an empty cell,
or just jump over the hole. The way around this problem is to
lazy deletion. In lazy deletion, the cell is marked DELETED.
Only when it is needed again is it re-used. Every cell is
marked as either
• ACTIVE: it has an item in it
• EMPTY: it has no item in it
• DELETED: it had an item that was deleted it can be re-used
Different collision control
methods using open Addressing
• linear probing.
• quadratic probing.
• double hashing.
Linear Probing

• In linear probing, the collision resolution function, f(i; x), is a

linear function that ignores the value of x, i.e.., f(i) = a i + b.
In the simplest case, a = 1 and b = 0, so that f(i; x) = i and
hi(x) = (h(x)+i)%M. In other words, consecutive locations in
the hash table are probed, treating the table like a circular list.
• Example 1. Consider a hash table of size 10 with the simple
division hash function h(x) = x % 10 and suppose we insert
the sequence of keys, 5; 15; 6; 3; 27; 8 In principle, only one
collision should occur: 5 and 15 because they both map to the
location 5.
Linear probing cont’d

• But linear probing causes many more collisions.

After inserting 5, 15 causes a collision. It is placed
in H[6]. Then 6 has a collision at H[6] and is
placed in H[7]. 3 gets placed without a collision,
but 27 collides with 6 and is placed in H[8]. This
causes 8 to collide, and it is placed in H[9]. Figure
shows the state of the hash table after each
insertion.
Linear probing cont’d
Linear probing cont’d
LINEAR PROBING HASH TABLE
AFTER EACH INSERTION

• Hash(89,10) = 9
• Hash(18,10) =8
• Hash(49,10) = 9
• Hash(58,10) = 8
TABLE

0 49 49 49
1 58 58
2 9
3
4
5
6
7
8 18 18 18 18
9 89 89 89 89 89
Quadratic Probing

• Quadratic probing eliminates clustering. In quadratic

probing the collision resolution function is a quadratic
function of i and does not depend on x, namely f(i; x) =
i2. In other words, when a collision occurs, the
successive locations to be probed are at a distance
(modulo table size) of 1; 4; 9; 16; 25; 26; 49, and so on.
The sequence of successive locations is denoted by the
equations
• h0 = h(x)
• hi = (h0 + i2) % M
Quadratic probing cont’d
Double Hashing

• In double hashing, the sequence of probes is a linear sequence with

an increment obtained by applying a second hash function to the
key:
• f(i) = i * hash2(x);
• We search locations hash(x) + i*hash2(x) for i = 1,2,3,. . .
• The choice of the second hash function can be disastrous it should
never evaluate to a factor of the table size, obviously. It should be
relatively prime to table size. It should never evaluate to 0 either.
Choosing
• hash2(x) = R- (x % R)
• it work well if R is a small prime number.
Collision cont’d

Advantages of open addressing over chaining

-No need for linked list structures
Disadvantages of open addressing over chaining
-Slower insertion, May need several attempts to find
an empty slot
-Table needs to be bigger (than chaining-based table)
to achieve average-case constant time performance
Rehashing

• If the hash table gets too full it should be resized. The best
way to resize it is to create a new hash table about twice as
large and hash all of the elements of the hash table into the
new table using its hash function.
• Rehashing is expensive, so it should only be done when
necessary:
• 1. When an insertion fails, or
• 2. When the table gets half full, or
• 3. When the table load factor reaches some predefined value.
Load factor
• Load factor λ of a hash table T is defined as follows:
• N = number of elements in T (“current size”)
• M = size of T(“table size”)
• λ= N/M(“ load factor”)
• i.e., λ is the average length of a chain
• Unsuccessful search time: O(λ)
• -Same for insert time
• Successful search time: O(λ/2)
• Ideally, want λ≤ 1 (not a function of N)
implementation

void clear( )
Resets and empties the hash table.

Object clone( )
Returns a duplicate of the invoking object.

boolean contains(Object value)

Returns true if some value equal to the value exists within the hash table.
Returns false if the value isn't found.
boolean containsKey(Object key)
Returns true if some key equal to the key exists within the hash table. Returns
false if the key isn't found.

boolean containsValue(Object value)

Returns true if some value equal to the value exists within the hash table.
Returns false if the value isn't found.

Enumeration elements( )
Returns an enumeration of the values contained in the hash table.
Object get(Object key)
Returns the object that contains the value associated with the key. If the key
is not in the hash table, a null object is returned.

boolean isEmpty( )
Returns true if the hash table is empty; returns false if it contains at least one
key.

Enumeration keys( )
Returns an enumeration of the keys contained in the hash table.
Object put(Object key, Object value)
Inserts a key and a value into the hash table. Returns null if the key isn't
already in the hash table; returns the previous value associated with the key if
the key is already in the hash table.

void rehash( )
Increases the size of the hash table and rehashes all of its keys.
Object remove(Object key)
Removes the key and its value. Returns the value associated with the key. If
the key is not in the hash table, a null object is returned.

int size( )
Returns the number of entries in the hash table.

String toString( )
Returns the string equivalent of a hash table.
IMPLEMENTATION
• import java.util.*;
• public class HashTable{

• public static void main(String args[]) {

• // Create a hash map
• Hashtable balance = new Hashtable();
• Enumeration names;
• String str;
• bal;
IMPLEMENTATION cont’d
• balance.put(“Dickson", new Double(1.5));
• balance.put(“Aidah", new Double(2.5));
• balance.put(“Mary", new Double(3.5));
• balance.put(“Ben", new Double(4.5));
• balance.put(“Hatimah", new Double(5.5));

• // Show all balances in hash table.

• names = balance.keys();
•
IMPLEMENTATION CONT’d

•
• // Deposit 1,000 into Dickson’s account
• bal = ((Double)balance.get(“Dickson")).doubleValue();
• balance.put(“Dickson", new Double(bal + 1000));
• System.out.println(
• “Dickson's new balance: " + balance.get(“Dickson"));
• }
• }
References

• Guttag,J.V(2015). Introduction to computation and

programming using python.London,UK,Dunches.
• Szymanski,T.G(1985).Hash table reorganization.
Journal of algorithms,6(3),322-335.
• Nimbe,P.,Opoku,M. & Asante,A.(2014).Hash table
collision resolution using a multi dimensional
array.International journal of innovation and
scientific research,9(2),258-267.

Network Flow DAA
No ratings yet
Network Flow DAA
22 pages
Hash Tables
100% (1)
Hash Tables
30 pages
Hash Tables: Dr. Dibakar Saha
No ratings yet
Hash Tables: Dr. Dibakar Saha
26 pages
Unit II
No ratings yet
Unit II
7 pages
Search Algorithms AI Detailed
No ratings yet
Search Algorithms AI Detailed
6 pages
Hash Table
No ratings yet
Hash Table
9 pages
Unit 2 AI
No ratings yet
Unit 2 AI
107 pages
Review Exercises - Solution
No ratings yet
Review Exercises - Solution
17 pages
Dsa Lecture 13 Hash Tables
No ratings yet
Dsa Lecture 13 Hash Tables
15 pages
Topic 6 Hashing
No ratings yet
Topic 6 Hashing
31 pages
UNIT2
No ratings yet
UNIT2
25 pages
3-Solving Problems by Searching-31-07-2024
No ratings yet
3-Solving Problems by Searching-31-07-2024
160 pages
Solving Problems by Searching
No ratings yet
Solving Problems by Searching
71 pages
Assignment 1 Questions
No ratings yet
Assignment 1 Questions
16 pages
Backtracking
No ratings yet
Backtracking
7 pages
TCP2101 Algorithm Design & Analysis: - Hash Tables
No ratings yet
TCP2101 Algorithm Design & Analysis: - Hash Tables
58 pages
11 Hashing
No ratings yet
11 Hashing
60 pages
11 Hash Tables Slides
No ratings yet
11 Hash Tables Slides
34 pages
Hashing
No ratings yet
Hashing
66 pages
20 Hashing
No ratings yet
20 Hashing
47 pages
Hashing Updated
No ratings yet
Hashing Updated
26 pages
Ds 17hashing
No ratings yet
Ds 17hashing
27 pages
2.8. ADS - Collision Resolution-Extendible Hashing-1
No ratings yet
2.8. ADS - Collision Resolution-Extendible Hashing-1
47 pages
Hashing Presentation
No ratings yet
Hashing Presentation
12 pages
World Hash Directory 2008
100% (1)
World Hash Directory 2008
100 pages
MODULE 5 - BCS304 - HASHING - Leftisht Trees - OBST - Notes
No ratings yet
MODULE 5 - BCS304 - HASHING - Leftisht Trees - OBST - Notes
32 pages
SORTING PROGRAMS - Counting + Bucket + Heap
No ratings yet
SORTING PROGRAMS - Counting + Bucket + Heap
27 pages
11-Hashing-Hong Kong
No ratings yet
11-Hashing-Hong Kong
25 pages
The University of Jordan Department of Mathematics: Branch and Cut
No ratings yet
The University of Jordan Department of Mathematics: Branch and Cut
17 pages
University of Science and Technology Chittagong
No ratings yet
University of Science and Technology Chittagong
14 pages
Chapter 4: Modern Cryptography: Cryptographic Hash Functions
No ratings yet
Chapter 4: Modern Cryptography: Cryptographic Hash Functions
4 pages
Lecture 27 - Hashing
No ratings yet
Lecture 27 - Hashing
48 pages
DS Lecture - 6 (Hashing)
No ratings yet
DS Lecture - 6 (Hashing)
26 pages
Unit 1 Hashing
No ratings yet
Unit 1 Hashing
61 pages
ALNS Routing
No ratings yet
ALNS Routing
44 pages
Algo Cha 8
No ratings yet
Algo Cha 8
20 pages
Chapter10 HashTables
No ratings yet
Chapter10 HashTables
49 pages
DS Unit 6
No ratings yet
DS Unit 6
15 pages
CS3301 As02
No ratings yet
CS3301 As02
3 pages
HASHING
No ratings yet
HASHING
16 pages
Metaheuristic
No ratings yet
Metaheuristic
8 pages
L-2005-08-Advance Data Structure Part 1-HS
No ratings yet
L-2005-08-Advance Data Structure Part 1-HS
46 pages
COMP6065 - PPTI02 - R1 - Ok
No ratings yet
COMP6065 - PPTI02 - R1 - Ok
60 pages
Breadth First Search and Depth First Search Algorithms
No ratings yet
Breadth First Search and Depth First Search Algorithms
2 pages
Prof. Amey D.S.Kerkar Computer Engineering Department, Don Bosco College of Engineering Fatorda-Goa
No ratings yet
Prof. Amey D.S.Kerkar Computer Engineering Department, Don Bosco College of Engineering Fatorda-Goa
76 pages
Cse373 10 Hashing
No ratings yet
Cse373 10 Hashing
36 pages
Handout 9 - Hashing
No ratings yet
Handout 9 - Hashing
11 pages
Uninformed Search Algorithms
No ratings yet
Uninformed Search Algorithms
58 pages
Hashing
No ratings yet
Hashing
30 pages
Hashing New
No ratings yet
Hashing New
48 pages
Chap-1 ADS
No ratings yet
Chap-1 ADS
5 pages
Lect Hashing
No ratings yet
Lect Hashing
36 pages
Hash Table PDF
No ratings yet
Hash Table PDF
25 pages
15 HashTables
No ratings yet
15 HashTables
27 pages
ADI Hashing
No ratings yet
ADI Hashing
47 pages
Hashing in Data Structure
No ratings yet
Hashing in Data Structure
43 pages
Hash Table
No ratings yet
Hash Table
26 pages
Ads M Tech Mid 2
No ratings yet
Ads M Tech Mid 2
26 pages
DS Lecture - 6 (Hashing)
No ratings yet
DS Lecture - 6 (Hashing)
32 pages
Hashing Slide
No ratings yet
Hashing Slide
16 pages
Hashing
No ratings yet
Hashing
20 pages
DSAL Manual Assignment 4
No ratings yet
DSAL Manual Assignment 4
6 pages
Graph Traversal: Text Depth-First Search Breadth-First Search
No ratings yet
Graph Traversal: Text Depth-First Search Breadth-First Search
41 pages
Collision
No ratings yet
Collision
24 pages
Cs 218 - Data Structures: Hashing
No ratings yet
Cs 218 - Data Structures: Hashing
18 pages
Hashing Algorithms
No ratings yet
Hashing Algorithms
22 pages
Hashing 1
No ratings yet
Hashing 1
26 pages
Informed Search: Prepared by Dr. Megharani Patil
No ratings yet
Informed Search: Prepared by Dr. Megharani Patil
22 pages
AST20105 Data Structure and Algorithms: Chapter 9 - Hash Table
No ratings yet
AST20105 Data Structure and Algorithms: Chapter 9 - Hash Table
39 pages
Ch7 Hashing
No ratings yet
Ch7 Hashing
12 pages
3 Hashing
No ratings yet
3 Hashing
20 pages
Hashing PDF
No ratings yet
Hashing PDF
56 pages
Bin Packing Problem
No ratings yet
Bin Packing Problem
12 pages
Hash Tables
No ratings yet
Hash Tables
21 pages
Hashing
No ratings yet
Hashing
35 pages
Struktur Data: By: Sri Rezeki Candra Nursari
No ratings yet
Struktur Data: By: Sri Rezeki Candra Nursari
34 pages
L5 HashTables
No ratings yet
L5 HashTables
22 pages
Hashing
No ratings yet
Hashing
56 pages
DSA MK Lect2 PDF
No ratings yet
DSA MK Lect2 PDF
92 pages
Chapter 8 - Searching
No ratings yet
Chapter 8 - Searching
44 pages
Hashing
No ratings yet
Hashing
37 pages
Hashing PPT For Student
No ratings yet
Hashing PPT For Student
53 pages
Hashing
No ratings yet
Hashing
38 pages
20CB603 - Unit 2
No ratings yet
20CB603 - Unit 2
47 pages
Hashing
No ratings yet
Hashing
35 pages
Hashing and Indexing
No ratings yet
Hashing and Indexing
28 pages
Informed Search Algorithms
No ratings yet
Informed Search Algorithms
12 pages
DS Lecture - 6 (Hashing)
No ratings yet
DS Lecture - 6 (Hashing)
27 pages
An Introduction to Linear Algebra and Tensors
From Everand
An Introduction to Linear Algebra and Tensors
M. A. Akivis
1/5 (1)
Hashing
From Everand
Hashing
Prakash Hegade
No ratings yet

Group 15 Hash Tables

Uploaded by

Group 15 Hash Tables

Uploaded by

HASH TABLES

• A Hash table is data structures used to store

• To support insertion, deletion and search in

• Hash functions :These are mathematical operations

Division Based Hash Functions

• In the worst case, this requires a search of the

• In open addressing, there are no separate lists attached to the

then there will be no way to tell whether the search should

• In linear probing, the collision resolution function, f(i; x), is a

• But linear probing causes many more collisions.

• Quadratic probing eliminates clustering. In quadratic

• In double hashing, the sequence of probes is a linear sequence with

Advantages of open addressing over chaining

boolean contains(Object value)

boolean containsValue(Object value)

• public static void main(String args[]) {

• // Show all balances in hash table.

• Guttag,J.V(2015). Introduction to computation and

You might also like