0% found this document useful (0 votes)

46 views5 pages

Lab 3

This document discusses different hashing techniques including division method, multiplication method, universal hashing, perfect hashing, chaining, open addressing, and double hashing. It also discusses issues with hashing like collisions and provides examples of using hashing to find anagrams in a set of words.

Uploaded by

RanaAsh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

46 views5 pages

Lab 3

Uploaded by

RanaAsh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Cairo University

Faculty of Computers &Information

Department of Computer Science
Course: Algorithms
Lab #5
Hashing

The basic idea behind hashing is to take a field in a record, known as the key, and convert it
through some fixed process to a numeric value, known as the hash key, which represents the
position to either store or find an item in the table. The numeric value will be in the range of 0
to n-1, where n is the maximum number of slots (or buckets) in the table.

The fixed process to convert a key to a hash key is known as a hash function. This function will
be used whenever access to the table is needed.

One common method of determining a hash key is the division method of hashing.

Division Method

 A key is mapped into one of m slots using the function

h(k) = k mod m

the division method is generally a reasonable strategy, unless the key happens to have some
undesirable properties. For example, if m is 10 and all of the keys end in zero.
Good values for m are prime numbers and m should not be a power of 2 and a power of 10.

Multiplication method

The multiplication method for creating a hash function operates in two steps

1. Multiply the key k by a constant A in the range 0 <A<1, and extract the fractional part of kA

2. Multiply this value by m and take the floor of the re

h(k) = [m·(kA)]

Knuth suggests

A= (5–1 )/2 = 0.6180339887 is likely to work well.

E.g.,

m = 10000, k= 123456, and A =(√5−1)/2= 0.618033,

then

h(k) =[ 10000·(123456·0.61803)]

=[10000·(76300.0041151)]

=[10000·0.0041151]

= [41.151]

= 41

The advantage of this method is that the value choice of m is not critical

Universal Hashing

– Select a hash function at random, from a designed class of functions at the

beginning of the execution

H={h(k): U(0,1,..,m-1)}

H is said to be universal if for x=!y|{h ∈ H : h(x) = h(y)}| = |H|/ m.

What is the probability of collision in this case ? It is equal to the probability of choosing a

function h ∈ U such that x!=y -> h(x)=h(y) which is |H|/ m / |H|= 1/m.

With universal hashing the chance of collision between distinct keys k and l is no more than the

1/m chance of collision if locations h(k) and h(l) were randomly and independently chosen from

the set {0, 1, …, m – 1}

Perfect Hashing

Perfect hashing is a technique for building a hash table with no collisions. It is only possible to
build one when we know all of the keys in advance

E.g. if I know the exact keys then it is trivial to produce a perfect hash function

int hash (int n) {

switch (n) {
case 10: return 0;
case 100: return 1;
case 32: return 2;
// ...
default: return -1;
}
}

ISSUES WITH HASHING

 Multiple keys can hash to the same slot

 Design hash functions such that collisions are minimized. But avoiding collisions is
impossible.
 Search will cost o(n) time in the worst case.
 However, all operations can be made to have an expected complexity of O(1).

Chaining
 Store all elements that hash to the same slot in a linked list.
 Store a pointer to the head of the linked list in the hash table slot.
Open Addressing

All elements stored in the hash table itself. When collisions occur,
use a systematic (consistent) procedure to store elements in free slots of the table.
Example of a systematic procedure is to save the key that make collision in
the first empty slot after the slot of the collision

h(k,i) = (h′(k) +i) mod m.

Another way to sharply reduce clustering (collision) is to increment not by a constant (as is
done in linear probing) but, by an amount that depends on the Key. We thus have a
second hashing function, This technique is called double hashing

h(k,i) = (h1(k) +i⋅h2(k)) mod m

Analysis of chaining

Let n be the number of keys in the table, and let m be the number of slots.

Define the load factor of T to be

α = n/m
= average number of keys per slot

The expected time for an unsuccessful search for a record with a given key is = Θ(1 + α).

Practice:

Given a set of words, we need to find the anagram words and display each category alone using
chaining method(linked list) and using linear probing

An anagram is a word or phrase formed by reordering the letters of another word or phrase.
Here is a list of words such that the words on each line are anagrams of each other:
barde, ardeb, bread, debar, beard, bared

Hint
The important thing is to make the key for your hash function unique.
The idea is to sort the word where the word is sorted by letter, so "car" => "acr". All anagrams
will have the same "sorted word".

Lecture 27 - Hashing
No ratings yet
Lecture 27 - Hashing
48 pages
Hashing
No ratings yet
Hashing
30 pages
Hashing: John Erol Evangelista
No ratings yet
Hashing: John Erol Evangelista
38 pages
Hashing
No ratings yet
Hashing
23 pages
ADI Hashing
No ratings yet
ADI Hashing
47 pages
Hashing
No ratings yet
Hashing
48 pages
Lecture 8 Hashing
No ratings yet
Lecture 8 Hashing
47 pages
HASHING
No ratings yet
HASHING
8 pages
Hashing
No ratings yet
Hashing
34 pages
Hashing Algorithms
No ratings yet
Hashing Algorithms
22 pages
HAshing (Satish Sir)
No ratings yet
HAshing (Satish Sir)
52 pages
UNIT V - Hashing
No ratings yet
UNIT V - Hashing
20 pages
CH 4
No ratings yet
CH 4
58 pages
Unit-5 2
No ratings yet
Unit-5 2
9 pages
Hashing
No ratings yet
Hashing
30 pages
What Is Hashing
No ratings yet
What Is Hashing
11 pages
Hashing and Indexing
No ratings yet
Hashing and Indexing
28 pages
Overview of Hash Tables
No ratings yet
Overview of Hash Tables
4 pages
Dsa 4
No ratings yet
Dsa 4
55 pages
Cse373 10 Hashing
No ratings yet
Cse373 10 Hashing
36 pages
Hashing
No ratings yet
Hashing
44 pages
Lect Hashing
No ratings yet
Lect Hashing
36 pages
Hashing
No ratings yet
Hashing
56 pages
Module 5
No ratings yet
Module 5
33 pages
Hash Tables - : Structure
No ratings yet
Hash Tables - : Structure
21 pages
Modue 5
No ratings yet
Modue 5
10 pages
Ads-Unit I-Hashing
No ratings yet
Ads-Unit I-Hashing
14 pages
Module 6 DSA 24
No ratings yet
Module 6 DSA 24
64 pages
3 Hashing
No ratings yet
3 Hashing
20 pages
DS Lecture - 6 (Hashing)
No ratings yet
DS Lecture - 6 (Hashing)
27 pages
DS Lecture 01.1 Fall-24-35
No ratings yet
DS Lecture 01.1 Fall-24-35
20 pages
Unit 1 Hashing
No ratings yet
Unit 1 Hashing
61 pages
Hashing Data Structure
No ratings yet
Hashing Data Structure
22 pages
2,2 Hashing
No ratings yet
2,2 Hashing
30 pages
Unit 5 Session 5 Hashing
No ratings yet
Unit 5 Session 5 Hashing
20 pages
Hashing
No ratings yet
Hashing
23 pages
05 Hashing
No ratings yet
05 Hashing
47 pages
Hashing
No ratings yet
Hashing
7 pages
HASHING
No ratings yet
HASHING
63 pages
Lecture 7 - Hash - Table - Direct - Adreess - Tables - Hash - Tables - Intro - Separate - Chaining
No ratings yet
Lecture 7 - Hash - Table - Direct - Adreess - Tables - Hash - Tables - Intro - Separate - Chaining
77 pages
Hashing
No ratings yet
Hashing
20 pages
Hashing Methods
No ratings yet
Hashing Methods
20 pages
Chapter 5 - Hashing - Part1
No ratings yet
Chapter 5 - Hashing - Part1
28 pages
L5 HashTables
No ratings yet
L5 HashTables
22 pages
Hash Table
No ratings yet
Hash Table
26 pages
Exp 5 - Dsa Lab File
No ratings yet
Exp 5 - Dsa Lab File
10 pages
SORTING PROGRAMS - Counting + Bucket + Heap
No ratings yet
SORTING PROGRAMS - Counting + Bucket + Heap
27 pages
Analysis of Algorithms CS 477/677: Hashing Instructor: George Bebis
No ratings yet
Analysis of Algorithms CS 477/677: Hashing Instructor: George Bebis
53 pages
Hash Function
No ratings yet
Hash Function
9 pages
Analysis of Algorithms CS 477/677: Hashing Instructor: George Bebis
No ratings yet
Analysis of Algorithms CS 477/677: Hashing Instructor: George Bebis
53 pages
Hashing PPT For Student
No ratings yet
Hashing PPT For Student
53 pages
Unit 5
No ratings yet
Unit 5
50 pages
Hashing
No ratings yet
Hashing
20 pages
Unit 5 Data Structure
No ratings yet
Unit 5 Data Structure
12 pages
Algo Cha 8
No ratings yet
Algo Cha 8
20 pages
Week13 1
No ratings yet
Week13 1
16 pages
Dat Astruc T Hashing Rep
No ratings yet
Dat Astruc T Hashing Rep
13 pages
DS Lecture - 6 (Hashing)
No ratings yet
DS Lecture - 6 (Hashing)
26 pages

Lab 3

Uploaded by

Lab 3

Uploaded by

Cairo University

Faculty of Computers &Information

 A key is mapped into one of m slots using the function

2. Multiply this value by m and take the floor of the re

A= (5–1 )/2 = 0.6180339887 is likely to work well.

m = 10000, k= 123456, and A =(√5−1)/2= 0.618033,

– Select a hash function at random, from a designed class of functions at the

H is said to be universal if for x=!y|{h ∈ H : h(x) = h(y)}| = |H|/ m.

the set {0, 1, …, m – 1}

int hash (int n) {

ISSUES WITH HASHING

 Multiple keys can hash to the same slot

h(k,i) = (h′(k) +i) mod m.

h(k,i) = (h1(k) +i⋅h2(k)) mod m

Define the load factor of T to be

You might also like