0% found this document useful (0 votes)

21 views32 pages

9 DictionaryandHashing-1

Uploaded by

Rohan Work

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views32 pages

9 DictionaryandHashing-1

Uploaded by

Rohan Work

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 32

DICTIONARY (MAP) AND HASHING

J.Govindarajan, Asst.Prof. (S.G) , CSE

Map

• A map is an abstract data type designed to

efficiently store and retrieve values based
upon a uniquely identifying search key for
each.
• Map stores keyvalue pairs (k,v), which is
called as entries
• Also known as associate array (using entry’s
key serves as index. Key need not be
numeric.)
Map : Applications

 A university’s information system relies on some form of a student ID as key

 domain-name system (DNS)maps a host name, such as www.wiley.com, to an

Internet-Protocol (IP) address, such as 208.215.179.146

 A social media site typically relies on a (nonnumeric) username as a key

 A company’s customer base may be stored as a map

 A computer graphics system may map a color name to RGB numbers

Multimap ADT

 Allows multiple entries to have the same key

 Multimap to contain entries (k,v) and (k,v′) having the same

key
Dictionary
 Models a searchable collection of key-
element items
Multiple items with same key allowed
 Main operations include
insertion, searching, and deleting
 Applications
– Telephone directory
– Mapping student info to roll nos
Dictionary : ADT

 find(k):
if the dictionary has an item with key k, returns the position of this item, else, returns a null position.

 insertItem(k, o):

inserts item o with key k into the dictionary

 removeElement(k):

removes the item with key k from the dictionary. Exception of no such element.
 Other functions

– size(), isEmpty()

– keys(), Elements()
Dictionary
 Types
– Ordered Dictionaries
A total order relation is defined on the keys
– Unordered Dictionaries
No order relation is assumed on the keys
Only equality testing between keys is used
 Associative Stores

– When keys are unique, keys are like addresses to the location where the element

is stored
Dictionary : Using Direct Addressing

• Key k is stored in slot k

• Applied when the number of keys are small and are unique
Dictionary : Using Direct Addressing - not suitable

 If the universe U is large, storing a table T of size |U| may

be impractical, or even impossible, given the memory
available on a typical computer.

 the set K of keys actually stored may be so small

relative to U that most of the space allocated for T would
be wasted
Dictionary : Using Hashing

• Mapping key into the

index

index = hash(key)
Hashing : An Example

Telephone directory
Hashing Function

key k

integer

index
Hash Code Maps

 Bit Representation as an Integer

combine in some way the high-order and low-order portions of a 64-bit key to form a 32-
bit hash code,

summation of bits

 Polynomial Hash Codes

 Cyclic Shift hast codes

 A variant of the polynomial hash code replaces multiplication by a with a cyclic shift of a
partial sum by a certain number of bits.
HashCode

 Memory address:

 interpret the memory address of the key object as an integer

 Casting to an Integer

 Eg: Float.floatToIntBits(x) in java

 Suitable for keys whose length is lesser than that of integer

 Summing the Components:

object x whose binary representation can be viewed as an n-tuple (x0,x1, . . . ,xn−1) of 32-
bit integers
Summing the Components

 Example: Summing the ASCII codes

STOP - 83 + 84 +79 + 80 = 326

POTS - 80 + 79 + 84 + 83 = 326
Polynomial Accumulation

 x0an−1 +x1an−2+···+xn−2a+xn−1

 a=33 or 37 or 39 or 41 will give at most 6 collisions on

vocabulary of 50,000 words

 By Horner’s rule the polynomial computation

xn−1+a(xn−2 +a(xn−3+···+a(x2+a(x1+ax0)) ···)).

Example

 xn−1+a(xn−2 +a(xn−3+···+a(x2+a(x1+ax0)) ···)).

 STOP (if a=33)

33*(80+(33*(79+(33*(84+(33*83)))))) = 101538822

 POTS (if a=33)

33*(83+(33*(84+(33*(79+(33*80)))))) = 97806918
Cyclic-Shift Hash Codes
 Example:
static int hashCode(String s) {
int h=0;
for (int i=0; i<s.length( ); i++) {
h = (h << 5)
h += (int) s.charAt(i); // add in next character
}  Example:

return h; STOP - 2804276

}
POTS - 2705107
Cyclic-Shift Hash Codes

 Example:
STOP - 2804276

POTS - 2705107
Hash code Excerise

What would be a good hash code for a vehicle identification

that is a string of numbers and letters of the form
"9X9XX99X9XX999999", where a "9" represents a digit
and an "X" represents a letter?
Answer:

 Either Summing components, Polynomial hash codes, or Cyclic Shift hash

codes would be appropriate
 Breaking into key consists of 6 letters and 11 digits into pieces:

break it into two groups of 3 letters, two groups of four digits and one group of

three digits. five numbers or components (x0, x1, x2, x3, x4)

- whose maximum values are 17575 (263-1), 17575 (263-1), 9999, 9999 and 999,

respectively. For the size of our hash table, we can choose a prime number near

20000 (for example, N=19997).

Answer (continued)

If we use summing of components, the hash function would

be
x0 + x1 + x2 + x3 + x4 mod N
For a polynomial hash code, the hash function is
x4 a4 + x3a3 + x2 a2 + x1 a + x0 mod N
which can be calculated using "Horner's Rule" as
x0 + z ( x1 + z ( x2 + z ( x3 + z ( x4 )))) mod N
Compression Functions

 Division Method
i mod N

 MAD (Multiply-Add-and-Divide) Method

[(ai+b) mod p] mod N,

Collision-Handling Schemes

 Separate Chaining

 Open Addressing
- linear probing
- quadratic probing
- double hashing
- Using pseudorandom number generator
Separate Chaining
• Good hash function, the
core map operations run
in O(⌈n/N⌉)

Where,
|λ= n/N, called the load
factor of the hash table

• As long as l is O(1), the

core operations on the
hash table run in O(1)
Figure: hash table of size 13, storing 10 entries
expected time.
Open Addressing

 This approach saves space because no auxiliary

structures are employed

 Open addressing requires that the load factor is

always at most 1 and that entries are stored directly
in the cells of the bucket array itself.
Linear Probing

hash function is h(k) = k mod 11.

Finding empty bucket
Next try A[( j+1) mod N] Next try A[( j+2) mod N] and so on
where, j = h(k),
Linear Probing

 get, put, or remove operations should be modified

 Replace a deleted entry with a special “defunct” sentinel

object to aovid

 Put should remember a defunct location encountered during

the search for k
Quadratic probing

Finding empty bucket:

A[(h(k)+ f (i)) mod N], for i =0,1,2, . . ., where f (i) =i2
 Complicates the removal operation
 Avoids the kinds of clustering patterns that occur with
linear probing.
 it creates its own kind of clustering, called secondary
clustering
Double hashing

 secondary hash function, h′,

A[(h(k)+ f (i)) mod N] next,
for i = 1,2,3, . . ., where f (i) = i · h′(k)

Where h′(k) = q−(k mod q)

Double Hashing example

 h(k) = k mod 13 and h′(k) = 8 – (k mod 8)

 h(k) = k mod 7 and h′(k) = 5 – (k mod 5)

A[(h(k)+ f (i)) mod N] next,

for i = 1,2,3, . . ., where f (i) = i · h′(k)
Exercise

 Draw the 11-entry hash table that results from using the
hash function, h(i) =(3i+5) mod 11, to hash the keys 12, 44,
13, 88, 23, 94, 11, 39, 20, 16, and 5, assuming collisions
are handled by chaining.

Hashing PPT
No ratings yet
Hashing PPT
39 pages
Lecture05 Hash Table
No ratings yet
Lecture05 Hash Table
65 pages
Dsa Merged
No ratings yet
Dsa Merged
339 pages
Hashing
No ratings yet
Hashing
25 pages
Maps and Dictionary: Data Structures and Algorithms
No ratings yet
Maps and Dictionary: Data Structures and Algorithms
50 pages
Hash Tables: Map Dictionary Key "Address."
No ratings yet
Hash Tables: Map Dictionary Key "Address."
16 pages
Chapter 8 - Hashing
No ratings yet
Chapter 8 - Hashing
78 pages
Hashing
No ratings yet
Hashing
19 pages
Maps and Hashing - Final
No ratings yet
Maps and Hashing - Final
51 pages
c11 Hashing
No ratings yet
c11 Hashing
9 pages
Updated PDAssignment6
No ratings yet
Updated PDAssignment6
15 pages
Hashing Powerpoint
No ratings yet
Hashing Powerpoint
58 pages
Lecture 12
No ratings yet
Lecture 12
33 pages
Hashing
No ratings yet
Hashing
66 pages
L04 Hashing
No ratings yet
L04 Hashing
63 pages
14 Hashing
No ratings yet
14 Hashing
61 pages
Chapter 3 File Organization Hashing
No ratings yet
Chapter 3 File Organization Hashing
37 pages
Dictionary ADT: Dictionaries 4/1/2003 8:43 AM
No ratings yet
Dictionary ADT: Dictionaries 4/1/2003 8:43 AM
4 pages
Chapter 11 Hashing
No ratings yet
Chapter 11 Hashing
42 pages
Dictionaries: Sets
No ratings yet
Dictionaries: Sets
92 pages
Unit 5
No ratings yet
Unit 5
50 pages
CHAPTER 8 Hashing: Instructors: C. Y. Tang and J. S. Roger Jang
No ratings yet
CHAPTER 8 Hashing: Instructors: C. Y. Tang and J. S. Roger Jang
78 pages
Hashing: Presented by
No ratings yet
Hashing: Presented by
35 pages
11 Hash Tables Slides
No ratings yet
11 Hash Tables Slides
34 pages
DS 5
No ratings yet
DS 5
23 pages
9.map 1 HashTable
No ratings yet
9.map 1 HashTable
31 pages
Hashing
No ratings yet
Hashing
30 pages
Algo Cha 8
No ratings yet
Algo Cha 8
20 pages
MODULE 5 - BCS304 - HASHING - Leftisht Trees - OBST - Notes
No ratings yet
MODULE 5 - BCS304 - HASHING - Leftisht Trees - OBST - Notes
32 pages
Lecture 13 - Hash Tables
No ratings yet
Lecture 13 - Hash Tables
51 pages
Hashing
No ratings yet
Hashing
37 pages
CH 4
No ratings yet
CH 4
58 pages
Lecture 27 - Hashing
No ratings yet
Lecture 27 - Hashing
48 pages
SORTING PROGRAMS - Counting + Bucket + Heap
No ratings yet
SORTING PROGRAMS - Counting + Bucket + Heap
27 pages
11 Hashing
No ratings yet
11 Hashing
60 pages
Ads-Unit I
No ratings yet
Ads-Unit I
16 pages
File Organization
No ratings yet
File Organization
49 pages
DS Lecture - 6 (Hashing)
No ratings yet
DS Lecture - 6 (Hashing)
26 pages
Module 5
No ratings yet
Module 5
33 pages
Ads M Tech Mid 2
No ratings yet
Ads M Tech Mid 2
26 pages
Chap-1 ADS
No ratings yet
Chap-1 ADS
5 pages
Hashing PPT For Student
No ratings yet
Hashing PPT For Student
53 pages
Metaphors and Tropes
No ratings yet
Metaphors and Tropes
36 pages
Cse373 10 Hashing
No ratings yet
Cse373 10 Hashing
36 pages
DS - Unit 5 - Notes
No ratings yet
DS - Unit 5 - Notes
8 pages
Hash Table PDF
No ratings yet
Hash Table PDF
25 pages
Hashing in Data Structure
No ratings yet
Hashing in Data Structure
25 pages
Chapter 8 - Hashing
No ratings yet
Chapter 8 - Hashing
78 pages
Hashing ClassNotes
No ratings yet
Hashing ClassNotes
8 pages
Hashing
No ratings yet
Hashing
38 pages
DS Lecture - 6 (Hashing)
No ratings yet
DS Lecture - 6 (Hashing)
32 pages
Chapter 8 - Searching
No ratings yet
Chapter 8 - Searching
44 pages
Hash Functions
No ratings yet
Hash Functions
60 pages
Hashing PDF
No ratings yet
Hashing PDF
56 pages
Hashing PDF
No ratings yet
Hashing PDF
65 pages
Chapter 28 Hashing: Hash Table. The Function That Maps A Key To An Index in The Hash Table Is
No ratings yet
Chapter 28 Hashing: Hash Table. The Function That Maps A Key To An Index in The Hash Table Is
4 pages
Implementation Priority Queue Using Array
No ratings yet
Implementation Priority Queue Using Array
3 pages
ASM2
0% (2)
ASM2
22 pages
Hashing and Indexing
No ratings yet
Hashing and Indexing
28 pages
VarianApiBook PDF
100% (1)
VarianApiBook PDF
144 pages
UNIT 5-JavaFX Event Handling, Controls and Components
No ratings yet
UNIT 5-JavaFX Event Handling, Controls and Components
66 pages
DMEE Configuration
No ratings yet
DMEE Configuration
22 pages
Cobol Db2 Sample Program
0% (1)
Cobol Db2 Sample Program
5 pages
DS Lecture - 6 (Hashing)
No ratings yet
DS Lecture - 6 (Hashing)
27 pages
Lesson 4 Gradient Descent
No ratings yet
Lesson 4 Gradient Descent
13 pages
ADA Lab Manual - AnujJain ITM Universe Vadodara
No ratings yet
ADA Lab Manual - AnujJain ITM Universe Vadodara
74 pages
Go Programming For Beginners An Introduction
No ratings yet
Go Programming For Beginners An Introduction
137 pages
Blockchain & Smart Contract Security
No ratings yet
Blockchain & Smart Contract Security
52 pages
Best Digital Marketing Training Institute in Lucknow
No ratings yet
Best Digital Marketing Training Institute in Lucknow
14 pages
Programs 8086
No ratings yet
Programs 8086
17 pages
Operations Manager 2007 Report Authoring Guide: Authors
No ratings yet
Operations Manager 2007 Report Authoring Guide: Authors
73 pages
BSC Computer Science Curriculum 2024 2025
No ratings yet
BSC Computer Science Curriculum 2024 2025
55 pages
Os Micro
No ratings yet
Os Micro
20 pages
GCF Presentation
No ratings yet
GCF Presentation
23 pages
PRELIM LAB QUIZ 1 - Attempt Reviewadadawasd
No ratings yet
PRELIM LAB QUIZ 1 - Attempt Reviewadadawasd
4 pages
Indirect Communication Distributed Systems
No ratings yet
Indirect Communication Distributed Systems
3 pages
Memory Management
No ratings yet
Memory Management
63 pages
FoxPro Notes
No ratings yet
FoxPro Notes
25 pages
Xilinx Tools in Command Line Mode
No ratings yet
Xilinx Tools in Command Line Mode
6 pages
Unit-Iv Swings Notes
No ratings yet
Unit-Iv Swings Notes
34 pages
Sougata Jana - CSC407 - OS - 2nd - Year - CSE - SET3
No ratings yet
Sougata Jana - CSC407 - OS - 2nd - Year - CSE - SET3
2 pages
What Is The Field Name For PO Header Text in ME23n
No ratings yet
What Is The Field Name For PO Header Text in ME23n
5 pages
3x SuperTrend
No ratings yet
3x SuperTrend
2 pages
Lecture 4 1
No ratings yet
Lecture 4 1
22 pages
Heuristic Attacks Against Graphical Password Gener
No ratings yet
Heuristic Attacks Against Graphical Password Gener
14 pages
Module 1 Chapter 1
No ratings yet
Module 1 Chapter 1
17 pages
Additional Slides For Cultural Education Course
No ratings yet
Additional Slides For Cultural Education Course
15 pages
Lecture 6A
No ratings yet
Lecture 6A
12 pages
Swayam Prabha Educational DTH Channels India 13.8.2024 Onwards
No ratings yet
Swayam Prabha Educational DTH Channels India 13.8.2024 Onwards
18 pages
5 Pbo2-Eventhandling
No ratings yet
5 Pbo2-Eventhandling
13 pages
Class-X Term - 2 Project - Compressed
No ratings yet
Class-X Term - 2 Project - Compressed
16 pages
A Distributed Service Oriented Architecture For Business Process Execution
No ratings yet
A Distributed Service Oriented Architecture For Business Process Execution
31 pages
Lecture 8
No ratings yet
Lecture 8
8 pages
Oop hw1
No ratings yet
Oop hw1
7 pages
Lisp Interpreter in Rust
From Everand
Lisp Interpreter in Rust
Vishal Patil
1/5 (1)
A Short Course in Discrete Mathematics
From Everand
A Short Course in Discrete Mathematics
Edward A. Bender
3/5 (1)

9 DictionaryandHashing-1

Uploaded by

9 DictionaryandHashing-1

Uploaded by

DICTIONARY (MAP) AND HASHING

J.Govindarajan, Asst.Prof. (S.G) , CSE

• A map is an abstract data type designed to

 A university’s information system relies on some form of a student ID as key

 domain-name system (DNS)maps a host name, such as www.wiley.com, to an

 A social media site typically relies on a (nonnumeric) username as a key

 A company’s customer base may be stored as a map

 A computer graphics system may map a color name to RGB numbers

 Allows multiple entries to have the same key

 Multimap to contain entries (k,v) and (k,v′) having the same

inserts item o with key k into the dictionary

• Key k is stored in slot k

 If the universe U is large, storing a table T of size |U| may

 the set K of keys actually stored may be so small

• Mapping key into the

 Bit Representation as an Integer

 Polynomial Hash Codes

 Cyclic Shift hast codes

 interpret the memory address of the key object as an integer

 Eg: Float.floatToIntBits(x) in java

 Suitable for keys whose length is lesser than that of integer

 Summing the Components:

 Example: Summing the ASCII codes

STOP - 83 + 84 +79 + 80 = 326

 a=33 or 37 or 39 or 41 will give at most 6 collisions on

 By Horner’s rule the polynomial computation

xn−1+a(xn−2 +a(xn−3+···+a(x2+a(x1+ax0)) ···)).

 xn−1+a(xn−2 +a(xn−3+···+a(x2+a(x1+ax0)) ···)).

 STOP (if a=33)

 POTS (if a=33)

return h; STOP - 2804276

What would be a good hash code for a vehicle identification

 Either Summing components, Polynomial hash codes, or Cyclic Shift hash

20000 (for example, N=19997).

If we use summing of components, the hash function would

 MAD (Multiply-Add-and-Divide) Method

[(ai+b) mod p] mod N,

• As long as l is O(1), the

 This approach saves space because no auxiliary

 Open addressing requires that the load factor is

hash function is h(k) = k mod 11.

 get, put, or remove operations should be modified

 Replace a deleted entry with a special “defunct” sentinel

 Put should remember a defunct location encountered during

Finding empty bucket:

 secondary hash function, h′,

Where h′(k) = q−(k mod q)

 h(k) = k mod 13 and h′(k) = 8 – (k mod 8)

 h(k) = k mod 7 and h′(k) = 5 – (k mod 5)

A[(h(k)+ f (i)) mod N] next,

You might also like