0% found this document useful (0 votes)

97 views24 pages

Hash Table

Hash tables provide constant-time insertion, deletion and search by using a hash function to map keys to array indices. Collisions occur when different keys hash to the same index, and separate chaining resolves collisions by storing keys in linked lists at each index. The document discusses using a hash table to count integer frequencies, considers appropriate data structures for associating names to phone numbers, and reviews how separate chaining works by inserting example keys into a hash table and linking collided keys into buckets.

Uploaded by

Ram C. Gudavalli

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

97 views24 pages

Hash Table

Uploaded by

Ram C. Gudavalli

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 24

Motivating Hash Tables

For a dictionary with n key, value pairs

insert

find

delete

Unsorted linked-list
O(1)
O(n)
O(n)
Unsorted array
O(1)
O(n)
O(n)
Sorted linked list
O(n)
O(n)
O(n)
Sorted array
O(n)
O(log n)
O(n)
Balanced tree
O(log n) O(log n)
O(log n)
Magic array
O(1)
O(1)
O(1)

Sufficient magic:
Use key to compute array index for an item in O(1) time [doable]
Have a different index for every item [magic]

11/3/16

Motivating Hash Tables

Lets say you are tasked with counting the frequency of
integers in a text file. You are guaranteed that only the
integers 0 through 100 will occur:
For example: 5, 7, 8, 9, 9, 5, 0, 0, 1, 12
Result: 0 2
11
52
71

2
What structure is appropriate?
Tree?
2
1
2
List?
Array?
0
1
2
3
4
5

11/3/16

1
6

2
8

Motivating Hash Tables

Now what if we want to associate name to
phone number?
Suppose keys are first, last names
how big is the key space?

Maybe we only care about students

11/3/16

Hash Tables

Aim for constant-time (i.e., O(1)) find, insert, and

delete
On average under some often-reasonable assumptions

hash table

A hash table is an array of some fixed size

Basic idea:
hash function:
index = h(key)

key space (e.g., integers, strings)

11/3/16

TableSize 1

11/3/16

Hash Tables vs. Balanced

Trees
In terms of a Dictionary ADT for just insert, find,
delete, hash tables and balanced trees are just
different data structures
Hash tables O(1) on average (assuming we follow good
practices)
Balanced trees O(log n) worst-case

Constant-time is better, right?

Yes, but you need hashing to behave (must avoid
collisions)
Yes, but findMin, findMax, predecessor, and successor go
from O(log n) to O(n), printSorted from O(n) to O(n log n)
Why your textbook considers this to be a different ADT
11/3/16

Hash Tables
There are m possible keys (m typically large, even
infinite)
We expect our table to have only n items
n is much less than m (often written n << m)
Many dictionaries have this property
Compiler: All possible identifiers allowed by the language vs.
those used in some file of one program
Database: All possible student names vs. students enrolled
AI: All possible chess-board configurations vs. those
considered by the current player

11/3/16

Hash functions
An ideal hash function:
Fast to compute
Rarely hashes two used keys to the same index
hash table
Often impossible in theory but easy in practice
0
Will handle collisions later

hash function:
index = h(key)

key space (e.g., integers, strings)

11/3/16

TableSize 1

Simple Integer Hash Functions

key space K = integers
TableSize = 7
h(K) = K % 7
Insert: 7, 18, 41

11/3/16

0
1
2
3
4
5
6

Simple Integer Hash Functions

0
1
2
3
h(K) = ??
4
5
Insert: 7, 18, 41, 34
What happens when we insert 6
44?
7
8
9
key space K = integers
TableSize = 10

11/3/16

7
18

Aside: Properties of Mod

To keep hashed values within the size of the
table, we will generally do:

h(K) = function(K) % TableSize

(In the previous examples, function(K) = K.)

Useful properties of mod:

(a + b) % c = [(a % c) + (b % c)] % c
(a b) % c = [(a % c) (b % c)] % c
a % c = b % c (a b) % c = 0

11/3/16

Designing Hash Functions

Often based on modular hashing:

h(K) = f(K) % P
P is typically the TableSize
P is often chosen to be prime:
Reduces likelihood of collisions due to patterns in
data
Is useful for guarantees on certain hashing strategies
(as well see)

Equivalent objects MUST hash to the same

location
11/3/16
12

Some String Hash

Functions
key space = strings
K = s0 s1 s2 s

m-1

(where si are chars: si [0,

128])
H(batman) = H(ballgame)

1. h(K) = s0 % TableSize

si
i 0

2. h(K) =

m 1

3. h(K) =
11/3/16

s 37
i

H(spot) = H(pots)

% TableSize

% TableSize
13

What to hash?
We will focus on the two most common things to hash:
ints and strings
For objects with several fields, usually best to have most of
the identifying fields contribute to the hash to avoid
collisions
Example:
class Person {
String first; String middle; String last;
Date birthdate;
}
An inherent trade-off: hashing-time vs. collision-avoidance

11/3/16

Bad idea(?): Use only first name

Good idea(?): Use only middle initial? Combination of fields?
Admittedly, what-to-hash-with is often unprincipled
14

Deep Breath
Recap

11/3/16

Hash Tables: Review

Aim for constant-time (i.e., O(1)) find, insert, and
delete
On average under some reasonable assumptions

A hash table is an array of some fixed size

But growable as well see

hash table library

client
E

hash table

int

collision? collision
table-index
resolution

TableSize 1
11/3/16

Collision resolution
Collision:
When two keys map to the same location in
the hash table
We try to avoid it, but number-of-keys exceeds
table size
So hash tables should support collision
resolution
Ideas?
11/3/16

Separate Chaining
0

11/3/16

Chaining:
All keys that map to the same
table location are kept in a list
(a.k.a. a chain or bucket)
As easy as it sounds
Example:
insert 10, 22, 107, 12, 42
with mod hashing
and TableSize = 10
18

Separate Chaining
0

10 /

11/3/16

Chaining:
All keys that map to the same
table location are kept in a list
(a.k.a. a chain or bucket)
As easy as it sounds
Example:
insert 10, 22, 107, 12, 42
with mod hashing
and TableSize = 10

Separate Chaining
0
1

10 /

/
22 /

2
3

11/3/16

Chaining:
All keys that map to the same
table location are kept in a list
(a.k.a. a chain or bucket)
As easy as it sounds
Example:
insert 10, 22, 107, 12, 42
with mod hashing
and TableSize = 10

Separate Chaining
0
1

10 /

/
22 /

2
3
4
5
6

/
/
/
/

7
/

11/3/16

As easy as it sounds

107 /

Chaining:
All keys that map to the same
table location are kept in a list
(a.k.a. a chain or bucket)

Example:
insert 10, 22, 107, 12, 42
with mod hashing
and TableSize = 10

Separate Chaining
0
1

10 /

/
12

2
3
4
5
6

/
/
/
/

7
/

11/3/16

As easy as it sounds

107 /

22 /

Chaining:
All keys that map to the same
table location are kept in a list
(a.k.a. a chain or bucket)

Example:
insert 10, 22, 107, 12, 42
with mod hashing
and TableSize = 10

Separate Chaining
0
1

10 /

/
42

2
3
4
5
6

/
/
/
/

7
/

11/3/16

As easy as it sounds
107 /

Chaining:
All keys that map to the same
table location are kept in a
22 /
list (a.k.a. a chain or
bucket)

Example:
insert 10, 22, 107, 12, 42
with mod hashing
and TableSize = 10

More rigorous chaining

analysis

Definition: The load factor, , of a hash table is

TableSize

number of elements

Under chaining, the average number of elements per

bucket is
So if some inserts are followed by random finds, then on
average:
Each unsuccessful find compares against items

So we like to keep fairly low (e.g., 1 or 1.5 or 2) for

chaining
11/3/16
24

Module 5
No ratings yet
Module 5
72 pages
Siemens, Teamcenter PDF
No ratings yet
Siemens, Teamcenter PDF
20 pages
CSC 302 - Hashing Techniques
No ratings yet
CSC 302 - Hashing Techniques
19 pages
Hashing
No ratings yet
Hashing
96 pages
Finals Complexity and Algorithmn
No ratings yet
Finals Complexity and Algorithmn
49 pages
Lecture 12
No ratings yet
Lecture 12
33 pages
Hash Tables
No ratings yet
Hash Tables
35 pages
14 Hashing
No ratings yet
14 Hashing
61 pages
11 Hashtable-1
No ratings yet
11 Hashtable-1
48 pages
11 Hash Tables Slides
No ratings yet
11 Hash Tables Slides
34 pages
Variant Configuration of Sap SD
No ratings yet
Variant Configuration of Sap SD
5 pages
Hash Tables
No ratings yet
Hash Tables
45 pages
09 Hashtable
No ratings yet
09 Hashtable
53 pages
Unique Number (Rollno) Name of The Student As in The Academic Records
No ratings yet
Unique Number (Rollno) Name of The Student As in The Academic Records
55 pages
Hash Tables
No ratings yet
Hash Tables
30 pages
DSA2 Chapter 5 Hashing
No ratings yet
DSA2 Chapter 5 Hashing
44 pages
Chapter 5 - Hashing - Part1
No ratings yet
Chapter 5 - Hashing - Part1
28 pages
CH 4
No ratings yet
CH 4
58 pages
A Crypto Currency - Paper Presentation
100% (1)
A Crypto Currency - Paper Presentation
8 pages
18csc310j Unit 5
No ratings yet
18csc310j Unit 5
300 pages
15 HashTables
No ratings yet
15 HashTables
27 pages
Hashing
No ratings yet
Hashing
23 pages
Dsa 4
No ratings yet
Dsa 4
55 pages
Lecture 13 - Hash Tables
No ratings yet
Lecture 13 - Hash Tables
51 pages
Your SQL Quickstart Guide
No ratings yet
Your SQL Quickstart Guide
32 pages
Hashing
No ratings yet
Hashing
37 pages
Unit 1 Hashing
No ratings yet
Unit 1 Hashing
61 pages
Dsa Lecture 13 Hash Tables
No ratings yet
Dsa Lecture 13 Hash Tables
15 pages
Lecture 27 - Hashing
No ratings yet
Lecture 27 - Hashing
48 pages
Hashng Notes SVIMS
No ratings yet
Hashng Notes SVIMS
14 pages
Hashing
No ratings yet
Hashing
9 pages
Hashing
No ratings yet
Hashing
44 pages
Lec12 Hash Tables 09092024 090609pm
No ratings yet
Lec12 Hash Tables 09092024 090609pm
48 pages
0.1 Direct-Address Tables
No ratings yet
0.1 Direct-Address Tables
10 pages
Hashing in Data Structure
No ratings yet
Hashing in Data Structure
43 pages
Sets Maps and Hash Tables Review
No ratings yet
Sets Maps and Hash Tables Review
3 pages
Hashing RPK
No ratings yet
Hashing RPK
61 pages
Module 5
No ratings yet
Module 5
33 pages
Lect Hashing
No ratings yet
Lect Hashing
36 pages
Exercise 3 Two-Way Traffic Light
No ratings yet
Exercise 3 Two-Way Traffic Light
3 pages
Cse373 10 Hashing
No ratings yet
Cse373 10 Hashing
36 pages
GROUP 15.Pptx Presentation
No ratings yet
GROUP 15.Pptx Presentation
29 pages
Hashing
No ratings yet
Hashing
11 pages
Hashing in Data Structures
No ratings yet
Hashing in Data Structures
27 pages
05 Hashing
No ratings yet
05 Hashing
47 pages
DSA Lab 11 Hashing
No ratings yet
DSA Lab 11 Hashing
9 pages
ADI Hashing
No ratings yet
ADI Hashing
47 pages
Hashing in Data Structure
No ratings yet
Hashing in Data Structure
25 pages
SDS2 7.0-Assorted Tools
No ratings yet
SDS2 7.0-Assorted Tools
96 pages
Idst 2016 SA 05 Hashing
No ratings yet
Idst 2016 SA 05 Hashing
68 pages
DSA MK Lect2 PDF
No ratings yet
DSA MK Lect2 PDF
92 pages
Hashing
No ratings yet
Hashing
38 pages
Hashing
No ratings yet
Hashing
34 pages
Hashing
No ratings yet
Hashing
20 pages
DS Module 5 Hashing
No ratings yet
DS Module 5 Hashing
23 pages
L5 HashTables
No ratings yet
L5 HashTables
22 pages
Hashing PDF
No ratings yet
Hashing PDF
56 pages
Uday Devops Cloud
No ratings yet
Uday Devops Cloud
6 pages
Hash Tables: COT4810 Ken Pritchard 2 Sep 04
No ratings yet
Hash Tables: COT4810 Ken Pritchard 2 Sep 04
20 pages
Hashing
No ratings yet
Hashing
56 pages
3 Hashing
No ratings yet
3 Hashing
20 pages
Hashing and Indexing
No ratings yet
Hashing and Indexing
28 pages
15 GHZ MW Link Using Siemens SRAL XD Operating Manual
No ratings yet
15 GHZ MW Link Using Siemens SRAL XD Operating Manual
49 pages
Hash Tables - : Structure
No ratings yet
Hash Tables - : Structure
21 pages
Introduction To Python Programming
No ratings yet
Introduction To Python Programming
69 pages
06 Hashing
No ratings yet
06 Hashing
6 pages
Introduction To Hashing & Hashing Techniques: Review of Searching Techniques
No ratings yet
Introduction To Hashing & Hashing Techniques: Review of Searching Techniques
19 pages
Qpir 09022024 0
No ratings yet
Qpir 09022024 0
178 pages
Aoc Service Manual-Hp l1706 Gm2621 A00 9615
100% (1)
Aoc Service Manual-Hp l1706 Gm2621 A00 9615
52 pages
Unit28 Hashing1
No ratings yet
Unit28 Hashing1
19 pages
9XR Motherboard Connector Pinout J1 Right Switches Atmega
No ratings yet
9XR Motherboard Connector Pinout J1 Right Switches Atmega
2 pages
BGP Secure Routing 1708284503
No ratings yet
BGP Secure Routing 1708284503
82 pages
AWS Startup Security Baseline
No ratings yet
AWS Startup Security Baseline
55 pages
MG Gs Crestron Flex Unified Communications Solutions
No ratings yet
MG Gs Crestron Flex Unified Communications Solutions
9 pages
Solution Guide - IBM Sterling Order Management - Lightwell
No ratings yet
Solution Guide - IBM Sterling Order Management - Lightwell
5 pages
Skills IT Academy Profile
No ratings yet
Skills IT Academy Profile
8 pages
Arista-FreeRadius 01dec2024
No ratings yet
Arista-FreeRadius 01dec2024
16 pages
2 LS Nav - Setup
No ratings yet
2 LS Nav - Setup
41 pages
Aurix™, Tricore™, Xc2000, Xe166, Xc800 Families Dap Connector
No ratings yet
Aurix™, Tricore™, Xc2000, Xe166, Xc800 Families Dap Connector
15 pages
Exp1 Aa-B2.44
No ratings yet
Exp1 Aa-B2.44
8 pages
Connection - FC HBA - Tape - ESXi6.0
No ratings yet
Connection - FC HBA - Tape - ESXi6.0
12 pages
TSP Report
No ratings yet
TSP Report
5 pages
Secure File Storage On Cloud Using Hybrid Cryptography
No ratings yet
Secure File Storage On Cloud Using Hybrid Cryptography
8 pages
Elevator Control System Using Finite Automata
No ratings yet
Elevator Control System Using Finite Automata
5 pages
Provision-IsR CMS - PC Decode & Record Capabilities
No ratings yet
Provision-IsR CMS - PC Decode & Record Capabilities
1 page
Resume Amin
No ratings yet
Resume Amin
3 pages
Ws 3500
No ratings yet
Ws 3500
2 pages
The Tech Interview Playbook: From DSA to System Design
From Everand
The Tech Interview Playbook: From DSA to System Design
Chinmoy Mukherjee
No ratings yet
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
Hashing
From Everand
Hashing
Prakash Hegade
No ratings yet

Hash Table

Uploaded by

Hash Table

Uploaded by

Motivating Hash Tables

For a dictionary with n key, value pairs

Motivating Hash Tables

Motivating Hash Tables

Maybe we only care about students

Aim for constant-time (i.e., O(1)) find, insert, and

A hash table is an array of some fixed size

key space (e.g., integers, strings)

Hash Tables vs. Balanced

Constant-time is better, right?

key space (e.g., integers, strings)

Simple Integer Hash Functions

Simple Integer Hash Functions

Aside: Properties of Mod

h(K) = function(K) % TableSize

Useful properties of mod:

Designing Hash Functions

Equivalent objects MUST hash to the same

Some String Hash

(where si are chars: si [0,

Bad idea(?): Use only first name

Hash Tables: Review

A hash table is an array of some fixed size

hash table library

More rigorous chaining

Definition: The load factor, , of a hash table is

Under chaining, the average number of elements per

So we like to keep fairly low (e.g., 1 or 1.5 or 2) for

You might also like