0% found this document useful (0 votes)

32 views26 pages

CS 332: Algorithms: Hash Tables

This document provides an overview of hash tables and skip lists. It discusses implementing skip lists for homework 4 which involves building skip lists, evaluating their performance, and optionally comparing them to randomly built binary search trees. It then reviews hash tables and hash functions, covering direct addressing, collisions, open addressing, chaining, analysis of chaining, choices of hash functions like division and multiplication methods, and universal hashing to prevent worst case scenarios from adversaries.

Uploaded by

Anonymous niE5VQOH

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views26 pages

CS 332: Algorithms: Hash Tables

Uploaded by

Anonymous niE5VQOH

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 26

CS 332: Algorithms

Hash Tables

Homework 4
Programming assignment:
Implement Skip Lists
Evaluate performance
Extra credit: compare performance to randomly

built BST
Due Mar 9 (Fri before Spring Break)
Will post later today

Review: Skip Lists

The basic idea:
level 3
level 2
level 1

Keep a doubly-linked list of elements

Min, max, successor, predecessor: O(1) time
Delete is O(1) time, Insert is O(1)+Search time

During insert, add each level-i element to level i+1

with probability p (e.g., p = 1/2 or p = 1/4)

Summary: Skip Lists

O(1) expected time for most operations
O(lg n) expected time for insert
O(n2) time worst case
But random, so no particular order of insertion

evokes worst-case behavior

O(n) expected storage requirements
Easy to code

Review: Hash Tables

Hash table:
Given a table T and a record x, with key (= symbol) and

satellite data, we need to support:

Insert (T, x)
Delete (T, x)
Search(T, x)

We want these to be fast, but dont care about sorting

the records
In this discussion we consider all keys to be (possibly
large) natural numbers

Review: Direct Addressing

Suppose:
The range of keys is 0..m-1
Keys are distinct

The idea:
Set up an array T[0..m-1] in which
T[i] = x
T[i] = NULL

if x T and key[x] = i
otherwise

This is called a direct-address table

Operations take O(1) time!

Review: The Problem With

Direct Addressing
Direct addressing works well when the range m of

keys is relatively small

But what if the keys are 32-bit integers?
Problem 1: direct-address table will have

232 entries, more than 4 billion

Problem 2: even if memory is not an issue, the time to
initialize the elements to NULL may be
Solution: map keys to smaller range 0..m-1
This mapping is called a hash function

Hash Functions
Next problem: collision
T
U
(universe of keys)

0
h(k1)

k1
k4
K
(actual
keys)
k2

h(k4)
k5

h(k2) = h(k5)
k3

h(k3)
m-1

Resolving Collisions
How can we solve the problem of collisions?
Solution 1: chaining
Solution 2: open addressing

Open Addressing
Basic idea (details in Section 12.4):
To insert: if slot is full, try another slot, , until an open slot

is found (probing)
To search, follow same sequence of probes as would be used
when inserting the element
If reach element with correct key, return it
If reach a NULL pointer, element is not in table

Good for fixed sets (adding but no deletion)

Example: spell checking

Table neednt be much bigger than n

Chaining
Chaining puts elements that hash to the same

slot in a linked list:

U
(universe of keys)

k1
k4
K
(actual
k7
keys)

k3
k8

Chaining
How do we insert an element?
U
(universe of keys)

k1
k4
K
(actual
k7
keys)

k3
k8

Chaining
How do we delete an element?
Do we need a doubly-linked list for efficient delete?
U
(universe of keys)

k1
k4
K
(actual
k7
keys)

k3
k8

Chaining
How do we search for a element with a

given key?
U
(universe of keys)

k1
k4
K
(actual
k7
keys)

k3
k8

Analysis of Chaining
Assume simple uniform hashing: each key in

table is equally likely to be hashed to any slot

Given n keys and m slots in the table: the
load factor = n/m = average # keys per slot
What will be the average cost of an
unsuccessful search for a key?

Analysis of Chaining
Assume simple uniform hashing: each key in

table is equally likely to be hashed to any slot

Given n keys and m slots in the table, the
load factor = n/m = average # keys per slot
What will be the average cost of an
unsuccessful search for a key? A: O(1+)

Analysis of Chaining
Assume simple uniform hashing: each key in

table is equally likely to be hashed to any slot

Given n keys and m slots in the table, the
load factor = n/m = average # keys per slot
What will be the average cost of an
unsuccessful search for a key? A: O(1+)
What will be the average cost of a successful
search?

Analysis of Chaining
Assume simple uniform hashing: each key in

table is equally likely to be hashed to any slot

Analysis of Chaining Continued

)
If the number of keys n is proportional to the
number of slots in the table, what is ?
A: = O(1)
So the cost of searching = O(1 +

In other words, we can make the expected cost of

searching constant if we make constant

Choosing A Hash Function

Clearly choosing the hash function well is

crucial
What will a worst-case hash function do?
What will be the time to search in this case?

What are desirable features of the hash

function?
Should distribute keys uniformly into slots
Should not depend on patterns in the data

Hash Functions:
The Division Method
h(k) = k mod m
In words: hash k into a table with m slots using the slot

given by the remainder of k divided by m

What happens to elements with adjacent

values of k?
What happens if m is a power of 2 (say 2 P)?
What if m is a power of 10?
Upshot: pick table size m = prime number not too
close to a power of 2 (or 10)

Hash Functions:
The Multiplication Method
For a constant A, 0 < A < 1:
h(k) = m (kA - kA )

What does this term represent?

Hash Functions:
The Multiplication Method
For a constant A, 0 < A < 1:
h(k) = m (kA - kA )

Fractional part of kA
Choose m = 2P
Choose A not too close to 0 or 1
Knuth: Good choice for A = (5 - 1)/2

Hash Functions:
Worst Case Scenario
Scenario:
You are given an assignment to implement hashing
You will self-grade in pairs, testing and grading your

partners implementation
In a blatant violation of the honor code, your partner:
Analyzes your hash function
Picks a sequence of worst-case keys, causing your

implementation to take O(n) time to search

Whats an honest CS student to do?

Hash Functions:
Universal Hashing
As before, when attempting to foil an

malicious adversary: randomize the algorithm

Universal hashing: pick a hash function
randomly in a way that is independent of the
keys that are actually going to be stored
Guarantees good performance on average, no

matter what keys adversary chooses

The End

WebServices C#
No ratings yet
WebServices C#
87 pages
Pan Os
0% (1)
Pan Os
952 pages
Cryptography Presentation
100% (1)
Cryptography Presentation
21 pages
Hashing
50% (2)
Hashing
43 pages
Basics of Image Processing and Analysis
No ratings yet
Basics of Image Processing and Analysis
175 pages
Linux Essentials For Cybersecurity
100% (23)
Linux Essentials For Cybersecurity
1,966 pages
DSA - Unit 1
No ratings yet
DSA - Unit 1
43 pages
Lecture05 Hash Table
No ratings yet
Lecture05 Hash Table
65 pages
Hashing PDF
No ratings yet
Hashing PDF
65 pages
Dsa Merged
No ratings yet
Dsa Merged
339 pages
Ds 17hashing
No ratings yet
Ds 17hashing
27 pages
Hashing
No ratings yet
Hashing
34 pages
Dsa Lecture 13 Hash Tables
No ratings yet
Dsa Lecture 13 Hash Tables
15 pages
Unit 1 Hashing
No ratings yet
Unit 1 Hashing
61 pages
Hashing
No ratings yet
Hashing
37 pages
Chapter10 HashTables
No ratings yet
Chapter10 HashTables
49 pages
DSA2 Chapter 5 Hashing
No ratings yet
DSA2 Chapter 5 Hashing
44 pages
Algo Lec3
No ratings yet
Algo Lec3
53 pages
06 - APS - Hash Table
No ratings yet
06 - APS - Hash Table
28 pages
Hashing
No ratings yet
Hashing
44 pages
Hashing in Data Structure
No ratings yet
Hashing in Data Structure
43 pages
Lecture 27 - Hashing
No ratings yet
Lecture 27 - Hashing
48 pages
EDU 201 71a Lab Guide
No ratings yet
EDU 201 71a Lab Guide
88 pages
Lecture 7 - Hash - Table - Direct - Adreess - Tables - Hash - Tables - Intro - Separate - Chaining
No ratings yet
Lecture 7 - Hash - Table - Direct - Adreess - Tables - Hash - Tables - Intro - Separate - Chaining
77 pages
09 Hashtable
No ratings yet
09 Hashtable
53 pages
Lect Hashing
No ratings yet
Lect Hashing
36 pages
Hashing Part1
No ratings yet
Hashing Part1
73 pages
Cse373 10 Hashing
No ratings yet
Cse373 10 Hashing
36 pages
Dsa 4
No ratings yet
Dsa 4
55 pages
4.loop Control Instructions
0% (2)
4.loop Control Instructions
5 pages
L-2005-08-Advance Data Structure Part 1-HS
No ratings yet
L-2005-08-Advance Data Structure Part 1-HS
46 pages
CMP2030 L02 Hashing
No ratings yet
CMP2030 L02 Hashing
21 pages
Fixed Assets Presentation
No ratings yet
Fixed Assets Presentation
14 pages
Lecture 8 Hashing
No ratings yet
Lecture 8 Hashing
47 pages
11 Hashing
No ratings yet
11 Hashing
60 pages
05 Hashing
No ratings yet
05 Hashing
47 pages
Lecture 13 - Hash Tables
No ratings yet
Lecture 13 - Hash Tables
51 pages
Hashing
No ratings yet
Hashing
23 pages
Hash Tables
No ratings yet
Hash Tables
20 pages
Ders7 - Data Structures and Search Algorithms
No ratings yet
Ders7 - Data Structures and Search Algorithms
41 pages
CH 4
No ratings yet
CH 4
58 pages
DSA MK Lect2 PDF
No ratings yet
DSA MK Lect2 PDF
92 pages
Hashing RPK
No ratings yet
Hashing RPK
61 pages
9.map 1 HashTable
No ratings yet
9.map 1 HashTable
31 pages
Full Unit 6 Cse 205
No ratings yet
Full Unit 6 Cse 205
20 pages
Hash Tables
No ratings yet
Hash Tables
35 pages
Hash Tables
No ratings yet
Hash Tables
21 pages
Analysis of Algorithms CS 477/677: Hashing Instructor: George Bebis
No ratings yet
Analysis of Algorithms CS 477/677: Hashing Instructor: George Bebis
53 pages
ATM Database System Abstract
100% (1)
ATM Database System Abstract
2 pages
15 HashTables
No ratings yet
15 HashTables
27 pages
Day3.2 DS2 HashTablesHeaps
No ratings yet
Day3.2 DS2 HashTablesHeaps
61 pages
Hashing
No ratings yet
Hashing
29 pages
Lecture 12
No ratings yet
Lecture 12
19 pages
Lecture9f13 Hashing
No ratings yet
Lecture9f13 Hashing
29 pages
CS 332: Algorithms: Dijkstra's Algorithm Disjoint-Set Union
No ratings yet
CS 332: Algorithms: Dijkstra's Algorithm Disjoint-Set Union
46 pages
CS 03
No ratings yet
CS 03
22 pages
DS Lecture - 6 (Hashing)
No ratings yet
DS Lecture - 6 (Hashing)
32 pages
Lecture 22
No ratings yet
Lecture 22
38 pages
Systems Alliance: VPP 4.3.3: VISA Implementation Specification For The G Language
No ratings yet
Systems Alliance: VPP 4.3.3: VISA Implementation Specification For The G Language
53 pages
Lecture 25
No ratings yet
Lecture 25
63 pages
Tushar Internship Presentation
No ratings yet
Tushar Internship Presentation
15 pages
Lecture 12
No ratings yet
Lecture 12
60 pages
AlienVault Creating A Data Source Plugin
No ratings yet
AlienVault Creating A Data Source Plugin
39 pages
Hashing - Datastructures and Algorithms
No ratings yet
Hashing - Datastructures and Algorithms
32 pages
CS 332: Algorithms: Solving Recurrences Continued The Master Theorem Introduction To Heapsort
No ratings yet
CS 332: Algorithms: Solving Recurrences Continued The Master Theorem Introduction To Heapsort
56 pages
Hashing
No ratings yet
Hashing
10 pages
Lecture 34
No ratings yet
Lecture 34
20 pages
CS 332: Algorithms: Red-Black Trees
No ratings yet
CS 332: Algorithms: Red-Black Trees
20 pages
Analysis of Algorithms CS 477/677: Hashing Instructor: George Bebis
No ratings yet
Analysis of Algorithms CS 477/677: Hashing Instructor: George Bebis
53 pages
CS 561, Lecture 2: Randomization in Data Structures: Jared Saia University of New Mexico
No ratings yet
CS 561, Lecture 2: Randomization in Data Structures: Jared Saia University of New Mexico
46 pages
Process Synchronization
No ratings yet
Process Synchronization
33 pages
Lecture 7
No ratings yet
Lecture 7
38 pages
Lecture 15
No ratings yet
Lecture 15
32 pages
CSC321 Image Processing
No ratings yet
CSC321 Image Processing
5 pages
Django Rest Angular Guide
No ratings yet
Django Rest Angular Guide
50 pages
Synthesis and Layout
No ratings yet
Synthesis and Layout
8 pages
CS 332: Algorithms: NP Completeness Continued
No ratings yet
CS 332: Algorithms: NP Completeness Continued
12 pages
CS 332: Algorithms: Medians and Order Statistics Structures For Dynamic Sets
No ratings yet
CS 332: Algorithms: Medians and Order Statistics Structures For Dynamic Sets
40 pages
Lecture 33
No ratings yet
Lecture 33
39 pages
Lecture 21
No ratings yet
Lecture 21
37 pages
CS 332: Algorithms: Merge Sort Solving Recurrences The Master Theorem
No ratings yet
CS 332: Algorithms: Merge Sort Solving Recurrences The Master Theorem
27 pages
CS 332: Algorithms: Introduction To Heapsort
No ratings yet
CS 332: Algorithms: Introduction To Heapsort
35 pages
Lecture 32
No ratings yet
Lecture 32
32 pages
W78E58B/W78E058B Data Sheet 8-Bit Microcontroller: Table of Contents
No ratings yet
W78E58B/W78E058B Data Sheet 8-Bit Microcontroller: Table of Contents
36 pages
Hash Table Time Costs - Hash Functions - The Map Interface and Implementations
No ratings yet
Hash Table Time Costs - Hash Functions - The Map Interface and Implementations
25 pages
Equivalence and Reduction of Finite-State Machines
100% (1)
Equivalence and Reduction of Finite-State Machines
24 pages
CS 332: Algorithms: Heapsort Priority Queues Quicksort
No ratings yet
CS 332: Algorithms: Heapsort Priority Queues Quicksort
29 pages
CS 332: Algorithms: Linear-Time Sorting Continued Medians and Order Statistics
No ratings yet
CS 332: Algorithms: Linear-Time Sorting Continued Medians and Order Statistics
29 pages
Lesson 12 CSS
No ratings yet
Lesson 12 CSS
20 pages
Lecture 20
No ratings yet
Lecture 20
27 pages
Lecture 8
No ratings yet
Lecture 8
27 pages
CS 332: Algorithms: Review of MST Algorithms Disjoint-Set Union Amortized Analysis
No ratings yet
CS 332: Algorithms: Review of MST Algorithms Disjoint-Set Union Amortized Analysis
26 pages
CS 332: Algorithms: Amortized Analysis
No ratings yet
CS 332: Algorithms: Amortized Analysis
26 pages
CS 332: Algorithms: Go Over Exam Binary Search Trees
No ratings yet
CS 332: Algorithms: Go Over Exam Binary Search Trees
25 pages
CS 332: Algorithms: Linear-Time Sorting Algorithms
No ratings yet
CS 332: Algorithms: Linear-Time Sorting Algorithms
24 pages
What We Normally Check For in The Database Testing
No ratings yet
What We Normally Check For in The Database Testing
20 pages
Hash
No ratings yet
Hash
5 pages
Kamini Resume
No ratings yet
Kamini Resume
3 pages
CS 332: Algorithms: Skip Lists
No ratings yet
CS 332: Algorithms: Skip Lists
14 pages
Hashing in Data Structure
No ratings yet
Hashing in Data Structure
25 pages
Lecture 35
No ratings yet
Lecture 35
21 pages
CS 332: Algorithms: Universal Hashing
No ratings yet
CS 332: Algorithms: Universal Hashing
19 pages
Hash Tables - : Structure
No ratings yet
Hash Tables - : Structure
21 pages
Docu9424 Brocade Fabric OS Control Processor Blade (CP8) Replacement Procedure
No ratings yet
Docu9424 Brocade Fabric OS Control Processor Blade (CP8) Replacement Procedure
16 pages
0.1 Direct-Address Tables
No ratings yet
0.1 Direct-Address Tables
10 pages
CS 332: Algorithms: Skip Lists Introduction To Hashing
No ratings yet
CS 332: Algorithms: Skip Lists Introduction To Hashing
13 pages
400-151 Exam Dumps With PDF and VCE Download (61-80)
No ratings yet
400-151 Exam Dumps With PDF and VCE Download (61-80)
8 pages
CS 332: Algorithms: Dynamic Programming
No ratings yet
CS 332: Algorithms: Dynamic Programming
11 pages
CS 332: Algorithms: S-S Shortest Path: Dijkstra's Algorithm Disjoint-Set Union Amortized Analysis
No ratings yet
CS 332: Algorithms: S-S Shortest Path: Dijkstra's Algorithm Disjoint-Set Union Amortized Analysis
11 pages
How To Write An Industry-Standard EEPROM (24C04) Using The MAX2990 I C Interface
No ratings yet
How To Write An Industry-Standard EEPROM (24C04) Using The MAX2990 I C Interface
4 pages
Function Point Analysis Fpa On A Team Planning Website Based On PHP and Mysql 2165 7866 1000237
No ratings yet
Function Point Analysis Fpa On A Team Planning Website Based On PHP and Mysql 2165 7866 1000237
5 pages
Lecture Notes On Hash Tables: 15-122: Principles of Imperative Computation Frank Pfenning, Rob Simmons February 28, 2013
No ratings yet
Lecture Notes On Hash Tables: 15-122: Principles of Imperative Computation Frank Pfenning, Rob Simmons February 28, 2013
7 pages
Geocue Terraphoto
No ratings yet
Geocue Terraphoto
2 pages
Rtos 101
No ratings yet
Rtos 101
1 page
Ca
No ratings yet
Ca
3 pages
Resume Jevay Aggarwal
No ratings yet
Resume Jevay Aggarwal
1 page
The Tech Interview Playbook: From DSA to System Design
From Everand
The Tech Interview Playbook: From DSA to System Design
Chinmoy Mukherjee
No ratings yet
Basic Exercises for Competitive Programming: Python
From Everand
Basic Exercises for Competitive Programming: Python
Jan Pol
No ratings yet

CS 332: Algorithms: Hash Tables

Uploaded by

CS 332: Algorithms: Hash Tables

Uploaded by

CS 332: Algorithms

Review: Skip Lists

Keep a doubly-linked list of elements

During insert, add each level-i element to level i+1

with probability p (e.g., p = 1/2 or p = 1/4)

Summary: Skip Lists

evokes worst-case behavior

Review: Hash Tables

satellite data, we need to support:

We want these to be fast, but dont care about sorting

Review: Direct Addressing

This is called a direct-address table

Review: The Problem With

keys is relatively small

232 entries, more than 4 billion

Good for fixed sets (adding but no deletion)

Table neednt be much bigger than n

slot in a linked list:

table is equally likely to be hashed to any slot

table is equally likely to be hashed to any slot

table is equally likely to be hashed to any slot

table is equally likely to be hashed to any slot

Analysis of Chaining Continued

In other words, we can make the expected cost of

searching constant if we make constant

Choosing A Hash Function

What are desirable features of the hash

given by the remainder of k divided by m

What does this term represent?

implementation to take O(n) time to search

Whats an honest CS student to do?

malicious adversary: randomize the algorithm

matter what keys adversary chooses

You might also like