0% found this document useful (0 votes)

32 views4 pages

CS 240 Tutorial 9 Notes: (Array A of Size M, Store V at A (K) )

The document discusses associative arrays and different methods for implementing them using hashing. An associative array allows storing key-value pairs and looking up values by key. Direct addressing stores values directly in an array using the key as the index, but requires large space. Hashing maps keys to array indices to reduce space. Common hashing methods are chaining, linear probing, and double hashing, which deal with collisions when different keys hash to the same index. The document provides an example comparing how these methods insert different keys into an array of size 5.

Uploaded by

DavidKnight

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views4 pages

CS 240 Tutorial 9 Notes: (Array A of Size M, Store V at A (K) )

Uploaded by

DavidKnight

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

CS 240 Tutorial 9 Notes

Dictionary/Associative array: An abstract data type which holds a collection of (key, value) pairs, where
each key appears at most once.
This allows one to store a value in the associative array using a key as an index. The value can later
be extracted if one knows the key it was stored under.
Typical operations are
insert(k, v): add value v, associated to key k
delete(k): remove item associated to key k
search(k): return value associated to key k (if one exists)
For simplicity, keys are usually assumed to be integers. (If not, we can always map the keys to integers
first.) Also, say maximum key is M .
Question: How would one go about implementing an associative array?
Unsorted array/linked list
Sorted array
Balanced search tree (e.g., AVL)
Direct addressing
(array A of size M , store v at A[k])

Insert
O(1)

Search
O(n)

Delete
O(n)

(add to end)

(brute force)

O(n)

O(log n)

O(n)

(shifting)

(binary search)

(shifting)

O(log n)

(search + rotation)

(max height)

(search + swap + rotations)

O(1)

(key tells us exactly where item will be stored)

Question: What is the downside to direct addressing?

Space required is O(M ), even if n is very small, which is wasteful. If the keys contained 99 decimal
digits, A would have to be of size 10100 , which is more than the estimated number of atoms in the
universe!
A hash table is a way of maintaining the good behaviour of this approach, while also addressing the downside.
The idea is to use an array of smaller size (e.g., O(n)) and then map the original keys into a smaller range, so
that they are indices for this smaller array.
The process of mapping a key into a small keyspace is known as hashing, and is done by applying a hash
function.
Main problem: Since we are mapping a large keyspace onto a small keyspace, some large keys will hash to
the same small key (by the pigeonhole principle), so we must somehow deal with collisions.
Three ideas:
Chaining: Each array location can contain multiple (k, v) by storing them in a linked list.
Linear probing: If the location where you want to insert is already filled, insert in the next available
location.
Double hashing: Instead of looking sequentially for the next available location, jump ahead by a certain
amount until a space is free. The jump amount is controlled by a second hash function.
Example: Using chaining, linear probing, and double hashing, insert aardvark, aback, abacus, and abaft
into an array of size 5, where the key is the word itself and the hash function is
h(w) = h(w1 w2 . . . wk ) =

k
X
i=1

ascii(wi ) mod 5.

For double hashing, use the secondary hash function

h2 (w) = h2 (w1 w2 . . . wk ) = 1 +

X
k

ascii(wi ) 3

mod 4 .

i=1

Answer: Note that

h(aardvark) = 97 + 97 + 114 + 100 + 118 + 97 + 114 + 107 mod 5 = 844 mod 5 = 4
h(aback) = 97 + 98 + 97 + 99 + 107 mod 5
h(abacus) = 97 + 98 + 97 + 99 + 117 + 115 mod 5
h(abaft) = 97 + 98 + 97 + 102 + 116 mod 5

= 498 mod 5 = 3
= 623 mod 5 = 3
= 510 mod 5 = 0

Using chaining:

insert aardvark:

insert aback:

insert abacus:

insert abaft:

A[0]
A[1]
A[2]
A[3]
A[4]

aardvark

A[0]
A[1]
A[2]
A[3]
A[4]

aback
aardvark

A[0]
A[1]
A[2]
A[3]
A[4]

abacus aback
aardvark

A[0]
A[1]
A[2]
A[3]
A[4]

abaft

abacus aback
aardvark

Note: Values are inserted at the start of the linked list. This keeps insertion at O(1) cost.
However, in the worst case all items hash to the same location, and search/delete cost O(n).
But if the hash function is chosen properly this behaviour is unlikely. Assuming each hash value is equally
likely to occur, search/delete cost O(1 + n/|A|) in the average case. If we take |A| n then this is O(1).
This makes sense intuitively: if you want to store n items in A, you probably want to take |A| n to avoid
excessive chaining.

Using linear probing:

insert aardvark:

insert aback:

insert abacus:

insert abaft:

A[0]
A[1]
A[2]
A[3]
A[4]

aardvark

A[0]
A[1]
A[2]
A[3]
A[4]

aback
aardvark

abacus

aback
aardvark

abacus
abaft

A[0]
A[1]
A[2]
A[3]
A[4]
A[0]
A[1]
A[2]
A[3]
A[4]

aback
aardvark

Note: Now insert/delete/search are all O(n) in the worst case. When the hash table is mostly empty this
behaviour is unlikely, but as the table fills up (its load factor increases) it becomes more and more likely.
Using double hasing:
Note that
h2 (aardvark) = 1 + (97 37 + 97 36 + 114 35 + 100 34 + 118 33 + 97 32 + 114 3 + 107 mod 4)
= 1 + (323162 mod 4) = 1 + 2 = 3
h2 (aback) = 1 + (97 34 + 98 33 + 97 32 + 99 3 + 107 mod 4)
= 1 + (11780 mod 4) = 1 + 0 = 1
h2 (abacus) = 1 + (97 35 + 98 34 + 97 33 + 99 32 + 117 3 + 115 mod 4)
= 1 + (35485 mod 4) = 1 + 1 = 2
h2 (abaft) = 1 + (97 34 + 98 33 + 97 32 + 102 3 + 116 mod 4)
= 1 + (11798 mod 4) = 1 + 2 = 3

insert aardvark:

insert aback:

insert abacus:

insert abaft:

A[0]
A[1]
A[2]
A[3]
A[4]

aardvark

A[0]
A[1]
A[2]
A[3]
A[4]

aback
aardvark

abacus

aback
aardvark

abacus
abaft

aback
aardvark

A[0]
A[1]
A[2]
A[3]
A[4]
A[0]
A[1]
A[2]
A[3]
A[4]

jump amount: 3

jump amount: 1

jump amount: 2

jump amount: 3

Note: When the jump amount is 1, double hashing is identical to linear probing. Also, the jump amount
should never be 0, or no alternate positions will ever be checked. In general, if the jump amount evenly
divides the array size, not all alternate positions will be checked. (Making the array size prime and the jump
amount smaller than |A| guards against this.)

Cambridge IGCSE ICT Theory Workbook, 2nd Ed
No ratings yet
Cambridge IGCSE ICT Theory Workbook, 2nd Ed
97 pages
Termux Basic Commands
100% (1)
Termux Basic Commands
2 pages
Hashing PPT For Student
No ratings yet
Hashing PPT For Student
53 pages
Simatic PDF
No ratings yet
Simatic PDF
138 pages
Usb Boot Loader Tutorial On Lpc2148 Based Board: LPC2148 Bootloader User Manual
100% (1)
Usb Boot Loader Tutorial On Lpc2148 Based Board: LPC2148 Bootloader User Manual
15 pages
The Word "Computer" Usually Refers To The Central Processing Unit Plus? (A) External Memory (B) Internal Memory (C) Input Devices (D) Output Devices
No ratings yet
The Word "Computer" Usually Refers To The Central Processing Unit Plus? (A) External Memory (B) Internal Memory (C) Input Devices (D) Output Devices
4 pages
Dsa Merged
No ratings yet
Dsa Merged
339 pages
Finger Print Based Voting System For Rigging Free Governing System
No ratings yet
Finger Print Based Voting System For Rigging Free Governing System
2 pages
Elpa Userguide
No ratings yet
Elpa Userguide
147 pages
HP Virtual Connect Flex 10 Cookbook PDF
No ratings yet
HP Virtual Connect Flex 10 Cookbook PDF
313 pages
Lecture05 Hash Table
No ratings yet
Lecture05 Hash Table
65 pages
DSA - Unit 1
No ratings yet
DSA - Unit 1
43 pages
PHIL 215 - Final Exam Notes
No ratings yet
PHIL 215 - Final Exam Notes
99 pages
Hashing New
No ratings yet
Hashing New
48 pages
Chapter10 HashTables
No ratings yet
Chapter10 HashTables
49 pages
Hashing
No ratings yet
Hashing
57 pages
Unit 5
No ratings yet
Unit 5
50 pages
09 Hashtable
No ratings yet
09 Hashtable
53 pages
Maps and Hashing - Final
No ratings yet
Maps and Hashing - Final
51 pages
Lecture 27 - Hashing
No ratings yet
Lecture 27 - Hashing
48 pages
Hashing
No ratings yet
Hashing
41 pages
Big Mart Sales Analysis DOCUMENT
No ratings yet
Big Mart Sales Analysis DOCUMENT
58 pages
Lecture 13 - Hash Tables
No ratings yet
Lecture 13 - Hash Tables
51 pages
Dsa 4
No ratings yet
Dsa 4
55 pages
2,2 Hashing
No ratings yet
2,2 Hashing
30 pages
Analysis of Algorithms CS 477/677: Hashing Instructor: George Bebis
No ratings yet
Analysis of Algorithms CS 477/677: Hashing Instructor: George Bebis
53 pages
CH 4
No ratings yet
CH 4
58 pages
Chapter 8 - Hashing
No ratings yet
Chapter 8 - Hashing
78 pages
Lecture 12
No ratings yet
Lecture 12
33 pages
Hashing
No ratings yet
Hashing
30 pages
Introduction To Hashing - DATA STRUCTURES
No ratings yet
Introduction To Hashing - DATA STRUCTURES
20 pages
Cse373 10 Hashing
No ratings yet
Cse373 10 Hashing
36 pages
Data Wrangling & Visualization - II
No ratings yet
Data Wrangling & Visualization - II
41 pages
Lecture12 Hashing2
No ratings yet
Lecture12 Hashing2
26 pages
Module-4 Dictionaries and Hash Tables
No ratings yet
Module-4 Dictionaries and Hash Tables
31 pages
ADS Unit 3
No ratings yet
ADS Unit 3
14 pages
Hashing: Data Structure
No ratings yet
Hashing: Data Structure
17 pages
Chapter 11-Hash Tables
No ratings yet
Chapter 11-Hash Tables
42 pages
JEDI Slides-Intro1-Chapter04-Programming Fundamentals
No ratings yet
JEDI Slides-Intro1-Chapter04-Programming Fundamentals
96 pages
CHAPTER 8 Hashing: Instructors: C. Y. Tang and J. S. Roger Jang
No ratings yet
CHAPTER 8 Hashing: Instructors: C. Y. Tang and J. S. Roger Jang
78 pages
Hashing PDF
No ratings yet
Hashing PDF
65 pages
Hashing
No ratings yet
Hashing
23 pages
Data Structures Digital Notes-111-120
No ratings yet
Data Structures Digital Notes-111-120
10 pages
Cloud Computing Syllabus
No ratings yet
Cloud Computing Syllabus
2 pages
9.map 1 HashTable
No ratings yet
9.map 1 HashTable
31 pages
Hash Tables: Map Dictionary Key "Address."
No ratings yet
Hash Tables: Map Dictionary Key "Address."
16 pages
JS Strings, Validation + Functions, RegExp, Modal, Lists
No ratings yet
JS Strings, Validation + Functions, RegExp, Modal, Lists
25 pages
Ch-2: Abstract Data Structures
No ratings yet
Ch-2: Abstract Data Structures
8 pages
Maps
No ratings yet
Maps
36 pages
Hashing Reading
No ratings yet
Hashing Reading
10 pages
03 Hashing
No ratings yet
03 Hashing
21 pages
Theory PDF
No ratings yet
Theory PDF
18 pages
Chapter 8 - Hashing
No ratings yet
Chapter 8 - Hashing
78 pages
CS2040 Summary
No ratings yet
CS2040 Summary
16 pages
Comp 110 Introduction
No ratings yet
Comp 110 Introduction
11 pages
Hash Functions
No ratings yet
Hash Functions
60 pages
CS301 Lec41
No ratings yet
CS301 Lec41
18 pages
2023-07-05 - Linux Rootkits Explained - Part 1 - Dynamic Linker Hijacking
No ratings yet
2023-07-05 - Linux Rootkits Explained - Part 1 - Dynamic Linker Hijacking
6 pages
Hashing ClassNotes
No ratings yet
Hashing ClassNotes
8 pages
Hash
No ratings yet
Hash
5 pages
Analysis of Algorithms CS 477/677: Hashing Instructor: George Bebis
No ratings yet
Analysis of Algorithms CS 477/677: Hashing Instructor: George Bebis
53 pages
Introduction To Algorithms, Recitation 4
No ratings yet
Introduction To Algorithms, Recitation 4
9 pages
Hash Table PDF
No ratings yet
Hash Table PDF
25 pages
CrashReport 1676386336268
No ratings yet
CrashReport 1676386336268
4 pages
Cs 218 - Data Structures: Hashing
No ratings yet
Cs 218 - Data Structures: Hashing
18 pages
Cuckoo Hashing For Undergraduates: Rasmus Pagh IT University of Copenhagen March 27, 2006
No ratings yet
Cuckoo Hashing For Undergraduates: Rasmus Pagh IT University of Copenhagen March 27, 2006
6 pages
Connecting Python Application With Mysql
No ratings yet
Connecting Python Application With Mysql
4 pages
Recap: Signals & Coding Framing Error Detection & Correction
No ratings yet
Recap: Signals & Coding Framing Error Detection & Correction
26 pages
dcs2231 Ia2 ch3
No ratings yet
dcs2231 Ia2 ch3
3 pages
Last Exception
No ratings yet
Last Exception
2 pages
Classes and Objects in Java
No ratings yet
Classes and Objects in Java
9 pages
Kushal Vijay Resume
No ratings yet
Kushal Vijay Resume
1 page
Hashing
No ratings yet
Hashing
13 pages
Latihan Prosedur Text
No ratings yet
Latihan Prosedur Text
3 pages
Hashing
No ratings yet
Hashing
10 pages
Ch11 Soln 2
No ratings yet
Ch11 Soln 2
8 pages
Runtimeos: Software Product Solid Edge 2020
No ratings yet
Runtimeos: Software Product Solid Edge 2020
10 pages
KMCM RG MGMT
No ratings yet
KMCM RG MGMT
16 pages
Lab 2
No ratings yet
Lab 2
10 pages
Hashing With Chaining
No ratings yet
Hashing With Chaining
5 pages
Lecture Notes On Hash Tables: 15-122: Principles of Imperative Computation Frank Pfenning, Rob Simmons February 28, 2013
No ratings yet
Lecture Notes On Hash Tables: 15-122: Principles of Imperative Computation Frank Pfenning, Rob Simmons February 28, 2013
7 pages
Installing and Configuring Mamp With Phpstorm Ide: Prerequisites
No ratings yet
Installing and Configuring Mamp With Phpstorm Ide: Prerequisites
13 pages
Using OLE Columns in A DataWindow Object
No ratings yet
Using OLE Columns in A DataWindow Object
6 pages
CS 240 Tutorial 10 Notes: Lo Hi Lo Hi
No ratings yet
CS 240 Tutorial 10 Notes: Lo Hi Lo Hi
4 pages
CS 240 Tutorial 8 Notes: N I 1 1 X N I N I N I 1
No ratings yet
CS 240 Tutorial 8 Notes: N I 1 1 X N I N I N I 1
2 pages
FST DFS Lutron Grafik6000
No ratings yet
FST DFS Lutron Grafik6000
3 pages
Hashing - : Value Key
No ratings yet
Hashing - : Value Key
11 pages
CS454/654 Assignment 2 (Milestone Assignment)
No ratings yet
CS454/654 Assignment 2 (Milestone Assignment)
4 pages
Mobile GIS Development
No ratings yet
Mobile GIS Development
5 pages
CS 240 Tutorial 11 Notes: C A A B A
No ratings yet
CS 240 Tutorial 11 Notes: C A A B A
2 pages
Specificatii DUT
No ratings yet
Specificatii DUT
7 pages
ACTSC 221 - Review For Final Exam
No ratings yet
ACTSC 221 - Review For Final Exam
2 pages
SMT-3231 Datasheet
No ratings yet
SMT-3231 Datasheet
2 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
From Everand
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
Peter Kattan
2.5/5 (2)
Factoring and Algebra - A Selection of Classic Mathematical Articles Containing Examples and Exercises on the Subject of Algebra (Mathematics Series)
From Everand
Factoring and Algebra - A Selection of Classic Mathematical Articles Containing Examples and Exercises on the Subject of Algebra (Mathematics Series)
CSPacademic
No ratings yet

CS 240 Tutorial 9 Notes: (Array A of Size M, Store V at A (K) )

Uploaded by

CS 240 Tutorial 9 Notes: (Array A of Size M, Store V at A (K) )

Uploaded by

CS 240 Tutorial 9 Notes

(search + swap + rotations)

(key tells us exactly where item will be stored)

Question: What is the downside to direct addressing?

For double hashing, use the secondary hash function

Answer: Note that

Using linear probing:

You might also like