0% found this document useful (0 votes)

3 views

Hash Table

Uploaded by

sara.lu.9210

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

Hash Table

Uploaded by

sara.lu.9210

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

Hash Table

This section concerns Hash Table data structure and its relevant hash functions designs. And it's
an improved version of Direct-Access Table for operational efficiency.

Direct-Access Table

universe 𝒰 of keys for its implementation simplicity and effectiveness.

As an inherited example of Symbol Table, Direct-Access Table is satisfactory if given a small

Formal Def:

Suppose distinct m keys are taken from 𝒰 ⊆ {0, 1, ..., n}, setup an array T[0, 1, ...m-1]:

Figure 1. Direct-Access Table Mathematical Representation

wherein every position in array T is called a slot. It is expected to have only Θ(1) access time but
if the range of keys is too large, e.g. 64-bit numbers which have over trillions of different keys,
the direct-access table may be overwhelmed in memory usage as well as entry access time. It is a
poor choice in the context of Internet data size. Thus, the Hash Table is introduced.

Design Idea
Hash Table is an extended array structure that associate keys with values that is dynamically
resizable on demand. It is designed to combine the good qualities of List and Array data
structures.

Suppose given an evolving set 𝒮 from global universe 𝒰.

 List only requires Θ(|𝒮|) space but lookup time Θ(|𝒮|) as well.
 Array only requires Ο(1) for lookup time but Θ(|𝒰|) space.

Hash Table possesses both the small space of storage Ο(|𝒮|) and swift entry access of Ο(1)
optimally.

In order to support three distinct operations of Hash Table: INSERTION of new record,
DELETE existing record and LOOKUP for a specified record, Hash Table is built in several
steps:
1. decide a n that is the number of "buckets" for storing entries; (might be dynamically

2. choose a hash function 𝒽: 𝒰 → {0,1,2, .., n-1}

extensible if |𝒮| is variant).

3. store entry 𝓍 in array A[𝒽(𝓍)]

Note: if the record does not exist, INSERTION of new entries will complete but DELETE will
fail; if the record exists, INSERTION of new entries will override the existing values and
DELETE will succeed.

What if the "bucket" location has already been occupied when trying to insert a new entry? In
other words, a hash collision happens. Generally, there are two approaches, (separate) chaining
and open-addressing, in resolving this situation.

Hash Collision
Note: hash collision always happens, there is never a "perfect" hash function ever discovered to
have no collisions during entry insertion.

Load Factor

Define a load factor (α) as a hashing performance metric for hash table Τ:

α = n / m, where n is the total number of elements, m is the number of buckets in hash table for
storing elements. The value of α can vary as bigger than 1, equal to 1 and less than 1; when α is
far larger than 1, the hash table is hence overloaded and need to expand itself by the techniques
introduced in table doubling.

The average-case performance of hashing depends on how well the 𝒽 function distribute n keys
in m slots. Hence, assuming that bucket position each element is hashed into is independent with
each other and equally likely any of m slots would be chosen as the bucket, then

simple uniform hashing:

For ј = 0, 1, ..., m-1, let's denote the length of the list Τ[ј] by nј, so that n = n0 + n1 + .. + nm-1. The
expected value of nј is E[nj] = α = n/m.

And the following content provides practices on the basis of such term.

(Separate) Chaining

In chaining method, all entries with same hash values are chained together in a LinkedList;
Specifically, either a pointer exists in a bucket of the hash table which points to the LinkedList
(Singly or Doubly Linked list) of all entries hashed to that bucket, or there is only a NIL in that
bucket.
LOOKUP costs Θ(1 + α) in chaining version of Hash Table. INSERTION operation in this
structure takes worst-case running time of Ο(1), while the DELETION might takes up to Ο(n) if
all entries hashed into one bucket; if using the doubly LinkedList, DELETION may speed up.

Open Addressing

Although it is possible for (separate) chaining method to have number of buckets smaller than

hash table buckets (m ⩾ n, α ⩽ 1).

the total number of elements to store, open addressing must be kept with enough extra spaces in

In other words, all elements are stored in buckets and one entry per bucket. While it is possible
for such hash table to fill up the entire buckets, open addressing method saves memory
occupation by pointers to allocate more buckets, drastically minimizing hash collisions and
improving searching speed.

In order to INSERT element into the hash table, open addressing requires continuous buckets
examinations, which is termed probe, until an empty bucket is located;

probe sequence: to make INSERTION operation successful, for each key k, there is a length m
probe sequence:

< h(k,0), h(k, 1), ..., h(k, m-1) >

wherein each hash function generates a distinct position within the range of buckets. There are
three major probing techniques in which probe sequence is produced in different manners.

Note: when performing DELETE operation, the designated entry should be labeled "deleted" in
case of further DELETE operation failure. (e.g. k2 is inserted after probing the location of k1, if
k1 is deleted to recover an empty bucket, there is no way to know beforehand whether k2 has been
inserted. Then, the further SEARCH and DELETE operation of k2 will fail)

 Linear Probing

Given an auxiliary hash function 𝒽': 𝒰 -> {0, 1, ..., m-1}, then linear probing would use:

𝒽(k, i) = (𝒽'(k) + i) mod m

that before INSERTION, first probing the bucket Τ[𝒽'(k)], and Τ[𝒽'(k) + 1],... till the Τ[𝒽'(k) -
1]; to be noted that the initial probing position determines the entire possible sequence.

 Quadratic Probing

Under the same premise of linear probing, quadratic probing adopts a better strategy:

𝒽(k, i) = (𝒽'(k) + 𝒸1 ⋅ i + 𝒸2 ⋅ i2) mod m

wherein 𝒸1 and 𝒸2 are positive constants, i = 0, 1, ..., m-1

instead of occupying large portions of adjacent buckets by using linear probing, quadratic
probing leads to a more dispersed distribution of elements.

 Double Hashing

This is known as the best strategy of the three in open addressing method;

𝒽(k, i) = (𝒽1(k) + i ⋅ 𝒽2(k)) mod m

where in both 𝒽1 and 𝒽2 are auxiliary hash functions; in order to search the whole hash table,
value of 𝒽2 has to be relatively prime to the hash table size m. And it results in a Θ(m2) of
probing sequence for each key k, approximating the optimal simple uniform hashing that for
each key there is m! number of probe sequence permutations.

Tare A
100% (9)
Tare A
3 pages
Hashing
No ratings yet
Hashing
10 pages
Dsa Merged
No ratings yet
Dsa Merged
339 pages
DSA MK Lect2 PDF
No ratings yet
DSA MK Lect2 PDF
92 pages
Hashing PPT
No ratings yet
Hashing PPT
39 pages
Hash Tables
No ratings yet
Hash Tables
35 pages
Hashing: Amar Jukuntla
No ratings yet
Hashing: Amar Jukuntla
22 pages
Theory PDF
No ratings yet
Theory PDF
18 pages
Chapter 8 - Hashing
No ratings yet
Chapter 8 - Hashing
78 pages
CHAPTER 8 Hashing: Instructors: C. Y. Tang and J. S. Roger Jang
No ratings yet
CHAPTER 8 Hashing: Instructors: C. Y. Tang and J. S. Roger Jang
78 pages
lecture12_hashing2
No ratings yet
lecture12_hashing2
26 pages
Lab 09 - Hashing
No ratings yet
Lab 09 - Hashing
47 pages
Lect Hashing
No ratings yet
Lect Hashing
36 pages
Study_Material_on_Hashing
No ratings yet
Study_Material_on_Hashing
4 pages
L-2005-08-Advance Data Structure Part 1-HS
No ratings yet
L-2005-08-Advance Data Structure Part 1-HS
46 pages
Modifed Hash
No ratings yet
Modifed Hash
42 pages
Hashing 1
No ratings yet
Hashing 1
26 pages
11 Hashing
No ratings yet
11 Hashing
60 pages
Chapter 8 - Hashing
No ratings yet
Chapter 8 - Hashing
78 pages
06 Hashing
No ratings yet
06 Hashing
6 pages
Hashing - Datastructures and Algorithms
No ratings yet
Hashing - Datastructures and Algorithms
32 pages
CS2040 Summary
No ratings yet
CS2040 Summary
16 pages
Hashing
No ratings yet
Hashing
35 pages
Hashing
No ratings yet
Hashing
35 pages
Course7 Hashing
No ratings yet
Course7 Hashing
19 pages
ADS M TECH MID 2
No ratings yet
ADS M TECH MID 2
26 pages
Dsa 4
No ratings yet
Dsa 4
55 pages
Searching, Sorting and Hashing
No ratings yet
Searching, Sorting and Hashing
52 pages
DSA Chapter 08 (Searching)
No ratings yet
DSA Chapter 08 (Searching)
65 pages
DS Revision on Heap
No ratings yet
DS Revision on Heap
34 pages
Search vs. Hashing
No ratings yet
Search vs. Hashing
55 pages
Hash Table PDF
No ratings yet
Hash Table PDF
25 pages
Collision
No ratings yet
Collision
24 pages
unit 1 Hashing
No ratings yet
unit 1 Hashing
61 pages
0.1 Direct-Address Tables
No ratings yet
0.1 Direct-Address Tables
10 pages
Lec 7
No ratings yet
Lec 7
6 pages
Hashing
No ratings yet
Hashing
57 pages
Cs 218 - Data Structures: Hashing
No ratings yet
Cs 218 - Data Structures: Hashing
18 pages
Hash Tables
No ratings yet
Hash Tables
21 pages
chap-1 ADS
No ratings yet
chap-1 ADS
5 pages
Hashing RPK
No ratings yet
Hashing RPK
61 pages
Hash Table v2
No ratings yet
Hash Table v2
34 pages
Hashing
No ratings yet
Hashing
38 pages
Hash Functions
No ratings yet
Hash Functions
60 pages
Hashing Techniques - U3
No ratings yet
Hashing Techniques - U3
9 pages
Topic 1: Hashing - Introduction: Hashing Is A Method of Storing and Retrieving Data From A Database Efficiently
No ratings yet
Topic 1: Hashing - Introduction: Hashing Is A Method of Storing and Retrieving Data From A Database Efficiently
31 pages
CMP2030 L02 Hashing
No ratings yet
CMP2030 L02 Hashing
21 pages
Data Structures
No ratings yet
Data Structures
42 pages
Lecture 3.2.2 Collision Resolution Strategies
No ratings yet
Lecture 3.2.2 Collision Resolution Strategies
35 pages
Hashing
No ratings yet
Hashing
23 pages
02 Hash Tables
No ratings yet
02 Hash Tables
21 pages
TCP2101 Algorithm Design & Analysis: - Hash Tables
No ratings yet
TCP2101 Algorithm Design & Analysis: - Hash Tables
58 pages
Hashing in Data Structure
No ratings yet
Hashing in Data Structure
25 pages
Hashing new
No ratings yet
Hashing new
48 pages
3 Hashing
No ratings yet
3 Hashing
20 pages
Ch-2: Abstract Data Structures
No ratings yet
Ch-2: Abstract Data Structures
8 pages
05 Hashing
No ratings yet
05 Hashing
47 pages
Handling Collisions and Rehashing
No ratings yet
Handling Collisions and Rehashing
26 pages
Recursive Analysis
From Everand
Recursive Analysis
R. L. Goodstein
No ratings yet
An Introduction to Linear Algebra and Tensors
From Everand
An Introduction to Linear Algebra and Tensors
M. A. Akivis
1/5 (1)
Elementary Functional Analysis
From Everand
Elementary Functional Analysis
Georgi E. Shilov
4/5 (1)
CP Lab Manual
No ratings yet
CP Lab Manual
101 pages
Due Date Telephone No Amount Payable: Summary of Charges Usage History (6 Months)
No ratings yet
Due Date Telephone No Amount Payable: Summary of Charges Usage History (6 Months)
4 pages
Swift For Beginners
60% (5)
Swift For Beginners
60 pages
BSNL
No ratings yet
BSNL
9 pages
Add Subtract Scientific Notation
No ratings yet
Add Subtract Scientific Notation
23 pages
Red Hat Certified Engineer Learning Path
No ratings yet
Red Hat Certified Engineer Learning Path
1 page
Form 1.1 Self-Assessment Check INSTRUCTIONS: This Self-Check Instrument Will Give The Trainer Necessary
100% (1)
Form 1.1 Self-Assessment Check INSTRUCTIONS: This Self-Check Instrument Will Give The Trainer Necessary
3 pages
Qualifying Submission and Selection Criteria: National Ict Security Discourse (Nictsed) Cybersafe Challenge Trophy 2019
No ratings yet
Qualifying Submission and Selection Criteria: National Ict Security Discourse (Nictsed) Cybersafe Challenge Trophy 2019
3 pages
Introduction To Capacity and Performance Management
No ratings yet
Introduction To Capacity and Performance Management
17 pages
Jdeveloper Auditing Framework
No ratings yet
Jdeveloper Auditing Framework
78 pages
Project Title: Buying Patterns of Levi's Jeans
No ratings yet
Project Title: Buying Patterns of Levi's Jeans
7 pages
CP5092-Cloud Computing Technologies
No ratings yet
CP5092-Cloud Computing Technologies
11 pages
Backup, Recovery and Media Services (Program BRMS)
No ratings yet
Backup, Recovery and Media Services (Program BRMS)
40 pages
Parisi - Artificial Intelligence
No ratings yet
Parisi - Artificial Intelligence
3 pages
Loops in C++: Iterative Method
No ratings yet
Loops in C++: Iterative Method
6 pages
Visual Basic
100% (2)
Visual Basic
43 pages
Autosar Sws Spihandlerdriver
No ratings yet
Autosar Sws Spihandlerdriver
115 pages
Programming in Java: Abstract Class and Interface
No ratings yet
Programming in Java: Abstract Class and Interface
23 pages
Form 5 - Instructor's Feedback Form AJ
No ratings yet
Form 5 - Instructor's Feedback Form AJ
2 pages
8051 Interrupt
No ratings yet
8051 Interrupt
48 pages
Accela Civic Platform
No ratings yet
Accela Civic Platform
12 pages
Exp4 LabVIEW and Data Acquisition Systems
100% (1)
Exp4 LabVIEW and Data Acquisition Systems
9 pages
Intelligent Anti Theft Security System Using Microcontroller and GSM DTMF Devices With Text Display
No ratings yet
Intelligent Anti Theft Security System Using Microcontroller and GSM DTMF Devices With Text Display
4 pages
Euro Symbol Display PDF
No ratings yet
Euro Symbol Display PDF
2 pages
SS-CD Assignment 3
No ratings yet
SS-CD Assignment 3
4 pages
Resume
No ratings yet
Resume
2 pages
Java Studio Creator Field Guide Anderson download pdf
100% (6)
Java Studio Creator Field Guide Anderson download pdf
88 pages
Data Leakage For Dummies PDF
100% (1)
Data Leakage For Dummies PDF
74 pages
Chapter 1-2 HDIMS
No ratings yet
Chapter 1-2 HDIMS
19 pages