0% found this document useful (0 votes)

14 views

Lecture 6 - Searching

Searching

Uploaded by

clintsimiyu004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views

Lecture 6 - Searching

Searching

Uploaded by

clintsimiyu004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 41

Searching: Hash tables and Binary

search trees

Juliet Moso
Department of Computer Science
SOME TERMINOLOGY
Ancestor of a node: any node on the path from the root to
that node
Descendant of a node: any node on a path from the node to
the last node in the path
Level (depth) of a node: number of edges in the path from the
root to that node
Height of a tree: number of levels
BINARY SEARCH TREES (BST)
Binary Search Tree Property:
The value stored at a node is greater than the value stored at its left child
and less than the value stored at its right child
Thus, the value stored at the root of a subtree is greater than any value in
its left subtree and less than any value in its right subtree!!
SEARCHING A BST
(1) Start at the root
(2) Compare the value of the item you are searching
for with the value stored at the root
(3) If the values are equal, then item found;
otherwise, if it is a leaf node, then not found
(4) If it is less than the value stored at the root, then
search the left subtree
(5) If it is greater than the value stored at the root,
then search the right subtree
(6) Repeat steps 2-6 for the root of the subtree
chosen in the previous step 4 or 5
NUMBER OF NODES
Recursive implementation
#nodes in a tree = #nodes in left subtree + #nodes in right
subtree + 1
What is the size factor?
Number of nodes in the tree we are examining
What is the base case?
The tree is empty
What is the general case?
CountNodes(Left(tree)) + CountNodes(Right(tree)) + 1
NUMBER OF NODES
Let’s consider the first few steps:
BST OPERATIONS: RETRIEVE ITEM
What is the size of the problem?
Number of nodes in the tree we are examining

What is the base case(s)?

1. When the key is found
2. The tree is empty (key was not found)

What is the general case?

Search in the left or right subtrees
BST OPERATIONS: RETRIEVE ITEM
BST OPERATIONS: INSERT ITEM
What is the size of the problem?
Number of nodes in the tree we are examining
What is the base case(s)?
The tree is empty
What is the general case?
Choose the left or right subtree

• Use the binary search tree property to insert the new item
at the correct place
BST OPERATIONS: INSERT ITEM
BST OPERATIONS: INSERT ITEM
Insert 11
DOES THE ORDER OF INSERTING
ELEMENTS INTO A TREE MATTER?
Yes, certain orders produce very unbalanced trees!!
Unbalanced trees are not desirable because search
time increases!!
There are advanced tree structures (e.g.,"red-black
trees") which guarantee balanced trees
Does the
order of
inserting
elements
into a tree
matter?
BST OPERATIONS: DELETE ITEM
What is the size of the problem?
Number of nodes in the tree we are examining
What is the base case(s)?
Key to be deleted was found
What is the general case?
Choose the left or right subtree

First, find the item; then, delete it

Important: binary search tree property must be preserved!!
We need to consider three different cases:
(1) Deleting a leaf
(2) Deleting a node with only one child
(3) Deleting a node with two children
DELETING A LEAF
DELETING A NODE WITH ONLY ONE
CHILD
DELETING A NODE WITH TWO
CHILDREN

Find predecessor (it is the rightmost node in the left subtree)

Replace the data of the node to be deleted with predecessor's data
Delete predecessor node
TREE TRAVERSALS
There are mainly three ways to traverse a tree:
1) Inorder Traversal
2) Postorder Traversal
3) Preorder Traversal
TreeWalk(x)
TreeWalk(left[x]);
print(x);
TreeWalk(right[x]);
• Prints elements in sorted (increasing) order
• This is called an Inorder Traversal: print left, then root, then right
• Preorder Traversal: print root, then left, then right
• Postorder Traversal: print left, then right, then root
INORDER TRAVERSAL: A E H J M T Y

Visit second
tree

‘J’

‘E’ ‘T’

‘A’ ‘H’ ‘M’ ‘Y’

Visit left subtree first Visit right subtree last

POSTORDER TRAVERSAL: A H E M Y TJ

Visit last
tree

‘J’

‘E’ ‘T’

‘A’ ‘H’ ‘M’ ‘Y’

Visit left subtree first Visit right subtree second

PREORDER TRAVERSAL: J E A H T M Y
Visit first

tree

‘J’

‘E’ ‘T’

‘A’ ‘H’ ‘M’ ‘Y’

Visit left subtree second Visit right subtree last

WHAT IS A HASH TABLE ?
Hash tables are an array-based method for
implementing a Dictionary
The simplest kind of hash table is an array of
records.
This example has 701 records.

[0] [1] [2] [3] [4] [5] [ 700]

...

An array of records
WHAT IS A HASH TABLE ? [4]

Each record has a special field,

called its key.
In this example, the key is a long Number 506643548
integer field called Number
The number might be a person's
identification number, and the
rest of the record has
information about the person. [5]

[0] [1] [2] [3] [4] [ 700]

...
WHAT IS A HASH TABLE ?
When a hash table is in use, some spots contain
valid records, and other spots are "empty".
The empty spots are identified by a special key.
For example, if all our identification numbers are
positive, then we could use 0 as the Number that
indicates an empty spot.

[0] [1] [2] [3] [4] [5] [ 700]

Number 281942902 Number 233667136 Number 506643548 Number 155778322

...
INSERTING A NEW RECORD
In order to insert a new record, the Number 580625685
key must somehow be converted
to an array index between 0 and
700.
The conversion process is called
hashing
The index is called the hash value
of the key.
[0] [1] [2] [3] [4] [5] [ 700]
Number 281942902 Number 233667136 Number 506643548 Number 155778322

...
INSERTING A NEW RECORD
Typical way to create a hash value: Number 580625685
Take the key mod 701 (which could
be anywhere from 0 to 700).

(Number mod 701)

3
What is (580625685 mod 701) ?

[0] [1] [2] [3] [4] [5] [ 700]

Number 281942902 Number 233667136 Number 506643548 Number 155778322

...
INSERTING A NEW RECORD
The hash value is used for the Number 580625685
location of the new record.

So, this new item will be

placed at location [3] of the
array.
[3]

[0] [1] [2] [3] [4] [5] [ 700]

Number 281942902 Number 233667136 Number 506643548 Number 155778322

...
COLLISIONS
Sometimes, two different records Number 701466868
might end up with the same hash
value.
Here is another new record to
insert, with a hash value of 2.
My hash
value is [2].

[0] [1] [2] [3] [4] [5] [ 700]

Number 281942902 Number 233667136 Number 580625685 Number 506643548 Number 155778322

...
COLLISION RESOLUTION
If, when an element is inserted, it hashes to the
same value as an already inserted element, then we
have a collision and need to resolve it.

There are several methods for dealing with this:

• Separate chaining
• Open addressing
• Linear Probing
• Quadratic Probing
• Double Hashing
SEPARATE CHAINING
The idea is to keep a list of all elements that hash
to the same value.
• The array elements are pointers to the first nodes of the
lists.
• A new item is inserted to the front of the list.
Advantages:
• Better space utilization for large items.
• Simple collision handling: searching linked list.
• Overflow: we can store more items than the hash table
size.
• Deletion is quick and easy: deletion from the linked list.
SEPARATE CHAINING: EXAMPLE
Keys: 0, 1, 4, 9, 16, 25, 36, 49, 64, 81
hash(key) = key mod 10.
0 0

1 81 1
2

4
64 4
5
25
6
36 16

9
49 9
OPEN ADDRESSING
Separate chaining has the disadvantage of using linked
lists.
• Requires the implementation of a second data structure.

In an open addressing hashing system, all the data go

inside the table.
• Thus, a bigger table is needed.
• If a collision occurs, alternative cells are tried until an
empty cell is found.
There are three common collision resolution strategies:
• Linear Probing
• Quadratic probing
• Double hashing
OPEN ADDRESSING
This is called a collision, because Number 701466868
there is already another valid
record at [2].

When a collision
occurs,
move forward until you
find an empty spot.

[0] [1] [2] [3] [4] [5] [ 700]

Number 281942902 Number 233667136 Number 580625685 Number 506643548 Number 155778322

...
OPEN ADDRESSING
The new record is always placed in the first available
empty spot, after the hash value.

The new record goes

in the empty spot.

[0] [1] [2] [3] [4] [5] [ 700]

Number 281942902 Number 233667136 Number 580625685 Number 506643548 Number 701466868 Number 155778322

...
SEARCHING FOR A KEY
The data that's attached to a key Number 701466868
can be found fairly quickly.
Start by computing the hash value,
which is 2 in this case.
Then check location 2.
If location 2 has a different key than My hash
value is [2].
the one you are looking for, then
move forward
Not me.
[0] [1] [2] [3] [4] [5] [ 700]
Number 281942902 Number 233667136 Number 580625685 Number 506643548 Number 701466868
Number 155778322

...
SEARCHING FOR A KEY
Number 701466868

Keep moving forward until you

find the key, or you reach an
empty spot.
My hash
value is [2].
Not me.
[0] [1] [2] [3] [4] [5] [ 700]
Number 281942902 Number 233667136 Number 580625685 Number 506643548 Number 701466868 Number 155778322

...
SEARCHING FOR A KEY
Number 701466868

Keep moving forward until you

find the key, or you reach an
empty spot.
My hash
value is [2].
Not me.

[0] [1] [2] [3] [4] [5] [ 700]

Number 281942902 Number 233667136 Number 580625685 Number 506643548 Number 701466868 Number 155778322

...
SEARCHING FOR A KEY
Number 701466868
Keep moving forward until you
find the key, or you reach an
empty spot.

My hash
value is [2].
Yes!

[0] [1] [2] [3] [4] [5] [ 700]

Number 281942902 Number 233667136 Number 580625685 Number 506643548 Number 701466868 Number 155778322

...
SEARCHING FOR A KEY
When the item is found, the Number 701466868
information can be copied to the
necessary location.
What happens if a search reaches
an empty spot?
It can halt and indicate that
the key was not in the hash My hash
table. Yes! value is [2].

[0] [1] [2] [3] [4] [5] [ 700]

Number 281942902 Number 233667136 Number 580625685 Number 506643548 Number 701466868 Number 155778322

...
DELETING A RECORD
Records may also be deleted from a hash table.
But the location must not be left as an ordinary
"empty spot" since that could interfere with searches.
(Remember that a search can stop when it reaches
an empty spot.)

Please
delete me.
[0] [1] [2] [3] [4] [5] [ 700]
Number 281942902 Number 233667136 Number 580625685 Number 506643548 Number 701466868 Number 155778322

...
DELETING A RECORD
The location must be marked in some special way so
that a search can tell that the spot used to have
something in it.
In any case, a search can not stop when it reaches "a
location that used to have something here".
A search can only stop when it reaches a true empty
spot.

[0] [1] [2] [3] [4] [5] [ 700]

Number 281942902 Number 233667136 Number 580625685 Number 701466868 Number 155778322

...

CS214 DS2022 Lec 14 - Hashing
No ratings yet
CS214 DS2022 Lec 14 - Hashing
31 pages
Hash 2
No ratings yet
Hash 2
38 pages
Hash Tables
No ratings yet
Hash Tables
37 pages
Module V Unit 2 Hashing
No ratings yet
Module V Unit 2 Hashing
41 pages
Lecture 5 - Hash Table and BST
No ratings yet
Lecture 5 - Hash Table and BST
15 pages
Java 11
No ratings yet
Java 11
32 pages
DataStructures Cheatsheet Zero To Mastery V1.01
No ratings yet
DataStructures Cheatsheet Zero To Mastery V1.01
39 pages
AVL Tree Deletion
100% (1)
AVL Tree Deletion
42 pages
Module-6 Searching Techniques
No ratings yet
Module-6 Searching Techniques
44 pages
Search and Sort Algorithm
No ratings yet
Search and Sort Algorithm
37 pages
Algorithms (OBF) Dummies - SPARK
No ratings yet
Algorithms (OBF) Dummies - SPARK
29 pages
DSA Chapter 08 (Searching)
No ratings yet
DSA Chapter 08 (Searching)
65 pages
Hashing ClassNotes
No ratings yet
Hashing ClassNotes
8 pages
Hashing PDF
No ratings yet
Hashing PDF
65 pages
Ds Impp
No ratings yet
Ds Impp
22 pages
Lect Hashing
No ratings yet
Lect Hashing
36 pages
06 Hashing
No ratings yet
06 Hashing
6 pages
05 Hashing
No ratings yet
05 Hashing
47 pages
Lecture 5 - Hash Table and BST
No ratings yet
Lecture 5 - Hash Table and BST
14 pages
Hashing in Data Structure
No ratings yet
Hashing in Data Structure
43 pages
Hashing RPK
No ratings yet
Hashing RPK
61 pages
Week 3
No ratings yet
Week 3
29 pages
DSA MK Lect2 PDF
No ratings yet
DSA MK Lect2 PDF
92 pages
Day3.2 DS2 HashTablesHeaps
No ratings yet
Day3.2 DS2 HashTablesHeaps
61 pages
Hash Tables
No ratings yet
Hash Tables
35 pages
Theory PDF
No ratings yet
Theory PDF
18 pages
Searching 2
No ratings yet
Searching 2
64 pages
Search vs. Hashing
No ratings yet
Search vs. Hashing
55 pages
CH 4
No ratings yet
CH 4
58 pages
210 Maps PDF
No ratings yet
210 Maps PDF
39 pages
Hashing
No ratings yet
Hashing
16 pages
ADS Unit 3
No ratings yet
ADS Unit 3
14 pages
Indexing
No ratings yet
Indexing
77 pages
Topic 1: Hashing - Introduction: Hashing Is A Method of Storing and Retrieving Data From A Database Efficiently
No ratings yet
Topic 1: Hashing - Introduction: Hashing Is A Method of Storing and Retrieving Data From A Database Efficiently
31 pages
Hashing
No ratings yet
Hashing
20 pages
Hashing
No ratings yet
Hashing
23 pages
Hashing
No ratings yet
Hashing
44 pages
Hash Table Data Structure
No ratings yet
Hash Table Data Structure
34 pages
Design and Analysis of Algorithm
No ratings yet
Design and Analysis of Algorithm
182 pages
Chapter 1-3 algorithm analysis
No ratings yet
Chapter 1-3 algorithm analysis
181 pages
B Tree, B Plus and Graph
No ratings yet
B Tree, B Plus and Graph
38 pages
3 Hashing
No ratings yet
3 Hashing
20 pages
Lec12-Hash-Tables-09092024-090609pm (1)
No ratings yet
Lec12-Hash-Tables-09092024-090609pm (1)
48 pages
36 BST Remove Hashing
No ratings yet
36 BST Remove Hashing
7 pages
Lecture 09 - Searching (Updated)
No ratings yet
Lecture 09 - Searching (Updated)
68 pages
19hashing
No ratings yet
19hashing
44 pages
Tutorial 10 Indexing
No ratings yet
Tutorial 10 Indexing
36 pages
Indexing and Hashing: Solutions To Practice Exercises
No ratings yet
Indexing and Hashing: Solutions To Practice Exercises
11 pages
Assignment (DS)
No ratings yet
Assignment (DS)
8 pages
L-2005-08-Advance Data Structure Part 1-HS
No ratings yet
L-2005-08-Advance Data Structure Part 1-HS
46 pages
Hashing PPT For Student
No ratings yet
Hashing PPT For Student
53 pages
Hashing
No ratings yet
Hashing
42 pages
22csc22 Cat-3.1 - Answer Key
No ratings yet
22csc22 Cat-3.1 - Answer Key
22 pages
Hashing Refers To The Process of Generating A Fixed-Size Output From An Input of Variable Size
No ratings yet
Hashing Refers To The Process of Generating A Fixed-Size Output From An Input of Variable Size
10 pages
Hashing and Graphs
No ratings yet
Hashing and Graphs
28 pages
Lecture 27 - Hashing
No ratings yet
Lecture 27 - Hashing
48 pages
chap-1 ADS
No ratings yet
chap-1 ADS
5 pages
Hashing
No ratings yet
Hashing
34 pages
Fast mental calculation tricks
From Everand
Fast mental calculation tricks
EasyMath
No ratings yet
Hashing
From Everand
Hashing
Prakash Hegade
No ratings yet
Heaps, Heap Sort, and Priority Queues
No ratings yet
Heaps, Heap Sort, and Priority Queues
35 pages
Package Rpart': R Topics Documented
No ratings yet
Package Rpart': R Topics Documented
34 pages
Interview Algorithemdocx
No ratings yet
Interview Algorithemdocx
34 pages
Delta Xmcqs
No ratings yet
Delta Xmcqs
5 pages
Minimax Root Value by A Given Amount. Additionally, For The Critical Position in A Game, We Can
No ratings yet
Minimax Root Value by A Given Amount. Additionally, For The Critical Position in A Game, We Can
1 page
DS, C, C++, Aptitude, Unix, RDBMS, SQL, CN, Os
No ratings yet
DS, C, C++, Aptitude, Unix, RDBMS, SQL, CN, Os
219 pages
Cse Syllabus R 2009
No ratings yet
Cse Syllabus R 2009
87 pages
Buy ebook Data Structures and Applications: A Simple and Systematic Approach Padma Reddy cheap price
100% (2)
Buy ebook Data Structures and Applications: A Simple and Systematic Approach Padma Reddy cheap price
41 pages
Exercises B+Tree
100% (1)
Exercises B+Tree
9 pages
IOI Training Week 7 Advanced Data Structures: 1.1 Square-Root (SQRT) Decomposition
No ratings yet
IOI Training Week 7 Advanced Data Structures: 1.1 Square-Root (SQRT) Decomposition
6 pages
A Framework For The Automated Drawing: Structure Diagrams Data
No ratings yet
A Framework For The Automated Drawing: Structure Diagrams Data
15 pages
Data Structures - QuestionBank
No ratings yet
Data Structures - QuestionBank
4 pages
Mining Frequent Patterns Without Candidate Generation
No ratings yet
Mining Frequent Patterns Without Candidate Generation
12 pages
Simple Programming Problems
No ratings yet
Simple Programming Problems
6 pages
BSC CS III Semester
No ratings yet
BSC CS III Semester
2 pages
Dbms Query Evaluation
No ratings yet
Dbms Query Evaluation
28 pages
Computer Science and Information Technology scqp09
0% (1)
Computer Science and Information Technology scqp09
3 pages
30 Most Asked Coding Questions
No ratings yet
30 Most Asked Coding Questions
19 pages
Quantitative Analysis For Management Ch03
No ratings yet
Quantitative Analysis For Management Ch03
77 pages
85 COMPUTER SCIENCE TECHNOLOGY 4th
No ratings yet
85 COMPUTER SCIENCE TECHNOLOGY 4th
43 pages
C Questions and Answer
No ratings yet
C Questions and Answer
215 pages
Solutions For HW10-CS 6033 Fall 2023
No ratings yet
Solutions For HW10-CS 6033 Fall 2023
10 pages
Developing SCRABBLE Game
No ratings yet
Developing SCRABBLE Game
73 pages
University of Chakwal: Department of Computer Science & Information Technology
No ratings yet
University of Chakwal: Department of Computer Science & Information Technology
3 pages
Cs33 - Data Structures Questions and Answers
88% (43)
Cs33 - Data Structures Questions and Answers
26 pages
Unit-4 Tree Tt Bt Bst
No ratings yet
Unit-4 Tree Tt Bt Bst
63 pages
Decision Tree
No ratings yet
Decision Tree
2 pages
Introduction To Star-Ccm+: Features
No ratings yet
Introduction To Star-Ccm+: Features
29 pages
Course Plan (2)
No ratings yet
Course Plan (2)
3 pages
ANU MCA Syllabus
No ratings yet
ANU MCA Syllabus
148 pages

Lecture 6 - Searching

Uploaded by

Lecture 6 - Searching

Uploaded by

Searching: Hash tables and Binary

What is the base case(s)?

What is the general case?

First, find the item; then, delete it

Find predecessor (it is the rightmost node in the left subtree)

‘A’ ‘H’ ‘M’ ‘Y’

Visit left subtree first Visit right subtree last

‘A’ ‘H’ ‘M’ ‘Y’

Visit left subtree first Visit right subtree second

‘A’ ‘H’ ‘M’ ‘Y’

Visit left subtree second Visit right subtree last

[0] [1] [2] [3] [4] [5] [ 700]

Each record has a special field,

[0] [1] [2] [3] [4] [ 700]

[0] [1] [2] [3] [4] [5] [ 700]

(Number mod 701)

[0] [1] [2] [3] [4] [5] [ 700]

So, this new item will be

[0] [1] [2] [3] [4] [5] [ 700]

[0] [1] [2] [3] [4] [5] [ 700]

There are several methods for dealing with this:

In an open addressing hashing system, all the data go

[0] [1] [2] [3] [4] [5] [ 700]

The new record goes

[0] [1] [2] [3] [4] [5] [ 700]

Keep moving forward until you

Keep moving forward until you

[0] [1] [2] [3] [4] [5] [ 700]

[0] [1] [2] [3] [4] [5] [ 700]

[0] [1] [2] [3] [4] [5] [ 700]

[0] [1] [2] [3] [4] [5] [ 700]

You might also like