Chapter 1 Part II

Query algorithms reduce relational operations to file scan operations on physical file structures. There are different access paths for each relational operation, and query engines have specialized algorithms for different operation and access path combinations. Examples of algorithms discussed include linear search, binary search, search using indexes, nested-loop join, index nested-loop join, sort-merge join, and hash join. Query optimization aims to find the most efficient evaluation plan to minimize query execution time.

Uploaded by

yordanosgetahun887

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views22 pages

Chapter 1 Part II

Uploaded by

yordanosgetahun887

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

QUERY ALGORITHMS

QUERY ALGORITHMS
Queries are ultimately reduced to a number of file
scan operations on the underlying physical file
structures.
For each relational operation, there can exist several
different access paths to the particular records
needed.
The query execution engine can have a multitude of
specialized algorithms designed to process particular
relational operation and access path combinations.
We will look at some examples of algorithms for both
the select and join operations.
Selection Algorithms
The Select operation must search through the
data files for records meeting the selection
criteria. The following are some examples of
simple (one attribute) selection algorithms:
Linear search
Every record from the file is read and compared
to the selection criteria. The execution cost for
searching on a non -key attribute is br, where br
is the number of blocks in the file representing
relation r. On a key attribute, the average cost is
br / 2, with a worst case of br
Binary search on primary key
Search using a primary index on
equality
Search using a primary index on
comparison
Search using a secondary index on
equality
Join Algorithms
Like selection, the join operation can be
implemented in a variety of ways. In terms of
disk accesses, the join operations can be very
expensive, so implementing and utilizing
efficient join algorithms is critical in
minimizing a query’s execution time. The
following are 4 well-known types of join
algorithms:
Nested-Loop Join
This algorithm consists of an inner for loop nested
within an outer for loop. To illustrate this
algorithm, we will use the following notations:
r, s Relations r and s
tr Tuple (record) in relation r
ts Tuple (record) in relation s
nr Number of records in relation r
ns Number of records in re lation s
br Number of blocks with records in relation r
bs Number of blocks with records in relation s
Nested-Loop Join
Here is a sample pseudo -code listing for
joining the two relations r and s utilizing the
nested –for loop
for each tuple tr in r
for each tuple ts in s
if join condition is true for (tr, ts)
add tr+ts to the result
Nested-Loop Join
Each record in the outer relation r is scanned once, and each
record in the inner relation s is scanned nr times, resulting
in nr* ns total record scans. If only one block of each

relation can fit into memory, then the cost (number of block
accesses )is nr * bs + br . If all blocks in both relations can fit
into memory, then the cost is br + bs . If all of the blocks in
relation s (the inner relation) can fit into memory, then the
cost is identical to both relations fitting in memory:
br + bs .
Index Nested -Loop Join:
This algorithm is the same as the Nested-Loop
Join, except an index file on the inner
relation’s (s) join attribute is used versus a
data-file scan on s—each index lookup in
the inner loop is essentially an equality
selection on s utilizing one of the selection
algorithms . Let c be the cost for the lookup,
then the worst-case cost for joining r and s is
br + nr * c
Sort-Merge Join
This algorithm can be used to perform natural
joins and equi -joins and requires that each
relation ( r and s) be sorted by the common
attributes between them ( R ∩ S) .
Each record in r and s is only scanned once,
thus producing a worst and best -case cost of
br + bs
Hash Join
Like the sort -merge join, the hash join
algorithm can be used to perform natural joins
and equi-joins .
The hash join utilizes two hash table file
structures (one for each relation) to
partition each relation’s records into sets
containing identical hash values on the join
attributes.
Hash Join
Each relation is scanned and its corresponding
hash table on the join attribute values is built.
Note that collisions may occur, resulting in some
of the partitions containing different sets records
with matching join attribute values.
After the two hash tables are built , for each
matching partition in the hash tables, an in -
memory hash index of the smaller relation’s (the
build relation) records is built and a nested –loop
join is performed against the corresponding
records in the other relation , writing out to the
result for each join
Hash Join
Note that the above works only if the required
amount of memory is available to hold the
hash index and the number records in any
partition of the build relation. If not, then a
process known as recursive partitioning is
performed.
Hash Join
QUERY OPTIMIZATION
The function of a DBMS’ query optimization
engine is to find an evaluation plan that
reduces the overall execution cost of a query.
We have seen in the previous sections that the
costs for performing particular operations
such as select and join can vary quite
dramatically. As an example, consider 2
relations r and s, with the following
characteristics:
In heuristic -based optimization,
mathematical rules are applied to the
components of the query to generate an
evaluation plan that, theoretically, will
result in a lower execution time.
Typically, these components are the
data elements within an internal data
structure, such as a query tree, that the
query parser has generated from a higher
level representation of the query (i.e.
SQL).
Another way of optimizing a query is semantic
–based query optimization. In many cases, the
data within and between relations contain
“rules” and patterns that are based upon
“real-world” situations that the DBMS does
not “know” about. For example, vehicles like
the Delorean were not made after 1990, so a
query like “Retrieve all vehicles with make
equal to Delorean and year > 2000” will
produce zero records. Injecting these types of
semantic rules into a DBMS can thus further
enhance a query’s execution time.

The Teradata Database - Part 3 Usage Fundamentals PDF
No ratings yet
The Teradata Database - Part 3 Usage Fundamentals PDF
20 pages
Unit 3
No ratings yet
Unit 3
63 pages
Lesson 06
No ratings yet
Lesson 06
44 pages
DBMS UNIT 4 Part 1
No ratings yet
DBMS UNIT 4 Part 1
15 pages
7-Query Processing
No ratings yet
7-Query Processing
47 pages
Unit-2 Query Processing and Optimization, Query Equivalence, Join Strategies
No ratings yet
Unit-2 Query Processing and Optimization, Query Equivalence, Join Strategies
38 pages
BCS Topic
No ratings yet
BCS Topic
66 pages
Session - 10 Querying
No ratings yet
Session - 10 Querying
36 pages
Dbms Chapter 5
No ratings yet
Dbms Chapter 5
54 pages
3 Join Optimization
No ratings yet
3 Join Optimization
32 pages
Ch12-Query Processing
No ratings yet
Ch12-Query Processing
34 pages
Dbms Query Evaluation
No ratings yet
Dbms Query Evaluation
28 pages
Lecture11 Query Processing
No ratings yet
Lecture11 Query Processing
37 pages
Unit IV Part II
No ratings yet
Unit IV Part II
37 pages
Advance Database Management System: Unit - 2 .Query Processing and Optimization
No ratings yet
Advance Database Management System: Unit - 2 .Query Processing and Optimization
38 pages
Unit 3 - DBMS
No ratings yet
Unit 3 - DBMS
15 pages
Course08 - RelEval
No ratings yet
Course08 - RelEval
22 pages
Data Warehousing: Need For Speed: Join Techniques
No ratings yet
Data Warehousing: Need For Speed: Join Techniques
22 pages
Query Processing + Optimization: Outline: Operator Evaluation Strategies
No ratings yet
Query Processing + Optimization: Outline: Operator Evaluation Strategies
53 pages
Query Processing - Short Form
No ratings yet
Query Processing - Short Form
3 pages
Lecture Notes
No ratings yet
Lecture Notes
96 pages
05 Optimization
No ratings yet
05 Optimization
58 pages
Q Evaluation
No ratings yet
Q Evaluation
17 pages
Algorithms For Query Processing and Optimization
No ratings yet
Algorithms For Query Processing and Optimization
77 pages
Query Processing
No ratings yet
Query Processing
39 pages
QEII
No ratings yet
QEII
44 pages
Setting The Degree of Parallelism: Figure C-4
No ratings yet
Setting The Degree of Parallelism: Figure C-4
16 pages
Cse CSPC403 DBMS-70
No ratings yet
Cse CSPC403 DBMS-70
1 page
DBMS R19 Unit Iv
No ratings yet
DBMS R19 Unit Iv
25 pages
Oracle Join Algorithms
No ratings yet
Oracle Join Algorithms
7 pages
QueryProcess Optim
No ratings yet
QueryProcess Optim
60 pages
Query Optimization
No ratings yet
Query Optimization
20 pages
Query Execution
No ratings yet
Query Execution
87 pages
Evaluation of Relational Operations: Chapter 14, Part A (Joins)
No ratings yet
Evaluation of Relational Operations: Chapter 14, Part A (Joins)
6 pages
This
No ratings yet
This
8 pages
ADBMS
No ratings yet
ADBMS
15 pages
Problem Solving 3
No ratings yet
Problem Solving 3
3 pages
Execution
No ratings yet
Execution
37 pages
Hash Tables and Query Execution: March 1st, 2004
No ratings yet
Hash Tables and Query Execution: March 1st, 2004
32 pages
Nested Loops, Hash Join and Sort Merge Joins - Difference?: Nested Loop (Loop Over Loop)
No ratings yet
Nested Loops, Hash Join and Sort Merge Joins - Difference?: Nested Loop (Loop Over Loop)
7 pages
06 Query Processing (2) - NDN
No ratings yet
06 Query Processing (2) - NDN
31 pages
CSE 444: Database Internals: Section 4: Query Optimizer
No ratings yet
CSE 444: Database Internals: Section 4: Query Optimizer
16 pages
Module - 1
No ratings yet
Module - 1
94 pages
DBMS Unit 8
No ratings yet
DBMS Unit 8
7 pages
DBMS 10 Joins v2
No ratings yet
DBMS 10 Joins v2
38 pages
13 QP1
No ratings yet
13 QP1
33 pages
Relational Algebra Optimization
No ratings yet
Relational Algebra Optimization
24 pages
Solution 03
No ratings yet
Solution 03
6 pages
DB - Lecture Query Optimization
No ratings yet
DB - Lecture Query Optimization
80 pages
Query Processing
No ratings yet
Query Processing
77 pages
CH 13 Updated
No ratings yet
CH 13 Updated
30 pages
QueryProcessing Sorting
No ratings yet
QueryProcessing Sorting
44 pages
Chapter 13
No ratings yet
Chapter 13
24 pages
1.6 PPT - Query Optimization
No ratings yet
1.6 PPT - Query Optimization
53 pages
Unit 1
No ratings yet
Unit 1
23 pages
Query Processing: Solutions To Practice Exercises
No ratings yet
Query Processing: Solutions To Practice Exercises
5 pages
Chapter 13: Query Processing
No ratings yet
Chapter 13: Query Processing
25 pages
28-Execution Plan Optimization Techniques Stroffek Kovarik
No ratings yet
28-Execution Plan Optimization Techniques Stroffek Kovarik
49 pages
Chapter 2-1: Query Processing
No ratings yet
Chapter 2-1: Query Processing
31 pages
Visualizing Data Structures
From Everand
Visualizing Data Structures
Rhonda Hoenigman
No ratings yet
Mastering Data Structures and Algorithms in C and C++
From Everand
Mastering Data Structures and Algorithms in C and C++
Sachin Naha
No ratings yet
Apdf
100% (1)
Apdf
4 pages
SM Note (AutoRecovered)
No ratings yet
SM Note (AutoRecovered)
21 pages
Course Outline Integrated Marketing Comm.
No ratings yet
Course Outline Integrated Marketing Comm.
3 pages
Lic Internship Eyob Birku
No ratings yet
Lic Internship Eyob Birku
15 pages
Exit Exam Preparation
100% (1)
Exit Exam Preparation
9 pages
Chapter 1 (Edited)
No ratings yet
Chapter 1 (Edited)
31 pages
05 - Linux - File - Folder - Permissions T
No ratings yet
05 - Linux - File - Folder - Permissions T
17 pages
Chapter 2 &3 New
No ratings yet
Chapter 2 &3 New
25 pages
002 Coaching Maths4Mgmt Finance AssignmentTwo
No ratings yet
002 Coaching Maths4Mgmt Finance AssignmentTwo
2 pages
Inclusiveness Individ Assignment
No ratings yet
Inclusiveness Individ Assignment
2 pages
Inclusiveness Assignment
No ratings yet
Inclusiveness Assignment
7 pages
DSA Topics 80 30 GPT
No ratings yet
DSA Topics 80 30 GPT
38 pages
The Rust Programming Language 2nd Edition Steve Klabnik Instant Download
100% (2)
The Rust Programming Language 2nd Edition Steve Klabnik Instant Download
46 pages
DBMS Capsule
No ratings yet
DBMS Capsule
4 pages
WO2020060606A1
No ratings yet
WO2020060606A1
38 pages
CSE220 Final Spring-24 Set-A
No ratings yet
CSE220 Final Spring-24 Set-A
4 pages
EnCase Examiner v7.06 User's Guide
100% (1)
EnCase Examiner v7.06 User's Guide
615 pages
DS Lecture - 6 (Hashing)
No ratings yet
DS Lecture - 6 (Hashing)
32 pages
Datastage Questions1
No ratings yet
Datastage Questions1
33 pages
Hashing
50% (2)
Hashing
43 pages
AIM: Write A Program To Generate SHA-1 Hash. Description:: Practical: 9
No ratings yet
AIM: Write A Program To Generate SHA-1 Hash. Description:: Practical: 9
24 pages
21 - Data Structure and Algorithms - Hash Table
No ratings yet
21 - Data Structure and Algorithms - Hash Table
9 pages
Memcached
No ratings yet
Memcached
20 pages
Discussion Questions Chapter 1 3
No ratings yet
Discussion Questions Chapter 1 3
16 pages
CSC508 Hashing
No ratings yet
CSC508 Hashing
35 pages
Lecture 10-11 (10-11 - 23-24-MAY-2023) - CH08 - PPT
No ratings yet
Lecture 10-11 (10-11 - 23-24-MAY-2023) - CH08 - PPT
105 pages
An Enhanced Passkey Entry Protocol For Secure Simple Pairing in Bluetooth
No ratings yet
An Enhanced Passkey Entry Protocol For Secure Simple Pairing in Bluetooth
13 pages
Assignment No.2: HOANG Nguyen Phong
No ratings yet
Assignment No.2: HOANG Nguyen Phong
6 pages
Dbms Unit III Notes
No ratings yet
Dbms Unit III Notes
27 pages
Module 5
No ratings yet
Module 5
16 pages
Bitcoin Developer Reference - Bitcoin
No ratings yet
Bitcoin Developer Reference - Bitcoin
190 pages
Weak and Bright Student Assignment
No ratings yet
Weak and Bright Student Assignment
4 pages
18CSC205J Operating Systems Unit 5 - New
No ratings yet
18CSC205J Operating Systems Unit 5 - New
140 pages
Isaac Madan, Shaurya Saluja, Aojia Zhao, Automated Bitcoin Trading Via Machine Learning Algorithms
No ratings yet
Isaac Madan, Shaurya Saluja, Aojia Zhao, Automated Bitcoin Trading Via Machine Learning Algorithms
10 pages
Sorting and Hashing
100% (1)
Sorting and Hashing
35 pages
Chapter 8 - Hashing
No ratings yet
Chapter 8 - Hashing
78 pages
CBS 416 Assigment Solution
No ratings yet
CBS 416 Assigment Solution
8 pages
DS - Unit 5 - Notes
No ratings yet
DS - Unit 5 - Notes
8 pages
11 What Is Hashing in DBMS
No ratings yet
11 What Is Hashing in DBMS
20 pages
DSA All Labs
No ratings yet
DSA All Labs
117 pages

Chapter 1 Part II

Uploaded by

Chapter 1 Part II

Uploaded by

QUERY ALGORITHMS

You might also like