0% found this document useful (0 votes)

7 views16 pages

CSE 444: Database Internals: Section 4: Query Optimizer

The document discusses the query optimization process in databases, focusing on estimating the cost of query plans and providing examples of physical query plans. It includes a detailed breakdown of a sample query involving students, books, and checkouts, demonstrating different optimization strategies and their costs. Additionally, it covers the Sellinger optimization example with sailors, boats, and reserves, highlighting the use of indexes and various join methods to improve query performance.

Uploaded by

wahodbsa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views16 pages

CSE 444: Database Internals: Section 4: Query Optimizer

Uploaded by

wahodbsa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

CSE 444: Database Internals

Section 4:
Query Optimizer
Plan for Today
• Problem 1A, 1B: Estimating cost of a plan
– You try to compute the cost for 5 mins
– We go over the solution together

• Problem 2: Sellinger Optimizer

– We will do it together
1. Estimating Cost of a given plan
Student (sid, name, age, address)
Book(bid, title, author)
Checkout(sid, bid, date)

Query:
SELECT S.name
FROM Student S, Book B, Checkout C
WHERE S.sid = C.sid
AND B.bid = C.bid
AND B.author = 'Olden Fames'
AND S.age > 12
AND S.age < 20
S(sid,name,age,addr)
B(bid,title,author)
C(sid,bid,date)
Assumptions
• Student: S, Book: B, Checkout: C

• Sid, bid foreign key in C referencing S and B resp.

• There are 10,000 Student records stored on 1,000 pages.
• There are 50,000 Book records stored on 5,000 pages.
• There are 300,000 Checkout records stored on 15,000
pages.
• There are 500 different authors.
• Student ages range from 7 to 24.
S(sid,name,age,addr) T(S)=10,000 B(S)=1,000 V(B,author) = 500
B(bid,title,author) T(B)=50,000 B(B)=5,000 7 <= age <= 24
C(sid,bid,date) T(C)=300,000 B(C)=15,000

Physical Query Plan – 1A

(On the fly) d name
Q. Compute
1. the cost and
(On the fly) c 12<age<20 Ʌ author = ‘Olden Fames’
cardinality in steps
(a) to (d)
(Tuple-based nested loop 2. the total cost
B inner) b
bid Assumptions:
• Data is not sorted on any
attributes
(Block-nested loop, a • Outer relation fits in memory
S outer, C inner) sid

Student S Checkout C Book B

(File scan) (File scan) (File scan)
5
S(sid,name,age,addr) T(S)=10,000 B(S)=1,000 V(B,author) = 500
B(bid,title,author) T(B)=50,000 B(B)=5,000 7 <= age <= 24
C(sid,bid,date) T(C)=300,000 B(C)=15,000
(a) B(S) + B(S) * B(C)
Solution – 1A = 1000 + 1000 * 15000
= 15,001,000

(On the fly) d name cardinality = 300,000

(foreign key join, output
pipelined to next join)
Also, applying the formula, join
(On the fly) c 12<age<20 Ʌ author = ‘Olden Fames’ size=T(S) * T(C)/max (V(S, sid), V(C, sid) )
= T(S) since V(S, sid) > = V(C, sid) and
T(S) = V(S, sid)
(Tuple-based nested loop (b) T(S C) * B(B)
B inner) b = 300,000 * 5,000 = 15 * 108
bid
cardinality = 300,000
(foreign key join, don’t need
scanning for outer relation)
(Block-nested loop, a (c, d) cost 0 (on the fly)
S outer, C inner) sid Book B Cardinality:
(File scan) 300,000 * 1/500 * 7/18
= 234 (approx)
(assuming uniformity and
Student S Checkout C independence)
(File scan) (File scan) Total cost = 1,515,001,000
Final cardinality = 234 (approx)
6
S(sid,name,age,addr) T(S)=10,000 B(S)=1,000 V(B,author) = 500
B(bid,title,author) T(B)=50,000 B(B)=5,000 7 <= age <= 24
C(sid,bid,date) T(C)=300,000 B(C)=15,000

Physical Query Plan – 1B

(On the fly) g name

Q. Compute
(On the fly)
f 12<age<20 1. the cost and
cardinality in steps
(Block nested loop e (a) to (g)
S inner) sid 2. the total cost
Assumptions:
d sid (On the fly) • Unclustered B+tree index on
B.author
(Indexed-nested loop, • Clustered B+tree index on
C.bid
B outer, C inner) c • All index pages are in memory
bid • Unlimited memory
(On the fly) b bid
a author = ‘Olden Fames’ Checkout C Student S
(Index scan) (File scan)
Book B
(Index scan) 7
S(sid,name,age,addr) T(S)=10,000 B(S)=1,000 V(B,author) = 500
B(bid,title,author): Un. B+ on author T(B)=50,000 B(B)=5,000 7 <= age <= 24
C(sid,bid,date): Cl. B+ on bid T(C)=300,000 B(C)=15,000
(a) T(B) / V(B, author)
Solution – 1B cost = 50,000/500 = 100
cardinality = 100
(unclustered)

(On the fly) g name

(b) Cost 0, cardinality 100
(c)
(On the fly)
f 12<age<20 i. one index lookup per outer B tuple
ii. 1 book has 6 checkouts (uniformity)
iii. # C tuples per page = T(C)/B(C) = 20
(Block nested loop e iv. 6 tuples fit in at most 2 consecutive pages
(clustered) – could assume 1 page as well
S inner) sid Cost <= 100 * 2= 200
cardinality = 100 * 6 = 600
d sid (On the fly) (= 100 * T(C)/ MAX(100, V(C, bid)) assuming
V(C, bid) = V(B, bid) = T(B) = 50,000)
(d) Cost 0, cardinality 600
(Indexed-nested loop,
(e) Outer relation is already in memory,
B outer, C inner) c Student S need to scan S relation
bid
cost B(S) = 1000
(File scan) Cardinality = 600
(On the fly) b bid
(f) Cost = 0
a author = ‘Olden Fames’ Checkout C Cardinality = 600 * 7/18 = 234 (approx)

(Index scan) (d) Cost 0, cardinality 234

Book B Total cost = 1300 (compare with 1,515,001,000 in 1A!)
(Index scan) Final cardinality = 234 (approx) (same as 1A!) 8
2. Sellinger Optimization Example
Sailors (sid, sname, srating, age)
Boats(bid, bname, color)
Reserves(sid, bid, date, rname)

Query:
SELECT S.sid, R.rname
FROM Sailors S, Boats B, Reserves R
WHERE S.sid = R.sid
AND B.bid = R.bid
AND B.color = red
Example is from the Ramakrishnan book
S (sid, sname, srating, age)
B (bid, bname, color)
R (sid, bid, date, rname)
Available Indexes
• Sailors: S, Boats: B, Reserves: R

• Sid, bid foreign key in R referencing S and B resp.

• Sailors
– Unclustered B+ tree index on sid
– Unclustered hash index on sid
• Boats
– Unclustered B+ tree index on color
– Unclustered hash index on color
• Reserves
– Unclustered B+ tree on sid
– Clustered B+ tree on bid
S (sid, sname, srating, age): B+tree - sid, hash index - sid SELECT S.sid, R.rname
B (bid, bname, color) : B+tree - color, hash index - color WHERE S.sid = R.sid
R (sid, bid, date, rname) : B+tree - sid, Clustered B+tree - bid B.bid = R.bid, B.color = red

First Pass
• Where to start?
– How to access each relation, assuming it would be the first
relation being read
– File scan is also available!
• Sailors?
– No selection matching an index, use File Scan (no overhead)
• Reserves?
– Same as Sailors
• Boats?
– Hash index on color, matches B.color = red
– B+ tree also matches the predicate, but hash index is cheaper
• B+ tree would be cheaper for range queries
S (sid, sname, srating, age): 1. B+tree - sid, 2. hash index - sid SELECT S.sid, R.rname
B (bid, bname, color) : 1. B+tree - color, 2. hash index - color WHERE S.sid = R.sid
R (sid, bid, date, rname) : 1. B+tree - sid, 2. Clustered B+tree - bid B.bid = R.bid, B.color = red

Second Pass
• What next?
– For each of the plan in Pass 1 taken as outer, consider joining
another relation as inner
• What are the combinations? How many new options?
Outer Inner OPTION 1 OPTION 2 OPTION 3
R (file scan) B (B+-color) (hash color) (File scan)
R (file scan) S (B+-sid) (hash sid) ,,
S (file scan) B (B+-color) (hash color) ,,
S (file scan) R (B+-sid) (Cl. B+ bid) ,,
B (hash index) R (B+-sid) (Cl. B+ bid ,,
B (hash index) S (B+-sid) (hash sid) ,,
S (sid, sname, srating, age): 1. B+tree - sid, 2. hash index - sid SELECT S.sid, R.rname
B (bid, bname, color) : 1. B+tree - color, 2. hash index - color WHERE S.sid = R.sid
R (sid, bid, date, rname) : 1. B+tree - sid, 2. Clustered B+tree - bid B.bid = R.bid, B.color = red

Second Pass
• Which outer-inner combinations can be discarded?
– B, S and S, B: Cartesian product!

Outer Inner OPTION 1 OPTION 2 OPTION 3

R (file scan) B (B+-color) (hash color) (File scan)
R (file scan) S (B+-sid) (hash sid) ,,
S (file scan) B (B+-color) (hash color) ,,
S (file scan) R (B+-sid) (Cl. B+ bid) ,,
B (hash index) S (B+-sid) (hash sid) ,,
B (hash index) R (B+-sid) (Cl. B+ bid): ,,

OPTION 3 is not shown on next slide,

expected to be more expensive
S (sid, sname, srating, age): 1. B+tree - sid, 2. hash index - sid SELECT S.sid, R.rname
B (bid, bname, color) : 1. B+tree - color, 2. hash index - color WHERE S.sid = R.sid
R (sid, bid, date, rname) : 1. B+tree - sid, 2. Clustered B+tree - bid B.bid = R.bid, B.color = red

Outer Inner OPTION 1 OPTION 2

R (file scan) S (B+-sid) Slower than (hash sid): likely to be faster
hash-index 2A. Index nested loop join
(need Sailor tuples matching 2B Sort Merge based join: (no index
S.sid = value, where value is sorted on sid, need to sort, output
comes from an outer R tuple) sorted by sid, retained if cheaper)

R (file scan) B (B+-color) Not useful (hash color) Consider all methods, select
those tuples where B.color = red using the
color index (note: no index on bid)
S (file scan) R (B+-sid) Consider all (Cl. B+ bid) Not useful
methods
B (hash R (B+-sid) Not useful (Cl. B+ bid)
index) 2A. Index nested loop join
(no H. I. on bid)
2B. Sort-merge join
Keep the least cost plan between (clustered, index sorted on bid,
• (R, S) and (S, R) produces outputs in sorted order by
• (R, B) and (B, R) bid, retained if cheaper)
S (sid, sname, srating, age): 1. B+tree - sid, 2. hash index - sid SELECT S.sid, R.rname
B (bid, bname, color) : 1. B+tree - color, 2. hash index - color WHERE S.sid = R.sid
R (sid, bid, date, rname) : 1. B+tree - sid, 2. Clustered B+tree - bid B.bid = R.bid, B.color = red

Third Pass
• Join with the third relation
• For each option retained in Pass 2, join with the third
relation
• E.g.
– Boats (B+tree on color) – sort-merged-join – Reserves
(B+tree on bid)
– Join the result with Sailors (B+ tree on sid) using sort-merge-
join
• Need to sort (B join R) by sid, was sorted on bid before
• Outputs tuples sorted by sid
• Not useful here, but will be useful if we had GROUP BY on sid
• In general, a higher cost “interesting” plans may be retained (e.g.
sort operator at root, grouping attribute in group by query later, join
attriute in a later join)
Tomorrow, Lecture 12
• Pseudocode for Sellinger Optimization as a
dynamic programming
• Complexity of the algorithm

Compact Max USER GUIDE
No ratings yet
Compact Max USER GUIDE
14 pages
CSE 444 Practice Problems
No ratings yet
CSE 444 Practice Problems
13 pages
L10-Query Evaluaion
No ratings yet
L10-Query Evaluaion
50 pages
Session - 10 Querying
No ratings yet
Session - 10 Querying
36 pages
QEII
No ratings yet
QEII
44 pages
Overview of Query Evaluation: R&G Chapter 12
No ratings yet
Overview of Query Evaluation: R&G Chapter 12
30 pages
Lecture11 Query Processing
No ratings yet
Lecture11 Query Processing
37 pages
CSE 444 Practice Problems
No ratings yet
CSE 444 Practice Problems
8 pages
Hash Tables and Query Execution: March 1st, 2004
No ratings yet
Hash Tables and Query Execution: March 1st, 2004
32 pages
13 QP1
No ratings yet
13 QP1
33 pages
Final Review
No ratings yet
Final Review
96 pages
Course08 - RelEval
No ratings yet
Course08 - RelEval
22 pages
Evaluation of Relational Operations: Chapter 14, Part A (Joins)
No ratings yet
Evaluation of Relational Operations: Chapter 14, Part A (Joins)
6 pages
Ch12-Query Processing
No ratings yet
Ch12-Query Processing
34 pages
7-Query Processing
No ratings yet
7-Query Processing
47 pages
QueryProcess Optim
No ratings yet
QueryProcess Optim
60 pages
Query Processing
No ratings yet
Query Processing
77 pages
Unit IV Part II
No ratings yet
Unit IV Part II
37 pages
05 Optimization
No ratings yet
05 Optimization
58 pages
DBMS R19 Unit Iv
No ratings yet
DBMS R19 Unit Iv
25 pages
Introduction To Database Systems CSE 344: Basic Query Evaluation and Indexes
No ratings yet
Introduction To Database Systems CSE 344: Basic Query Evaluation and Indexes
38 pages
ADBMS TypicalQueryOptimizer
No ratings yet
ADBMS TypicalQueryOptimizer
30 pages
Relational Query Optimization: CS186 R & G Chapters 12/15
No ratings yet
Relational Query Optimization: CS186 R & G Chapters 12/15
51 pages
Midterm 13w2
No ratings yet
Midterm 13w2
8 pages
05 QueryProcessing LecW4 Feb7 22
No ratings yet
05 QueryProcessing LecW4 Feb7 22
55 pages
Query Processing and Optimization
No ratings yet
Query Processing and Optimization
45 pages
Practice Optimizer Blank
No ratings yet
Practice Optimizer Blank
7 pages
BCS Topic
No ratings yet
BCS Topic
66 pages
Query Processing
No ratings yet
Query Processing
39 pages
CAS CS 460/660 Introduction To Database Systems Query Optimization
No ratings yet
CAS CS 460/660 Introduction To Database Systems Query Optimization
20 pages
DBMS UNIT 4 Part 1
No ratings yet
DBMS UNIT 4 Part 1
15 pages
Lesson 06
No ratings yet
Lesson 06
44 pages
Query Processing + Optimization: Outline: Operator Evaluation Strategies
No ratings yet
Query Processing + Optimization: Outline: Operator Evaluation Strategies
53 pages
HW 3 Sol
No ratings yet
HW 3 Sol
8 pages
Final 15
No ratings yet
Final 15
7 pages
Relational Query Optimization: Plan: Tree of R.A. Ops, With Choice of Alg For Each Op
No ratings yet
Relational Query Optimization: Plan: Tree of R.A. Ops, With Choice of Alg For Each Op
7 pages
Database Modeling - notes-VI
No ratings yet
Database Modeling - notes-VI
8 pages
Correction of Final Exam 24-25
No ratings yet
Correction of Final Exam 24-25
5 pages
Advance Database Management System: Unit - 2 .Query Processing and Optimization
No ratings yet
Advance Database Management System: Unit - 2 .Query Processing and Optimization
38 pages
Plan Cost
No ratings yet
Plan Cost
37 pages
3 Join Optimization
No ratings yet
3 Join Optimization
32 pages
DBMS 10 Joins v2
No ratings yet
DBMS 10 Joins v2
38 pages
CH 13 Updated
No ratings yet
CH 13 Updated
30 pages
Q Evaluation
No ratings yet
Q Evaluation
17 pages
Optimization
No ratings yet
Optimization
17 pages
Access Path Selection in A Relation Database Management System
No ratings yet
Access Path Selection in A Relation Database Management System
13 pages
DBMS Unit5 Lecture1
No ratings yet
DBMS Unit5 Lecture1
22 pages
Lecture Notes
No ratings yet
Lecture Notes
96 pages
Database Management Systems Practice Problem Set: Query Evaluation, Optimization
No ratings yet
Database Management Systems Practice Problem Set: Query Evaluation, Optimization
3 pages
10 Qo343435154tertweretwgstwgw4
No ratings yet
10 Qo343435154tertweretwgstwgw4
46 pages
Query Optimization
No ratings yet
Query Optimization
20 pages
QueryProcessing Sorting
No ratings yet
QueryProcessing Sorting
44 pages
Advanced Database Systems Lecture Notes
No ratings yet
Advanced Database Systems Lecture Notes
79 pages
Relational Query Optimization: Warih Maharani, ST.,MT
No ratings yet
Relational Query Optimization: Warih Maharani, ST.,MT
39 pages
Database Tuning: Database Tuning Describes A Group of Activities Used To Optimize and Homogenize The
No ratings yet
Database Tuning: Database Tuning Describes A Group of Activities Used To Optimize and Homogenize The
20 pages
Database Technology Query Processing: Heiko Paulheim
No ratings yet
Database Technology Query Processing: Heiko Paulheim
60 pages
Physical Database Design: University of California, Berkeley School of Information
No ratings yet
Physical Database Design: University of California, Berkeley School of Information
71 pages
Assignment2 PDF
No ratings yet
Assignment2 PDF
8 pages
Query Execution
No ratings yet
Query Execution
87 pages
Relational Algebra Optimization
No ratings yet
Relational Algebra Optimization
24 pages
De Moiver's Theorem (Trigonometry) Mathematics Question Bank
From Everand
De Moiver's Theorem (Trigonometry) Mathematics Question Bank
Mohmmad Khaja Shareef
No ratings yet
Base Plate, Pedestal and Footing
No ratings yet
Base Plate, Pedestal and Footing
8 pages
16 Daftar Pustaka
No ratings yet
16 Daftar Pustaka
3 pages
Receipt
No ratings yet
Receipt
1 page
Pakistan Broadcasting Corporation Headquarters Islamabad
No ratings yet
Pakistan Broadcasting Corporation Headquarters Islamabad
3 pages
Despiece Block WA500 4.11
No ratings yet
Despiece Block WA500 4.11
1 page
Energies 10 00336 v2 PDF
No ratings yet
Energies 10 00336 v2 PDF
19 pages
ID Pengaruh Pengawasan Kepala Sekolah Terha
No ratings yet
ID Pengaruh Pengawasan Kepala Sekolah Terha
8 pages
Audience Survey: Ethiopia 2011
80% (5)
Audience Survey: Ethiopia 2011
44 pages
DADF-G1 Install PDF
No ratings yet
DADF-G1 Install PDF
9 pages
Application For Repatriation and Citizenship
No ratings yet
Application For Repatriation and Citizenship
4 pages
Linux Manual
100% (1)
Linux Manual
55 pages
Resources: Artificial Intelligence in The Sorting of Municipal Waste As An Enabler of The Circular Economy
No ratings yet
Resources: Artificial Intelligence in The Sorting of Municipal Waste As An Enabler of The Circular Economy
9 pages
Lila Poonawalla Foundation - Complete
No ratings yet
Lila Poonawalla Foundation - Complete
60 pages
Sas Oracle Loading Methods
No ratings yet
Sas Oracle Loading Methods
9 pages
Betoniera Parti
No ratings yet
Betoniera Parti
3 pages
Checking Summary: Customer Service Information
100% (1)
Checking Summary: Customer Service Information
4 pages
Performance of Bored Piles Constructed Using Polymer Fluids: Lessons From European Experience
No ratings yet
Performance of Bored Piles Constructed Using Polymer Fluids: Lessons From European Experience
9 pages
Booting Up Kali Linux GUI in AWS
No ratings yet
Booting Up Kali Linux GUI in AWS
25 pages
SAP Fiori Elements Usage Guide - August 2019
No ratings yet
SAP Fiori Elements Usage Guide - August 2019
29 pages
Quotation: Onyx Energies (Private) Limited
No ratings yet
Quotation: Onyx Energies (Private) Limited
2 pages
Sa206 - Biedaalt1
No ratings yet
Sa206 - Biedaalt1
10 pages
SL 763733
No ratings yet
SL 763733
8 pages
WITchar - 2025 - 1
No ratings yet
WITchar - 2025 - 1
1 page
Marioff HI-FOG Land Applications: Approvals Matrix
No ratings yet
Marioff HI-FOG Land Applications: Approvals Matrix
2 pages
ARRL QST Antenna - And.tower - Safety. (2001)
No ratings yet
ARRL QST Antenna - And.tower - Safety. (2001)
1 page
Welding Shop Assignment
100% (1)
Welding Shop Assignment
15 pages
Ener Chi Ionization
No ratings yet
Ener Chi Ionization
2 pages
Characterization And Properties Of Green-Emitting Β-Sialon:Eu2+ Powder Phosphors For White Light-Emitting Diodes
No ratings yet
Characterization And Properties Of Green-Emitting Β-Sialon:Eu2+ Powder Phosphors For White Light-Emitting Diodes
4 pages
Nomination For Nari Shakti Puruskars'-2025
No ratings yet
Nomination For Nari Shakti Puruskars'-2025
1 page

CSE 444: Database Internals: Section 4: Query Optimizer

Uploaded by

CSE 444: Database Internals: Section 4: Query Optimizer

Uploaded by

CSE 444: Database Internals

• Problem 2: Sellinger Optimizer

• Sid, bid foreign key in C referencing S and B resp.

Physical Query Plan – 1A

Student S Checkout C Book B

(On the fly) d name cardinality = 300,000

Physical Query Plan – 1B

(On the fly) g name

(Index scan) (d) Cost 0, cardinality 234

• Sid, bid foreign key in R referencing S and B resp.

Outer Inner OPTION 1 OPTION 2 OPTION 3

OPTION 3 is not shown on next slide,

Outer Inner OPTION 1 OPTION 2

You might also like