0% found this document useful (0 votes)

186 views5 pages

DBMS Chapter 7

The document discusses query optimization in database systems. It describes the basic steps in query processing as parsing and translation, optimization, and evaluation. The optimization step chooses the most efficient execution plan from semantically equivalent options. Query cost estimation is used to select the lowest-cost plan based on database statistics. Relational algebra transformation rules are applied to generate equivalent expressions to optimize queries. Operator trees are used to represent relational algebra expressions graphically.

Uploaded by

Nabin Shrestha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

186 views5 pages

DBMS Chapter 7

Uploaded by

Nabin Shrestha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Chapter 7

Query Optimization

Query Processing
Query Processing refers to range of activities involved in extracting data from a database.
The basic steps involved in processing of a query are
1. Parsing and translation
2. Optimization
3. Evaluation

Parser and
Query Relation Algebra
Translator

Optimizer

Evaluation
Query Execution Plan Database
Engine
Statistics
(Data
Dictionary)
Data Data

Fig: Steps in Query Processing

1. Parsing and Translation

The first step in any query processing system is to translate a given query into its internal form. This
translation process is similar to the work performed by the parser of a compiler. In generating the internal
from of the query, the parser check the syntax of the user’s query, verifies that the relation is formulated
according to the syntax rules of the query language. Then this is translated into relational algebra.
2. Optimization
A relational algebra expression may have many equivalent expressions.
E.g. σbalance<2500(∏balance(account)) is equivalent to∏balance(σbalance<2500(account))
We can execute each relation algebra operation by one of several different execute algorithms. The process
of choosing a suitable one with lowest cost is known as query optimization. Cost is estimated using the
statistical information from database catalog. The different statistical information is number of tuples in
each relation, size of tuples etc. So among all equivalent expressions, choose the one with the cheapest
possible evaluation plan (one of the possible way of executing a query).
3. Execution
The query execution engine takes a query evaluation plan, executes that plan and returns the answer to the
query.

Query Cost Estimation

Each query is translated into a number of semantically equivalent plans. So there are several alternatives, now
the question is which one is the most efficient evaluation plan to be selected for execution. To get the answer,
Compiled By: Mohan Bhandari
the cost for all alternatives must be estimated and the plan with lowest cost is selected. Since a database
resides on disk, often the cost of reading and writing to disk dominates the cost of processing a query.
We can choose a strategy based on reliable information, database systems may store statistics (metadata) for
each relation R. These statistics includes number of tuples in a relation, size of tuples in a relation etc. Cost is
generally measured as total elapsed time for answering a query. Many factors contribute to time cost. Some of
them are disk accesses, CPU, network communication etc.

Equivalence / Transformation Rules

Two algebraic expressions are said to be equivalent if they produce same result. By using the equivalence rule
which is concerned with basic relational algebra operator, we can formulate any equivalent expressions for a
single query. If R, S and T are relations and C1, C2……Cn are conditions then equivalent rules are
1. Commutativity of binary operators
RUS≡SUR R∩S≡S∩R R S≡S R R S≡S R
2. Associativity of binary operator
(R U S) U T ≡ R U(S U T) (R ∩ S) ∩ T ≡ R ∩(S ∩ T) R (S T) ≡ (R S) T
3. Commutating projection with binary operator
∏C(R S) ≡ ∏A(R) ∏B(S) where C=A U B such that attribute A is in relation R and attribute B is in relation S.
And similar for join operator also.
4. Commutating selection with binary operator
a. σC(R S) ≡ σC(R) S , if the attribute involved in condition is from relation R
b. σC(R S) ≡ R σC (S) , if the attribute involved in condition is from relation S
c. σC(R S) ≡ σA (R) σB (S) , where C=A ˄ B such that condition A has attribute from R and condition B has attribute from S.
5. Commutating selection and projection
∏X(σC(R)) ≡ σC(∏X (R)) σC (∏X (R)) ≡ ∏X (σC (R))
6. Idempotence of unary operator
a. Combine Cascade Selection
σC1(σC2(R)) ≡ σC1˄σ C2(R)
b. Combine Cascade Projection
∏X(∏Y(R)) ≡ ∏X(R) if X is subset of Y.

Example:
Suppose we have the relational algebra expression as below.
- ∏customer-name(σbranch-city = ‘ktm’ ˄ balance > 1000( branch account depositer))
Using rule no 4a we can have equivalent expression as below
- ∏customer-name((σbranch-city = ‘ktm’ ˄ balance > 1000( branch account) depositer))
Using rule no 4c, we can have another equivalent expression as below
- ∏customer-name((σbranch-city = ‘ktm’ (Branch) σ balance > 1000(account) depositer))

Operator Tree
The relational algebra query can be represented graphically for simplicity by an operator tree. An operator tree
is a tree in which leaf node is a relation stored in the database and a non-leaf node is a intermediate relation
produce by a relational algebra operator. The sequence of operations is directed from leaves to the root, which
represents the answer to the query.
σc

≡
σc E2

E1 E2
E1

Compiled By: Mohan Bhandari

Example:
Suppose we have a relational algebra expression as below.
1. ∏student-name(σcourse-naem=’DBMS’(Student Registration Course))
The initial operator tree for the above relational expression is as below.
∏student-name

σcourse-name=’DBMS’

Student

Registration Course

Query Optimization
It is the process of selecting the most efficient query execution plan among the many strategies possible for
processing a query. The query optimizer is very important component of a database system because the
efficiency of the system depends on the performance of the optimizer. The selected plan minimizes the cost
function.
Query optimization refers to the process of producing a query execution plan which represents an execution
strategy for the query. The selected plan minimizes an object cost function.
Steps of optimization
1. Create an initial operator (expression) tree.
2. Move select operation down the tree for the easiest possible execution.
3. Applying more restrictive select operation first.
4. Replace Cartesian product by join.
5. Creating new projection whenever needed.
6. Adjusting rest of the tree accordingly.
Example1:
The following query retrieves the customer name from branch city pokhara whose balance is greater then
1000
Select customer-name from Branch, Account, Depositor where city= ‘Pkr’ and balance >1000
To process the above query, there are number of evaluation plan in which the above query can be processed.
1. Join relation Branch and Account, join the result with Depositor and then do the restriction.
2. Join the relation Branch and Account, do the restrictions and then join the result with Depositor.
3. Do the restriction, join the relations Branch and Account, and join the result with Depositor.
The query optimizer estimates cost for each of the plan and choose the best way to process the query.
Let us consider the following algebraic expression
∏customer-name(σcity=’Pkr’˄ balance>1000 (Branch Account Depositor))

Compiled By: Mohan Bhandari

The initial operator tree is The final Tree after multiple Transformations is
∏customer-name
∏customer-name

σcity=’Pkr’ ˄ balance > 1000

Depositor

Branch
σcity=’Pkr’ σbalance>1000
Account Depositor

Branch Account

Exmple2:
Suppose we are given the following table definitions with the certain records in each table.
PROJ (PNO, PNAME, BUDGET)
EMP(ENO, ENAME, TITLE)
ASG(ENO, PNO, DUR)
Write the sql statement and RA expression: “Find the names of employees other than Ram Thapa who worked
on CAD/CAM project for either 1 or 2 years”. Construct initial operator tree and final efficient operator
tree after applying transformation rules.
:
SQL:
select ENAME from EMP, ASG, PROJ where EMP.ENO=ASG.ENO and ASG.PNO=PROJ.PNO and
ENAME != ‘Ram Thapa’ and PNAME=’CAD/CAM’ and (DUR = 1 or DUR =2)
RA:
∏ENAME(σENAME ≠ ‘Ram Thapa’ PNAM = ‘CAD/CAM’ (DUR =1 DUR = 2)(PROJ⋈ (EMP ⋈ ASG)))
Initial Operator tree
∏ENAME (Project)

σENAME≠’Ram Thapa’ ˄ PNAME = ‘CAD/CAM’˄ (DUR = 1 DUR = 2) (Select)

Final operator tree (a more efficient query evaluation tree, since more selective operations are performed
first)

Compiled By: Mohan Bhandari

σPNAME=’CAD/CAM’ σ DUR =1 DUR =2 σ ENAME≠’Ram Thapa’

PROJ ASG EMP

Compiled By: Mohan Bhandari

Presentation9 - Query Processing and Query Optimization in DBMS
No ratings yet
Presentation9 - Query Processing and Query Optimization in DBMS
36 pages
Ambiguity: E E+E - E E - (E) - Id
No ratings yet
Ambiguity: E E+E - E E - (E) - Id
9 pages
ADBMS Notes
67% (3)
ADBMS Notes
48 pages
Compiler Design Notes
No ratings yet
Compiler Design Notes
101 pages
Query Trees and Heuristics For Query Optimization
No ratings yet
Query Trees and Heuristics For Query Optimization
29 pages
Introduction To Database Management System: 1.1 Data
No ratings yet
Introduction To Database Management System: 1.1 Data
9 pages
Compiler Design - CS3501 - Notes
No ratings yet
Compiler Design - CS3501 - Notes
163 pages
Dbms Notes Unit 3
No ratings yet
Dbms Notes Unit 3
39 pages
Ply Talk
100% (2)
Ply Talk
87 pages
SPLK-1001: Number: SPLK-1001 Passing Score: 800 Time Limit: 120 Min File Version: 1
No ratings yet
SPLK-1001: Number: SPLK-1001 Passing Score: 800 Time Limit: 120 Min File Version: 1
36 pages
Chapter 13: Query Processing
No ratings yet
Chapter 13: Query Processing
25 pages
DBMS Chapter 4
No ratings yet
DBMS Chapter 4
39 pages
Unit-5 Query Processing and Optimization
No ratings yet
Unit-5 Query Processing and Optimization
40 pages
ATCD Important Questions
No ratings yet
ATCD Important Questions
7 pages
Syllabus-M SC Com Appl IARI PDF
No ratings yet
Syllabus-M SC Com Appl IARI PDF
32 pages
DBMS TM Relational Model Chapter3 (II)
100% (1)
DBMS TM Relational Model Chapter3 (II)
67 pages
Mcsethesis Tanmoy Chakraborty
No ratings yet
Mcsethesis Tanmoy Chakraborty
229 pages
Artificial Intelligence (CSC 477)
No ratings yet
Artificial Intelligence (CSC 477)
116 pages
Query Language
No ratings yet
Query Language
44 pages
Chapter 5: Query Optimization: Acknowledgements: Slides Are Adapted From Böhlen and
No ratings yet
Chapter 5: Query Optimization: Acknowledgements: Slides Are Adapted From Böhlen and
53 pages
Chapter - 1 - Query Optimization
No ratings yet
Chapter - 1 - Query Optimization
38 pages
DBMS3
No ratings yet
DBMS3
91 pages
Module 2 Relational Model & SQL
No ratings yet
Module 2 Relational Model & SQL
204 pages
Ad Bms Notes
No ratings yet
Ad Bms Notes
44 pages
III B. Tech II - Sem CG LessonPlan (R23) - DR Raja Kumar
No ratings yet
III B. Tech II - Sem CG LessonPlan (R23) - DR Raja Kumar
2 pages
DBMS 3
No ratings yet
DBMS 3
35 pages
2 SimpleOnePassCompiler
No ratings yet
2 SimpleOnePassCompiler
66 pages
Eisenstein-Nov18 - Definicao-1-30
No ratings yet
Eisenstein-Nov18 - Definicao-1-30
30 pages
Unit III Relational Algebra and Relational Calculus
No ratings yet
Unit III Relational Algebra and Relational Calculus
74 pages
Query Execution
No ratings yet
Query Execution
87 pages
CH 2
No ratings yet
CH 2
59 pages
DBMS Unit - 7
No ratings yet
DBMS Unit - 7
33 pages
Relational Algebra
No ratings yet
Relational Algebra
27 pages
DBMS 11
No ratings yet
DBMS 11
39 pages
Advanced Database Systems Chapter One Query Processing & Optimization
No ratings yet
Advanced Database Systems Chapter One Query Processing & Optimization
22 pages
AMSAL
No ratings yet
AMSAL
58 pages
Relational Algebra: Types of Relational Operation
No ratings yet
Relational Algebra: Types of Relational Operation
20 pages
#Chapter 1 - CD
No ratings yet
#Chapter 1 - CD
37 pages
Relational Model: - Example: If
No ratings yet
Relational Model: - Example: If
15 pages
Database Management Systems Week 5
No ratings yet
Database Management Systems Week 5
22 pages
CS2202 RelAlgebra
No ratings yet
CS2202 RelAlgebra
55 pages
Database Management Systems Week 4
No ratings yet
Database Management Systems Week 4
31 pages
2 Dbms
No ratings yet
2 Dbms
80 pages
Advanced Database
No ratings yet
Advanced Database
47 pages
Relational Algebra
No ratings yet
Relational Algebra
80 pages
4 Chapter Four
No ratings yet
4 Chapter Four
34 pages
Sqlparse
No ratings yet
Sqlparse
31 pages
File and File Structure: Overview of Storage Device
No ratings yet
File and File Structure: Overview of Storage Device
29 pages
ADB Chapter 2
No ratings yet
ADB Chapter 2
40 pages
CD DSTC Notes
No ratings yet
CD DSTC Notes
35 pages
Relational Algebra1
No ratings yet
Relational Algebra1
54 pages
Relational Algebra
No ratings yet
Relational Algebra
54 pages
DBMS 11
No ratings yet
DBMS 11
23 pages
Compiler Design Notes
No ratings yet
Compiler Design Notes
17 pages
Unit 3
No ratings yet
Unit 3
54 pages
Adbms Unit2
No ratings yet
Adbms Unit2
20 pages
DBMS - Unit 3 1
No ratings yet
DBMS - Unit 3 1
17 pages
Java Server Pages (JSP) /servlet Technology 10.1. Applets, Servlets, and Java Server Pages
No ratings yet
Java Server Pages (JSP) /servlet Technology 10.1. Applets, Servlets, and Java Server Pages
22 pages
KD Query Processing1
No ratings yet
KD Query Processing1
32 pages
Ch-2 Query Processing and Optimization
No ratings yet
Ch-2 Query Processing and Optimization
26 pages
Lecture 4
No ratings yet
Lecture 4
17 pages
Unit 5 Query Processing Detail
No ratings yet
Unit 5 Query Processing Detail
38 pages
Chapter 2-Query Processing and Optimi
No ratings yet
Chapter 2-Query Processing and Optimi
43 pages
DBMS Unit - 7
No ratings yet
DBMS Unit - 7
34 pages
Query Optimization Part1
No ratings yet
Query Optimization Part1
52 pages
CO3 Session 7
No ratings yet
CO3 Session 7
32 pages
CH 14 Updated
No ratings yet
CH 14 Updated
30 pages
Module - 4
No ratings yet
Module - 4
60 pages
Module 4 - 3 Bhargavi
No ratings yet
Module 4 - 3 Bhargavi
56 pages
University of Azad Jammu & Kashmir (Muzaffarabad AJK) Department of Computer Science & Information Technology
No ratings yet
University of Azad Jammu & Kashmir (Muzaffarabad AJK) Department of Computer Science & Information Technology
22 pages
Co Po Mapping of SS N CD & SS N CD Lab & OOMD
No ratings yet
Co Po Mapping of SS N CD & SS N CD Lab & OOMD
3 pages
28-Query Processing-30-09-2024
No ratings yet
28-Query Processing-30-09-2024
17 pages
Chapter 2-Query Processing - 110554
No ratings yet
Chapter 2-Query Processing - 110554
38 pages
RA and RC
No ratings yet
RA and RC
30 pages
DE Module5 QueryOptimization
No ratings yet
DE Module5 QueryOptimization
11 pages
Pramod Parajuli Simulation and Modeling, CS-331: Csitnepal
No ratings yet
Pramod Parajuli Simulation and Modeling, CS-331: Csitnepal
12 pages
Dbi 3
No ratings yet
Dbi 3
28 pages
Moiz Compiler
No ratings yet
Moiz Compiler
11 pages
6th Sem - ORIENTATION - PPT
No ratings yet
6th Sem - ORIENTATION - PPT
9 pages
Unit-3 RDBMS-1
No ratings yet
Unit-3 RDBMS-1
22 pages
Introduction To Query Processing and Optimization
No ratings yet
Introduction To Query Processing and Optimization
4 pages
5 Relational Algebra
No ratings yet
5 Relational Algebra
11 pages
Chapter 2: Programming Architecture 2.1 Model View Controller (MVC)
No ratings yet
Chapter 2: Programming Architecture 2.1 Model View Controller (MVC)
7 pages
Query Processing and Query Optimization
No ratings yet
Query Processing and Query Optimization
9 pages
Alien, J. F. 1995. Natural Language Understanding. Benjamin Cummings, Redwood City, California
No ratings yet
Alien, J. F. 1995. Natural Language Understanding. Benjamin Cummings, Redwood City, California
11 pages
Database Management System
No ratings yet
Database Management System
4 pages
(Week 3) Lecture 5 & 6: Dr. Naseer Ahmed Sajid Email Id: Whatsapp# 0346-5100010
No ratings yet
(Week 3) Lecture 5 & 6: Dr. Naseer Ahmed Sajid Email Id: Whatsapp# 0346-5100010
8 pages
SDT, ICG & CO - Quizizz
No ratings yet
SDT, ICG & CO - Quizizz
8 pages
Bottom Up Parse
No ratings yet
Bottom Up Parse
14 pages
Chapter 1
No ratings yet
Chapter 1
10 pages
Cs6T1 - Principles of Compiler Design: Pre-Requisites
No ratings yet
Cs6T1 - Principles of Compiler Design: Pre-Requisites
2 pages
Query Optimization
No ratings yet
Query Optimization
5 pages
OCR Tool 2
No ratings yet
OCR Tool 2
3 pages
Mca 5th Sem
No ratings yet
Mca 5th Sem
6 pages
Interview
No ratings yet
Interview
2 pages
Problem Statement Proposed Methodolgy: Rishu Kumar (157251) Pranav Pawar (157243) Rahul Ramteke (157250)
No ratings yet
Problem Statement Proposed Methodolgy: Rishu Kumar (157251) Pranav Pawar (157243) Rahul Ramteke (157250)
1 page
Nabin Shrestha: Thankot, Kathmandu Nepal + 9 7 7 9 8 4 3 4 1 2 4 0 5
No ratings yet
Nabin Shrestha: Thankot, Kathmandu Nepal + 9 7 7 9 8 4 3 4 1 2 4 0 5
1 page
Oracle 11g Streams Implementer's Guide
From Everand
Oracle 11g Streams Implementer's Guide
Ann L. R. McKinnell
No ratings yet

DBMS Chapter 7

Uploaded by

DBMS Chapter 7

Uploaded by

Chapter 7

Fig: Steps in Query Processing

1. Parsing and Translation

Query Cost Estimation

Equivalence / Transformation Rules

Compiled By: Mohan Bhandari

Compiled By: Mohan Bhandari

σcity=’Pkr’ ˄ balance > 1000

σENAME≠’Ram Thapa’ ˄ PNAME = ‘CAD/CAM’˄ (DUR = 1 DUR = 2) (Select)

Compiled By: Mohan Bhandari

PROJ ASG EMP

Compiled By: Mohan Bhandari

You might also like