0% found this document useful (0 votes)

10 views42 pages

3 - Query Tuning

Uploaded by

Hunter Money

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views42 pages

3 - Query Tuning

Uploaded by

Hunter Money

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 42

Query tuning

Viet-Trung Tran
SoICT

9/27/21 Database Tuning

1
What is query tuning

• Rewrite query to run faster

• First thing to do if query is slow
• Other tuning approaches related to query
• Adding indexes
• Changing schema ( 3, 4 NF, etc)
• Modify transaction lengh

9/27/21 Database Tuning

2
1. Overview

• What is query processing

• Phrases of query processing
• Parser
• Optimizer
1.1. What is query processing

• The entire process or activities involved in retrieving data from the

database
• SQL query translation into low level instructions (usually relational algebra)
• Query optimization to save resources, cost estimation or evaluation of query
• Query execution for the extraction of data from the database.
1.2. Phases of query processing

SQL

Optimized
Parser Query plan
execution
plan
Optimizer
Code
Generator

Code for executing

1.3. Parser

• Scans and parses the query into individual tokens and examines for
the correctness of query
• Does it containt the right keywords?
• Does it conform to the syntax?
• Does it containt the valid tables, attributes?
• Output: Query plan
• E.g.
• Input: SELECT balance FROM account WHERE balance < 2500
• Output: Relational algebra expression
• But it’s not unique
1.4. Optimizer

• Input: RA expression

• Output: Query execution plan

• Query execution plan = query plan + the algorithms for the
executions of RA operations
• Aims to choose the cheapest execution plan out of the
possible ones
• Step 1: Equivalence transformation
• Step 2: Annotation for the algorithm of the RA expression
• Step 3: Cost estimation for different query execution plans
2. Understanding optimizer

• Choose the cheapest execution plan out of the possible ones

• Step 1: Equivalence transformation
• Step 2: Annotation for the algorithmic execution of the RA expression
• Step 3: Cost estimation for different query execution plans
2.1. Step 1: Equivalence transformation

• RA expressions are equivalent if they generate the same set of tuples

on every database instance
• Equivalence rules:
• Transform one relational algebra expression into equivalent one
• Similar to numeric algebra: a + b = b + a, a(b + c) = ab + ac, etc
• Why producing equivalent expressions?
• equivalent algebraic expressions give the same result
• but usually the execution time varies significantly
2.1. Step 1: Equivalence transformation

• Equivalance tranformation rules

• (1) Conjunctive selection operations can be deconstructed into a
sequence of individual seections; cascade of 𝜎
• 𝜎!! ∧ !" 𝐸 = 𝜎!! 𝜎!" 𝐸
• (2) Selection operations are commutative
• 𝜎!! 𝜎!" 𝐸 = 𝜎!" 𝜎!! 𝐸
• (3) Only the final operations in a sequence of projection operations
is needed; cascade of Π
• Π#! Π#" … Π## 𝐸 … = Π#! (𝐸)
• (4) Selections can be combined with Cartesian products and theta
joins
• 𝜎!! 𝐸$ × 𝐸% = 𝐸$ ⋈!! 𝐸%
• 𝜎!! 𝐸$ ⋈!" 𝐸% = 𝐸$ ⋈!! ∧ !" 𝐸%
2.1. Step 1: Equivalence transformation

• Equivalance tranformation rules

• (5) Theta Join operations are commutative
• 𝐸! ⋈" 𝐸# = 𝐸# ⋈" 𝐸!
• (6) Natural join operations are associative
• 𝐸! ⋈ 𝐸# ⋈ 𝐸$ = (𝐸! ⋈ 𝐸# ) ⋈ 𝐸$
• Theta join are associative in the follwoing manner where θ# involves
attributes from E2 and E3 only
• (𝐸! ⋈"! 𝐸# ) ⋈"" ∧ "# 𝐸$ = 𝐸! ⋈! ∧ "# (𝐸# ⋈"" 𝐸$ )
2.1. Step 1: Equivalence transformation

• Equivalance tranformation rules

• (7) Selection distributes over joins in the following ways
• If predicate involves attributes of E1 only
• 𝜎"! 𝐸! ⋈"" 𝐸# = 𝜎"! (𝐸! ) ⋈"" 𝐸#
• If predicate θ! involves only attributes of E1 and θ# involves only
attributes of E2 (a consequence of rule 7 and 1)
• 𝜎"! ∧ "" 𝐸! ⋈"# 𝐸# = 𝜎"! (𝐸! ) ⋈"# 𝜎"" (𝐸# )
2.1. Step 1: Equivalence transformation

• Equivalance tranformation rules

• (8) Projection distributes over join as follows
• Π&!∪&" (𝐸! ⋈" 𝐸# ) = Π&! (𝐸! ) ⋈" Π&" (𝐸# )
• If 𝜃 involves attributes in 𝐿! ∪ 𝐿# only and 𝐿( contains attributes of 𝐸(
• (9) The set operations union and intersection are
commutative
• 𝐸! ∪ 𝐸# = 𝐸# ∪ 𝐸!
• 𝐸! ∩ 𝐸# = 𝐸# ∩ 𝐸!
• (10) The union and intersection are associative
• (𝐸! ∪ 𝐸# ) ∪ 𝐸$ = 𝐸! ∪ (𝐸# ∪ 𝐸$ )
2.1. Step 1: Equivalence transformation

• Equivalance tranformation rules

• (11) The selection operation distributes over union,
intersection, and set-difference
• 𝜎" 𝐸! ∪ 𝐸# = 𝜎" (𝐸! ) ∪ 𝜎" (𝐸# )
• 𝜎" 𝐸! ∩ 𝐸# = 𝜎" (𝐸! ) ∩ 𝜎" (𝐸# )
• 𝜎" 𝐸! − 𝐸# = 𝜎" (𝐸! ) − 𝜎" (𝐸# )
• (12) The project operation distributes over the union
• Π& 𝐸! ∪ 𝐸# = Π& (𝐸! ) ∪ Π& (𝐸# )
2.2. Step 2: Execution algorithms of RA
operations

• Algebra expression is not a query execution plan.

• Additional decisions required:
• which indexes to use, for example, for joins and selects?
• which algorithms to use, for example, sort-merge vs. hash join?
• materialize intermediate results or pipeline them?
2.2. Step 2: Execution algorithms of RA
operations

• Basic Operators
• One-pass operators:
• Scan
• Select
• Project
• Multi-pass operators:
• Join
• Various implementations
• Handling of larger-than-memory sources
• Aggregation, union, etc.
2.2. Step 2: Execution algorithms of RA
operations

• 1-Pass Operators: Scanning a Table

• Sequential scan: read through blocks of table
• Index scan: retrieve tuples in index order
2.2. Step 2: Execution algorithms of RA
operations

• Nested-loop JOIN

For each tuple tr in r {

for each tuple ts in s {
if (tr and ts satisfy the join condition) {
add tuple tr x ts to the result set
}
}
}

• No index needed
• Any join condition types
• Expensive: O(n2)
2.2. Step 2: Execution algorithms of RA
operations

• Single-loop JOIN (Index-based)

• Sort-merge JOIN
• Requires data physically sorted by join attributes: Merge and join
sorted files, reading sequentially a block at a time
• Maintain two file pointers
• While tuple at R < tuple at S, advance R (and vice versa)
• While tuples match, output all possible pairings
• Very efficient for presorted data. Otherwise, may require a sort (adds
cost + delay)
2.2. Step 2: Execution algorithms of RA
operations

• Partition-hash JOIN
• Hash two relations on join attributes
• Join buckets accordingly
2.2. Step 2: Execution algorithms of RA
operations

• Execution Strategy: Materialization vs. Pipelining

• Execution strategy defines how to walk the query execution plan
• Materialization
• Pipelining

Join
PressRel.Symbol = EastCoast.CoSymbol

Join Project
PressRel.Symbol = Clients.Symbol
CoSymbol

Select
Client = “Atkins”

Scan Scan Scan

PressRel Clients EastCoast
2.2. Step 2: Execution algorithms of RA
operations

• Materialization
• Performs the innermost or leaf-level operations first of the query
execution plan
• The intermediate result of each operation is materialized into
temporary relation and becomes input for subsequent operations.
• The cost of materialization is the sum of the individual operations plus
the cost of writing the intermediate results to disk
• lots of temporary files, lots of I/O.
2.2. Step 2: Execution algorithms of RA
operations

• Pipelining
• Operations form a queue, and results are passed from one operation
to another as they are calculated
• Pipelining restructures the individual operation algorithms so that they
take streams of tuples as both input and output.
• Limitation
• algorithms that require sorting can only use pipelining if the input is already
sorted beforehand
• since sorting by nature cannot be performed until all tuples to be sorted are known.
2.3. Step 3: Cost estimation

• Each relational algebra expression can result in many query execution

plans
• Some query execution plans may be better than others
• Finding the fastest one
• Just an estimation under certain assumptions
• Huge number of query plans may exist
2.3. Step 3: Cost estimation

• Cost estimation factors

• Catalog information: database maintains statistics about relations
• Ex.
• number of tuples per relation
• number of blocks on disk per relation
• number of distinct values per attribute
• histogram of values per attribute
• Problems
• cost can only be estimated
• updating statistics is expensive, thus they are often out of date
2.3. Step 3: Cost estimation

• Choosing the cheapest query plan

• Problem:
• Estimating cost for all possible plans too expensive.
• Solutions:
• pruning: stop early to evaluate a plan
• heuristics: do not evaluate all plans
• Real databases use a combination of
• Apply heuristics to choose promising query plans.
• Choose cheapest plan among the promising plans using pruning.
• Examples of heuristics:
• perform selections as early as possible
• perform projections early avoid Cartesian products
2.3. Step 3: Cost estimation

• Heuristic rules
• Break apart conjunctive selections into a sequence of simple selections
• Move 𝜎 down the query tree as soon as possible
• Replace 𝜎-x pairs by ⋈
• Break apart and move Π down the tree as soon as possible
• Perform the joins with the smallest expected result first
Remark

• Query processing is the entire process or activities involved in

retrieving data from the database
• Parser
• Optimizer
• Code generator
• Query optimizer
• Step 1: Equivalence transformation
• Step 2: Annotation for the algorithm of the RA expression
• Step 3: Cost estimation for different query execution plans
Why query tuning? Why query optimizer is not
enough?

• Optimizers are not perfect:

• transformations produce only a subset of all possible query plans
• only a subset of possible annotations might be considered
• cost of query plans can only be estimated
• Query Tuning: Make life easier for your query optimizer!

9/27/21 Database Tuning

30
Figure out problematic queries

• Which queries should be rewritten?

• Rewrite queries that run “too slow”
• How to find these queries?
• query issues far too many disc accesses,
for example, point query scans an entire table
• you look at the query plan and see that relevant indexes are not used

9/27/21 Database Tuning

31
Overview of query tuning

• avoid DISTINCTs
• subqueries often inefficient
• temporary tables might help
• use clustering indexes for joins
• HAVING vs. WHERE
• use views with care
• system peculiarities: OR and order in FROM clause

9/27/21 Database Tuning

32
Testbed scenario

• Employee(ssnum, name, manager, dept, salary, numfriends)

• clustering index on ssnum
• non-clustering index on name
• non-clustering index on dept
• keys: ssnum, name
• Students(ssnum, name, course, grade)
• clustering index on ssnum
• non-clustering index on name
• keys: ssnum, name
• Techdept(dept, manager, location)
• clustering index on dept
• key: dept
• manager may manage many departments
• a location may contain many departments

9/27/21 Database Tuning

33
DISTINCT

• How can DISTINCT hurt?

• DISTINCT forces sort or other overhead.
• If not necessary, it should be avoided.
• Query: Find employees who work in the information systems
department.
• SELECT DISTINCT ssnum
FROM Employee
WHERE dept = ’information systems’
• DISTINCT not necessary:
• ssnum is a key of Employee, so it is also a key of a subset of Employee.
• Note: Since an index is defined on ssnum, there is likely to be no
overhead in this particular examples.

9/27/21 Database Tuning

34
Non-Correlated Subqueries
• Many systems handle subqueries inefficiently.
• Non-correlated: attributes of outer query not used in inner query.
• Query:
• SELECT ssnum
FROM Employee
WHERE dept IN (SELECT dept FROM Techdept)
• May lead to inefficient evaluation:
• check for each employee whether they are in Techdept
• index on Employee.dept not used!
• Equivalent query:
• SELECT ssnum
FROM Employee, Techdept
WHERE Employee.dept = Techdept.dept
• Efficient evaluation:
• look up employees for each dept in Techdept
use index on Employee.dept

9/27/21 Database Tuning

35
Temporary tables

• Temporary tables can hurt in the following ways:

• force operations to be performed in suboptimal order
(optimizer often does a very good job!)
• creating temporary tables i.s.s.1 causes catalogue update – possible
concurrency control bottleneck
• system may miss opportunity to use index
• Temporary tables are good:
• to rewrite complicated correlated subqueries
• to avoid ORDER BYs and scans in specific cases (see example)

9/27/21 Database Tuning

36
Ex. Unnecessary temp table

• Query: Find all IT department employees who earn more than

40000.
• SELECT * INTO Temp
FROM Employee
WHERE salary > 40000
SELECT ssnum
FROM Temp
WHERE Temp.dept = ’IT’
• Inefficient SQL:
• index on dept can not be used
• overhead to create Temp table (materialization vs. pipelining)
• Efficient SQL:
• SELECT ssnum
FROM Employee
WHERE Employee.dept = ’IT’
AND salary > 40000

9/27/21 Database Tuning

37
Joins: Use clustering indexes and numeric
values

• Query: Find all students who are also employees.

• Inefficient SQL:
• SELECT Employee.ssnum
FROM Employee, Student
WHERE Employee.name = Student.name
• Efficient SQL:
• SELECT Employee.ssnum
FROM Employee, Student
WHERE Employee.ssnum = Student.ssnum
• Benefits:
• Join on two clustering indexes allows merge join (fast!).
• Numerical equality is faster evaluated than string equality.

9/27/21 Database Tuning

38
Don’t use HAVING where WHERE is enough

• Query: Find average salary of the IT department.

• Inefficient SQL:
• SELECT AVG(salary) as avgsalary, dept
FROM Employee
GROUP BY dept
HAVING dept = ’IT’
• Problem: May first compute average for employees of all
departments.
• Efficient SQL: Compute average only for relevant employees.
• SELECT AVG(salary) as avgsalary, dept
FROM Employee
WHERE dept = ’IT’
GROUP BY dept

9/27/21 Database Tuning

39
Use views with care

• Views: macros for queries

• queries look simpler
• but are never faster and sometimes slower
• Creating a view:
• CREATE VIEW Techlocation
AS SELECT ssnum, Techdept.dept, location
FROM Employee, Techdept
WHERE Employee.dept = Techdept.dept
• Using the view:
• SELECT location
FROM Techlocation
WHERE ssnum = 452354786
• System expands view and executes:
• SELECT location
FROM Employee, Techdept
WHERE Employee.dept = Techdept.dept
AND ssnum = 452354786

9/27/21 Database Tuning

40
• Query: Get the department name for the employee with social
security number 452354786 (who works in a technical
department).
• Example of an inefficient SQL:
• SELECT dept
FROM Techlocation
WHERE ssnum = 452354786
• This SQL expands to:
• SELECT dept
FROM Employee, Techdept
WHERE Employee.dept = Techdept.dept
AND ssnum = 452354786
• But there is a more efficient SQL (no join!) doing the same thing:
• SELECT dept
FROM Employee
WHERE ssnum = 452354786

9/27/21 Database Tuning

41
System peculiarity: Indexes and OR

• Some systems never use indexes when conditions are OR-

connected.
• Query: Find employees with name Smith or who are in the
acquisitions department.
• SELECT Employee.ssnum
FROM Employee
WHERE Employee.name = ’Smith’
OR Employee.dept = ’acquisitions’
• Fix: use UNION instead of OR
• SELECT Employee.ssnum
FROM Employee
WHERE Employee.name = ’Smith’
UNION
SELECT Employee.ssnum
FROM Employee
WHERE Employee.dept = ’acquisitions’

9/27/21 Database Tuning

Data Structure & Algorithm
0% (1)
Data Structure & Algorithm
19 pages
Nirafon PLC R
No ratings yet
Nirafon PLC R
54 pages
Chapter - 1 - Query Optimization
No ratings yet
Chapter - 1 - Query Optimization
38 pages
Chapter 13: Query Processing
No ratings yet
Chapter 13: Query Processing
25 pages
Advanced Database
No ratings yet
Advanced Database
47 pages
DBMS Unit - 7
No ratings yet
DBMS Unit - 7
34 pages
Ivunit Query Processing
No ratings yet
Ivunit Query Processing
12 pages
Chapter - 2 Query Processing
No ratings yet
Chapter - 2 Query Processing
63 pages
Unit-5 Query Processing and Optimization
No ratings yet
Unit-5 Query Processing and Optimization
40 pages
Chapter - 2 Query Processing
No ratings yet
Chapter - 2 Query Processing
61 pages
DBMS Unit - 7
No ratings yet
DBMS Unit - 7
33 pages
Presentation9 - Query Processing and Query Optimization in DBMS
No ratings yet
Presentation9 - Query Processing and Query Optimization in DBMS
36 pages
Lecture11 Query Processing
No ratings yet
Lecture11 Query Processing
37 pages
Ch-2 Query Processing and Optimization
No ratings yet
Ch-2 Query Processing and Optimization
26 pages
Query Optimization
No ratings yet
Query Optimization
103 pages
Unit 3 - DBMS
No ratings yet
Unit 3 - DBMS
15 pages
DE Module5 QueryOptimization
No ratings yet
DE Module5 QueryOptimization
11 pages
Chapter - 2 Query Processing
No ratings yet
Chapter - 2 Query Processing
64 pages
AMSAL
No ratings yet
AMSAL
58 pages
Chapter 5
No ratings yet
Chapter 5
45 pages
Chapter - 2 Query Processing
No ratings yet
Chapter - 2 Query Processing
61 pages
Query Processing Concepts
No ratings yet
Query Processing Concepts
99 pages
Advanced Database System Chapter Two Query Processing and Optimization
No ratings yet
Advanced Database System Chapter Two Query Processing and Optimization
50 pages
CH - 2 Query Process
No ratings yet
CH - 2 Query Process
44 pages
Query Evaluation
No ratings yet
Query Evaluation
51 pages
Chapter 2
No ratings yet
Chapter 2
47 pages
KD Query Processing1
No ratings yet
KD Query Processing1
32 pages
Chapter 1 Query Processing
100% (1)
Chapter 1 Query Processing
45 pages
ADB Slides 4
No ratings yet
ADB Slides 4
47 pages
Rdbms Assignment
No ratings yet
Rdbms Assignment
12 pages
Adb ch2
No ratings yet
Adb ch2
72 pages
Ad Database All Slide
No ratings yet
Ad Database All Slide
49 pages
Unit 6
No ratings yet
Unit 6
34 pages
ADBMS Chapter 1
No ratings yet
ADBMS Chapter 1
47 pages
ADB Chapter 2 DB Part1
No ratings yet
ADB Chapter 2 DB Part1
10 pages
1 Intro Select Project
No ratings yet
1 Intro Select Project
28 pages
Ch1 Query Processing
No ratings yet
Ch1 Query Processing
49 pages
Chapter 1 Query Processing
No ratings yet
Chapter 1 Query Processing
58 pages
CO3 Session 7
No ratings yet
CO3 Session 7
32 pages
28-Query Processing-30-09-2024
No ratings yet
28-Query Processing-30-09-2024
17 pages
1.6 PPT - Query Optimization
No ratings yet
1.6 PPT - Query Optimization
53 pages
Unit 3
No ratings yet
Unit 3
63 pages
13 Query Plan Space
No ratings yet
13 Query Plan Space
71 pages
DB - Lecture Query Optimization
No ratings yet
DB - Lecture Query Optimization
80 pages
Chapter 1 Query Processing
100% (1)
Chapter 1 Query Processing
63 pages
Advanced Database Systems Lecture Notes
No ratings yet
Advanced Database Systems Lecture Notes
79 pages
Chapter 8
No ratings yet
Chapter 8
65 pages
4 Chapter Four
No ratings yet
4 Chapter Four
34 pages
CH 02
No ratings yet
CH 02
127 pages
Session - 10 Querying
No ratings yet
Session - 10 Querying
36 pages
QUERY Processing and Relational Algebra
No ratings yet
QUERY Processing and Relational Algebra
27 pages
Unit 2
No ratings yet
Unit 2
104 pages
Introduction To Query Processing and Query Optimization Techniques
No ratings yet
Introduction To Query Processing and Query Optimization Techniques
77 pages
ch2 PDF
No ratings yet
ch2 PDF
72 pages
CH - 1 Query Process SW
No ratings yet
CH - 1 Query Process SW
43 pages
CO3-Notes-Query Processing and Optimization
No ratings yet
CO3-Notes-Query Processing and Optimization
5 pages
12 Query Plan Space
No ratings yet
12 Query Plan Space
72 pages
Chapter 8
No ratings yet
Chapter 8
65 pages
Query Processing and Optimization
No ratings yet
Query Processing and Optimization
28 pages
Relational Query Optimization: Warih Maharani, ST.,MT
No ratings yet
Relational Query Optimization: Warih Maharani, ST.,MT
39 pages
Chapter 6 - Query Processing and Optimization Algorithm
No ratings yet
Chapter 6 - Query Processing and Optimization Algorithm
27 pages
Oracle 11g Streams Implementer's Guide
From Everand
Oracle 11g Streams Implementer's Guide
Ann L. R. McKinnell
No ratings yet
Encoder:: US/.html
No ratings yet
Encoder:: US/.html
2 pages
XYZ Bank: Detailed Test Plan
No ratings yet
XYZ Bank: Detailed Test Plan
15 pages
Introduction To Computer Systems
No ratings yet
Introduction To Computer Systems
3 pages
Learn Cryptography With Python - Python Technologies
No ratings yet
Learn Cryptography With Python - Python Technologies
94 pages
Testing Pentest
No ratings yet
Testing Pentest
53 pages
Tentec Omc Manual
No ratings yet
Tentec Omc Manual
47 pages
Li Fi Technology
No ratings yet
Li Fi Technology
15 pages
Chapter - 5
No ratings yet
Chapter - 5
32 pages
Table of Specifications (Tos) Epp 6 - Ict and Entrepreneurship - Quarter 1
100% (1)
Table of Specifications (Tos) Epp 6 - Ict and Entrepreneurship - Quarter 1
1 page
Discord 101 For Creators 1 2
No ratings yet
Discord 101 For Creators 1 2
1 page
Changelog
No ratings yet
Changelog
20 pages
SAP FICO Enterprise Structure
No ratings yet
SAP FICO Enterprise Structure
14 pages
Chatgpt
No ratings yet
Chatgpt
2 pages
Software Engineering - Module1
No ratings yet
Software Engineering - Module1
60 pages
Bca C++ Pratical
No ratings yet
Bca C++ Pratical
35 pages
Cheetah 10K.7 SCSI Installation Guide: ST3300007LW/LC/LCV, ST3146707LW/LC/LCV, ST373207LW/LC/LCV
No ratings yet
Cheetah 10K.7 SCSI Installation Guide: ST3300007LW/LC/LCV, ST3146707LW/LC/LCV, ST373207LW/LC/LCV
2 pages
Dsa Report 1
No ratings yet
Dsa Report 1
66 pages
VM Load Balancing - XCP-NG Documentation
No ratings yet
VM Load Balancing - XCP-NG Documentation
7 pages
Yts C 0111
No ratings yet
Yts C 0111
44 pages
Ethereal Guide PDF
No ratings yet
Ethereal Guide PDF
2 pages
React Fundamentals
No ratings yet
React Fundamentals
9 pages
Synopsis
No ratings yet
Synopsis
11 pages
The Frizz
No ratings yet
The Frizz
33 pages
Portfolio: 3D Animation Design
No ratings yet
Portfolio: 3D Animation Design
18 pages
Operating Systems: Simple/Basic Segmentation
No ratings yet
Operating Systems: Simple/Basic Segmentation
29 pages
SQL Joins
No ratings yet
SQL Joins
15 pages
50 Excel Shortcuts To Save Time and Effort in Articleship
No ratings yet
50 Excel Shortcuts To Save Time and Effort in Articleship
9 pages
Explainable Artificial Intelligence How Face Masks Are Detected Via Deep Neural Networks
No ratings yet
Explainable Artificial Intelligence How Face Masks Are Detected Via Deep Neural Networks
9 pages

3 - Query Tuning

Uploaded by

3 - Query Tuning

Uploaded by

Query tuning

9/27/21 Database Tuning

• Rewrite query to run faster

9/27/21 Database Tuning

• What is query processing

• The entire process or activities involved in retrieving data from the

Code for executing

• Output: Query execution plan

• Choose the cheapest execution plan out of the possible ones

• RA expressions are equivalent if they generate the same set of tuples

• Equivalance tranformation rules

• Equivalance tranformation rules

• Equivalance tranformation rules

• Equivalance tranformation rules

• Equivalance tranformation rules

• Algebra expression is not a query execution plan.

• 1-Pass Operators: Scanning a Table

For each tuple tr in r {

• Single-loop JOIN (Index-based)

for each tube tr in R {

• Execution Strategy: Materialization vs. Pipelining

Scan Scan Scan

• Each relational algebra expression can result in many query execution

• Cost estimation factors

• Choosing the cheapest query plan

• Query processing is the entire process or activities involved in

• Optimizers are not perfect:

9/27/21 Database Tuning

• Which queries should be rewritten?

9/27/21 Database Tuning

9/27/21 Database Tuning

• Employee(ssnum, name, manager, dept, salary, numfriends)

9/27/21 Database Tuning

• How can DISTINCT hurt?

9/27/21 Database Tuning

9/27/21 Database Tuning

• Temporary tables can hurt in the following ways:

9/27/21 Database Tuning

• Query: Find all IT department employees who earn more than

9/27/21 Database Tuning

• Query: Find all students who are also employees.

9/27/21 Database Tuning

• Query: Find average salary of the IT department.

9/27/21 Database Tuning

• Views: macros for queries

9/27/21 Database Tuning

9/27/21 Database Tuning

• Some systems never use indexes when conditions are OR-

9/27/21 Database Tuning

You might also like