Cost Estimation For Query Optimization

dfzfbd

Uploaded by

abhive106

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF or read online on Scribd

0% found this document useful (0 votes)

53 views14 pages

Cost Estimation For Query Optimization

dfzfbd

Uploaded by

abhive106

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF or read online on Scribd

You are on page 1/ 14

Cost Estimation in Query OptimizationCost Estimation in Query Optimization * The main aim of query optimization is to choose the most efficient way of implementing the relational algebra operations at the lowest possible cost. * The query optimizer should not depend solely on heuristic rules, but, it should also estimate the cost of executing the different strategies and find out the strategy with the minimum cost estimate.* The cost functions used in query optimization are estimates and not exact cost functions. * The cost of an operation is heavily dependent on its ‘selectivity, that |s, the proportion of select operation(s) that forms the output. * In general the differentyalgorithmsyarewsuitable for low or high selectivity queries. * In order for query optimizer to choose suitable algorithm for an operation an estimate of the cost of executing that algorithm must be provided* The cost of an algorithm is depend of a cardinality of its input. * To estimate the cost of different query execution strategies, the query tree is viewed as containing a series of basic operations which are linked in order to perform the query. * It is also important to know the expected cardinality of an operation’s output because this forms the input to the next operation.Cost Components of Query Execution The cost of executing the query includes the following components: — Access cost to secondary storage. — Storage cost. — Computation cost. — Memory uses cost. — Communication cost.Importance of Access cost Out of the above five cost components, the most important is the secondary storage access cost. The emphasis of the cost minimization depends onthe size and type of database applications. For example in smaller database the emphasis is on the minimizingwcomputinguicost as because most of the data in the files involve in the query can be completely store in the main memory. For largé database, the main emphasis is on* For distributed database, the communication cost is minimized as because many sites are involved for the data transfer. * To estimate the cost of various execution strategies, we must keep track of any information that is needed for the cost function. >This information may be stored in database — catalog, where it is accessed by the query optimizer.Information in system Catalogue The number of tuples in relation as R [nTuples(R)]. The average record size in relation R. The number of blocks required to store relation R as [nBlocks(R)]. The blocking factors in relation R (that is the number of tuples of R that fit into one block) as [bFactor(R)]. Primary access method for each file. Primary access attributes for each file. The number of level of each multilevel index | (primary, secondary or clustering) as [nLevelsA(|)].The number of first level index blocks as [nBlocksA (I)]. The number of distinct values that are appear for attribute A in relation R as [nDistinctA(R)]. The minimum and maximum possible values for attribute A in relation R as [minA(R), maxA(R)]. The selectivity of an attribute, which is the fraction of records satisfying an equality condition on the attribute. The selection cardinality of given attribute Ain relation Ras [SCA(R)]. The selection cardinality is the average number of tuples that satisfied an equality condition on attribute A.Cost functions for SELECT Operation * Linear Search: — [nBlocks(R)/2], if the record is found. — [nBlocks(R)], if no record satisfied the condition. * Binary Search : 2 [log2(nBlocks(R))], if equality condition is on key attribute, because SCA(R) = 1 in this case. o [log2(nBlocks(R))] + [SCA(R)/bFactor(R)] — 1, otherwise.* Equity condition on Primary key — [nLevelA(1) + 1] * Equity condition on Non-Primary key :- — [nLevelA(|) + 1] + [nBlocks(R)/2]Cost functions for JOIN Operation * Join operation is the most time consuming operation to process. * An estimate for the size (number of tuples) of the file that results after the JOIN operation is required to develop reasonably accurate cost functions for JOIN operations. * The JOIN operations define the relation containing tuples that satisfy a specific predicate F from the Cartesian product of two relations R and S.Different strategies for JOIN operations Strategies Cost Estimation Block nested-loop JOIN a) nBlocks(R) + (nBlocks(R) * nBlocks(S)) If the buffer has only one block b) nBlocks(R) + [ nBlocks(S) * ( nBlocks(R)/(nBuffer-2) ) ] If (nBuffer-2) blocks is there for R cc) nBlocks(R) + nBlocks(S) Ifall blocks of R can be read into database buffer Indexed nested-loop a) nBlocks(R) + nTuples(R) * (nLevel,(l) + 1) JOIN Ifjoin attribute Ain Sis a primary key b) nBlocks(R) + nTuples(R) * (nLevel,(l) + [SC,(R) / bFactor(R) } ) If clustering index | is on attribute A.Different strategies for JOIN operations Sort-merge JOIN a) nBlocks(R) *[ logenBlocks(R) | + nBlocks(S) * [ lognBlocks(R) ] For Sort b) nBlocks(R) +nBlocks(S) For Merge Hash JOIN a) 3(nBlocks(R) + nBlocks(S)) If Hash index is in memory b) 2(nBlocks(R) + nBlocks(S}) * [log (nBlocks(S)) - 1] + nBlocks(R) + nBlocks(S) Otherwise

Advanced Database Systems Lecture Notes
No ratings yet
Advanced Database Systems Lecture Notes
79 pages
7-Query Processing
No ratings yet
7-Query Processing
47 pages
1.3 PPT - Measure of Query Cost
100% (1)
1.3 PPT - Measure of Query Cost
42 pages
Introduction To Query Processing
No ratings yet
Introduction To Query Processing
21 pages
05 QueryProcessing LecW4 Feb7 22
No ratings yet
05 QueryProcessing LecW4 Feb7 22
55 pages
Query Processing and Optimisation - Intr
No ratings yet
Query Processing and Optimisation - Intr
41 pages
QueryProcess Optim
No ratings yet
QueryProcess Optim
60 pages
20 Cost Based Optimization Annotated
No ratings yet
20 Cost Based Optimization Annotated
52 pages
Lecture Notes
No ratings yet
Lecture Notes
96 pages
CH 7 Query Optimizations
No ratings yet
CH 7 Query Optimizations
48 pages
05 Optimization
No ratings yet
05 Optimization
58 pages
Query Optimization
No ratings yet
Query Optimization
20 pages
Query Optimization
No ratings yet
Query Optimization
7 pages
ADBMS Assignment
No ratings yet
ADBMS Assignment
19 pages
Session - 10 Querying
No ratings yet
Session - 10 Querying
36 pages
ADB Slides 4
No ratings yet
ADB Slides 4
47 pages
Measures of Query Cost
No ratings yet
Measures of Query Cost
15 pages
Chapter 12 - 2
No ratings yet
Chapter 12 - 2
38 pages
Database Technology Query Processing: Heiko Paulheim
No ratings yet
Database Technology Query Processing: Heiko Paulheim
60 pages
QueryOptimization Siao
No ratings yet
QueryOptimization Siao
24 pages
Unit 4
No ratings yet
Unit 4
24 pages
15 QueryOptimization
No ratings yet
15 QueryOptimization
24 pages
Query Processing Concepts
No ratings yet
Query Processing Concepts
99 pages
QEII
No ratings yet
QEII
44 pages
UNIT 4 Query Processing and Different Types of Databases
No ratings yet
UNIT 4 Query Processing and Different Types of Databases
13 pages
Heuristic-Based Query Optimization
No ratings yet
Heuristic-Based Query Optimization
6 pages
Lesson 05
No ratings yet
Lesson 05
29 pages
CH 13 Updated
No ratings yet
CH 13 Updated
30 pages
DBMS Unit5 Lecture1
No ratings yet
DBMS Unit5 Lecture1
22 pages
Overview Ioannidis Chapter
No ratings yet
Overview Ioannidis Chapter
3 pages
11 Query Evaluations
No ratings yet
11 Query Evaluations
17 pages
Unit-2 Query Processing and Optimization, Query Equivalence, Join Strategies
No ratings yet
Unit-2 Query Processing and Optimization, Query Equivalence, Join Strategies
38 pages
Chapter 13: Query Processing: Database System Concepts, 5th Ed
No ratings yet
Chapter 13: Query Processing: Database System Concepts, 5th Ed
55 pages
Dbms Seminar
No ratings yet
Dbms Seminar
24 pages
Lecture11 Query Processing
No ratings yet
Lecture11 Query Processing
37 pages
13 QP1
No ratings yet
13 QP1
33 pages
Query Processing and Query Optimization Techniques
No ratings yet
Query Processing and Query Optimization Techniques
20 pages
Unit 1
No ratings yet
Unit 1
23 pages
Adbms Unit 2
No ratings yet
Adbms Unit 2
137 pages
1.6 PPT - Query Optimization
No ratings yet
1.6 PPT - Query Optimization
53 pages
Query Processing
No ratings yet
Query Processing
39 pages
Overview of Query Evaluation: R&G Chapter 12
No ratings yet
Overview of Query Evaluation: R&G Chapter 12
30 pages
Database Modeling - notes-VI
No ratings yet
Database Modeling - notes-VI
8 pages
Chapter 13: Query Processing
No ratings yet
Chapter 13: Query Processing
55 pages
DBMS R19 Unit Iv
No ratings yet
DBMS R19 Unit Iv
25 pages
Q Evaluation
No ratings yet
Q Evaluation
17 pages
ADBMS TypicalQueryOptimizer
No ratings yet
ADBMS TypicalQueryOptimizer
30 pages
Advance Database Management System: Unit - 2 .Query Processing and Optimization
No ratings yet
Advance Database Management System: Unit - 2 .Query Processing and Optimization
38 pages
Unit IV Part II
No ratings yet
Unit IV Part II
37 pages
Query Processing and Optimization
No ratings yet
Query Processing and Optimization
127 pages
Rdbms Assignment
No ratings yet
Rdbms Assignment
12 pages
Relational Query Optimization: Warih Maharani, ST.,MT
No ratings yet
Relational Query Optimization: Warih Maharani, ST.,MT
39 pages
DBMS
No ratings yet
DBMS
24 pages
3 Query Processing and Optimization-1
No ratings yet
3 Query Processing and Optimization-1
18 pages
Query Processing and Optimization
No ratings yet
Query Processing and Optimization
28 pages
Ch12-Query Processing
No ratings yet
Ch12-Query Processing
34 pages
Query Proc Notes
No ratings yet
Query Proc Notes
10 pages
Measures of Query Cost
No ratings yet
Measures of Query Cost
15 pages

Cost Estimation For Query Optimization

Uploaded by

Cost Estimation For Query Optimization

Uploaded by

You might also like