0% found this document useful (0 votes)

212 views4 pages

Distributed Query Optimization Techniques

The document discusses query optimization in distributed database systems. It describes the distributed query processing architecture where queries are optimized globally and locally. The global optimizer maps global queries to local queries by fragmenting tables across sites. It generates an execution plan to minimize data transfer. Local queries are then optimized and merged to return the overall result. Distributed query optimization aims to find an optimal solution by optimally using resources, performing query trading, and reducing the solution space.

Uploaded by

kirosmhret97

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

212 views4 pages

Distributed Query Optimization Techniques

Uploaded by

kirosmhret97

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Query Optimization in Distributed Systems

This chapter discusses query optimization in distributed database system.

Distributed Query Processing Architecture

In a distributed database system, processing a query comprises of optimization at both the
global and the local level. The query enters the database system at the client or controlling site.
Here, the user is validated, the query is checked, translated, and optimized at a global level.

The architecture can be represented as −

Mapping Global Queries into Local Queries

The process of mapping global queries to local ones can be realized as follows −

The tables required in a global query have fragments distributed across multiple sites.
The local databases have information only about local data. The controlling site uses the
global data dictionary to gather information about the distribution and reconstructs the
global view from the fragments.
If there is no replication, the global optimizer runs local queries at the sites where the
fragments are stored. If there is replication, the global optimizer selects the site based
upon communication cost, workload, and server speed.
The global optimizer generates a distributed execution plan so that least amount of data
transfer occurs across the sites. The plan states the location of the fragments, order in
which query steps needs to be executed and the processes involved in transferring
intermediate results.

The local queries are optimized by the local database servers. Finally, the local query
results are merged together through union operation in case of horizontal fragments and
join operation for vertical fragments.

For example, let us consider that the following Project schema is horizontally fragmented
according to City, the cities being New Delhi, Kolkata and Hyderabad.

PROJECT

PId City Department Status

Suppose there is a query to retrieve details of all projects whose status is “Ongoing”.

The global query will be &inus;

$$\sigma_{status} = {\small "ongoing"}^{(PROJECT)}$$

Query in New Delhi’s server will be −

$$\sigma_{status} = {\small "ongoing"}^{({NewD}_-{PROJECT})}$$

Query in Kolkata’s server will be −

$$\sigma_{status} = {\small "ongoing"}^{({Kol}_-{PROJECT})}$$

Query in Hyderabad’s server will be −

$$\sigma_{status} = {\small "ongoing"}^{({Hyd}_-{PROJECT})}$$

In order to get the overall result, we need to union the results of the three queries as follows −

$\sigma_{status} = {\small "ongoing"}^{({NewD}_-{PROJECT})} \cup \sigma_{status} = {\small

"ongoing"}^{({kol}_-{PROJECT})} \cup \sigma_{status} = {\small "ongoing"}^{({Hyd}_-
{PROJECT})}$

Distributed Query Optimization

Distributed query optimization requires evaluation of a large number of query trees each of which
produce the required results of a query. This is primarily due to the presence of large amount of
replicated and fragmented data. Hence, the target is to find an optimal solution instead of the
best solution.
The main issues for distributed query optimization are −

Optimal utilization of resources in the distributed system.

Query trading.
Reduction of solution space of the query.

Optimal Utilization of Resources in the Distributed System

A distributed system has a number of database servers in the various sites to perform the
operations pertaining to a query. Following are the approaches for optimal resource utilization −

Operation Shipping − In operation shipping, the operation is run at the site where the data is
stored and not at the client site. The results are then transferred to the client site. This is
appropriate for operations where the operands are available at the same site. Example: Select
and Project operations.

Data Shipping − In data shipping, the data fragments are transferred to the database server,
where the operations are executed. This is used in operations where the operands are distributed
at different sites. This is also appropriate in systems where the communication costs are low,
and local processors are much slower than the client server.

Hybrid Shipping − This is a combination of data and operation shipping. Here, data fragments are
transferred to the high-speed processors, where the operation runs. The results are then sent to
the client site.

Query Trading
In query trading algorithm for distributed database systems, the controlling/client site for a
distributed query is called the buyer and the sites where the local queries execute are called
sellers. The buyer formulates a number of alternatives for choosing sellers and for reconstructing
the global results. The target of the buyer is to achieve the optimal cost.

The algorithm starts with the buyer assigning sub-queries to the seller sites. The optimal plan is
created from local optimized query plans proposed by the sellers combined with the
communication cost for reconstructing the final result. Once the global optimal plan is
formulated, the query is executed.

Reduction of Solution Space of the Query

Optimal solution generally involves reduction of solution space so that the cost of query and data
transfer is reduced. This can be achieved through a set of heuristic rules, just as heuristics in
centralized systems.

Following are some of the rules −

Perform selection and projection operations as early as possible. This reduces the data
flow over communication network.
Simplify operations on horizontal fragments by eliminating selection conditions which are
not relevant to a particular site.
In case of join and union operations comprising of fragments located in multiple sites,
transfer fragmented data to the site where most of the data is present and perform
operation there.
Use semi-join operation to qualify tuples that are to be joined. This reduces the amount of
data transfer which in turn reduces communication cost.
Merge the common leaves and sub-trees in a distributed query tree.

Distributed Query Optimization Techniques
No ratings yet
Distributed Query Optimization Techniques
9 pages
Chapter 9-I
No ratings yet
Chapter 9-I
72 pages
Distributed Query Optimization Survey
No ratings yet
Distributed Query Optimization Survey
10 pages
Unit II QUERY PROCESSING AND DECOMPOSITION
No ratings yet
Unit II QUERY PROCESSING AND DECOMPOSITION
24 pages
Module 2
No ratings yet
Module 2
17 pages
Assignment # 2: Submitted by Submitted To Class Semester Roll No
No ratings yet
Assignment # 2: Submitted by Submitted To Class Semester Roll No
9 pages
Overview of Distributed Databases
No ratings yet
Overview of Distributed Databases
37 pages
Unit V
No ratings yet
Unit V
80 pages
2e Query Optimization Ozsu ch8
No ratings yet
2e Query Optimization Ozsu ch8
26 pages
Efficient Join On DBMS
No ratings yet
Efficient Join On DBMS
3 pages
SF8 - Unit 2 DDB
No ratings yet
SF8 - Unit 2 DDB
97 pages
Query Optimization in Distributed Databases
No ratings yet
Query Optimization in Distributed Databases
7 pages
Layers of Query Processing in DDBMS
80% (10)
Layers of Query Processing in DDBMS
20 pages
4 2 Query - Processing
No ratings yet
4 2 Query - Processing
106 pages
Distributed Querry Optimization
No ratings yet
Distributed Querry Optimization
4 pages
Adt 16 Mark
No ratings yet
Adt 16 Mark
19 pages
Query Processing
No ratings yet
Query Processing
121 pages
DDBS Unit 2
No ratings yet
DDBS Unit 2
7 pages
Introduction To Distributed Query Processing
No ratings yet
Introduction To Distributed Query Processing
10 pages
17 DatabaseArchitectures
No ratings yet
17 DatabaseArchitectures
41 pages
Query Proceessing
No ratings yet
Query Proceessing
5 pages
Query
No ratings yet
Query
104 pages
Distributed Query Optimization: Oscar Romero Alberto Abelló Gamazo
No ratings yet
Distributed Query Optimization: Oscar Romero Alberto Abelló Gamazo
44 pages
DBMS Unit - 7
No ratings yet
DBMS Unit - 7
33 pages
Ddbms
No ratings yet
Ddbms
10 pages
Distributed Databases: CS347 May 30, 2001
No ratings yet
Distributed Databases: CS347 May 30, 2001
48 pages
Lecture 06
No ratings yet
Lecture 06
41 pages
Queryoptimization Examples
No ratings yet
Queryoptimization Examples
26 pages
Distributed Query Processing
No ratings yet
Distributed Query Processing
3 pages
Query Processing in Distributed Database
No ratings yet
Query Processing in Distributed Database
20 pages
Distributed Query Optimization Techniques
No ratings yet
Distributed Query Optimization Techniques
72 pages
Distributed Databases Overview
No ratings yet
Distributed Databases Overview
33 pages
Equivalence and Parsing in DBMS
No ratings yet
Equivalence and Parsing in DBMS
34 pages
DDBMS-Chapter-4-SE-LectureNote (Version 1)
No ratings yet
DDBMS-Chapter-4-SE-LectureNote (Version 1)
11 pages
Advanced Database Individual Assignment
No ratings yet
Advanced Database Individual Assignment
4 pages
Chapter 6
No ratings yet
Chapter 6
45 pages
4-Query - Processing (1) - PTIT
No ratings yet
4-Query - Processing (1) - PTIT
72 pages
C3-Distributed Databases
No ratings yet
C3-Distributed Databases
31 pages
Chapter 7
No ratings yet
Chapter 7
26 pages
Overview of Distributed Catalog Management
No ratings yet
Overview of Distributed Catalog Management
4 pages
Database Modeling - notes-VII
No ratings yet
Database Modeling - notes-VII
6 pages
Ads Unit 2..
No ratings yet
Ads Unit 2..
3 pages
Distributed Database Management Overview
No ratings yet
Distributed Database Management Overview
10 pages
Distributed Query Processing
No ratings yet
Distributed Query Processing
31 pages
Vu Lec 35
No ratings yet
Vu Lec 35
42 pages
Query Processing in Distributed Database
No ratings yet
Query Processing in Distributed Database
24 pages
7 Distributed DB
No ratings yet
7 Distributed DB
38 pages
Distributed Database Concepts
No ratings yet
Distributed Database Concepts
52 pages
Distributed Query Optimization
86% (7)
Distributed Query Optimization
48 pages
Outline: Distributed Query Processing
No ratings yet
Outline: Distributed Query Processing
8 pages
Distrubuted Database Concept
No ratings yet
Distrubuted Database Concept
22 pages
Distributed Database Systems Guide
No ratings yet
Distributed Database Systems Guide
5 pages
4-Query Processing (Autosaved)
No ratings yet
4-Query Processing (Autosaved)
74 pages
Vu Lec 30
No ratings yet
Vu Lec 30
28 pages
6-Query Intro
No ratings yet
6-Query Intro
15 pages
Distributed Database Design Guide
No ratings yet
Distributed Database Design Guide
52 pages
Advanced Database Systems
No ratings yet
Advanced Database Systems
16 pages
Concurrency Control in Distributed Datab
No ratings yet
Concurrency Control in Distributed Datab
5 pages
Principles of Distributed Database Systems: M. Tamer Özsu Patrick Valduriez
No ratings yet
Principles of Distributed Database Systems: M. Tamer Özsu Patrick Valduriez
73 pages
Complete Data
No ratings yet
Complete Data
46 pages
European Steel and Alloy Grades: Alloy Standards Search About Us EN 10263-3
No ratings yet
European Steel and Alloy Grades: Alloy Standards Search About Us EN 10263-3
2 pages
IC Construction RFI Tracking Log Template 10770
No ratings yet
IC Construction RFI Tracking Log Template 10770
4 pages
Booking Details for Ms Zulva
No ratings yet
Booking Details for Ms Zulva
1 page
AutoCAD Plant 3D De-Mystifying Isos
100% (2)
AutoCAD Plant 3D De-Mystifying Isos
69 pages
Nutrition Basics for Filipinos
No ratings yet
Nutrition Basics for Filipinos
12 pages
JP Morgan - Global Report
100% (1)
JP Morgan - Global Report
88 pages
Quincy Selected Paintings
100% (1)
Quincy Selected Paintings
56 pages
Moslima Bibi Invoice No-1
No ratings yet
Moslima Bibi Invoice No-1
1 page
Vendor Statement Reconciliation - Feasibility - PQD
No ratings yet
Vendor Statement Reconciliation - Feasibility - PQD
7 pages
Dc-Da2000 - SM 1-2
No ratings yet
Dc-Da2000 - SM 1-2
12 pages
Financial Legal Affairs of B.J. Maguire
No ratings yet
Financial Legal Affairs of B.J. Maguire
2 pages
Distribution of Key Natural Resources
50% (2)
Distribution of Key Natural Resources
23 pages
Solar Wind Hybrid System Overview
No ratings yet
Solar Wind Hybrid System Overview
21 pages
Torres Farm Resort
No ratings yet
Torres Farm Resort
8 pages
Magnetic Properties and Paramagnetism
50% (2)
Magnetic Properties and Paramagnetism
20 pages
Raman Scattering - Wikipedia
No ratings yet
Raman Scattering - Wikipedia
42 pages
(Education 3-13 2000-Jun Vol. 28 Iss. 2
No ratings yet
(Education 3-13 2000-Jun Vol. 28 Iss. 2
5 pages
YogSandesh January Hindi 2011
No ratings yet
YogSandesh January Hindi 2011
68 pages
Rab Tambang Nikel
100% (6)
Rab Tambang Nikel
96 pages
ResearchingDreams TheFundamentals
No ratings yet
ResearchingDreams TheFundamentals
230 pages
CFM56 - 5B - Esm - Rev - 05 11 01 200 001 N PGK08 001 N - TSN.77 - U - 20161030
100% (1)
CFM56 - 5B - Esm - Rev - 05 11 01 200 001 N PGK08 001 N - TSN.77 - U - 20161030
3 pages
Capstone Project Question Bank
No ratings yet
Capstone Project Question Bank
11 pages
Alright - I
No ratings yet
Alright - I
8 pages
Quatation 2025072 Shree Surabhi Udyog Pvt. Limited.
No ratings yet
Quatation 2025072 Shree Surabhi Udyog Pvt. Limited.
4 pages
Chemistry Form One Notes
0% (1)
Chemistry Form One Notes
99 pages
Circular 14 2022
No ratings yet
Circular 14 2022
3 pages
SC580 Brochure
No ratings yet
SC580 Brochure
6 pages
Numerical Investigation of Elliptical and Triangular Perforated Fins Under Forced Convection
100% (1)
Numerical Investigation of Elliptical and Triangular Perforated Fins Under Forced Convection
4 pages
Assigmt - Module 10 Assisting With Medications Group Assignment-Part I
No ratings yet
Assigmt - Module 10 Assisting With Medications Group Assignment-Part I
4 pages

Distributed Query Optimization Techniques

Uploaded by

Distributed Query Optimization Techniques

Uploaded by

Query Optimization in Distributed Systems

This chapter discusses query optimization in distributed database system.

Distributed Query Processing Architecture

The architecture can be represented as −

Mapping Global Queries into Local Queries

PId City Department Status

The global query will be &inus;

$$\sigma_{status} = {\small "ongoing"}^{(PROJECT)}$$

Query in New Delhi’s server will be −

$$\sigma_{status} = {\small "ongoing"}^{({NewD}_-{PROJECT})}$$

Query in Kolkata’s server will be −

$$\sigma_{status} = {\small "ongoing"}^{({Kol}_-{PROJECT})}$$

Query in Hyderabad’s server will be −

$$\sigma_{status} = {\small "ongoing"}^{({Hyd}_-{PROJECT})}$$

$\sigma_{status} = {\small "ongoing"}^{({NewD}_-{PROJECT})} \cup \sigma_{status} = {\small

Distributed Query Optimization

Optimal utilization of resources in the distributed system.

Optimal Utilization of Resources in the Distributed System

Reduction of Solution Space of the Query

Following are some of the rules −

You might also like