0% found this document useful (0 votes)

73 views46 pages

Ch-9 Multidatabase Query Processing

This document discusses query processing in multidatabase systems. It covers issues in multidatabase query processing like autonomy of component databases. It describes the mediator/wrapper architecture for multidatabase query processing. It discusses query rewriting using views in both global-as-view and local-as-view approaches using datalog. It also covers query optimization and execution challenges like heterogeneous cost modeling, optimization, and adaptive query processing in multidatabase systems.

Uploaded by

fortnite

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

73 views46 pages

Ch-9 Multidatabase Query Processing

Uploaded by

fortnite

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 46

Multidatabase Query

Processing
by:
Kay Kay Chetty
Jason Eversole
Table of Contents

 Issues in Multidatabase Query Processing

 Multidatabase Query Processing Architecture
 Query Rewriting Using Views
 Query Optimization and Execution
 Query Translation and Execution
 Conclusion

2
Introduction

 This chapter will focus on query processing that provides interoperability

among a set of DBMSs.
 This is only one part of the more general interoperability problem.
 Distributed applications pose major requirements regarding the databases
they access
 In particular, the ability to access legacy data as well as the newly developed
databases.
 So, providing integrated access to multiple distributed databases and other
heterogeneous data sources has become a major topic and focus.

3
Issues in Multidatabase Query Processing

 The autonomy of the component DBMSs poses problems. So, we define this
autonomy along three main dimensions.
 Communication: This means that a component DBMS communicates with others at
its own discretion. It may terminate its services at anytime.
 Design: This may restrict the availability and accuracy of cost information that is
needed for query optimization.
 Execution: It is difficult to apply some of the query optimization strategies that
have been previously discussed. Problems arise because communication with
component DBMSs occur at a high level of the DBMS API.

4
Issues in Multidatabase Query Processing

 In distributed DBMSs query processors have to deal only with data distribution
across multiple sites.
 In a distributed multidatabase environment data is distributed not only across
sites but also across multiple databases.
 This difference increases the parties involved in processing the query from
two in the distributed DBMS to three in the distributed multidatabase
environment.

5
Multidatabase Query Processing
Architecture
 Mediator/Wrapper Architecture
 Each component database has a wrapper that exports source schema, data and
query processing capabilities
 A mediator centralizes information from the wrappers in a unified view of available
data, this is stored in a global data dictionary
 The mediator also preforms query processing using the wrappers to access the
component DBMSs.
 The data model used by the mediator can be:
 Relational
 Object-Oriented
 Semi-Structured

6
Multidatabase Query Processing
Architecture
 Mediator/Wrapper Architecture has several advantages
 The specialized components allow various concerns of different kinds of users to be
handled separately
 Mediators typically specialize in a related set of component databases with similar
data

 These lead to a flexible and extensible distributed system

 It allows seamless integration of different data stored in very different
components

7
Multidatabase Query Processing
Architecture
 Three main layers are involved in multidatabase query processing
 The first two layers map input query into an optimized distributed query execution
plan (QEP)
 They also perform the functions of query rewriting, query optimization and some
query processing
 The third layer performs query translation and execution using the wrappers

8
Query Rewriting Using Views

 Query rewriting reformulates the input query that is expressed on global

relations into one on local relations
 Since the global schema is used the query must be rewritten using views
 Rewriting techniques differ depending on the database integration approach
used:
 Global-as-view (GAV)
 Local-as-view (LAV)
 Datalog, a logic-based database language, is used for query rewriting using
views

9
Datalog Terminology

 Datalog can be viewed as an in-line version of domain relational calculus

 Conjunctive queries (select-project-join queries):
 Q(T): -R1(T1), … , Rn(Tn)
 Q(T) is the head of the query and denotes the result relation
 R1(T1), … , Rn(Tn) are the subgoals in the body and denote database relations
 Q and R1, … , Rn are predicate names and correspond to relation names
 T, T1, … , Tn refer to relation tuples and contain variables or constants

10
Datalog Terminology

 Consider relations EMP(ENO, ENAME, TITLE, CITY) and ASG(ENO, PNO, DUR)
 Assume ENO is the primary key of EMP and (ENO, PNO) is of ASG
 SQL Query:
SELECT ENO, TITLE, PNO FROM EMP, ASG
WHERE EMP.ENO = ASG.ENO AND TITLE = “Programmer” or DUR = 24
 Query in Datalog:
Q(ENO, TITLE, PNO): - EMP(ENO, ENAME, “Programmer”, CITY),
ASG(ENO, PNO, DUR)
Q(ENO, TITLE, PNO): - EMP(ENO, ENAME, TITLE, CITY),
ASG(ENO, PNO, 24)

11
Rewriting in GAV

 Is this approach, the global schema is expressed in terms of the data sources
 Each global relation is defined as a view over the local relation
 The rewriting technique that will be used is called unfolding
 It replaces each global relation invoked with it’s corresponding view
 This is done by applying the view definition rules to the query and producing
a union of conjunctive queries
 Unfolding can however generate redundant queries that need to be
eliminated

12
Rewriting in LAV

 The global schema is expressed independent of the local databases

 Each local relation is defined as a view over the global relations
 This enables considerable flexibility for defining local relations
 Algorithms for rewriting a query using views essentially try to reduce the
number of rewrites that must be considered.
 Three algorithms:
I. The bucket algorithm
II. The inverse rule algorithm
III. MinCon algorithm

13
Rewriting in LAV

 The bucket algorithm considers each predicate of the query independently to

select only the views that are relevant to that predicate.
 Given a query Q, the algorithm proceeds in two steps
 One: bucket b is built for each subgoal q of Q that is not a comparison
predicate
 b is inserted into the heads of the views that are relevant to answer q
 Mapping that unifies q with subgoal v in view V must be there to determine
whether a view V should be in b
 Two: for each view V of the Cartesian product of the non-empty buckets

14
Rewriting in LAV

 The algorithms produces a conjunctive query and checks whether it is

contained in Q
 If it is contained in Q it is kept to represent one way to answer part of Q from
V.
 So, this rewritten query is a union of conjunctive queries.

15
Rewriting in LAV

 By considering the query globally and considering how each predicate in the
query interacts with the views the MinCon algorithm addresses the limitations
of the bucket algorithm.
 One: views are selected that contain subgoals corresponding to the subgoals
of query Q
 Finds a mapping that unifies a subgoal q of Q with a subgoal v in view V
 It then considers the join predicates in Q and finds the minimum set of
additional subgoals of Q
 These must be mapped to subgoals in V
 This set of subgolas of Q is captured by a MinCon description (MCD) associated
with V
16
Rewriting in LAV

 Two: a rewritten query is produced by combining the different MCDs.

 Because of how the MCDs were created it is guaranteed that the resulting
rewritings will be contained in the original query.

17
Query Optimization and Execution

 There are three main problems of query optimization in multidatabase

systems
I. Heterogeneous Cost Modeling
II. Heterogeneous Query Optimization
III. Adaptive Query Processing

18
Heterogeneous Cost Modeling

 The global cost function definition and the associated problem of obtaining
cost-related information from component DBMSs is the most studied of the
three problems.
 Primarily interested in determining the cost of the lower levels of a query
execution tree
 This corresponds to the parts of the query executed at component DBMSs
 Three approaches:
I. Black Box Approach
II. Customized Approach
III. Dynamic Approach

19
Black Box Approach

 This approach treats each component DBMS as a block box, running some test
queries against it and then determines the necessary cost information
 Cost functions are expressed logically rather than on the basis of physical
characteristics.
 The cost function for component DBMSs is expressed as:

Cost = initialization cost + cost to find qualifying tuples + cost to process

selected tuples

20
Black Box Approach

 The individual terms of this formula will differ for different operators.
 The difficulty is determining the coefficients is the formula since they will
change with different component DBMSs.
 The major drawback of this approach is that the cost model is common for all
component DBMSs and may not capture their individual specifics.
 So, it might fail to accurately estimate the cost of a query executed at a
component DBMS that exposes unforeseen behavior.

21
Customized Approach

 Uses previous knowledge about the component DBMSs and their external
characteristics to subjectively determine the cost information
 The basis is that the query processors of the component DBMSs are too
different to be represented by a unique cost model
 Assumes that the ability to accurately estimate the cost of local subqueries
will improve global optimization.
 Provides framework to integrate the component DBMSs cost model into the
mediator query optimizer.
 Extends the wrapper interface such that the mediator gets some specific cost
information from each of the wrappers
 The wrapper developer is free to provide a cost model, partially or entirely
22
Customized Approach

 This poses a challenge to integrate this cost description into the mediator
query optimizer.
 Two main solutions:
 One: Provide the logic within the wrapper to compute three cost estimates
I. The time to initiate the query process and receive the first result item
(reset_cost)
II. The time to get the next item (advance_cost)
III. The result cardinality
 This makes the total query cost:
Total_access_cost = reset_cost +(cardinality – 1) * advance_cost

23
Customized Approach

 Two: Is to use a hierarchical generic cost model

 This is where each node represents a cost rule that associates a query pattern
with a cost function for various cost parameters.
 The five cost rules are:
I. Default-scope rules
II. Wrapper-scope rules
III. Collection scope rules
IV. Predicate-scope rules
V. Query specific rules

24
Customized Approach

25
Dynamic Approach

 This monitors the run-time behavior of component DBMSs and dynamically

collects the cost information
 In most cases execution environment factors are constantly changing
 Three classes of environmental factors based on dynamicity:
I. First class (frequently changing, every second to every minute) are CPU load, I/O
throughput and available memory
II. Second class (slowly changing, every hour to every day) are DBMS configuration
parameters, physical data organization on disk and database schema.
III. Third class (almost stable, every month to every year) are DBMS type, database
location, and CPU speed

26
Dynamic Approach

 One approach is to extend the sampling method and consider user queries as
new samples for environments where network connection, data storage and
available memory change over time.
 Query response time is measured to adjust the cost model parameters at run
time for subsequent queries.
 This avoids the overhead of processing queries periodically
 However, it still requires heavy computation to solve the cost model
equations and does not guarantee that the cost model’s precision improves
over time

27
Dynamic Approach

 Qualitative is a better solution.

 This defines are system contention level as a combined effect of frequently
changing factors on query cost.
 System contention level is divided into discrete categories:
 High
 Medium
 Low
 No system contention

28
Dynamic Approach

 These distinctions allow for defining a multi-category cost model that

provides accurate cost estimates all while the dynamic factors vary.
 Cost model is initially calibrated using probing queries
 Current system contention level is computed over time
 Assumes query executions are short, so environmental factors remain pretty
constant during query execution

29
Heterogeneous Query Optimization
 One component DBMS may support only simple select operations while another may
support complex queries involving join and aggregate.
 Depending on how the wrappers export such capabilities, query processing at the
mediator level can be more or less complex.

1---Query based:
 The wrappers support the same query capability,
 e.g., a subset of SQL, which is translated to the capability of the component DBMS.
2---Operator based:
 The wrappers export the capabilities of the component DBMSs through compositions of
relational operators.
 Thus, there is more flexibility in defining the level of functionality between the mediator,
any functionality that may not be supported by component DBMSs (e.g., join) will need to
be implemented at the mediator.

30
Query-based Approach
DBMSs appear homogeneous to the mediator, one approach is to use a distributed
cost-based query optimization algorithm with a heterogeneous cost model
 extensions are needed to convert the distributed execution plan into
subqueries
 Hybrid two-step optimization technique
1. static plan is produced by a centralized cost-based query optimizer
2. At startup time, an execution plan is produced by carrying out site selection
and allocating the subqueries to the sites.

31
 Cost-based query optimizer : first generate a left linear join tree, and then
convert it to a bushy tree.
A hybrid algorithm that concurrently performs a bottom-up and top-down sweep
of the left linear join execution tree, transforming it, step-by-step, to a bushy
tree.
The algorithm has two pointers
Bottom UAN (Upper Anchor Nodes)
 Is set to the grandparent of the leftmost root node
Top UAN
 Is set to the root
For each UAN the algorithm selects a lower anchor node (LAN)

32
The LAN is chosen such that its right child subtree’s
response time is close to the corresponding UAN’s right
child subtree’s response time.

This helps in keeping the transformed bushy tree balanced,

which reduces the response time.
33
1. Choose UANB if the response time of its left child subtree is smaller than that
of UANT ’s subtree; otherwise choose UANT .
2. If the response times are the same, choose the one with the more unbalanced
child subtree.
End of transformation UANB and UANT are adjusted
3. when UANB = UANT - Terminates ( no further transformations are possible)
And the result is balanced join execution tree.

*response time is reduced due to parallel execution of the joins.

34
Operator-based Approach
 Expressing the capabilities of the component DBMSs through relational
operators allows tight integration of query processing between mediator and
wrappers.
 In this approach, the capabilities of the component DBMSs are expressed by
the wrappers as planning functions that can be directly called by a
centralized query optimizer.
-------Consider the following SQL query submitted to mediator m:
SELECT ENAME, PNAME, DUR
FROM EMPASG
WHERE CITY = "Paris" AND DUR > 24
 the GAV approach, the global view EMPASG(ENAME, CITY, PNAME,DUR) is
EMPASG = (db1.EMP 1 db2.ASG) db3.EMPASG

35
• Used to model non-relational data sources such as web sites.

• The operator-based approach has also been successfully used in DISCO,

a multi-DBMS designed to access multiple databases over the web

36
1. Search space generation:
 The query is decomposed into a number of QEPs, which constitutes the search
space for query optimization.
 The search space is generated using a traditional search strategy such as
dynamic programming.

2. QEP decomposition
 Each QEP is decomposed into a forest of n wrapper QEPs and a composition QEP.
 Each wrapper QEP is the largest part of the initial QEP that can be entirely
executed by the wrapper.
 Operators that cannot be performed by a wrapper are moved up to the
composition QEP.
 The composition QEP combines the results of the wrapper QEPs in the final
answer, typically through unions and joins of the intermediate results produced
by the wrappers.
37
Eddy Approach:
Eddy is a general framework for adaptive query processing. It was developed in
the context of the Telegraph project with the goal of running queries on large
volumes of online data with unpredictable input rates and fluctuations in the
running environment.

A QEP can be modeled as a tupleQ = <D,P,C>, where

D is a set of data sources,
P is a set of query predicates withassociated algorithms, and
C is a set of ordering constraints that must be followed during execution

38
Querry

second join can only be an index join over T

p is an expensive predicate

Under these assumptions, the QEP is defined

39
A Query Execution Plan with
Eddy.

The flexibility of choosing the currently available data source is obtained by

relaxing the fixed order of predicates in a QEP. In Eddy, there is no fixed QEP and
each tuple follows its own path through predicates according to the constraints in the
plan and its own history of predicate evaluation.

40
Query Translation and Execution
Query translation and execution is performed by the wrappers using the component
DBMSs. A wrapper encapsulates the details of one or more component databases,
each supported by the same DBMS (or file system). It also exports to the mediator
the component DBMS capabilities and cost functions in a common interface. One
of the major practical uses of wrappers has been to allow an SQL-based DBMS to
access non-SQL databases

Wrapper interfaces

41
 The main function of a wrapper is conversion between the common interface and
the DBMS-dependent interface.

 Second, the wrapper must translate the results to the common interface format so
that they can be returned to the mediator for integration.

 In addition, the wrapper can execute operations that are not supported by the
component DBMS

42
Consider the relation EMP(ENO, ENAME, CITY) stored in a very simple component
database, in server Component DB, built with Unix text files. Each
 EMP tuple can then be stored as a line in a file, e.g., with the attributes
separated by “:”. In SQL/MED, the definition of the local schema for this
relation together with the mapping to a Unix file can be declared as a foreign
relation with the following
 statement:
CREATE FOREIGN TABLE
EMP ENO INTEGER, ENAME VARCHAR(30), CITY VARCHAR(20)
SERVER ComponentDB
OPTIONS (Filename ’/usr/EngDB/emp.txt’, Delimiter ’:’)
 Then, the mediator can send the wrapper supporting access to this relation
SQL statements. For instance, the query:

43
SELECT ENAME FROM EMP
 can be translated by the wrapper using the following Unix shell command to
extract the relevant attribute:
 cut -d: -f2 /usr/EngDB/emp
 Additional processing, e.g., for type conversion, can then be done using
programming code.

1. Wrappers are mostly used for read-only queries, which makes query
translation and wrapper construction relatively easy.
2. Wrapper construction typically relies on CASE tools with reusable components
to generate most of the wrapper code

44
Problem:
The main problem of updating through a wrapper is to guarantee
component database consistency by rejecting all updates that violate
integrity constraints, whether they are explicit or implicit.

Solution:
 A software engineering solution to this problem uses a CASE tool with
 reverse engineering techniques to identify within application code the
implicit integrity constraints which are then translated into validation code in
the wrappers

45
Conclusion
 Query processing in multidata-base systems is significantly more complex than
in tightly-integrated and homogeneous distributed DBMSs.

 In addition to being distributed , component databases may be autonomous,

have different database languages and query processing capabilities, and
exhibit varying behavior.

 In particular, component databases may range from full-fledged SQL

databases to very simple datasources

(e.g., text files).

Final Project New
100% (1)
Final Project New
127 pages
Chapter - 1 - Query Optimization
No ratings yet
Chapter - 1 - Query Optimization
38 pages
Overview of Network Infrastructure Design
100% (2)
Overview of Network Infrastructure Design
42 pages
Distributed Query Processing
No ratings yet
Distributed Query Processing
24 pages
CSE 453 Slide 3
No ratings yet
CSE 453 Slide 3
72 pages
Distributed Databases Data Warehousing: CPS 216 Advanced Database Systems
No ratings yet
Distributed Databases Data Warehousing: CPS 216 Advanced Database Systems
11 pages
Query Processing Concepts
No ratings yet
Query Processing Concepts
99 pages
Distributed Query Processing
No ratings yet
Distributed Query Processing
31 pages
Unit II QUERY PROCESSING AND DECOMPOSITION
No ratings yet
Unit II QUERY PROCESSING AND DECOMPOSITION
24 pages
SF8 - Unit 2 DDB
No ratings yet
SF8 - Unit 2 DDB
97 pages
Outline: Multidatabase Query Processing
No ratings yet
Outline: Multidatabase Query Processing
41 pages
Chapter 2 Adb
No ratings yet
Chapter 2 Adb
21 pages
Module 1 - Query Processing
No ratings yet
Module 1 - Query Processing
20 pages
CouchDB Essentials: Definitive Reference for Developers and Engineers
From Everand
CouchDB Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Query Processing
No ratings yet
Query Processing
28 pages
Distibuted System
No ratings yet
Distibuted System
11 pages
Advancedchapter 2 2013
No ratings yet
Advancedchapter 2 2013
16 pages
Chapter 2 - Query Processing and Optimization
100% (1)
Chapter 2 - Query Processing and Optimization
28 pages
Adbms Notes
No ratings yet
Adbms Notes
17 pages
4-Query - Processing (1) - PTIT
No ratings yet
4-Query - Processing (1) - PTIT
72 pages
Advanced Database Systems Chapter 2
100% (1)
Advanced Database Systems Chapter 2
16 pages
Chapter 20
No ratings yet
Chapter 20
99 pages
Distributed Database System: By: Madiha Hameed
No ratings yet
Distributed Database System: By: Madiha Hameed
13 pages
Query
No ratings yet
Query
104 pages
Lecture 09
No ratings yet
Lecture 09
56 pages
Itm661 Lecture03 Part2 2015
No ratings yet
Itm661 Lecture03 Part2 2015
47 pages
Chapter 4 Query Optimization
100% (2)
Chapter 4 Query Optimization
35 pages
Final DBMS Unit 7
No ratings yet
Final DBMS Unit 7
48 pages
Query Processing in Distributed Database
No ratings yet
Query Processing in Distributed Database
20 pages
Query Processing
No ratings yet
Query Processing
28 pages
Distributed Query Processing
No ratings yet
Distributed Query Processing
17 pages
Assignment # 2: Submitted by Submitted To Class Semester Roll No
No ratings yet
Assignment # 2: Submitted by Submitted To Class Semester Roll No
9 pages
Chapter 8
No ratings yet
Chapter 8
65 pages
DDP ch7
No ratings yet
DDP ch7
1 page
ADB Notes 2021
No ratings yet
ADB Notes 2021
43 pages
2 Chapter 3 Query Optimization
No ratings yet
2 Chapter 3 Query Optimization
29 pages
Query Processing and Optimization
No ratings yet
Query Processing and Optimization
31 pages
Unit 6
No ratings yet
Unit 6
34 pages
Data Integration
No ratings yet
Data Integration
42 pages
Query Optimization
No ratings yet
Query Optimization
103 pages
Data Integration Approaches
No ratings yet
Data Integration Approaches
28 pages
Chapter 1 Query Processing
No ratings yet
Chapter 1 Query Processing
26 pages
Unit-4 DBMS Merged
No ratings yet
Unit-4 DBMS Merged
156 pages
Query Optimization
No ratings yet
Query Optimization
29 pages
Chapter 1
No ratings yet
Chapter 1
44 pages
Debezium in Action: Definitive Reference for Developers and Engineers
From Everand
Debezium in Action: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Blank Space Questions
No ratings yet
Blank Space Questions
4 pages
Advanced Database Chapter Two Query Processing and Optimization
100% (1)
Advanced Database Chapter Two Query Processing and Optimization
43 pages
05 - Strategies For Query Processing (Ch18)
No ratings yet
05 - Strategies For Query Processing (Ch18)
50 pages
Query Optimization & Processing
No ratings yet
Query Optimization & Processing
32 pages
Principles of Distributed Database Systems: M. Tamer Özsu Patrick Valduriez
No ratings yet
Principles of Distributed Database Systems: M. Tamer Özsu Patrick Valduriez
73 pages
SQL and NoSQL Full Mastery: A Comprehensive Guide to Modern Data Management
From Everand
SQL and NoSQL Full Mastery: A Comprehensive Guide to Modern Data Management
Kameron Hussain
No ratings yet
4 Query Processing
No ratings yet
4 Query Processing
79 pages
Crafting Data-Driven Solutions: Core Principles for Robust, Scalable, and Sustainable Systems
From Everand
Crafting Data-Driven Solutions: Core Principles for Robust, Scalable, and Sustainable Systems
Peter Jones
No ratings yet
Chapter 5: Overview of Query Processing
No ratings yet
Chapter 5: Overview of Query Processing
18 pages
4-Query Processing Nhom1
No ratings yet
4-Query Processing Nhom1
73 pages
Ivunit Query Processing
No ratings yet
Ivunit Query Processing
12 pages
ADB - Unit - III (Chapter-2) - Query Processing and Decomposition
No ratings yet
ADB - Unit - III (Chapter-2) - Query Processing and Decomposition
42 pages
Implications of A Distributed Environment Part 2
No ratings yet
Implications of A Distributed Environment Part 2
38 pages
CH 02
No ratings yet
CH 02
127 pages
Ch-2 Query Processing and Optimization
No ratings yet
Ch-2 Query Processing and Optimization
21 pages
Advance Concept in Data Bases Unit-2 by Arun Pratap Singh
100% (1)
Advance Concept in Data Bases Unit-2 by Arun Pratap Singh
51 pages
Recognizing Records As A Strategic Resources: Chapter Overview
No ratings yet
Recognizing Records As A Strategic Resources: Chapter Overview
18 pages
SoccerEye Barreiraetal TOSSJ
No ratings yet
SoccerEye Barreiraetal TOSSJ
10 pages
012TEU-13188A2G - 12 Brins OM3 Armature en Mèches de Verre
No ratings yet
012TEU-13188A2G - 12 Brins OM3 Armature en Mèches de Verre
4 pages
SnowBird - English Version
No ratings yet
SnowBird - English Version
4 pages
SPOC: A Secure and Privacy-Preserving Opportunistic Computing Framework For Mobile-Healthcare Emergency
No ratings yet
SPOC: A Secure and Privacy-Preserving Opportunistic Computing Framework For Mobile-Healthcare Emergency
10 pages
Ebill 100017197372
No ratings yet
Ebill 100017197372
7 pages
Combinational Logic Design
No ratings yet
Combinational Logic Design
32 pages
Power Off Reset Reason Backup
No ratings yet
Power Off Reset Reason Backup
5 pages
GstarCAD 2021 Vs CorelCAD 2020
No ratings yet
GstarCAD 2021 Vs CorelCAD 2020
5 pages
What's The Difference?: Coordinate Systems
No ratings yet
What's The Difference?: Coordinate Systems
2 pages
WhatsApp Chat With Doraemon ??
No ratings yet
WhatsApp Chat With Doraemon ??
51 pages
MXK 319 819 823 PDF
No ratings yet
MXK 319 819 823 PDF
7 pages
Full Bridge Converter
No ratings yet
Full Bridge Converter
7 pages
BIM Manual Civil Works and Infrastructure
100% (1)
BIM Manual Civil Works and Infrastructure
85 pages
100 Watt Inverter
No ratings yet
100 Watt Inverter
11 pages
PDC - Answersheet For Candidates
No ratings yet
PDC - Answersheet For Candidates
5 pages
Ams Machinery Manager v5 71 Installation Guide en 4236422 PDF
No ratings yet
Ams Machinery Manager v5 71 Installation Guide en 4236422 PDF
68 pages
Develop Programs To Understand Concept of Class and Object in Python
No ratings yet
Develop Programs To Understand Concept of Class and Object in Python
49 pages
BEL PO Syllabus (Aptitude+ Technical)
No ratings yet
BEL PO Syllabus (Aptitude+ Technical)
2 pages
Lecture 1-Introduction
No ratings yet
Lecture 1-Introduction
18 pages
2009 Exam Elec2104
No ratings yet
2009 Exam Elec2104
13 pages
Floor Sec D
No ratings yet
Floor Sec D
42 pages
Micrometer - Caliper Calibration
No ratings yet
Micrometer - Caliper Calibration
3 pages
Culture Vulture Issue 05 Luxury - TO VIEW
100% (1)
Culture Vulture Issue 05 Luxury - TO VIEW
31 pages
Synopsis For Shopping Mania
No ratings yet
Synopsis For Shopping Mania
38 pages
Introduction To Hardware: 1 Pengantar Teknologi Informasi
No ratings yet
Introduction To Hardware: 1 Pengantar Teknologi Informasi
27 pages
Wartex #10.0
No ratings yet
Wartex #10.0
50 pages
STD 2 Revision With Blanks
No ratings yet
STD 2 Revision With Blanks
4 pages

Ch-9 Multidatabase Query Processing

Uploaded by

Ch-9 Multidatabase Query Processing

Uploaded by

Multidatabase Query

 Issues in Multidatabase Query Processing

 This chapter will focus on query processing that provides interoperability

 These lead to a flexible and extensible distributed system

 Query rewriting reformulates the input query that is expressed on global

 Datalog can be viewed as an in-line version of domain relational calculus

 The global schema is expressed independent of the local databases

 The bucket algorithm considers each predicate of the query independently to

 The algorithms produces a conjunctive query and checks whether it is

 Two: a rewritten query is produced by combining the different MCDs.

 There are three main problems of query optimization in multidatabase

Cost = initialization cost + cost to find qualifying tuples + cost to process

 Two: Is to use a hierarchical generic cost model

 This monitors the run-time behavior of component DBMSs and dynamically

 Qualitative is a better solution.

 These distinctions allow for defining a multi-category cost model that

This helps in keeping the transformed bushy tree balanced,

*response time is reduced due to parallel execution of the joins.

• The operator-based approach has also been successfully used in DISCO,

A QEP can be modeled as a tupleQ = <D,P,C>, where

second join can only be an index join over T

Under these assumptions, the QEP is defined

The flexibility of choosing the currently available data source is obtained by

 In addition to being distributed , component databases may be autonomous,

 In particular, component databases may range from full-fledged SQL

(e.g., text files).

You might also like