Lecture 8 - Distributed Databases
Lecture 8 - Distributed Databases
Distributed Databases
• Distributed Database
Site 2
Site 3
1
10/13/2014
• All sites use same DBMS product. • Sites may run different DBMS products, with
possibly different underlying data models.
• Occurs when sites have implemented their own
• Much easier to design and manage. databases and integration is considered later.
• Translations required to allow for:
7 8
QUERY
• Client-Server
– Client chips query to Query • Expect DDBMS to have at least the functionality
client client
of a DBMS.
single site. All query
processing at server Server Server Server
9 10
2
10/13/2014
13 14
Fragmentation…
Fragmentation
• Definition and allocation of fragments carried • Quantitative information may include:
out strategically to achieve: – frequency with which an application is run;
– Locality of Reference. – site from which an application is run;
– Improved Reliability and Availability. – performance criteria for transactions and
– Improved Performance. applications.
– Balanced Storage Capacities and Costs. • Qualitative information may include
– Minimal Communication Costs. transactions that are executed by application,
type of access (read or write), and predicates
• Involves analyzing most important applications,
of read operations.
based on quantitative/qualitative information.
15 16
17 18
3
10/13/2014
Data Allocation…
Why Fragment?
• Complete Replication • Usage
– Consists of maintaining complete copy of – Applications work with views rather than entire
database at each site. relations.
• Selective Replication • Efficiency
– Combination of partitioning, replication, and – Data is stored close to where it is most
centralization. frequently used.
– Data that is not needed by local applications is
not stored.
19 20
21 22
Rationale - Distribution
• Location Transparency - Users do not need to be aware
of at what sites data is located. Simplifies user programs
and interface activities. Data can migrate from site to site
without invalidating any of those programs or activities.
Data may be migrated around the network in response to
changing usage or performance requirements
• Fragmentation Transparency - Users do not need to be
aware of how data is fragmented.
• Replication Transparency - Users should not need to be
aware of how data is replicated. Replication is desirable
because performance is better if applications can operate
on local copies and availability is better so long as at least
one copy remains available for retrieval purposes.
23