0% found this document useful (0 votes)

28 views21 pages

Overview of Distributed Databases

A distributed database system consists of multiple interconnected sites where users can access data stored at any location, with each site managing its own local database and users. The system aims to achieve twelve objectives, including local autonomy, continuous operation, and replication independence, while addressing challenges such as query processing and update propagation. Key features include location independence, fragmentation independence, and the ability to operate across various hardware and software platforms.

Uploaded by

ompatel4624

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views21 pages

Overview of Distributed Databases

Uploaded by

ompatel4624

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Distributed Database

16-1
Introduction

16-2
Distributed Database System
▪ A system involving multiple sites connected together via communication
network.
▪ User at any site can access data stored at any site.
▪ Each site is a database system in its own right: its own local database,
local users, local DBMS, local DC manager.
Communi
User cation
manager
DBM
S
datab
ase

Communication
Network

Fig 16.1: A typical distributed database system

Wei-Pang Yang, Information Management, NDHU 16-3
The Twelve Objectives

16-4
The Twelve Objectives
1. Local Autonomy
• all operations at a given site are controlled by that site, should not
depend on other sites.
• local data is locally owned and managed.
• Not wholly achievable => sites should be autonomous to the
maximum extend possible.
2. No Reliance on a Central Site
• all sites must be treated as equals.
• the central site may be bottleneck.
3. Continuous Operation
• Reliability
• Availability
• Never require the system to be shutdown to perform some function:
e.g. add a new site.
Wei-Pang Yang, Information Management, NDHU 16-5
The Twelve Objectives (cont.)
4. Location Independence ( Location Transparency )
• user should not need to know at which site the data is stored, but should be
able to behave as if the entire database were stored at their own local site.
• a request for some remote data => system should find the data
automatically.
C
• Advantages A
<1> Simplify user programs and activities
<e.g.> SELECT S# B
FROM S
AT SITE A
WHERE SNAME = 'John'

<2> allow data to be moved from one site to another at any time without
invalidating any program or activities.

Wei-Pang Yang, Information Management, NDHU 16-6

The Twelve Objectives (cont.)
5. Fragmentation Independence ( Fragmentation Transparency )
• Data Fragmentation
• a given local object can be divided up into pieces (fragments) for
physical storage purpose.
<e.g.> user perception
EMP EMP# DEPT# SALARY
E1 DX 45K
E2 DY 40K
E3 DZ 50K
E4 DY 63K
E5 DZ 40K

EMP# DEPT# SALARY EMP# DEPT# SALARY London

New York E1 DX 45K E2 DY 40K fragment
fragment E3 DZ 50K E4 DY 63K
E5 DZ 40K

physical storage physical storage

New York London Fig. 16.2: An example
of fragmentation.
Wei-Pang Yang, Information Management, NDHU 16-7
The Twelve Objectives (cont.)
• Data Fragmentation
• A fragmentation can be any subrelation derivable via restriction and
projection (with primary key).
• Advantage: data can stored at the location where it is most frequently
used.
• Fragmentation independence
• user should be able to behave as if the relations were not fragmented
at all.
• one reasons why relational technology is suitable for DBMS.
• user should be presented with a view of data.
=> system must support updates against join and union views.
• Advantages
(1) simplify user program and activity.
(2) allow data to be re-fragmented at any time.

Wei-Pang Yang, Information Management, NDHU 16-8

The Twelve Objectives (cont.)
6. Replication Independence ( Replication Transparency )
• Data Replication
USER PERCEPTION
New York fragment London fragment
EMP EMP# DEPT# SALARY EMP# DEPT# SALARY EMP# DEPT# SALARY

E1 DX E1 DX 45K E2 DY 40K
45K
E2 DY E3 DZ 50K E4 DY 63K
40K
E3 DZ 50K E5 DZ 40K
E4 DY 63K copy
E5 DZ 40K replica of London fragment replica of New York fragment
EMP# DEPT# SALARY EMP# DEPT# SALARY
E2 DY 40K E1 DX 45K
E4 DY 63K E3 DZ 50K
E5 DZ 40K

physical storage physical storage

New York London

Fig. 16.3: An example of replication.

Wei-Pang Yang, Information Management, NDHU 16-9

The Twelve Objectives (cont.)
• Data Replication
• A given fragment of relation can be represented at the physical level
by many distinct copies of the same object at many distinct sites.
• Unit of replication: fragment (may not a complete relation)
• Advantage: better performance and availability
• Disadvantage: update propagation problem.
• Replication Independence

• User should be able to behave as if the data is not replicated at all.

• Advantages
(1) simplify user programs and activities.
(2) allow replicas to be created and destroyed dynamically.

Wei-Pang Yang, Information Management, NDHU 16-10

The Twelve Objectives (cont.)
7. Distributed Query Processing
• To execute single query at different location, does not able to
satisfy transparent request. So, query optimization is crucial and
performed transparently by DDBMS.
8. Distributed Transaction Management
• Transaction is able to update data at different sites transparently,
but control of recovery and concurrency is achieved by using
agents.
9. Hardware Independence: It should be possible for DDBMS
to run on different hardware platforms. Like IBM, DEC, HP,
PC, ...

Wei-Pang Yang, Information Management, NDHU 16-11

The Twelve Objectives (cont.)
10. Operating System Independence: It should be possible for
DDBMS to run on different Operating system platforms. Like
VMS, UNIX, ...
11. Network Independence: The DDBMS system is able to run
on any network platform. Like BITNET, INTERNET,
ARPANET, ...
12. DBMS Independence: Relational, hierarchical, network, ...
• The system must support any vendor of the database product.
• distributed system may be heterogeneous.

Wei-Pang Yang, Information Management, NDHU 16-12

Problems of Distributed Database
Systems

16-13
Basic Point: Network are slow !
Basic point: network are slow !

Overriding Objective : minimize the number and

volume of messages.

Give rise to the following problem

• Query Processing
• Update Propagation
• Concurrency
• Recovery
• Catalog Management
..
.
Wei-Pang Yang, Information Management, NDHU 16-14
Query Processing: Example
• Query Optimization is more important in a distributed
system.
• Example (Date, Vol.2 p.303)
S ( S#, CITY )
• Database: 10,000 tuples, stored at site A.
P ( P#, COLOR) 100,000 tuples, stored at site B.
SP ( S#, P# ) 1,000,000 tuples, stored at site A.

Assume each tuple is 100 bits long.

Site A: S SP Site B: P

Wei-Pang Yang, Information Management, NDHU 16-15

Query Processing: Example (cont.)
• Query: "Select S# for London suppliers of Red Parts"
SELECT S.S# site A site B
FROM S, P, SP S, SP P
WHERE S.CITY = "London"
AND S.S# = SP.S# S SP
AND SP.P# = P.P#
AND P.COLOR = 'Red'
• Estimates
# of Red Parts = 10
# of Shipments by London Supplier = 100,000
• Communication Assumption :
Data Rate = 10,000 bits per second
Access Delay = 1 second
• T[i] = total communication time for strategy i
= total access delay + total data volume / data rate
= (# of messages * 1 sec) + (total # of bits / 10,000 ) sec.

Wei-Pang Yang, Information Management, NDHU 16-16

Query Processing: Example (cont.)
site A site B
• Strategy 1 S, SP P
1. Join S and SP at site A
2. Select tuples from ( S SP ) for which city is 'London'
( 100,000 tuples )
3. For each of those tuple, check site B to see if the part is
red. (2 messages: 1 query, 1 response)
T[1] = ( 100,000 * 2 ) * 1 = 2.3 days
• Strategy 2
Move relations S and SP to site B and process the query at B.
T[2] = 2+(10,000+1,000,000)*100/10,000 = 28 hours
• Strategy 3
Move relation P to site A and process the query at A
T[3] =1+(100,000*100) /10,000 = 16.7 min

Wei-Pang Yang, Information Management, NDHU 16-17

Query Processing: Example (cont.)
• Strategy 4
1. Select tuples from P where color is red. (10 tuples)
2. Check site A to see if there exists a shipment relating the part to
a London Supplier. ( 2*10 messages )
T[4] = 2*10*1 = 20 sec site A site B
S, SP P
• Strategy 5
1. Select tuples from P where color is red (10 tuples)
2. Move the result to site A and complete the processing at A.
T[5] = 1 + ( 10*100) / 100,000 = 1.01 sec

• Note: Each of the five strategies represents a plausible

solution, but the variation in communication time is
enormous.
Wei-Pang Yang, Information Management, NDHU 16-18
Query Processing: Semijoin
• Semijoin: (used in SDD - 1) Ref. p.529 [18.15]
A B p.626 [21.26]

site A site B
<e.g.> S SP
• Database :
S: 1,000 tuples, at site A S'
SP' S#
SP: 2,000 tuples, at site B
# of tuples in S where S.S#=SP.S#: 100,
length of a S tuple: 100 bit
length of a SP tuple: 100 bit
length of the S# field: 10 bit

• Regular Join:
<1> Ship S to site B ( 1000 * 100 bits )
<2> Join S and SP at site B
communication time = 1 + 1000*100/10000 = 11 sec
Wei-Pang Yang, Information Management, NDHU 16-19
Query Processing: Semijoin (cont.)
• Semijoin
<1> site B: step 1. Project SP on S# (get SP')
site A site B step 2. ship to site A
S SP <2> site A: step 3. Join the projection of SP' on S# with S
step 4. The result S‘, ship to site B
S' <3> site B: step 5. Join S' with SP
SP' S#
communication time = 1+10*2000/10000+1+100*100/10000
= 1+2+1+1= 5 sec

Site A Site B
S SP SP'
S' S1
S# S# P#
Join S4
# = 100 # = 1,000 #=2,000
S' ... # =< 2,000
S921
100 bits
100 bits
10 bits

Wei-Pang Yang, Information Management, NDHU 16-20

Update Propagation
▪ Basic problem with data replication
• An update to any given logical data object must be propagated to all
stored copies of that object.
• some sites may be unavailable (because of site or network failure) at the
time of the update
=> Data is less available !
▪ A possible Solution: Primary Copy (used in distributed INGRES)
• one copy of each object is designated as the primary copy.
• primary copies of different objects will generally be at different sites.
• Update Operation
1. Complete as soon as the primary copy has been updated.
2. Control is returned and the transaction can continue execute.
3. The site holding the primary copy broadcasts the update to all other sites.
• Further Problem: violation of the local autonomy objective.
Wei-Pang Yang, Information Management, NDHU 16-21

Unit-V: Database Management System
No ratings yet
Unit-V: Database Management System
5 pages
Distributed Database
100% (1)
Distributed Database
24 pages
Lecture 1 Advance Database Systems Concepts
No ratings yet
Lecture 1 Advance Database Systems Concepts
54 pages
Fragmentation and Replication in Databases
No ratings yet
Fragmentation and Replication in Databases
24 pages
Chhanda Ray - Distributed Database Systems (2009, Pearson Education) - Libgen - Li
No ratings yet
Chhanda Ray - Distributed Database Systems (2009, Pearson Education) - Libgen - Li
325 pages
Chapter 7 - Distributed Database System
No ratings yet
Chapter 7 - Distributed Database System
27 pages
Distributed Databases
No ratings yet
Distributed Databases
58 pages
Distributed Database Fundamentals
No ratings yet
Distributed Database Fundamentals
36 pages
Advanced Database Systems
No ratings yet
Advanced Database Systems
16 pages
ch6 Distributed Database
No ratings yet
ch6 Distributed Database
35 pages
Distributed Database Systems-Chhanda Ray
No ratings yet
Distributed Database Systems-Chhanda Ray
271 pages
Advanced Database Chapter 6 and 7
No ratings yet
Advanced Database Chapter 6 and 7
30 pages
Chapter-7 Distributed Database Systems
No ratings yet
Chapter-7 Distributed Database Systems
40 pages
Unit V
No ratings yet
Unit V
22 pages
Understanding Distributed Databases Concepts
No ratings yet
Understanding Distributed Databases Concepts
56 pages
Adb CH 4
No ratings yet
Adb CH 4
14 pages
Unit 1 - Scsa3008 - Distributed Database and Information
No ratings yet
Unit 1 - Scsa3008 - Distributed Database and Information
23 pages
Enterprise Systems: Distributed Databases and Systems - DT211 4
No ratings yet
Enterprise Systems: Distributed Databases and Systems - DT211 4
25 pages
Distributed Databases Overview
No ratings yet
Distributed Databases Overview
33 pages
7-Distributed DB
No ratings yet
7-Distributed DB
37 pages
Distributed Databases: by Chien-Pin Hsu CS157B Section 1 Nov 11, 2004
No ratings yet
Distributed Databases: by Chien-Pin Hsu CS157B Section 1 Nov 11, 2004
24 pages
Client-Server Architectures in DDBs
No ratings yet
Client-Server Architectures in DDBs
73 pages
3 Distribution Design
No ratings yet
3 Distribution Design
110 pages
Chapter 4 - Distributed Database System
No ratings yet
Chapter 4 - Distributed Database System
52 pages
Distributed Database Systems Guide
No ratings yet
Distributed Database Systems Guide
5 pages
Chapter - 7 Distributed Database System
No ratings yet
Chapter - 7 Distributed Database System
29 pages
Distributed Database Systems Guide
No ratings yet
Distributed Database Systems Guide
25 pages
Database Design for IT Professionals
No ratings yet
Database Design for IT Professionals
5 pages
Chapter 6
No ratings yet
Chapter 6
45 pages
17 DatabaseArchitectures
No ratings yet
17 DatabaseArchitectures
41 pages
Distributed Database Concepts
No ratings yet
Distributed Database Concepts
52 pages
Unit 5 Notes
No ratings yet
Unit 5 Notes
30 pages
26 Distributed Dbms Nosql
No ratings yet
26 Distributed Dbms Nosql
45 pages
7 Distributed DB
No ratings yet
7 Distributed DB
38 pages
RD TH
No ratings yet
RD TH
4 pages
Module 2
No ratings yet
Module 2
62 pages
QueryProcessing Lect 3
No ratings yet
QueryProcessing Lect 3
26 pages
Unit-2 - Distributed Database System
No ratings yet
Unit-2 - Distributed Database System
7 pages
DBMS-Unit 5
No ratings yet
DBMS-Unit 5
27 pages
Week 12 - Distributed Databases
No ratings yet
Week 12 - Distributed Databases
37 pages
Distributed Database Management Overview
No ratings yet
Distributed Database Management Overview
10 pages
DDBS Unit 1
No ratings yet
DDBS Unit 1
11 pages
Distributed Database: Source
No ratings yet
Distributed Database: Source
19 pages
Distributed Database Design Guide
No ratings yet
Distributed Database Design Guide
52 pages
07 DistributedDataManagement
No ratings yet
07 DistributedDataManagement
44 pages
02 DistributedDataManagement
No ratings yet
02 DistributedDataManagement
37 pages
Unit 1
No ratings yet
Unit 1
28 pages
Advanced Distributed Databases
No ratings yet
Advanced Distributed Databases
8 pages
Distributed Database Systems
No ratings yet
Distributed Database Systems
50 pages
Iii. Current Trends: Distributed Databases and DBMSS: Concepts and Design
No ratings yet
Iii. Current Trends: Distributed Databases and DBMSS: Concepts and Design
32 pages
Synchronization: Performed. Pecialized
No ratings yet
Synchronization: Performed. Pecialized
13 pages
DDB Unit 1-5
No ratings yet
DDB Unit 1-5
190 pages
Distributed Database Frank Chinembiri and Florence-2
No ratings yet
Distributed Database Frank Chinembiri and Florence-2
42 pages
Distributed Databases: Not Just A Client/server System
No ratings yet
Distributed Databases: Not Just A Client/server System
43 pages
R PPR 2
No ratings yet
R PPR 2
6 pages
Distributed Databases and Client-Server Architectures
No ratings yet
Distributed Databases and Client-Server Architectures
60 pages
UNIT 1 Security
No ratings yet
UNIT 1 Security
29 pages
UNIT-1-Transaction and Concurrency
No ratings yet
UNIT-1-Transaction and Concurrency
31 pages
UNIT 1 Recovery
No ratings yet
UNIT 1 Recovery
22 pages
ADBMS Unit1 2
No ratings yet
ADBMS Unit1 2
54 pages
Networking Commands
No ratings yet
Networking Commands
6 pages
Assignment of Unit - 1
No ratings yet
Assignment of Unit - 1
1 page
Crimping
No ratings yet
Crimping
9 pages
IP Address Allocation
No ratings yet
IP Address Allocation
8 pages
LAN Creation in Work Group
No ratings yet
LAN Creation in Work Group
5 pages
8 Different Types of Servers in Computer Networks
No ratings yet
8 Different Types of Servers in Computer Networks
8 pages
The Van Gogh Deception Deron R. Hicks No Waiting Time
No ratings yet
The Van Gogh Deception Deron R. Hicks No Waiting Time
117 pages
Kashmir Bespoke Holiday Itinerary for 14
No ratings yet
Kashmir Bespoke Holiday Itinerary for 14
12 pages
Tree Cutting Investigation at Kalang-Kalang
No ratings yet
Tree Cutting Investigation at Kalang-Kalang
4 pages
Lab Manual Web Engineering
No ratings yet
Lab Manual Web Engineering
44 pages
F4-Eat, F4e-Iii - MT
0% (1)
F4-Eat, F4e-Iii - MT
125 pages
Raul Gomez Jattin
No ratings yet
Raul Gomez Jattin
2 pages
Comprehensive Guide to Leave Regulations
No ratings yet
Comprehensive Guide to Leave Regulations
41 pages
National Institute of Technology, Srinagar Department of Electrical Engineering
No ratings yet
National Institute of Technology, Srinagar Department of Electrical Engineering
2 pages
HDR Assertiveness Finding Your Voice 2025
No ratings yet
HDR Assertiveness Finding Your Voice 2025
39 pages
AI Report
No ratings yet
AI Report
22 pages
Brgy Resolution Als Realignment - NAHAPUNAN YDF
No ratings yet
Brgy Resolution Als Realignment - NAHAPUNAN YDF
3 pages
Corporate Securities: Ma. Pamela Z. Sepulveda Bsa 4 Financial Management 102 AUGUST 23, 2018
No ratings yet
Corporate Securities: Ma. Pamela Z. Sepulveda Bsa 4 Financial Management 102 AUGUST 23, 2018
51 pages
Queer Marxism in Taiwan - 240224 - 115453
No ratings yet
Queer Marxism in Taiwan - 240224 - 115453
24 pages
BYJU'S Employee Leave Policy
No ratings yet
BYJU'S Employee Leave Policy
10 pages
Sports Development Funding 2016 17 Impact
No ratings yet
Sports Development Funding 2016 17 Impact
2 pages
Urban Goods Movement Policy Guide
100% (1)
Urban Goods Movement Policy Guide
60 pages
Biography of Hazrat Umar Farooq (RA)
No ratings yet
Biography of Hazrat Umar Farooq (RA)
5 pages
Dena Owner Manual
No ratings yet
Dena Owner Manual
162 pages
HSE Questions
No ratings yet
HSE Questions
6 pages
Foreign Descriptions of Muscovy - An Analytic Bibliography of Prim
No ratings yet
Foreign Descriptions of Muscovy - An Analytic Bibliography of Prim
173 pages
Mutual Fund Accounting Guidelines
100% (1)
Mutual Fund Accounting Guidelines
1 page
Travels in European Turkey, 1850
100% (1)
Travels in European Turkey, 1850
506 pages
Mechanical Engineering Project Thesis PDF
100% (2)
Mechanical Engineering Project Thesis PDF
6 pages
M2.2 Lesson 2 NOUN - ADJECTIVE AGREEMENT
No ratings yet
M2.2 Lesson 2 NOUN - ADJECTIVE AGREEMENT
2 pages
Geetanjali-Hostel Brochure
No ratings yet
Geetanjali-Hostel Brochure
28 pages
Job Application Letter: Date: 20/08/2021
No ratings yet
Job Application Letter: Date: 20/08/2021
4 pages
Reuters News Writing Guide
No ratings yet
Reuters News Writing Guide
5 pages
Lukacs The Road To The Blum Theses
No ratings yet
Lukacs The Road To The Blum Theses
40 pages
Edgar Allan Poe: Short Stories Collection
No ratings yet
Edgar Allan Poe: Short Stories Collection
45 pages
Creating A Microclimate Box For Metal Storage
100% (1)
Creating A Microclimate Box For Metal Storage
3 pages

Overview of Distributed Databases

Uploaded by

Overview of Distributed Databases

Uploaded by

Distributed Database

Fig 16.1: A typical distributed database system

Wei-Pang Yang, Information Management, NDHU 16-6

EMP# DEPT# SALARY EMP# DEPT# SALARY London

physical storage physical storage

Wei-Pang Yang, Information Management, NDHU 16-8

physical storage physical storage

Fig. 16.3: An example of replication.

Wei-Pang Yang, Information Management, NDHU 16-9

• User should be able to behave as if the data is not replicated at all.

Wei-Pang Yang, Information Management, NDHU 16-10

Wei-Pang Yang, Information Management, NDHU 16-11

Wei-Pang Yang, Information Management, NDHU 16-12

Overriding Objective : minimize the number and

Give rise to the following problem

Assume each tuple is 100 bits long.

Wei-Pang Yang, Information Management, NDHU 16-15

Wei-Pang Yang, Information Management, NDHU 16-16

Wei-Pang Yang, Information Management, NDHU 16-17

• Note: Each of the five strategies represents a plausible

Wei-Pang Yang, Information Management, NDHU 16-20

You might also like