0% found this document useful (0 votes)

395 views58 pages

Chapter - 7 Distributed Database System

Here are the fragments accessed: New York fragment of EMPLOYEE Atlanta fragment of EMPLOYEE Miami fragment of EMPLOYEE The results are integrated transparently. - User need not know about fragments. - DBMS handles fragmentation details. - Highest level of transparency. - Most complex for DBMS to implement. - Slowest performance. - Not commonly supported. Distribution Transparency • Case 2: DB Supports Location Transparency SELECT * FROM EMP WHERE LOCATION = 'New York'

Uploaded by

dawod

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

395 views58 pages

Chapter - 7 Distributed Database System

Uploaded by

dawod

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 58

Chapter - 7

Distributed Databases system

1
Outline
1 Distributed Database Concepts

2 Data Fragmentation, Replication and Allocation

3 Types of Distributed Database Systems

4 Query Processing

5 Concurrency Control and Recovery

6 3-Tier Client-Server Architecture

Distributed Database Concepts
It can be defined as
 A distributed database (DDB) is a collection of multiple
logically related database distributed over a computer
network.
 A distributed database management system is a software
system that manages a distributed database while
making the distribution transparent to the user.
Advantages DDS
1. Management of distributed data with different levels of
transparency:
– Distribution transparency:
• This refers to the physical placement of data (files, relations,
etc.) is not known to the user.

Site 5
Site 1

Site 4 Communications neteork

Site 3 Site 2
Examples
The EMPLOYEE, PROJECT, and WORKS_ON tables may be fragmented
horizontally and stored with possible replication as shown below.
EMPLOYEES - All
PROJECTS - All
WORKS_ON - All
EMPLOYEES - New York
Chicago PROJECTS - All
(headquarters) WORKS_ON - New York Employees

EMPLOYEES - San Francisco and LA New York

PROJECTS - San Francisco
WORKS_ON - San Francisco Employees

San Francisco Communications neteork

Los Angeles Atlanta

EMPLOYEES - LA EMPLOYEES - Atlanta
PROJECTS - LA and San Francisco PROJECTS - Atlanta
WORKS_ON - LA Employees WORKS_ON - Atlanta Employees
Advantages DDS
− Network transparency: Users do not have to worry about operational details of
the network.
− Location transparency: refers to freedom of issuing command from any location
without affecting its working.
− Naming transparency: allows access to any named object (files, relations, etc.)
from any location.
− Replication transparency: Allows to store copies of a data at multiple sites.

• This is done to minimize access time to the required data.

− Fragmentation transparency: Allows to segment a relation horizontally

(create a subset of tuples of a relation) or vertically (create a subset of columns of a

relation).
Advantages DDS
2. Increase reliability and availability:
− Reliability refers to system live time, that is, system is running efficiently most
of the time.
− Availability is the probability that the system is continuously available (usable
or accessible) during a time interval.
− A distributed database system has multiple nodes (computers) and if one fails
then others are available to do the job.
3. Improved performance:
− A DDBMS fragments the database to keep data closer to where it is needed
most.
− This reduces data management (access and modification) time significantly.
4. Easier expansion (scalability):
− Allows new nodes (computers) to be added anytime without chaining the entire
configuration.
Disadvantages DDS
– Complexity

– Cost

– Security

– Integrity control more difficult

– Lack of standards

– Lack of experience

– Database design more complex

Types of Distributed Database Systems
Homogeneous
• All sites of the database system have identical setup, i.e., same database
system software.
• The underlying operating systems can be a mixture of Linux, Window,
Unix, etc.
• For example, all sites run Oracle or DB2, or Sybase or some other database
system.
Window
Site 5 Unix
Advantages Oracle Site 1
 Easy to use Oracle
 Easy to mange Window
Site 4 Communications
 Easy to Design neteork
Disadvantages
 Difficult for most organizations to Oracle

force a homogeneous environment Site 3 Site 2

Linux Oracle Linux Oracle
Heterogeneous
 Different data center may run different DBMS products, with possibly different underlying data models.

 Translations required to allow for:

o Different hardware.
o Different DBMS products.
o Different hardware and different DBMS products.

Object Unix Relational

Oriented Site 5 Unix
Site 1
Hierarchical
Window
Site 4 Communications
network

Network
Object DBMS
Oriented Site 3 Site 2 Relational
Linux Linux
Heterogeneous
 Advantages
 Huge data can be stored in one Global center from different data center
 Remote access is done using the global schema.
 Different DBMSs may be used at each node

 Disadvantages
 Difficult to mange
 Difficult to design.

.
Federated Database Management Systems

• A federated database system (FDBS) is a collection of cooperating

database systems that are autonomous and possibly heterogeneous.
• Differences in data models: Relational, Objected oriented,
hierarchical, network, etc.
• Differences in constraints: Each site may have their own data
accessing and processing constraints.
• Differences in query language: Some site may use SQL, some may
use SQL-89, some may use SQL-92, and so on.

Multidatabase system (MDBS): A distributed DBMS in which each site

maintains complete autonomy.
Distributed Processing and Distributed Database
Centralized Database Management System
Fully Distributed Database Management System
DDBMS Components
Computer workstations
 To form the network system.
Network hardware and software
 Components that reside in each workstation.
Communications media
 Carry the data from one workstation to another.
Transaction processor (TP)
 Receives and Processes the application’s data requests.
Data processor (DP)
 Stores and Retrieves data located at the site.
 Also Known as data manager (DM).
Distributed Database System Components
DDBMS protocol
• determines how the DDBMS will:
– Interface with the network to transport data and commands
between DPs and TPs.
– Synchronize all data received from DPs (TP side) and route
retrieved data to the appropriate TPs (DP side).
– Ensure common database functions in a distributed system --
security, concurrency control, backup, and recovery.
Levels of Data & Process Distribution

• Single-Site Processing, Single-Site Data (SPSD)

– All processing is done on a single CPU or host computer.
– All DBMS are stored on the host computer’s local disk.
– The DBMS is accessed by dumb terminals.
– Typical of most mainframe and minicomputer DBMSs.
– Typical of the 1st generation of single-user microcomputer database.
Non distributed (Centralized) DBMS
Levels of Data & Process Distribution

Multiple-Site Processing, Single-Site Data (MPSD)

− Typically, MPSD requires a network file server on which
conventional applications are accessed through a LAN.
− A variation of the MPSD approach is known as a
client/server architecture.

Figure 6.7
Levels of Data & Process Distribution

 Multiple-Site Processing, Multiple-Site Data (MPMD)

– Fully distributed DBMS with support for multiple DPs and
TPs at multiple sites.
– Homogeneous DDMS
 Integrate only one type of centralized DBMS over the network.

– Heterogeneous DDBMS
 Integrate different types of centralized DBMSs over a network.
Distributed DB Transparency

 DDBMS transparency features have the common property of

allowing the end users to think that he is the database’s only
user.
– Distribution transparency
– Transaction transparency
– Failure transparency
– Performance transparency
– Heterogeneity transparency
Distribution Transparency
• Distribution transparency allows us to manage a physically
dispersed database as though it were a centralized database.
• Three Levels of Distribution Transparency
– Fragmentation transparency
– Location transparency
– Local mapping transparency

Table 6.2
Distribution Transparency
• Example :
Employee data (EMPLOYEE) are distributed over three locations: New York,
Atlanta, and Miami.
Depending on the level of distribution transparency support, three different cases of
queries are possible:

Figure 6.9 Fragment Locations

Distribution Transparency
• Case 1: DB Supports Fragmentation Transparency
SELECT * FROM EMPLOYEE WHERE EMP_DOB < '01-JAN-1940';

• Case 2: DB Supports Location Transparency

SELECT * FROM E1 WHERE EMP_DOB < '01-JAN-1940';
UNION
SELECT * FROM E2 WHERE EMP_DOC < '01-JAN-1940';
UNION
SELECT * FROM E3 WHERE EMP_DOC < '01-JAN-1940';

• Case 3: DB Supports Local Mapping Transparency

SELECT * FROM E1 NODE NY WHERE EMP_DOB < '01-JAN-1940';
UNION
SELECT * FROM E2 NODE ATL WHERE EMP_DOB < '01-JAN-1940';
UNION
SELECT * FROM E3 NODE MIA WHERE EMP_DOB < '01-JAN-1940';
Transaction Transparency
• Transaction transparency - ensures that database transactions

will maintain the database’s integrity and consistency.

• Related Concepts:
– Remote Requests

– Remote Transactions

– Distributed Transactions

– Distributed Requests
A Remote Request
 Allows us to access data to be processed by a single remote database
processor.
A Remote Transaction
 Composed of several requests, may access data at only a single
site.
A Distributed Transaction

 Allows a transaction to reference several different (local or

remote) DP sites.
A Distributed Request
 Reference data from several remote DP sites.
 Allows a single request to reference a physically partitioned table.

Example2:
Distributed Request
Transaction Transparency

 Two-Phase Commit Protocol

 The two-phase commit protocol requires a

 DO-UNDO-REDO protocol and

 write-ahead protocol.
 The DO-UNDO-REDO protocol is used by the DP to roll back
and/or roll forward transactions with the help of the system’s
transaction log entries.
Transaction Transparency
 Two-Phase Commit Protocol

 DO performs the operation and records the “before” and “after” values
in the transaction log.
 UNDO reverses an operation, using the log entries written by the DO
portion of the sequence.
 REDO redoes an operation, using the log entries written by DO
portion of the sequence.

– The write-ahead protocol forces the log entry to be written to permanent

storage before the actual operation takes place.
Two-Phase Commit Protocol
• Two-phase commit protocol defines the operations between two
nodes;
• Coordinator and

• Subordinates or cohorts - one or more

Two-Phase Commit Protocol
• The protocol is implemented in two phases:
• Phase 1: Preparation

• The coordinator sends a PREPARE TO COMMIT message to all

subordinates.
• The subordinates receive the message, write the transaction log
using the write-ahead protocol, and send an acknowledgement
message to the coordinator.
• The coordinator makes sure that all nodes are ready to commit, or
it aborts the transaction.
Two-Phase Commit Protocol
– Phase 2: The Final Commit

– The coordinator broadcasts a COMMIT message to all

subordinates and waits for the replies.

– Each subordinate receives the COMMIT message then updates

the database, using the DO protocol.
– The subordinates reply with a COMMITTED or NOT COMMITTED
message to the coordinator.
 If one or more subordinates uncommitted, the coordinator sends
an ABORT message, thereby forcing them to UNDO all
changes.
Query Optimization

• The objective of a query optimization routine is to minimize the

total cost associated with the execution of a request.
• The costs associated with a request are a function of the:

– Access time (I/O) cost - involved in accessing the physical

data stored on disk.
– Communication cost - associated with the transmission of
data among nodes in distributed database systems.
– CPU time cost - associated with the processing overhead of
managing distributed transactions.
Performance Transparency and
Query Optimization

• Query optimization must provide distribution transparency as well

as replica transparency.

• Replica transparency refers to the DDBMSs ability to hide the

existence of multiple copies of data from the user.

• Query optimization algorithms are based on two principles:

• Selection of the optimum execution order

• Selection of sites to be accessed to minimize communication

costs
Performance Transparency and
Query Optimization
• Operation Modes of Query Optimization

– Automatic query optimization means that the DDBMS finds

the most cost-effective access path without user intervention.
– Manual query optimization requires that the optimization be
selected and scheduled by the end user or programmer.

• Timing of Query Optimization

– Static query optimization takes place at compilation time.

– Dynamic query optimization takes place at execution time.
Performance Transparency and
Query Optimization
• Optimization Techniques Information -

– Statistically based query optimization

• uses statistical information about the database.

– Rule-based query optimization algorithm

• based on a set of user-defined rules to determine the best

query access strategy.
Distributed Database Design

 The design of a distributed database introduces three new issues:

– How to partition the database into fragments.
– Which fragments to replicate.
– Where to locate those fragments and replicas.
Data Fragmentation
 Data fragmentation allows us to break a single object
into two or more segments or fragments.
 Three Types of Fragmentation Strategies:

 Horizontal fragmentation

 Vertical fragmentation

 Mixed fragmentation
Data Fragmentation
 Horizontal Fragmentation - Consists of a subset of the tuples
of a relation.
 Fragment represents the equivalent of a SELECT statement, with
the WHERE clause on a single attribute.
Data Fragmentation
 Vertical fragment Consists of a subset of the attributes of a
relation.
 Equivalent to the PROJECT statement.
Data Fragmentation

 Mixed fragment - Consists of a horizontal

fragment that is subsequently vertically
fragmented, or a vertical fragment that is
then horizontally fragmented.
 A mixed fragment is defined using the
Selection and Projection operations of the
relational algebra.
Data Replication

 Data replication refers to the storage of data copies at multiple

sites served by a computer network.
– Enhance data availability and response time, reducing
communication and total query costs.
Data Replication
• Mutual Consistency Rule
– All copies of data fragments be identical.
– DDBMS must ensure that a database update is performed at all
sites where replicas exist.
• Replication Conditions
– Fully Replicated database stores multiple copies of all database
fragments at multiple sites.
– Partially Replicated database stores multiple copies of some
database fragments at multiple sites.
• Factors for Data Replication Decision
– Database Size
– Usage Frequency
Data Allocation
 Data allocation describes the processing of deciding where to locate
data.
 Data Allocation Strategies
– Centralized
The entire database is stored at one site.
– Partitioned
The database is divided into several disjoint parts (fragments) and
stored at several sites.
– Replicated
Copies of one or more database fragments are stored at several
sites.
• Data allocation algorithms

• Data allocation algorithm take into consideration a variety of

factors:

– Performance and data availability goals

– Size, number of rows, the number of relations that an entity

maintains with other entities.

– Types of transactions to be applied to the database, the

attributes accessed by each of those transactions.
Database system architectures
 Parallel versus Distributed Architectures

– There are two main types of multiprocessor system architectures :

■ Shared memory (tightly coupled) architecture. Multiple processors share secondary
(disk) storage and also share primary memory.
– Shared disk (loosely coupled) architecture. Multiple processors share secondary
(disk) storage but each has their own primary memory.
– Shared nothing(parallel processing (MPP)) architecture - multiple processor
architecture in which each processor is part of a complete system, with its own memory
and disk storage.
Some different database system architectures.

 Parallel database architectures:

(a) shared memory;
(b) shared disk;
(c) shared nothing.
 Centralized database architecture

 A truly distributed database architecture.

Shared nothing architecture
Centralized database
Distributed database

Site 1
Client/Server vs. DDBMS
• Client/server architecture refers to the way in which computers
interact to form a system.
Reference architecture for a DDBMS
Questions ?

Unit - 1 DDB
No ratings yet
Unit - 1 DDB
34 pages
Post-Quiz Basisoftesting
100% (4)
Post-Quiz Basisoftesting
3 pages
Distributed Database Concepts
No ratings yet
Distributed Database Concepts
52 pages
Diff 590 Lesson Plan Whats My Word Blending Activity
No ratings yet
Diff 590 Lesson Plan Whats My Word Blending Activity
3 pages
Chapter - 7 Distributed Database System
100% (1)
Chapter - 7 Distributed Database System
54 pages
Distributed Database Systems (DDBS)
No ratings yet
Distributed Database Systems (DDBS)
30 pages
Chapter-7 Distributed Database Systems
No ratings yet
Chapter-7 Distributed Database Systems
40 pages
Chapter - 6 Distributed Database System
No ratings yet
Chapter - 6 Distributed Database System
50 pages
Distributed Database Design
88% (8)
Distributed Database Design
85 pages
IM Ch12 Distributed DBMS Ed12
No ratings yet
IM Ch12 Distributed DBMS Ed12
14 pages
Distributed Database Systems: January 2002
No ratings yet
Distributed Database Systems: January 2002
25 pages
Distributed Transactions Management
100% (3)
Distributed Transactions Management
28 pages
Advanced Database Chapter 6 and 7
No ratings yet
Advanced Database Chapter 6 and 7
30 pages
ADS Chapter 4 Concurrency Control Techniques
No ratings yet
ADS Chapter 4 Concurrency Control Techniques
36 pages
Cse V Database Management Systems
No ratings yet
Cse V Database Management Systems
82 pages
Object Oriented Databases
No ratings yet
Object Oriented Databases
12 pages
DDBMS MCQ - 1
No ratings yet
DDBMS MCQ - 1
10 pages
Unit-1 DDBMS Architecture
No ratings yet
Unit-1 DDBMS Architecture
14 pages
Chap-2-Database Security and Authorization
100% (1)
Chap-2-Database Security and Authorization
38 pages
Fundamentals of DBS - CH - 2
No ratings yet
Fundamentals of DBS - CH - 2
28 pages
Ch#22 TRANSACTION - MANAGEMENT
No ratings yet
Ch#22 TRANSACTION - MANAGEMENT
80 pages
Fundamentals of Database Management System
No ratings yet
Fundamentals of Database Management System
4 pages
Distributed Database: Source
No ratings yet
Distributed Database: Source
19 pages
DBMS Question Bank
100% (2)
DBMS Question Bank
13 pages
Chapter 1 - Query Processing and Optimization
No ratings yet
Chapter 1 - Query Processing and Optimization
62 pages
CH 10 Questions
No ratings yet
CH 10 Questions
5 pages
Dbms Lab File
100% (1)
Dbms Lab File
30 pages
Cs9152 DBT Unit I Notes
100% (1)
Cs9152 DBT Unit I Notes
53 pages
DBMS - LAB Manual
No ratings yet
DBMS - LAB Manual
22 pages
Distributed Databases
No ratings yet
Distributed Databases
39 pages
Chapter 1 Query Processing and Optimization
No ratings yet
Chapter 1 Query Processing and Optimization
129 pages
Distributed Catalog Management
100% (1)
Distributed Catalog Management
12 pages
Distributed Database Management Notes - 1
100% (11)
Distributed Database Management Notes - 1
21 pages
Data Recovery Presentation
No ratings yet
Data Recovery Presentation
8 pages
Chapter 5 - Database Management System
No ratings yet
Chapter 5 - Database Management System
30 pages
Chapter 4 Distributed Database Systems
No ratings yet
Chapter 4 Distributed Database Systems
69 pages
What Is Relational Model
No ratings yet
What Is Relational Model
7 pages
Unit I - Relational Database Part - A (2 Marks) : 1. Define Database Management System
No ratings yet
Unit I - Relational Database Part - A (2 Marks) : 1. Define Database Management System
27 pages
FDB For Exit Exam
No ratings yet
FDB For Exit Exam
284 pages
DBMS Questions Answers
100% (1)
DBMS Questions Answers
64 pages
7 Query Localization
No ratings yet
7 Query Localization
27 pages
Distributed Database System
No ratings yet
Distributed Database System
100 pages
Lab Manual 1 Database Systems
100% (1)
Lab Manual 1 Database Systems
7 pages
ADBMS Sem 1 Mumbai University (MSC - CS)
No ratings yet
ADBMS Sem 1 Mumbai University (MSC - CS)
39 pages
Chapter 8 Database Administration and Security
100% (1)
Chapter 8 Database Administration and Security
21 pages
Unit-3 Part 1 Normalization
No ratings yet
Unit-3 Part 1 Normalization
31 pages
004 Database Management System
No ratings yet
004 Database Management System
20 pages
Database System Development Lifecycle: From Chapter 10, "Database Systems" by Connolly and Begg (5 Edn, 2010)
No ratings yet
Database System Development Lifecycle: From Chapter 10, "Database Systems" by Connolly and Begg (5 Edn, 2010)
36 pages
Distributed Database System (KCA045)
No ratings yet
Distributed Database System (KCA045)
9 pages
Database Administration and Management
No ratings yet
Database Administration and Management
16 pages
Database Managment System
No ratings yet
Database Managment System
18 pages
Chapter - 7 Distributed Database System
0% (1)
Chapter - 7 Distributed Database System
54 pages
10-Distributed Databases Lecturer 3 Best
No ratings yet
10-Distributed Databases Lecturer 3 Best
55 pages
DDS Lecture 2
0% (1)
DDS Lecture 2
38 pages
Distributed Database Management Systems
No ratings yet
Distributed Database Management Systems
23 pages
Distributed Databases
No ratings yet
Distributed Databases
24 pages
DDB Unit 1-5
No ratings yet
DDB Unit 1-5
190 pages
Topic 7 DDBMS
No ratings yet
Topic 7 DDBMS
28 pages
Distributed Database Management Systems
No ratings yet
Distributed Database Management Systems
73 pages
Adbms Chapter 7 Ddbms
No ratings yet
Adbms Chapter 7 Ddbms
73 pages
Distributed DBMS (Good)
No ratings yet
Distributed DBMS (Good)
58 pages
Database And Computer Management: SERIES 1, #3
From Everand
Database And Computer Management: SERIES 1, #3
Elias Mutegi
No ratings yet
Advanced Database Systems Chapter One Query Processing & Optimization
No ratings yet
Advanced Database Systems Chapter One Query Processing & Optimization
22 pages
Ass 3
No ratings yet
Ass 3
1 page
Lab Manual
No ratings yet
Lab Manual
44 pages
Assignment Ex
No ratings yet
Assignment Ex
1 page
Chapter - 6 Database Security and Authorization
50% (2)
Chapter - 6 Database Security and Authorization
25 pages
Chapter - 3 TRANSACTION PROCESSING
No ratings yet
Chapter - 3 TRANSACTION PROCESSING
51 pages
Chapter - 2 Query Processing
No ratings yet
Chapter - 2 Query Processing
63 pages
Oxygen Therapy. Methods of Oxygenation
No ratings yet
Oxygen Therapy. Methods of Oxygenation
74 pages
Chapter 3 REGULAR EXPRESSION
No ratings yet
Chapter 3 REGULAR EXPRESSION
28 pages
Chapter 1 Introduction To The Theory of Computation
No ratings yet
Chapter 1 Introduction To The Theory of Computation
70 pages
TAB1
No ratings yet
TAB1
120 pages
C # Introduction
No ratings yet
C # Introduction
2 pages
Telephone Etiquette
No ratings yet
Telephone Etiquette
15 pages
GK Questions For Class 4, 5 & 6: Age Group: 9-12 Years
No ratings yet
GK Questions For Class 4, 5 & 6: Age Group: 9-12 Years
2 pages
Full Text 01
No ratings yet
Full Text 01
91 pages
MATLAB Course - Part 1
No ratings yet
MATLAB Course - Part 1
72 pages
Vula - STA1000S, 2024 - Tests & Quizzes
No ratings yet
Vula - STA1000S, 2024 - Tests & Quizzes
2 pages
G10 Analytical Listening
No ratings yet
G10 Analytical Listening
27 pages
Chapter 7: Great Books in The Philippines: 4.0 Intended Learning Outcomes
No ratings yet
Chapter 7: Great Books in The Philippines: 4.0 Intended Learning Outcomes
15 pages
My Movies Assignment
No ratings yet
My Movies Assignment
1 page
ACTIVITY in Eng
No ratings yet
ACTIVITY in Eng
11 pages
Trisomy 22
No ratings yet
Trisomy 22
2 pages
The Empty Drum
No ratings yet
The Empty Drum
1 page
Baby Name MeaningNumerologyGenderAdd To Fav
No ratings yet
Baby Name MeaningNumerologyGenderAdd To Fav
53 pages
01 Sets & Relations - PMD
No ratings yet
01 Sets & Relations - PMD
3 pages
Second-Person Narrative - Wikipedia, The Free Encyclopedia
No ratings yet
Second-Person Narrative - Wikipedia, The Free Encyclopedia
5 pages
Assignment MS Office
No ratings yet
Assignment MS Office
3 pages
Facilitate Learning Sessions
100% (1)
Facilitate Learning Sessions
87 pages
SS2 SECOND TERM Computer Science Notebook
No ratings yet
SS2 SECOND TERM Computer Science Notebook
38 pages
Tamil Sec 2024-25
No ratings yet
Tamil Sec 2024-25
13 pages
Sermon Shaped For Serving God
No ratings yet
Sermon Shaped For Serving God
5 pages
Figure of Speech 2
No ratings yet
Figure of Speech 2
9 pages
Mapa Mental. GA4-240202501-AA1-EV02
50% (2)
Mapa Mental. GA4-240202501-AA1-EV02
2 pages
DC - AC Converter
No ratings yet
DC - AC Converter
10 pages
Guru Gobind Singh Ji Marg Details
No ratings yet
Guru Gobind Singh Ji Marg Details
18 pages
Ephemeral City Cheap Print and Urban Culture in Renaissance Venice Rosa Salzberg Instant Download
100% (1)
Ephemeral City Cheap Print and Urban Culture in Renaissance Venice Rosa Salzberg Instant Download
52 pages
Atanua Guia Uso Basics
No ratings yet
Atanua Guia Uso Basics
7 pages
Literal Rule or Plain Meaning Rule
No ratings yet
Literal Rule or Plain Meaning Rule
5 pages

Chapter - 7 Distributed Database System

Uploaded by

Chapter - 7 Distributed Database System

Uploaded by

Chapter - 7

Distributed Databases system

2 Data Fragmentation, Replication and Allocation

3 Types of Distributed Database Systems

5 Concurrency Control and Recovery

6 3-Tier Client-Server Architecture

Site 4 Communications neteork

EMPLOYEES - San Francisco and LA New York

San Francisco Communications neteork

Los Angeles Atlanta

• This is done to minimize access time to the required data.

− Fragmentation transparency: Allows to segment a relation horizontally

– Integrity control more difficult

– Database design more complex

force a homogeneous environment Site 3 Site 2

 Translations required to allow for:

Object Unix Relational

• A federated database system (FDBS) is a collection of cooperating

Multidatabase system (MDBS): A distributed DBMS in which each site

• Single-Site Processing, Single-Site Data (SPSD)

Multiple-Site Processing, Single-Site Data (MPSD)

 Multiple-Site Processing, Multiple-Site Data (MPMD)

 DDBMS transparency features have the common property of

Figure 6.9 Fragment Locations

• Case 2: DB Supports Location Transparency

• Case 3: DB Supports Local Mapping Transparency

will maintain the database’s integrity and consistency.

 Allows a transaction to reference several different (local or

 Two-Phase Commit Protocol

 The two-phase commit protocol requires a

– The write-ahead protocol forces the log entry to be written to permanent

• Subordinates or cohorts - one or more

• The coordinator sends a PREPARE TO COMMIT message to all

– The coordinator broadcasts a COMMIT message to all

– Each subordinate receives the COMMIT message then updates

• The objective of a query optimization routine is to minimize the

– Access time (I/O) cost - involved in accessing the physical

• Query optimization must provide distribution transparency as well

• Replica transparency refers to the DDBMSs ability to hide the

• Query optimization algorithms are based on two principles:

• Selection of the optimum execution order

• Selection of sites to be accessed to minimize communication

– Automatic query optimization means that the DDBMS finds

• Timing of Query Optimization

– Static query optimization takes place at compilation time.

– Statistically based query optimization

• uses statistical information about the database.

– Rule-based query optimization algorithm

• based on a set of user-defined rules to determine the best

 The design of a distributed database introduces three new issues:

 Mixed fragment - Consists of a horizontal

 Data replication refers to the storage of data copies at multiple

• Data allocation algorithm take into consideration a variety of

– Performance and data availability goals

– Size, number of rows, the number of relations that an entity

– Types of transactions to be applied to the database, the

– There are two main types of multiprocessor system architectures :

 Parallel database architectures:

 A truly distributed database architecture.

You might also like