0% found this document useful (0 votes)

177 views60 pages

Lecture 8 - Distributed Database Management Systems

Here is the horizontal fragmentation based on projects with a budget less than $200,000: P1 = σ budget < $200,000 (Projects) This selects all tuples from the Projects relation where the budget attribute is less than $200,000.

Uploaded by

fatini

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

177 views60 pages

Lecture 8 - Distributed Database Management Systems

Uploaded by

fatini

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 60

Distributed Database

Management Systems
(DDBMS)

Muhammad Hamiz Mohd Radzi

Objectives
✘ Describe Distributed Database (DDB), DDBMS,
distributed processing, shared disk, shared memory
and shared nothing of parallel DBMS
✘ Explain the advantages and disadvantages of DDBMS
✘ Describe type of homogeneous & heterogeneous
DDBMS and the Multi Database System (MDBS)
✘ Explain functions and reference architecture of
DDBMS, MDBS and components of DDBMS architecture
Objectives
✘ Explain the concept of allocation in centralized,
fragmented, complete and partial replication of
Distributed Relational Database Design (DDD)
✘ Explain the horizontal, vertical, mixed and derived
fragmentation together with its correctness rules
✘ Describe the distribution, transaction, performance
and DBMS transparencies.
✘ Describe the fragment, location, local mapping and
naming in Distribution Transparency.
DBMS Approach
Application
program 1 ✘ DB is located at the
(with data
semantics) server
DBMS
✘ Processing is split
description
Application
program 2 manipulation between server and
(with data database client
semantics) control
✘ Less data traffic on the
Application network
program 3
(with data
semantics)
Centralized Database (Distributed
Processing)
✘ A database system which resides at one of the nodes
of a network of computers.

Site 1
Site 2
Site 5
Communication
Network

Site 4 Site 3
Problems with Centralized DB
✗ Performance degradation as number of remote sites grew
✗ High cost to maintain large centralized DBs
✗ Reliability problems with one, central site
✗ The site with the database can become a bottleneck.
✗ Data availability is not efficient
✗ Possible availability problem: if the site with the database
goes down, there can be no data access.
Concept of DDBMS
✘ Hence, to overcome the problem of centralized DBMS, DDBMS is
introduced.

✘ Distributed Database: A logically interrelated collection of shared

data (and a description of this data), physically distributed over a
computer network.

✘ Distributed DBMS (DDBMS): Software system that permits the

management of the distributed database and makes the
distribution transparent to users.
✘ Collection of logically-related shared data.
✘ Data split into fragments.
✘ Fragments may be replicated.
✘ Fragments/replicas allocated to sites.
✘ Sites linked by a communications network.
✘ Data at each site is under control of a DBMS.
✘ DBMSs handle local applications autonomously.
✘ Each DBMS participates in at least one global application.
Parallel DBMS
✘ A DBMS running across multiple processors and disks
designed to execute operations in parallel, whenever
possible, to improve performance.

✘ Main architecture are:

✗ Shared memory
✗ Shared disk
✗ Shared nothing
(a) shared memory

(b) shared disk

(c) shared nothing

Advantages & Disadvantages of
DDBMS
Types of DDBMS
✘ Homogeneous

✗ All sites use same DBMS product.

✗ Much easier to design and manage.

✗ Approach provides incremental growth and allows

increased performance.
✘ Heterogeneous:

✗ Sites may run different DBMS products, with possibly different

underlying data models.

✗ Occurs when sites have implemented their own databases and

integration is considered later.

✗ Translations required to allow for:

✗ Different hardware.

✗ Different DBMS products.

✗ Different hardware and different DBMS products.

Multi Database Systems (MDBS)
✘ DDBMS in which each site maintains complete autonomy.

✘ DBMS that resides transparently on top of existing database and

file systems and presents a single database to its users.

✘ Allows users to access and share data without requiring physical

database integration.

✘ Unfederated MDBS (no local users) and federated MDBS.

Functions & Architecture of DDBMS
✘ Functions: Expect DDBMS to have at least the
functionality of a DBMS.

✘ Also to have following functionality:

✗ Extended communication services.
✗ Extended Data Dictionary.
✗ Distributed query processing.
✗ Extended concurrency control.
✗ Extended recovery services.
✘ Global Conceptual Schema (GCS): Logical description of the whole database
which contains definitions of entities, relationships, constraints, security, and
integrity information.

✘ Fragmentation schema is a description of how the data is to be logically

partitioned.

✘ The allocation schema is a description of where the data is to be located,

taking account of any replication.

✘ Local schemas: Each local DBMS has its own set of schemas.
Reference Architecture for DDBMS
✘ Due to diversity, no accepted architecture equivalent
to ANSI/SPARC 3-level architecture.

✘ A reference architecture consists of:

✗ Set of global external schemas.
✗ Global conceptual schema (GCS).
✗ Fragmentation schema and allocation schema.
✗ Set of schemas for each local DBMS conforming to
3-level ANSI/SPARC.
Reference Architecture for DDBMS
Reference Architecture for FMDBS
✘ In DDBMS, GCS is union of all local conceptual schemas.

✘ In FMDBS, GCS is subset of local conceptual schemas (LCS),

consisting of data that each local system agrees to share.

✘ GCS of tightly coupled system involves integration of either parts

of LCSs or local external schemas.

✘ FMDBS with no GCS is called loosely coupled.

Reference Architecture for Tightly-Coupled FMDBS
Components of DDBMS Architecture
✘ Global System Catalog (GSC): Holds information such as
the fragmentation, replication, and allocation schemas.

✘ Local DBMS (LDBMS): Controlling the local data at each site

that has a database.

✘ Data Communications (DC): Software that enables all sites

to communicate with each other
Distributed Relational Database Design
✘ Data fragmentation:

✗ How to partition the database into fragments

✘ Data replication:

✗ Which fragments to replicate

✘ Data allocation:

✗ Where to locate those fragments and replicas

Fragmentation
✘ Definition and allocation of fragments carried out strategically to
achieve:

✗ Locality of Reference.
✗ Improved Reliability and Availability.
✗ Improved Performance.
✗ Balanced Storage Capacities and Costs.
✗ Minimal Communication Costs.

✘ Involves analyzing most important applications, based on

quantitative/qualitative information.
Data Allocation
✘ Centralized: Consists of single database and DBMS stored at one site with
users distributed across the network.

✘ Partitioned: Database partitioned into disjoint fragments, each fragment

assigned to one site.

✘ Complete Replication: Consists of maintaining complete copy of database at

each site.

✘ Selective Replication: Combination of partitioning, replication, and

centralization.
Reasons for Fragmentation
✘ Usage: Applications work with views rather than entire relations.

✘ Efficiency: Data is stored close to where it is most frequently used.

✘ Parallelism: With fragments as unit of distribution, transaction can be

divided into several subqueries that operate on fragments.

✘ Security: Data not required by local applications is not stored and so

not available to unauthorized users.
Types of Fragmentation
✘ Four types of fragmentation:

✗ Horizontal,
✗ Vertical,
✗ Mixed,
✗ Derived.

✘ Other possibility is no fragmentation:

✘ If relation is small and not updated frequently, may be better not

to fragment relation.
Horizontal and Vertical Fragmentation
Mixed Fragmentation
Horizontal Fragmentation
✘ Consists of a subset of the tuples of a relation.

✘ Defined using Selection operation of relational algebra:

σp(R)
✘ Assuming that there are only two property types, Flat and
House, the horizontal fragmentation of PropertyForRent by
property type can be obtained as follows:
P1 = σ type=‘House’(PropertyForRent)
P2 = σ type=‘Flat’(PropertyForRent)
Vertical Fragmentation
✘ Consists of a subset of attributes of a relation.

✘ Defined using Projection operation of relational algebra:

∏a1, ... ,an(R)

✘ For example:
S1 = ∏staffNo, position, sex, DOB, salary(Staff)
S2 = ∏staffNo, fName, lName, branchNo(Staff)

✘ Determined by establishing affinity of one attribute to

another.
Mixed Fragmentation
✘ Consists of a horizontal fragment that is vertically
fragmented, or a vertical fragment that is horizontally
fragmented.

✘ Defined using Selection and Projection operations of

relational algebra:

σ p(∏a1, ... ,an(R))

or
∏a1, ... ,an(σp(R))
Derived Horizontal Fragmentation

✘ A horizontal fragment that is based on horizontal

fragmentation of a parent relation.

✘ Ensures that fragments that are frequently joined

together are at same site.

✘ Defined using Semijoin operation of relational algebra:

Ri = R F Si, 1≤i≤w
Case study
✘ Supposed that
we have these
tables in our
database.
Question 1 raw

✘ Do a horizontal fragmentation
based on:
✗ PROJ1: projects with
budget less than $200,000
✗ PROJ2: projects with
budget greater than or
equal to $200,000
By using RA: Reconstruction:
Proj1
Proj1 = σ BUDGET<200K (Proj) ⋃
Proj2
Proj2 = σ BUDGET>=200K(Proj)
Question 2
✘ Do a vertical fragmentation
based on: column

✗ PROJ3: information about

project budgets.
✗ PROJ4: information about
project names and its
locations
For RA: Reconstruction:

PROJ3 = ∏PNO, BUDGET (PROJ) PROJ3 ⨝ PROJ4

PNO

PROJ4 = ∏PNO, NAME, LOC (PROJ)

Question 3
✘ Do a mixed fragmentation based PROJ1&3 PROJ1&4
on:
✗ PROJ1&3: information about
project budgets and it must
be less than $200,000
✗ PROJ1&4: information about
project names and its
locations and it must be less
than $200,000
✗ PROJ2&3: information about
PROJ2&3 PROJ2&4

project budgets and it must

be greater than or equal
$200,000
✗ PROJ2&4: information about
project names and its
locations and it must be
greater than or equal
$200,000
For RA:
PROJ1&3 = ∏PNO, BUDGET σ BUDGET<200K (PROJ)
PROJ1&4 = ∏PNO, NAME, LOC σ BUDGET<200K (PROJ)
PROJ2&3 = ∏PNO, BUDGET σ BUDGET>=200K (PROJ)
PROJ2&4 = ∏PNO, NAME, LOC σ BUDGET>=200K (PROJ)
PROJ1&3 PROJ1&4
Reconstruction:

PROJ1&3 ⨝ PROJ1&4
∪
PROJ2&3 PROJ2&4
PROJ2&3 ⨝ PROJ2&4
QUESTION 4
• Do a horizontal fragmentation based
on:
– PAY1: salary less than $30,000
– PAY2: salary greater than or equal to
$30,000
By using RA: Reconstruction:
Pay1
Pay1 = σ SALARY<30K (PAY) ⋃
Pay2
Pay2 = σ SALARY>30K(PAY)
Question 5
• Identify which table is a CHILD table
to PAY table.
– EMPLOYEE
HAMIZ RADZI

Question 6
• Do a derived fragmentation
of an EMPLOYEE table.
– EMP1: employee with
salary less than $30,000
– EMP2: employee with
salary greater than
$30,000
BY RA:

EMP1 = EMP TITLE PAY1

EMP2 = EMP TITLE PAY2

No Fragmentation
✘ A final strategy is not to fragment a relation.

✘ For example, the Branch relation contains

only a small number of tuples and is not updated very frequently.

✘ Hence, it is better to leave the table that way as fragmenting it

will lead to nothing better.
Correctness of Fragmentation
Completeness
✘ If relation R is decomposed into fragments R1, R2, ... Rn, each data item that
can be found in R must appear in at least one fragment.

Reconstruction
✘ Must be possible to define a relational operation that will reconstruct R from
the fragments.
✘ Reconstruction for horizontal fragmentation is Union operation and Join for
vertical .
Correctness of Fragmentation
Disjointness
✘ If data item di appears in fragment Ri, then it should
not appear in any other fragment.
✘ Exception: vertical fragmentation, where primary key
attributes must be repeated to allow reconstruction.
✘ For horizontal fragmentation, data item is a tuple.
✘ For vertical fragmentation, data item is an attribute.
Transparencies in a DDBMS

✘ Distribution Transparency ✘ Transaction Transparency

✘ Performance Transparency
✗ Fragmentation Transparency
✘ DBMS Transparency
✗ Location Transparency
✗ Replication Transparency
✗ Local Mapping Transparency
✗ Naming Transparency
Distribution Transparency
✘ Allows management of a physically dispersed database as though it were
a centralized database
✘ Supported by a distributed data dictionary (DDD) which contains the
description of the entire database as seen by the DBA
✗ The DDD is itself distributed and replicated at the network nodes

✘ Three levels of distribution transparency are recognized:

✗ Fragmentation transparency – user does not need to know if a
database is partitioned; fragment names and/or fragment locations
are not needed
✗ Location transparency – fragment name, but not location, is
required
✗ Local mapping transparency – user must specify fragment name
and location
A Summary of Transparency Features

53
Distribution Transparency
✘ The EMPLOYEE table is divided among three locations (no replication)

✘ Suppose an employee wants to find all employees with a birthdate prior

to jan 1, 1940

✗ Fragmentation transparency-
SELECT * FROM EMPLOYEE WHERE EMP_DOB < ’01-JAN-1940’;

✗ Location transparency-
SELECT * FROM E1 WHERE EMP_DOB < ’01-JAN-1940’ UNION
SELECT * FROM E2 … UNION SELECT * FROM E3…;

✗ Local Mapping Transparency

SELECT * FROM E1 NODE NY WHERE EMP_DOB < ’01-JAN-1940’
UNION SELECT * FROM E2 NODE ATL … UNION SELECT * FROM E3
NODE MIA…;
Naming Transparency
✘ Each item in a DDB must have a unique name.

✘ DDBMS must ensure that no two sites create a database object

with same name.

✘ One solution is to create central name server. However, this

results in:
✗ loss of some local autonomy;
✗ central site may become a bottleneck;
✗ low availability; if the central site fails, remaining sites
cannot create any new objects.
Replication Transparency
✘ Replication Transparency
✗ With replication transparency, user is unaware of
replication of fragments .
Transaction Transparency
✘ Ensures database transactions will maintain
distributed database’s integrity and consistency

✘ A DDBMS transaction can update data stored in many

different computers connected in a network
✗ Transaction transparency ensures that the
transaction will be completed only if all database
sites involved in the transaction complete their
part of the transaction
Performance Transparency
• Performance transparency – allows system to perform
as if it were a centralized DBMS.

• No performance degradation due to use of a network or

platform differences
DBMS Transparency
✘ DBMS transparency hides the knowledge that the local
DBMSs may be different, and is therefore only applicable to
heterogeneous DDBMSs.

✘ It is one of the most difficult transparencies to provide as a

generalization.
References
✘ Thomas Connolly and Carolyn Begg, Database Systems:
A Practical Approach to Design, Implementation, and
Management, 6th Edition, Pearson, 2015, ISBN: 978-
01329432

Assignment 5 - Network Layer
No ratings yet
Assignment 5 - Network Layer
2 pages
Lecture 6 -Distributed databases
No ratings yet
Lecture 6 -Distributed databases
61 pages
CSC302_ch24
No ratings yet
CSC302_ch24
23 pages
4.1 Lecture 4 Distributed Databases
No ratings yet
4.1 Lecture 4 Distributed Databases
42 pages
Week 12- Distributed Databases
No ratings yet
Week 12- Distributed Databases
37 pages
Chapter 6 DDBMS
No ratings yet
Chapter 6 DDBMS
41 pages
Chapter 2 - 9-15DDB Architecture
No ratings yet
Chapter 2 - 9-15DDB Architecture
67 pages
lecture-1-ho (1)
No ratings yet
lecture-1-ho (1)
62 pages
Distributed Db New
No ratings yet
Distributed Db New
44 pages
A Foundation in Digital Communication 2nd Edition Amos Lapidoth - Read the ebook online or download it to own the full content
No ratings yet
A Foundation in Digital Communication 2nd Edition Amos Lapidoth - Read the ebook online or download it to own the full content
57 pages
04_Distributed DBMSs - Concepts and Design
No ratings yet
04_Distributed DBMSs - Concepts and Design
72 pages
VCI V4 .NET Software Design Guide English
No ratings yet
VCI V4 .NET Software Design Guide English
52 pages
ADT unit 1 to 5 (1)
No ratings yet
ADT unit 1 to 5 (1)
160 pages
Distributed Databases
No ratings yet
Distributed Databases
55 pages
Lecture 8 - Distributed Databases
No ratings yet
Lecture 8 - Distributed Databases
4 pages
Unit V NoSQL Databases
No ratings yet
Unit V NoSQL Databases
124 pages
Types of Distributed Data Base System_49724
No ratings yet
Types of Distributed Data Base System_49724
37 pages
System Testing and Implementation
100% (7)
System Testing and Implementation
7 pages
Chapter -7 Distributed Database System
No ratings yet
Chapter -7 Distributed Database System
29 pages
Havit Audio Series
No ratings yet
Havit Audio Series
137 pages
Chapter 4 Distributed Databases
No ratings yet
Chapter 4 Distributed Databases
36 pages
Lesson1 - Multimedia Principles and Concepts
No ratings yet
Lesson1 - Multimedia Principles and Concepts
33 pages
Distributed Database Management Systems
No ratings yet
Distributed Database Management Systems
123 pages
Dbms Unit v Notes 2 27
No ratings yet
Dbms Unit v Notes 2 27
26 pages
Vdocuments - MX Mypower VG Ip Config Manual
No ratings yet
Vdocuments - MX Mypower VG Ip Config Manual
181 pages
DDIS U1-3
No ratings yet
DDIS U1-3
40 pages
W7 DBMS Chapter23
No ratings yet
W7 DBMS Chapter23
33 pages
Distributeddbms Er. Inderjeet Bal
No ratings yet
Distributeddbms Er. Inderjeet Bal
60 pages
ADBS_Chapter_Seven
No ratings yet
ADBS_Chapter_Seven
22 pages
Chapter 7 - Distributed Database System
No ratings yet
Chapter 7 - Distributed Database System
42 pages
Distributed
No ratings yet
Distributed
83 pages
Adt Unitnotes 1to3
No ratings yet
Adt Unitnotes 1to3
107 pages
Basis For Distributed Database Technology
No ratings yet
Basis For Distributed Database Technology
35 pages
Advanced Database Chapter 6 and 7
No ratings yet
Advanced Database Chapter 6 and 7
30 pages
cs601 Quiz 1 Searching File by Msa Vu Services
No ratings yet
cs601 Quiz 1 Searching File by Msa Vu Services
37 pages
Midterm Elective Database Notes
No ratings yet
Midterm Elective Database Notes
14 pages
Lecture 1 Ho PDF
No ratings yet
Lecture 1 Ho PDF
62 pages
ch6 Distributed Database
No ratings yet
ch6 Distributed Database
35 pages
Distributed Database Frank Chinembiri and Florence-2
No ratings yet
Distributed Database Frank Chinembiri and Florence-2
42 pages
8 Distributed Databases
No ratings yet
8 Distributed Databases
13 pages
Topic 7 DDBMS
No ratings yet
Topic 7 DDBMS
28 pages
Lecture 2 Distriburted Databases
No ratings yet
Lecture 2 Distriburted Databases
45 pages
Rectifier Circuits
No ratings yet
Rectifier Circuits
17 pages
Basis For Distributed Database Technology
No ratings yet
Basis For Distributed Database Technology
35 pages
Final
No ratings yet
Final
46 pages
Adb CH 4
No ratings yet
Adb CH 4
14 pages
Distributed Databases and Client-Server Architectures
No ratings yet
Distributed Databases and Client-Server Architectures
41 pages
You Have Two Datasets - Trips - TXT Which Records Tri...
No ratings yet
You Have Two Datasets - Trips - TXT Which Records Tri...
6 pages
Distrubuted Database Concept
No ratings yet
Distrubuted Database Concept
22 pages
KAA 502 - Note 2 - Chapter 5 - Signal and Noise 212
No ratings yet
KAA 502 - Note 2 - Chapter 5 - Signal and Noise 212
7 pages
Chapter - 7 Distributed Database System
0% (1)
Chapter - 7 Distributed Database System
54 pages
Profibus: in The Process Industries #2
No ratings yet
Profibus: in The Process Industries #2
52 pages
Distributed DBM S
No ratings yet
Distributed DBM S
67 pages
10 Distributeddbms
No ratings yet
10 Distributeddbms
56 pages
Part 2
No ratings yet
Part 2
8 pages
Pylontech New Chipset Update
No ratings yet
Pylontech New Chipset Update
4 pages
Unit-V Distributed and Client Server Databases: A Lalitha Associate Professor Avinash Degree College
No ratings yet
Unit-V Distributed and Client Server Databases: A Lalitha Associate Professor Avinash Degree College
24 pages
Adt Unit I
No ratings yet
Adt Unit I
18 pages
Distributed Database Concepts
No ratings yet
Distributed Database Concepts
35 pages
Distributed Databases
No ratings yet
Distributed Databases
46 pages
Chapter 4 Distributed Database Systems
No ratings yet
Chapter 4 Distributed Database Systems
69 pages
Iii. Current Trends: Distributed Databases and DBMSS: Concepts and Design
No ratings yet
Iii. Current Trends: Distributed Databases and DBMSS: Concepts and Design
32 pages
LSP Catalog Type 2 DC Surge Protective Device SLP20 DC Series
No ratings yet
LSP Catalog Type 2 DC Surge Protective Device SLP20 DC Series
2 pages
Distributed Database Concepts
No ratings yet
Distributed Database Concepts
52 pages
Distributed DBMS Concepts
No ratings yet
Distributed DBMS Concepts
6 pages
Unit 1 DISTRIBUTED DATABASE
No ratings yet
Unit 1 DISTRIBUTED DATABASE
6 pages
DDB Slides
No ratings yet
DDB Slides
30 pages
GP4X MT11 BTH - Eng
No ratings yet
GP4X MT11 BTH - Eng
34 pages
Answer:: The Different Components of DDBMS Are As Follows
No ratings yet
Answer:: The Different Components of DDBMS Are As Follows
9 pages
Chapter 5: File Systems
No ratings yet
Chapter 5: File Systems
15 pages
DBMS MASTER: Become Pro in Database Management System
From Everand
DBMS MASTER: Become Pro in Database Management System
Ummed Singh
No ratings yet
THE SQL LANGUAGE: Master Database Management and Unlock the Power of Data (2024 Beginner's Guide)
From Everand
THE SQL LANGUAGE: Master Database Management and Unlock the Power of Data (2024 Beginner's Guide)
JAMIE POWERS
No ratings yet
Smart Green-Mode PWM Controller With Multiple Protections: General Description Features
No ratings yet
Smart Green-Mode PWM Controller With Multiple Protections: General Description Features
18 pages
Database And Computer Management: SERIES 1, #3
From Everand
Database And Computer Management: SERIES 1, #3
Elias Mutegi
No ratings yet
Project Report
100% (1)
Project Report
29 pages
The Impact of NR Scheduling Timings On End-to-End Delay For Uplink Traffic
No ratings yet
The Impact of NR Scheduling Timings On End-to-End Delay For Uplink Traffic
6 pages
Pentaho Community Edition Installation Guide For Windows Whitepaper
No ratings yet
Pentaho Community Edition Installation Guide For Windows Whitepaper
15 pages
Aws Resume Sample
67% (3)
Aws Resume Sample
1 page
NC4 CBLM
No ratings yet
NC4 CBLM
25 pages
Uml Synopsis
No ratings yet
Uml Synopsis
9 pages
Intelligent Addressable Fire Alarm System: General
No ratings yet
Intelligent Addressable Fire Alarm System: General
8 pages
Log
No ratings yet
Log
2 pages
A Distributed Database Management System ('DDBMS') Is A Software System
No ratings yet
A Distributed Database Management System ('DDBMS') Is A Software System
5 pages
KVASOLUTIONS - Eaton 9E UPS
No ratings yet
KVASOLUTIONS - Eaton 9E UPS
2 pages
COMP1787
No ratings yet
COMP1787
34 pages
DBMS-Unit 5
No ratings yet
DBMS-Unit 5
27 pages
Enerlux M Software Module Users' Manual Rev.3: Timisoara
No ratings yet
Enerlux M Software Module Users' Manual Rev.3: Timisoara
35 pages
AWS Certified Solutions Architect - Professional
From Everand
AWS Certified Solutions Architect - Professional
VB Dev
No ratings yet
AN808P
No ratings yet
AN808P
10 pages
Project Proposal On Learning Management System
No ratings yet
Project Proposal On Learning Management System
5 pages

Lecture 8 - Distributed Database Management Systems

Uploaded by

Lecture 8 - Distributed Database Management Systems

Uploaded by

Distributed Database

Muhammad Hamiz Mohd Radzi

✘ Distributed Database: A logically interrelated collection of shared

✘ Distributed DBMS (DDBMS): Software system that permits the

✘ Main architecture are:

(b) shared disk

(c) shared nothing

✗ All sites use same DBMS product.

✗ Much easier to design and manage.

✗ Approach provides incremental growth and allows

✗ Sites may run different DBMS products, with possibly different

✗ Occurs when sites have implemented their own databases and

✗ Translations required to allow for:

✗ Different DBMS products.

✗ Different hardware and different DBMS products.

✘ DBMS that resides transparently on top of existing database and

✘ Allows users to access and share data without requiring physical

✘ Unfederated MDBS (no local users) and federated MDBS.

✘ Also to have following functionality:

✘ Fragmentation schema is a description of how the data is to be logically

✘ The allocation schema is a description of where the data is to be located,

✘ A reference architecture consists of:

✘ In FMDBS, GCS is subset of local conceptual schemas (LCS),

✘ GCS of tightly coupled system involves integration of either parts

✘ FMDBS with no GCS is called loosely coupled.

✘ Local DBMS (LDBMS): Controlling the local data at each site

✘ Data Communications (DC): Software that enables all sites

✗ How to partition the database into fragments

✗ Which fragments to replicate

✗ Where to locate those fragments and replicas

✘ Involves analyzing most important applications, based on

✘ Partitioned: Database partitioned into disjoint fragments, each fragment

✘ Complete Replication: Consists of maintaining complete copy of database at

✘ Selective Replication: Combination of partitioning, replication, and

✘ Efficiency: Data is stored close to where it is most frequently used.

✘ Parallelism: With fragments as unit of distribution, transaction can be

✘ Security: Data not required by local applications is not stored and so

✘ Other possibility is no fragmentation:

✘ If relation is small and not updated frequently, may be better not

✘ Defined using Selection operation of relational algebra:

✘ Defined using Projection operation of relational algebra:

✘ Determined by establishing affinity of one attribute to

✘ Defined using Selection and Projection operations of

σ p(∏a1, ... ,an(R))

✘ A horizontal fragment that is based on horizontal

✘ Ensures that fragments that are frequently joined

✘ Defined using Semijoin operation of relational algebra:

✗ PROJ3: information about

PROJ3 = ∏PNO, BUDGET (PROJ) PROJ3 ⨝ PROJ4

PROJ4 = ∏PNO, NAME, LOC (PROJ)

project budgets and it must

EMP1 = EMP TITLE PAY1

EMP2 = EMP TITLE PAY2

✘ For example, the Branch relation contains

✘ Hence, it is better to leave the table that way as fragmenting it

✘ Distribution Transparency ✘ Transaction Transparency

✘ Three levels of distribution transparency are recognized:

✘ Suppose an employee wants to find all employees with a birthdate prior

✗ Local Mapping Transparency

✘ DDBMS must ensure that no two sites create a database object

✘ One solution is to create central name server. However, this

✘ A DDBMS transaction can update data stored in many

• No performance degradation due to use of a network or

✘ It is one of the most difficult transparencies to provide as a

You might also like