Distributed DBMSArchitecture Part I
Distributed DBMSArchitecture Part I
Architecture
Architecture
• Goal:
– present the issues that need to be addressed at
design
– present a framework within which the design and
implementation issues can be discussed
• The ISO/OSI 7-layered reference model
for computer networks
Standardization
Reference Model
– A conceptual framework whose purpose is to divide
standardization work into manageable pieces and to show at a
general level how these pieces are related to one another.
A reference model can be described according to three
different approaches:
• Based on components
• Based on functions
• Based on data
DBMS STANDARDIZATION
• Based on components.
The components of the system are defined together with the
interrelationships between components. A DBMS consists of a
number of components, each of which provides some functionality.
• Based on functions.
The different classes of users are identified and the functions that
the system will perform for each class are defined. The system
specifications within this category typically specify a hierarchical
structure for the user classes. The ISO/OSI architecture fall in this
category.
DBMS STANDARDIZATION
• Based on data.
The different types of data are identified, and an architectural
framework is specified which defines the functional units that will
realize or use data according to these different views. This approach
(also referred as the data logical approach) is claimed to be the
preferable choice for standardization activities.
DBMS STANDARDIZATION
ANSI / SPARC ARCHITECTURE
The ANSI / SPARC architecture is claimed to be based
on the data organization. It recognizes three views of
data:
the external view, which is that of the user, who might be
a programmer; the internal view, that of the system or
machine; and the conceptual view, that of the enterprise.
DBMS STANDARDIZATION
ANSI / SPARC ARCHITECTURE
DBMS STANDARDIZATION
ANSI / SPARC ARCHITECTURE
• At the lowest level of the architecture is the internal view,
which deals with the physical definition and organization
of data.
INTERNAL_REL EMPL [
INDEX ON E# CALL EMINX
FIELD = {
HEADER : BYTE(1)
E# : BYTE(9)
ENAME : BYTE(15)
TIT : BYTE(10)
}
]
Ms. Mariam Nosheen CS- 212 Distributed Database Systems 12
DDBMS Architecture
DBMS STANDARDIZATION
ANSI / SPARC ARCHITECTURE
15
Ms. Mariam Nosheen CS- 212 Distributed Database Systems
DDBMS Architecture
DBMS STANDARDIZATION
ANSI / SPARC ARCHITECTURE
DBMS STANDARDIZATION
ANSI / SPARC ARCHITECTURE
• Two more users:
– Application programmer
– System programmer
• Two user classes:
– Casual user
• Retrieve database and possible update
• Added in external schema
– Novice user
• Typically have no knowledge of data base
• Example (banking machine)
Autonomy
• Distribution of control (and not data) - the degree of
independence
– The local operations of the individual DBMSs are not
affected by their participation in the multidatabase system
– The manner in which individual DBMSs process queries
and optimize them should not be affected by the
execution of global queries
– System consistency should not be compromised when
individual DBMSs join or leave the multidatabase system
Autonomy
• On the other hand specifies the dimension
of autonomy as:
• Design autonomy: Ability of a component DBMS
to decide on issues related to its own design.
• Communication autonomy: Ability of a
component DBMS to decide whether and how to
communicate with other DBMSs.
• Execution autonomy: Ability of a component
DBMS to execute local operations in any manner it
wants to.
Autonomy
• Possibilities:
– Tight integration – a single-image of the entire
database is available to any user who wants to share
the information, which may reside in multiple
databases.
– Semiautonomous system – consist of DBMSs that
can operate independently, but have decided to
participate in a federation to make their local data
sharable.
– Total isolation – the individual systems are stand-
alone DBMSs, which know neither of the existence of
other DBMSs nor how to communicate with them.
Two alternatives:
– client / server distribution
– peer-to-peer distribution (full distribution)
Peer-to-peer distribution.
There is no distinction of client machines versus servers. Each
machine has full DBMS functionality and can communicate with
other machines to execute queries and transactions.
H0 - homogeneous systems
H1 - heterogeneous systems
D0 - no distribution
D1 - client / server systems
D2 - peer-to-peer systems
Client/server
Autonomy
Multi-DBMS
Federated DBMS
Heterogeneity
30
Ms. Mariam Nosheen CS- 212 Distributed Database Systems
DDBMS Architecture
Distributed DBMS
• Distributed database requires distributed
DBMS
• Functions of a distributed DBMS:
– Locate data with a distributed data dictionary
– Determine location from which to retrieve data and
process query components
– DBMS translation between nodes with different local
DBMSs (using middleware)
– Data consistency (via multiphase commit protocols)
– Global primary key control
– Scalability
– Security, concurrency, query optimization, failure
recovery
Ms. Mariam Nosheen CS- 212 Distributed Database Systems 38
DDBMS Architecture
39
Ms. Mariam Nosheen CS- 212 Distributed Database Systems
DDBMS Architecture
3 5
4
Local transaction – all
data stored locally
41
Ms. Mariam Nosheen CS- 212 Distributed Database Systems
DDBMS Architecture
2
3
1
7 6
8
4