Distributed Database Management System
Distributed Database Management System
A DDBMS governs the storage and processing of logically related data over interconnected
computer systems in which both data and processing functions are distributed among several
sites. A DBMS must have at least the following to be classified as distributed.
DDBMS COMPONENTS
Computer workstations (sites or nodes) that form the network system. The distributed
database management system must be independent of the computer system hardware.
Network hardware and software components that reside in each work station. The
network components allow all sites to interact and exchange data. Because the
components are likely to be supplied by different vendors, it is desirable to ensure that
distributed database functions can be run on multiple platforms.
Communication media that carry the data from one workstation to the other. The
DDBMS must use communication media independent i.e it should be able to support
several types of communication media.
The transaction processor (TP). It’s the software component found in each computer
that requests data. The transaction processor receives and processes the application
data requests. It’s also called the application processor (AP) or the transaction
Manager (TM)
The data processor (DP). It’s the software component residing on each computer that
stores and retrieves data located at the site. The DP is also known as the Data Manager
(DM).
The communication among TP and DP is made possible through a specific set of rules
/protocols used by the DDBMS.
Interface with the network to transport data and commands between data processors
(DP) and transaction processors (TP).
Synchronize all data received from DP(TP side) and route retrieved data to the
appropriate TP (DP side)
Ensures common database functions in a distributed system. Such functions include
security, concurrency control, backup and recovery.
A distributed database stores logically related database over two or more physically
independent sites which are connected through a network. In a distributed database, a
database is composed of parts known as fragments. The database fragments are located at
different sites and can be replicated among various sites. Each database fragment is
managed by its local processes.
Current database systems can be classified on the basis of how process distribution and
data distribution are supported. For instance, a DBMS may store data in a single site
(centralized DB) or in multiple sites (distributed DB) and may support data processing at a
single site or at multiple sites. The different types of Data and Process distribution
methods are as follows:
All processing is done on a single CPU or host computer (main frame midrange) and all
the data are stored on the host computer’s local disk. Processing cannot be done on the
end users side of the system. The DBMS is located on the host computer which is
accessed by dumb terminals connected to it.
The TP and DP are embedded within the DBMS loacted on a single computer. The DBMS
usually runs under a time sharing, multitasking operating system which allows several
processes to run concurrently on a host CPU accessing a single DP. All data storage and
data processing are handled by a single host computer
Multiple site processing, single site data(MPSD)
Multiple processes run on different computers sharing a single data repository. The MPSD
require a network file server running conventional applications that are accessed through a
LAN.
In an MPSD:
The TP on each work station acts only as a redirector to route all network data
requests to the file server.
The enduser sees the fileserver as just another hard disk. Because only the data
storage are is handled by the file server computer. The MPSD offers only very
limited distributed processing capabilities.
All data selection, search and update functions take place at the workstation thus
requiring that the entire files travel through the network for processing at the work
stations. Such a requirement increases network traffic, slow response time and
increased communication costs.
The end user must make a direct reference to the file server in order to access
remote data. All record- and file-locking activities are done at the end-user
location.
Describes a fully distributed database management system which support for multiple
data processors and transaction processors at multiple sites. Depending on the level of
support the DDBMS can either be homogeneous or heterogeneous.
Homogeneous DDBMS integrate only one type of centralised DBMS over a network.
Therefore the same DBMS will be running on different machines.
Heterogeneous DDBMS intergrate different types of DBMS over a network . it
supports different DBMS that may even support different data models (relational,
object orinted, hierarchical e.t.c)
Homogeneous DDBMSs integrate only one type of DBMS over a network. Thus, the same
DBMS will be running on different server platforms (single processor server, multiprocessor
server, server farms, or server blades).
In contrast, heterogeneous DDBMSs integrate different types of centralized DBMSs over a
network. A fully heterogeneous DDBMS will support different DBMSs that may even
support different data models (relational, hierarchical, or network) running under different
computer systems, such as mainframes and PCs.