Distributed Database Management Systems
Distributed Database Management Systems
Lecture 10
10
Evolution of DDBMS
Decentralized database management systems (DDBMS)
Interconnected computer systems Data/processing functions reside on multiple sites
10
DDBMS Advantages
Data located near site with greatest demand Faster data access Faster data processing Growth facilitation Improved communications Reduced operating costs User-friendly interface Less danger of single-point failure Processor independence
3
10
DDBMS Disadvantages
Complexity of management and control Security Lack of standards Increased storage requirements Greater difficulty in managing data environment Increased training costs
10
Distributed Processing
Shares databases logical processing among physically, networked independent sites
Figure 10.1
5
10
Distributed Database
Stores logically related database over physically independent sites
Figure 10.2
6
10
Distributed database
Requires distributed processing
Both
Require a network to connect components
7
10
Functions of DDBMS
Application/end user interface Validation to analyze data requests Transformation to determine request components Query optimization to find the best access strategy Mapping to determine the data location I/O interface to read or write data Formatting to prepare the data for presentation Security to provide data privacy Backup and recovery DB Administration Concurrency Control Transaction Management
8
10
Centralized Database
Figure 10.3
9
10
Figure 10.4
10
10
DDBMS Components
Computer workstations Network hardware and software components Communications media Transaction processor (TP)
Also called application manager (AP) or transaction manager (TM)
11
10
Figure 10.5
12
10
DDBMS Protocols
Interface with network to transport data and commands between DPs and TPs Synchronize data received from DPs and route to appropriate TPs Ensure common database functions
Security Concurrency control Backup and recovery
13
10
Table 10.1
14
10
15
10
Figure 10.6
16
10
Figure 10.7
17
10
Heterogeneous
Integrate different types of centralized DBMSs over a network
18
10
Figure 10.8
19
10
Distributed DB Transparency
Allows end users to feel like only database user Hides complexities of distributed database Transparency features
Distribution Transaction Failure Performance
Heterogeneity
20
10
Distribution Transparency
Allows management of a physically dispersed database as though it were centralized Three Levels
Fragmentation transparency Location transparency Local mapping transparency
Table 10.2
21
10
Transaction Transparency
Ensures transactions maintain integrity and consistency Completed only if all involved database sites complete their part of the transaction Management mechanisms
Remote request Remote transaction Distributed transaction Distributed request
22
10
Remote Request
Figure 10.10
23
10
Remote Transaction
Figure 10.11
24
10
Distributed Transaction
Figure 10.12
25
10
Distributed Requests
Figure 10.13
26
10
Figure 10.14
27
10
28
10
Phases
Preparation
Coordinator sends message to all subordinates Confirms all are ready to commit or abort
Final Commit
Ensures all subordinates have committed or aborted
29
10
Dynamic or static optimization Statistically based vs. rule-based query optimization algorithms
30
10
Fragments to replicate
Storage of data copies at multiple sites Fully, partially, unreplicated databases
Data allocation
Where to locate data Centralized, partitioned, replicated
31
10
10
Client/Server Disadvantages
Creates more complex environment with different platforms Increased number of users and sites creates security problems Training issues become more complex and expensive
33