0% found this document useful (0 votes)

52 views

Distributed Database Overview

A distributed database consists of data stored across independent computer systems with separate DBMSs. Users are hidden from the distributed nature through transparency features. Benefits include scalability, reliability, efficiency and data sharing. Key challenges include distributed query optimization, update propagation, concurrency control and catalog management due to the distributed nature and potential data replication.

Uploaded by

Archana Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

52 views

Distributed Database Overview

Uploaded by

Archana Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 4

Distributed database overview

A distributed database can be defined as consisting of a collection of data with different parts under the control of separate DBMSs running on independent computer systems. All the computers are interconnected and each system has autonomous processing capability serving local applications. Each system participates, as well, in the execution of one or more global applications. Such applications require data from more than one site. The distributed nature of the database is hidden from users and this transparency manifests itself in a number of ways. Although there are a number of advantages to using a distributed DBMS, there are also a number of problems and implementation issues. Finally, data in a distributed DBMS can be partitioned or replicated or both. http://www.compapp.dcu.ie/databases/f449.html

Distributed database transparency

A distributed DBMS should provide a number of features which make the distributed nature of the DBMS transparent to the user. These include the following:

Location transparency Replication transparency Performance transparency Transaction transparency Catalog transparency

The distributed database should look like a centralised system to the users. Problems of the distributed database are at the internal level.

Distributed database advantages

There are a number of advantages to using a distributed DBMS. These include the following:

Capacity and incremental growth Reliability and availability Efficiency and flexibility Sharing

Distributed database issues

There are a number of issues or problems which are peculiar to a distributed database and these require novel solutions. These include the following:

Distributed query optimisation Distributed update propagation Distributed concurrency control Distributed catalog management

Distributed query optimisation

In a distributed database the optimisation of queries by the DBMS itself is critical to the efficient performance of the overall system. Query optimisation must take into account the extra communication costs of moving data from site to site, but can use whatever replicated copies of data are closest, to execute a query. Thus it is a more complex operation than query optimisation in centralised databases.

Distributed catalog management

The distributed database catalog entries must specify site(s) at which data is being stored in addition to data in a system catalog in a centralised DBMS. Because of data partitioning and replication, this extra information is needed. There are a number of approaches to implementing a distributed database catalog.

Centralised - Keep one master copy of the catalog Fully replicated - Keep one copy of the catalog at each site Partitioned - Partition and replicate the catalog as usage patterns demand Centralised/partitioned - Combination of the above

Distr ibuted update propagation

Update propagation in a distributed database is problematic because of the fact that there may be more than one copy of a piece of data because of replication, and data may be split up because of partitioning. Any updates to data performed by any user must be propagated to all copies throughout the database. The use of snapshots is one technique for implementing this.

Query optimisation overview

Query optimisation is essential if a DBMS is to achieve acceptable performance and efficiency. Relational database systems based on the relational model and relational algebra have the strength that their relational expressions are at a sufficiently high level so query optimisation is feasible in the first place; in non-relational systems, user requests are low level and optimisation is done manually by the user - the system cannot help. Hence systems which implement optimisation have several advantages over systems that do not. The optimisation process itself involves several stages, which involves the implementation of the relational operators. A different approach to query optimisation, called semantic optimisation has recently been suggested. This technique may be used in combination with the other optimisation techniques and uses constraints specified on the database schema. Consider the SQL query:
SELECT E.LNAME

FROM WHERE

EMPLOYEE E M E.SSN = M.SSN AND E.SALARY > M.SALARY

This query retrieves the names of employees who earn more than their supervisors. Suppose we had a constraint on the database schema that states that no employee can earn more than their supervisor. If the semantic query optimisor checks for the existence of this constraint, then it need not execute the query at all. This may save considerable time if the checking for constraints can be done efficiently; however, searching through many constraints to find ones applicable to a given query can also be quite time consuming.

Distributed concurrency control

Concurrency control in distributed databases can be done in several ways. Locking and timestamping are two techniques which can be used, but timestamping is generally preferred. The problems of concurrency control in a distributed DBMS are more severe than in a centralised DBMS because of the fact that data may be replicated and partitioned. If a user wants unique access to a piece of data, for example to perform an update or a read, the DBMS must be able to guarantee unique access to that data, which is difficult if there are copies throughout the sites in the distributed database.

Timestamping
Timestamping is a method of concurrency control where basically, all transactions are given a timestamp or unique date/time/site combination and the database management system uses one of a number of protocols to schedule transactions which require access to the same piece of data. While more complex to implement than locking, timestamping does avoid deadlock occurring by avoiding it in the first place.

Computer Science Option A Database
No ratings yet
Computer Science Option A Database
9 pages
Distributed Database System For Exams
100% (1)
Distributed Database System For Exams
13 pages
What Is Centralized Database?
No ratings yet
What Is Centralized Database?
8 pages
Advantages and Disadvantages (TAN)
No ratings yet
Advantages and Disadvantages (TAN)
12 pages
Chhanda Ray - Distributed Database Systems (2009, Pearson Education) - Libgen - Li
No ratings yet
Chhanda Ray - Distributed Database Systems (2009, Pearson Education) - Libgen - Li
325 pages
HUB_Audit_C03
No ratings yet
HUB_Audit_C03
39 pages
DBMS Unit 1.1
No ratings yet
DBMS Unit 1.1
6 pages
Final
No ratings yet
Final
3 pages
Distributed Database Overview
No ratings yet
Distributed Database Overview
5 pages
DDBMS
No ratings yet
DDBMS
44 pages
Fundamental Research of Distributed Database PDF
No ratings yet
Fundamental Research of Distributed Database PDF
9 pages
DATAMA
No ratings yet
DATAMA
10 pages
Distributed Database Systems
No ratings yet
Distributed Database Systems
50 pages
Distributed Database: Database Database Management System Storage Devices CPU Computers Network
No ratings yet
Distributed Database: Database Database Management System Storage Devices CPU Computers Network
15 pages
Distributed DB
No ratings yet
Distributed DB
4 pages
Notes Ddbms
No ratings yet
Notes Ddbms
16 pages
Distributed Database Systems: January 2002
No ratings yet
Distributed Database Systems: January 2002
25 pages
UNT-1
No ratings yet
UNT-1
19 pages
Distributed Database-Chapter 3
No ratings yet
Distributed Database-Chapter 3
26 pages
Unit 5
No ratings yet
Unit 5
28 pages
Assignment-3 OF: Mobile Computing
No ratings yet
Assignment-3 OF: Mobile Computing
10 pages
Unit - I Distributed Data Processing
100% (2)
Unit - I Distributed Data Processing
27 pages
Unit 2 DDMS
No ratings yet
Unit 2 DDMS
26 pages
Assignment Set - 1 Database Management System (DBMS and Oracle 9i)
No ratings yet
Assignment Set - 1 Database Management System (DBMS and Oracle 9i)
28 pages
DBMS - Unit 5
No ratings yet
DBMS - Unit 5
48 pages
Dbms Notes
No ratings yet
Dbms Notes
38 pages
Assignment # 1: Submitted by Submitted To Class Semester Roll No
No ratings yet
Assignment # 1: Submitted by Submitted To Class Semester Roll No
15 pages
12 It
No ratings yet
12 It
4 pages
Database Management System
From Everand
Database Management System
Knowledge Flow
No ratings yet
DDBS Lec1
No ratings yet
DDBS Lec1
20 pages
Data Communication Basics CH 7
No ratings yet
Data Communication Basics CH 7
27 pages
Compusoft, 2 (12), 396-399 PDF
No ratings yet
Compusoft, 2 (12), 396-399 PDF
4 pages
ADS Chapter 7 Distributed Database
No ratings yet
ADS Chapter 7 Distributed Database
16 pages
DISTRIBUTED DATABASES Presentation
No ratings yet
DISTRIBUTED DATABASES Presentation
13 pages
Distributed Database System
No ratings yet
Distributed Database System
15 pages
Distributed Database System
No ratings yet
Distributed Database System
4 pages
Adv DBMS-Unit 2
No ratings yet
Adv DBMS-Unit 2
15 pages
Unit 1 Lesson-1 Introduction To Database Management System
No ratings yet
Unit 1 Lesson-1 Introduction To Database Management System
8 pages
A Distributed Database Management System ('DDBMS') Is A Software System
No ratings yet
A Distributed Database Management System ('DDBMS') Is A Software System
5 pages
Assignment 1 - Database Management System
No ratings yet
Assignment 1 - Database Management System
12 pages
Advanced Database Chapter 6 and 7
No ratings yet
Advanced Database Chapter 6 and 7
30 pages
Chapter 4 CIS
No ratings yet
Chapter 4 CIS
47 pages
Distributed Databases AND Client-Server Architechures
No ratings yet
Distributed Databases AND Client-Server Architechures
73 pages
Rdbms III Sem
100% (1)
Rdbms III Sem
80 pages
Unit - 1
No ratings yet
Unit - 1
35 pages
Unit 1dbms
No ratings yet
Unit 1dbms
41 pages
Database Answer
No ratings yet
Database Answer
17 pages
Sta 2107 Database Mangement 2017 - Copy-1-1
No ratings yet
Sta 2107 Database Mangement 2017 - Copy-1-1
80 pages
712 Viva 2023
No ratings yet
712 Viva 2023
4 pages
DBMS Q&A
No ratings yet
DBMS Q&A
12 pages
Distributed Database Systems Overview
No ratings yet
Distributed Database Systems Overview
22 pages
Unit 1 Database Concepts RDBMS Tool
No ratings yet
Unit 1 Database Concepts RDBMS Tool
7 pages
RDBMS NOTES (1)
No ratings yet
RDBMS NOTES (1)
227 pages
Database
No ratings yet
Database
4 pages
Chapter 7 - Distributed Database System
No ratings yet
Chapter 7 - Distributed Database System
27 pages
Class XII IT Notes
No ratings yet
Class XII IT Notes
7 pages
DBMS Unit 5
No ratings yet
DBMS Unit 5
20 pages
ddb unit 1-5
No ratings yet
ddb unit 1-5
190 pages
DBMS Unit-I (1)
No ratings yet
DBMS Unit-I (1)
31 pages
Database Management System
From Everand
Database Management System
Manish Soni
No ratings yet

Distributed Database Overview

Uploaded by

Distributed Database Overview

Uploaded by

Distributed database overview

Distributed database transparency

Distributed database advantages

Distributed database issues

Distributed query optimisation

Distributed catalog management

Distr ibuted update propagation

Query optimisation overview

EMPLOYEE E M E.SSN = M.SSN AND E.SALARY > M.SALARY

Distributed concurrency control

You might also like