0% found this document useful (0 votes)
36 views6 pages

Assignment 01

The document discusses a distributed database management system (DDBMS). Key points: 1. A DDBMS contains a single logical database that is divided into fragments stored across multiple computers connected by a network. Each computer can process local and global requests. 2. A DDBMS has characteristics like shared data fragments, replication of fragments across sites, and each site is controlled by a local DBMS while also participating in global applications. 3. Data can be stored across sites through replication of entire relations or fragmentation of relations into horizontal or vertical partitions stored at different sites.

Uploaded by

Dil Nawaz
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
36 views6 pages

Assignment 01

The document discusses a distributed database management system (DDBMS). Key points: 1. A DDBMS contains a single logical database that is divided into fragments stored across multiple computers connected by a network. Each computer can process local and global requests. 2. A DDBMS has characteristics like shared data fragments, replication of fragments across sites, and each site is controlled by a local DBMS while also participating in global applications. 3. Data can be stored across sites through replication of entire relations or fragmentation of relations into horizontal or vertical partitions stored at different sites.

Uploaded by

Dil Nawaz
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 6

Name : Umar Ali

Roll Number : 522

Class # : BS (CS) – 6th Semester

Assignment # : 01

Submitted to : Sir Tariq

Subject : “CS-512 Distributed


Database”
Q1. Distributed Database Management System:
A Distributed Database Management System (DDBMS) contains a single logical
database that is divided into a number of fragments. Every fragment gets stored
on one or more computers under the control of a separate DBMS, with the
computers connected by a communications network. Each position is capable of
independently process every user's requests that require access to local data
(i.e., each position of the distributed system has some basic degree of local
autonomy) and is also able to process data stored on other computers within the
network. Users access the distributed database via applications that are
classified as those which do not need data from other sites (local applications);
and also those that do need data from other sites (global applications). You will
be requiring a DDBMS to have at least one global application. A DDBMS,
therefore, has the following characteristics:

 a collection of logically related shared data


 n the data is split into a number of fragments
 fragments may be replicated
 fragments/replicas are allocated to sites
 the sites are linked by a communications network
 the data at each site is under the control of a DBMS
 the DBMS at each site can handle local applications, autonomously
 each DBMS participates in at least one global application

A distributed database is basically a database that is not limited to one system, it


is spread over different sites, i.e, on multiple computers or over a network of
computers. A distributed database system is located on various sited that don’t
share physical components. This maybe required when a particular database
needs to be accessed by various users globally. It needs to be managed such
that for the users it looks like one single database.
Types of (DDBS):

1. Homogeneous Database:

In a homogeneous database, all different sites store database identically. The


operating system, database management system and the data structures used –
all are same at all sites. Hence, they’re easy to manage.

2. Heterogeneous Database:

In a heterogeneous distributed database, different sites can use different schema


and software that can lead to problems in query processing and transactions.
Also, a particular site might be completely unaware of the other sites. Different
computers may use a different operating system, different database application.
They may even use different data models for the database. Hence, translations
are required for different sites to communicate.

Distributed Data Storage-


There are 2 ways in which data can be stored on different sites. These are:
1. Replication
In this approach, the entire relation is stored redundantly at 2 or more sites. If the
entire database is available at all sites, it is a fully redundant database. Hence, in
replication, systems maintain copies of data.
This is advantageous as it increases the availability of data at different sites.
Also, now query requests can be processed in parallel.
However, it has certain disadvantages as well. Data needs to be constantly
updated. Any change made at one site needs to be recorded at every site that
relation is stored or else it may lead to inconsistency. This is a lot of overhead.
Also, concurrency control becomes way more complex as concurrent access
now needs to be checked over a number of sites.

2. Fragmentation
In this approach, the relations are fragmented (i.e., they’re divided into smaller
parts) and each of the fragments is stored in different sites where they’re
required. It must be made sure that the fragments are such that they can be used
to reconstruct the original relation (i.e, there isn’t any loss of data).
Fragmentation is advantageous as it doesn’t create copies of data, consistency is
not a problem.
Fragmentation of relations can be done in two ways:

 Horizontal fragmentation – Splitting by rows – The relation is fragmented


into groups of tuples so that each tuple is assigned to at least one
fragment.

 Vertical fragmentation – Splitting by columns – The schema of the relation


is divided into smaller schemas. Each fragment must contain a common
candidate key so as to ensure lossless join.
The distributed database management system contains the data in multiple locations.
That can be in different systems in the same place or across different geographical
locations.
As shown in the below example −

The database is divided into multiple locations and stores the data in Site1, Site2,Site3
and Site4.
The advantages and disadvantages of Distributed database management systems are
as follows −

Advantages of DDBMS-
 The database is easier to expand as it is already spread across multiple systems
and it is not too complicated to add a system.
 The distributed database can have the data arranged according to different levels
of transparency i.e data with different transparency levels can be stored at
different locations.
 The database can be stored according to the departmental information in an
organisation. In that case, it is easier for a organisational hierarchical access.
 there were a natural catastrophe such as fire or an earthquake all the data would
not be destroyed it is stored at different locations.
 It is cheaper to create a network of systems containing a part of the database.
This database can also be easily increased or decreased.
 Even if some of the data nodes go offline, the rest of the database can continue
its normal functions.
Disadvantages of DDBMS-
 The distributed database is quite complex and it is difficult to make sure that a
user gets a uniform view of the database because it is spread across multiple
locations.
 This database is more expensive as it is complex and hence, difficult to maintain.
 It is difficult to provide security in a distributed database as the database needs
to be secured at all the locations it is stored. Moreover, the infrastructure
connecting all the nodes in a distributed database also needs to be secured.
 It is difficult to maintain data integrity in the distributed database because of its
nature. There can also be data redundancy in the database as it is stored at
multiple locations.
 The distributed database is complicated and it is difficult to find people with the
necessary experience who can manage and maintain it.

You might also like