Distributed Database System
Distributed Database System
System
Introduction
There are 2 ways in which data can be stored on different sites. These
are:
1. Replication –
In this approach, the entire relationship is stored redundantly at 2 or
more sites. If the entire database is available at all sites, it is a fully
redundant database. Hence, in replication, systems maintain copies of
data.
This is advantageous as it increases the availability of data at different
sites. Also, now query requests can be processed in parallel.
However, it has certain disadvantages as well. Data needs to be
constantly updated. Any change made at one site needs to be recorded
at every site that relation is stored or else it may lead to inconsistency.
This is a lot of overhead. Also, concurrency control becomes way more
complex as concurrent access now needs to be checked over a number
of sites.
Distributed Data Storage
2. Fragmentation –
In this approach, the relations are fragmented (i.e., they’re divided into
smaller parts) and each of the fragments is stored in different sites where
they’re required. It must be made sure that the fragments are such that they
can be used to reconstruct the original relation (i.e, there isn’t any loss of
data).
Fragmentation is advantageous as it doesn’t create copies of data,
consistency is not a problem.