DBMS
DBMS
• Replication
– System maintains multiple copies of data, stored in different
sites, for faster retrieval and fault tolerance.
• Fragmentation
– Relation is partitioned into several fragments stored in distinct
sites
Distributed Data Storage
Advantages of Replication
– Availability: failure of site containing relation r does not
result in unavailability of r is replicas exist.
– Parallelism: queries on r may be processed by several
nodes in parallel.
– Reduced data transfer: relation r is available locally at
each site containing a replica of r.
Disadvantages of Replication
Advantages of Replication
– Increased cost of updates: each replica of relation r must be
updated.
– Increased complexity of concurrency control: concurrent
updates to distinct replicas may lead to inconsistent data
unless special concurrency control mechanisms are
implemented.
One solution: choose one copy as primary copy and apply
concurrency control operations on primary copy
Data Fragmentation
Horizontal:
– allows parallel processing on fragments of a
relation
– allows a relation to be split so that tuples are
located where they are most frequently accessed
Advantages of Fragmentation
Vertical:
– allows tuples to be split so that each part of the tuple
is stored where it is most frequently accessed
– tuple-id attribute allows efficient joining of vertical
fragments
– allows parallel processing on a relation
That’s all for now…