0% found this document useful (0 votes)

51 views7 pages

RDBMS - Module5 - Distributed and Parallel DB

good notes

Uploaded by

shiniii

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

51 views7 pages

RDBMS - Module5 - Distributed and Parallel DB

good notes

Uploaded by

shiniii

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

CHAPTER

Distributed and Parallel

7 Databases

7.1 DISTRIBUTED DATABASES

(GGSIPU,2011;MDU, Dec, 2009, May 2009-1o, KIN
Distributed databases can be termed as collection of multiple databases that are
stored on several computers across various location connected to one another
through a computer network.
A user sees a distributed database as a single database which is located on a

single computer. He does not have any idea that the particular data which he is
accessing may be located at some other site. A
distributed database management
system is a set of programme that uses client-server architecture to process
information requests.

7.1.1 Types of Distributed Databases

There are two types of distributed databases:

(i) Homogeneous distributed databases: Databases stored at various

geographical regions runs identical database softwares.

(ii) Heterogeneous distributed databases: Databases stored at various
geographical regions have different database softwares. For example, one
site may be running oracle database while other may have DB2 database.

7.1.2 Design of Distributed Database

Different techniques used for designing distributed databases are:

1. Data Fragmentation: In this technique, decision is made regarding what

portion of database is to be stored at which location. A relation is broken
into different fragment and is physically stored across various sites. Various
ways of fragmenting arelation are:
(a) Horizontal Fragmentation: A relation R is partitioned into many
relations where each new relation consists of some tuples of relation
R. These new relations are distributed across various sites.

Example: Consider the following student relation

Parallel Databases 211
and
pietributed

Sudent Name Branch Marke

No
Roll Ashu CSE
Binoy CSE
Himanshu T
Naina CSE 70
Rashmi

Fig. 7.1: StudentRelation

can be partitioned according to branch field of a student ie. as

This reelation

follows:
Student
Student_Prag2
Fragi
=h G Brnck s (Student)
r(Student)
Student_Frag2
Student_Prag1 Marks Roll No Name Branch Marks
Branch
Roll No
Name 3 Himanshu 79
CSE 95
Ashu Rashmi IT 65
1
CSE 84
Binoy
CSE 70
Naina

Fig.7.2: Horizontal Fragmentation

R partitioned into many relations

Vertical Fragmentation: A relation
is
b) attributes of a relation
where each new relation consist of only certain
which specifies logical or
r,

R. An additional attribute Tuple_Id is added

of a tuple.
physical address two new relations,
is partitioned into
Example: The student relation contains Name
contains RollNo., Marks while the other
one relation
student.
and Branch fields of a
(Student)
Student_Vírag1 = Tpollno, marks, Tuple,

Student_Vírag2 = ame, branch, Tugle

n(Student)

Student_Vfrag2
tudent_Vfrag1 Branch Tuple_ld
Name
Roll No Marks Tuple_Id
Ashu CSE
1
1
95 CSE
2 Binoy 3
2 84 IT
Himanshu
3 CSE 4
3 79 Naina
4 78 4 IT 5
Rashmi
65 5
5

Fragmentation
Fig. 7.3: Vertical
a relation is first
In this type of fragmentation further
(c) Mixed Fragmentation: obtained is

and then the new relation

horizontally
partitioned
Database Management Systems
212
and
is first partitioned vertically
partitioned vertically or a relation
partitioned horizontally.
then the new relation obtained isfurther
Example:
Stud =aNA,Nem (o C)(Student)
Stud.

RollNo Name
1 Ashu
2 Binoy
Naina

Fig.74:Mixed Fragmentation

2. Data Replication: Itrefers to maintaining of more than one copy of a data

at several different site i.e. many identical replicas of a relation is stored

at more than one site.

Two types of data replicationsare:
(a) Fully Replicated Database: A copy of entire database is replicated at

more than one site.

(b)Partially Replicated Database: Some portion of a database is

replicated at other site.

3. Data Allocotion: Data allocationis a strategyby which one decides how to

place data at different site. In centralised strategydata and DBMS is stored at

a single site and users at different site can access this data through a network.
Another strategy is to partition the data and store them atdiffrent site or

keep differentcopies of same data at several sites.

7.1.3 Architecture of Distributed Database

Following are the three architectures used in distributed database.

1. Shared Nothing Architecture: Every computer located at various site have

their own local database. All thesecomputers are connected via network
but no one shares it database with other.

Database 1

Site 1

Site 3

Database 2
Database 3
Site 2
Site 4

Database 3

Fig.7.5: Šhared Nothing Architecture

and rarallel Databasee
nistributed
213

2. Centralised Database: Each and

is
every computer located atvarious sites
connected through a nelwork and
shares a common database.
Site 1

Site
3 Site 2

Centralised
Database
Site 4

Fig. 7.6: Centralised Database Architecture

3 Truly Distributed Database: Each and every computer located at various

sites and connected through a network, have there own
local databases.

However, all these databases are shared.

Site 1

Site 2
Site 4

Site 3

Architecture
Fig. 7.7: Truly Distributed
Distributed and Parallel Databases
215

7.2 PARALLEL DATABASES

databases multiple processors works in parallel to
narallel
perform various
onerationsconcurrently. For example,one CPU might be loading the data while
other isexecuting a query atthe same time.

7.2.1 Architecture of Parallel Databases

(MDU,Dec 2009, May 2009, 2010, 2011, KU)
Three most popular architecture of parallel databases are:

1. Shared memory archilecture. As the name suggests all the

proCessors
and disk share a common memory. All the processors,
disk and mernory
are connected through a communication
network.A processor may also
have a local cache so that referencing of shared
memory is avciied
whenever possible. Processors communicate with each other through
memory writes.

Processor Processor Processor

Inter Connection Network

Disk Disk Shared memory

Fig. 7.8: Shared Memory Architecture of Parallel Database

Advantages
(a) Data access is fast as processor communicates through memory writes.
(b)Low communication overhead.
Disadvantages
(a) Cache coherency: If an update is done to shared memory then it should
also be done to local cache.
(b) Architecture not scalable beyond 32 or 64 processors.
2. Shared Disk Architecture: In this architecturethere are multiple processors
and each processor have there own private memory, but they all share
some common disk via interconnection network.

Memory Memory Memory

Processor Processor Processor

Inter Connection Network

Disk Disk

Fig. 7.9: Shared Disk Architectureof Parallel Database

Database
216 Management Systens

Advantages:
bus iss not a bottleneck.
(a) Since each processorhas its own memory,
fails, then other can take over.
(b) If one processor or memory
(c) Load balancing is easy.
Disadvantages:
(a) Problems of scalability
as with increase in processor number of disk
to disk becomes a
accessalsoincreasesand interconnection bottleneck.
(b) Due to increase in processor, existing processors get slow down
because of increased contention of memory access and network
bandwidth.
3. Shared Nothing Architecture: Every processor connected to the
interconnection network has its own individual memory and disk. All
communication is done through high speed communication network.

Memory Memory

Disk Processor Processor Disk

inter Connection Network

Processor Disk

Memory

Fig. 7.10: Shared Nothing Architectureof Parallel Database

Advantages:
(a) Better scalability. No sharing of resources minimises contention among
processors.
(b) High speed. As queries are executed at individual node so onlyqueries
requiring access to non-local disk and result pass through network.
(c) Support large number of processors.
Disadvantages:
(a)Communication costs are higher.
(b) Difficulty in load balancing.
(c) Cost of non local disk access is higher than shared one.

(d) Since, there is no sharing of disk and data, so if one processor fails
data becomes inaccessible to other processor.
Distributed and Parallel Databases
217
A. Hierarchical Architecture

Processor Processor
Disk
Memory

inter Connection
Network

Processor Processor
Disk Disk Memory
Fig. 7.11: Hierarchial
Architecture of Parallel Database

It is a combination
of shared memory, shared disk
and shared nothing
[Link]
the system can be seen as shared nothing
systen. Now
each node is shared memory system.
Within system each node the system is
shared disk system.

Advantages:

(a) Higher performance -

Higher speed up and scale up can be attained with
more number of CPU.
(b) Flexibility– more nodes can be added or removed
easily.
(c) A single system can serve many user.
7.2.2 Query Parallelism
Query parallelism means how to parallely execute multiple queries or how to
decompose a query into various parts so that they all can be executed in parallel.
Techniques toachieve thisquery parallelism are:

1. Inputoutput parallelism: A
relation is partitioned and kept on multiple
disk toreduce the retrievaltime. Now each partitionis processed
parallely
and then finally combined. Various strategiesto partition a relation are:
(a) Hash partitioning: Every tuple of a relation is hashed on some
partitioning attributeof the relation.
Ifthehash function returns value i
then this tuple is kept on disk i.

(b) Round robin partitioning: ith tuple of the relation is kept on disk

number D, mod n. So, all tuples are evenly distributed across every
disk.

(c) Range partitioning: Distributes contiguous attribute value range to

each disk. For example range partitioning with three disks numbered

CSC302 ch24
No ratings yet
CSC302 ch24
23 pages
Distributed DBM S
No ratings yet
Distributed DBM S
67 pages
Parallel and Distributed Databases
No ratings yet
Parallel and Distributed Databases
7 pages
Iii. Current Trends: Distributed Databases and DBMSS: Concepts and Design
No ratings yet
Iii. Current Trends: Distributed Databases and DBMSS: Concepts and Design
32 pages
Chapter 7 Distributed Database Systems
No ratings yet
Chapter 7 Distributed Database Systems
27 pages
04 - Distributed DBMSs - Concepts and Design
No ratings yet
04 - Distributed DBMSs - Concepts and Design
72 pages
Chapter 4 Distributed Database Systems
No ratings yet
Chapter 4 Distributed Database Systems
69 pages
Unit 1 DISTRIBUTED DATABASE
No ratings yet
Unit 1 DISTRIBUTED DATABASE
6 pages
Understanding Distributed Databases Concepts
No ratings yet
Understanding Distributed Databases Concepts
56 pages
DBMS-Unit 5
No ratings yet
DBMS-Unit 5
27 pages
22 Distributed
No ratings yet
22 Distributed
6 pages
Distributed Dbmss - Concepts and Design: Pearson Education © 2009
No ratings yet
Distributed Dbmss - Concepts and Design: Pearson Education © 2009
72 pages
Distributed Database Management Systems
No ratings yet
Distributed Database Management Systems
123 pages
Unit 4 Distributed DBMS by ANS
No ratings yet
Unit 4 Distributed DBMS by ANS
12 pages
Distributed Databases: Benefits and Issues To Be Considered
No ratings yet
Distributed Databases: Benefits and Issues To Be Considered
25 pages
Distributed Databases
No ratings yet
Distributed Databases
55 pages
ADBS Chapter Seven
No ratings yet
ADBS Chapter Seven
22 pages
Distributed and Parallel Database Systems: To-Peer, Require Sophisticated Protocols
No ratings yet
Distributed and Parallel Database Systems: To-Peer, Require Sophisticated Protocols
4 pages
Chapter 4 - Distributed Database System
No ratings yet
Chapter 4 - Distributed Database System
52 pages
Week 2 Parallel and Distributed Database
No ratings yet
Week 2 Parallel and Distributed Database
7 pages
CH 4
No ratings yet
CH 4
16 pages
Week 12 - Distributed Databases
No ratings yet
Week 12 - Distributed Databases
37 pages
Lec1 30 9 16
No ratings yet
Lec1 30 9 16
32 pages
Distributed Database System
No ratings yet
Distributed Database System
5 pages
Lecture 1 Ho PDF
No ratings yet
Lecture 1 Ho PDF
62 pages
Lecture 1 Ho
No ratings yet
Lecture 1 Ho
62 pages
Types of Distributed Database Systems
No ratings yet
Types of Distributed Database Systems
27 pages
Lecture 8 - Distributed Database Management Systems
No ratings yet
Lecture 8 - Distributed Database Management Systems
60 pages
Distributed Database Systems Guide
0% (1)
Distributed Database Systems Guide
54 pages
Distributed Database
100% (1)
Distributed Database
24 pages
Adv DBMS-Unit 2
No ratings yet
Adv DBMS-Unit 2
15 pages
Distributed
No ratings yet
Distributed
30 pages
Distributed Database Systems Guide
No ratings yet
Distributed Database Systems Guide
5 pages
NoSQL & Distributed Databases Overview
No ratings yet
NoSQL & Distributed Databases Overview
124 pages
ADT Unit 1 To 5
No ratings yet
ADT Unit 1 To 5
160 pages
Midterm Elective Database Notes
No ratings yet
Midterm Elective Database Notes
14 pages
Distributed Databases Guide
No ratings yet
Distributed Databases Guide
13 pages
Distributed Databases
No ratings yet
Distributed Databases
46 pages
Lecture3-Distributed Introduction
No ratings yet
Lecture3-Distributed Introduction
38 pages
9.CSI2004-ADBMS Module2 Part1
No ratings yet
9.CSI2004-ADBMS Module2 Part1
54 pages
Advanced Data Base Management Systems
No ratings yet
Advanced Data Base Management Systems
35 pages
Understanding Distributed Databases
No ratings yet
Understanding Distributed Databases
41 pages
Overview of NoSQL Database Systems
No ratings yet
Overview of NoSQL Database Systems
9 pages
Module 2
No ratings yet
Module 2
62 pages
26 Distributed Dbms Nosql
No ratings yet
26 Distributed Dbms Nosql
45 pages
Distributed Database Concepts
No ratings yet
Distributed Database Concepts
52 pages
Chapter 2
No ratings yet
Chapter 2
61 pages
ADBMS Notes
No ratings yet
ADBMS Notes
15 pages
7-Distributed DB
No ratings yet
7-Distributed DB
37 pages
Parallel & Distributed DBMS Guide
No ratings yet
Parallel & Distributed DBMS Guide
58 pages
Distrubuted Database Concept
No ratings yet
Distrubuted Database Concept
22 pages
Enterprise Systems: Distributed Databases and Systems - DT211 4
No ratings yet
Enterprise Systems: Distributed Databases and Systems - DT211 4
25 pages
Parallel and Distributed Database Systems
No ratings yet
Parallel and Distributed Database Systems
22 pages
8-Parallel Nhom5
No ratings yet
8-Parallel Nhom5
59 pages
Understanding Distributed Databases
No ratings yet
Understanding Distributed Databases
30 pages
legal_frameworks
No ratings yet
legal_frameworks
122 pages
Minor Data Science Content
No ratings yet
Minor Data Science Content
12 pages
Discs Mini HP Monthly 2021 Sunday Start Book
No ratings yet
Discs Mini HP Monthly 2021 Sunday Start Book
14 pages
Manufacturing Industries: Multiple Choice Questions
No ratings yet
Manufacturing Industries: Multiple Choice Questions
15 pages
Transport Systems in India: An Overview
No ratings yet
Transport Systems in India: An Overview
4 pages
Operating System - Lab 3
No ratings yet
Operating System - Lab 3
8 pages
Interprocess Communication
No ratings yet
Interprocess Communication
4 pages
IPC Mechanisms in Windows: C Programs
No ratings yet
IPC Mechanisms in Windows: C Programs
8 pages
Interprocess Communications (IPC) : Unit-2
No ratings yet
Interprocess Communications (IPC) : Unit-2
7 pages
DBMS Unit 6
No ratings yet
DBMS Unit 6
14 pages
Shared Memory Architecture
No ratings yet
Shared Memory Architecture
17 pages
III Cs Os Record Program
No ratings yet
III Cs Os Record Program
57 pages
Introduction to POSIX Threads Basics
No ratings yet
Introduction to POSIX Threads Basics
9 pages
Introduction To Parallel Computing
No ratings yet
Introduction To Parallel Computing
34 pages
IPC Mechanisms in Linux OS
No ratings yet
IPC Mechanisms in Linux OS
4 pages
Interprocess Communication Basics
No ratings yet
Interprocess Communication Basics
31 pages
Operating System Concepts Overview
No ratings yet
Operating System Concepts Overview
29 pages
Report On Linux
No ratings yet
Report On Linux
15 pages
Mach: Advanced UNIX Kernel Design
No ratings yet
Mach: Advanced UNIX Kernel Design
16 pages
POSIX
No ratings yet
POSIX
24 pages
Os Winter 2023
No ratings yet
Os Winter 2023
26 pages
CS 303 - Lab3
No ratings yet
CS 303 - Lab3
2 pages
Barbara Chapman Using OpenMP
No ratings yet
Barbara Chapman Using OpenMP
378 pages
Interprocessor Communication and Synchronization 2
No ratings yet
Interprocessor Communication and Synchronization 2
3 pages
g16ug使用手册
No ratings yet
g16ug使用手册
478 pages
Flynn's Taxonomy and SISD SIMD MISD MIMD
86% (14)
Flynn's Taxonomy and SISD SIMD MISD MIMD
7 pages
Parallel and Distributed Computing Lec 6
No ratings yet
Parallel and Distributed Computing Lec 6
26 pages
Os Project Team-7
No ratings yet
Os Project Team-7
11 pages
Co 1
No ratings yet
Co 1
66 pages
Classification - Shared Memory Systems
No ratings yet
Classification - Shared Memory Systems
3 pages
Distributed OS: Concepts & Challenges
No ratings yet
Distributed OS: Concepts & Challenges
69 pages
Process: Unit 2 Operating System
No ratings yet
Process: Unit 2 Operating System
28 pages
Diploma OS Exam Guide
No ratings yet
Diploma OS Exam Guide
27 pages
Distributed Shared Memory
No ratings yet
Distributed Shared Memory
9 pages
Parallel Computing LessonPlan
No ratings yet
Parallel Computing LessonPlan
10 pages

RDBMS - Module5 - Distributed and Parallel DB

Uploaded by

RDBMS - Module5 - Distributed and Parallel DB

Uploaded by

CHAPTER

Distributed and Parallel

7.1 DISTRIBUTED DATABASES

7.1.1 Types of Distributed Databases

(i) Homogeneous distributed databases: Databases stored at various

geographical regions runs identical database softwares.

7.1.2 Design of Distributed Database

1. Data Fragmentation: In this technique, decision is made regarding what

Example: Consider the following student relation

Sudent Name Branch Marke

Fig. 7.1: StudentRelation

can be partitioned according to branch field of a student ie. as

Fig.7.2: Horizontal Fragmentation

R partitioned into many relations

R. An additional attribute Tuple_Id is added

Student_Vírag2 = ame, branch, Tugle

and then the new relation

2. Data Replication: Itrefers to maintaining of more than one copy of a data

at several different site i.e. many identical replicas of a relation is stored

at more than one site.

more than one site.

replicated at other site.

place data at different site. In centralised strategydata and DBMS is stored at

keep differentcopies of same data at several sites.

7.1.3 Architecture of Distributed Database

Following are the three architectures used in distributed database.

1. Shared Nothing Architecture: Every computer located at various site have

Fig.7.5: Šhared Nothing Architecture

2. Centralised Database: Each and

Fig. 7.6: Centralised Database Architecture

3 Truly Distributed Database: Each and every computer located at various

However, all these databases are shared.

7.2 PARALLEL DATABASES

7.2.1 Architecture of Parallel Databases

1. Shared memory archilecture. As the name suggests all the

Processor Processor Processor

Inter Connection Network

Disk Disk Shared memory

Fig. 7.8: Shared Memory Architecture of Parallel Database

Memory Memory Memory

Processor Processor Processor

Inter Connection Network

Fig. 7.9: Shared Disk Architectureof Parallel Database

Disk Processor Processor Disk

inter Connection Network

Fig. 7.10: Shared Nothing Architectureof Parallel Database

(a) Higher performance -

(c) Range partitioning: Distributes contiguous attribute value range to

You might also like