Adv DBMS-Unit 2

Uploaded by

Kevin Francis

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

58 views15 pages

Adv DBMS-Unit 2

Uploaded by

Kevin Francis

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 15

Advanced DBMS

Unit-2
Parallel Databases
• Introduction
• I/O Parallelism
• Interquery Parallelism
• Intraquery Parallelism
• Intraoperation Parallelism
• Interoperation Parallelism
• Introduction to Distributed Databases
Introduction

• Parallel machines are becoming quite common and affordable

• Prices of microprocessors, memory and disks have dropped sharply
• Recent desktop computers feature multiple processors and this trend is
projected to accelerate
• Databases are growing increasingly large
• large volumes of transaction data are collected and stored for later analysis.
• multimedia objects like images are increasingly stored in databases
• Large-scale parallel database systems increasingly used for:
• storing large volumes of data
• processing time-consuming decision-support queries
• providing high throughput for transaction processing
Parallelism in Databases
• Data can be partitioned across multiple disks for parallel I/O.
• Individual relational operations (e.g., sort, join, aggregation) can be executed in
parallel
• data can be partitioned and each processor can work independently on its
own partition.
• Queries are expressed in high level language (SQL, translated to relational
algebra)
• makes parallelization easier.
• Different queries can be run in parallel with each other. Concurrency control
takes care of conflicts.
• Thus, databases naturally lend themselves to parallelism.
I/O Parallelism
• Reduce the time required to retrieve relations from disk by partitioning
• The relations on multiple disks.
• Horizontal partitioning – tuples of a relation are divided among many
disks such that each tuple resides on one disk.
• Partitioning techniques (number of disks = n):
Round-robin:
Send the I th tuple inserted in the relation to disk i mod n.
Hash partitioning:
• Choose one or more attributes as the partitioning attributes.
• Choose hash function h with range 0…n - 1
• Let i denote result of hash function h applied to the partitioning
attribute value of a tuple. Send tuple to disk i.
I/O Parallelism (Cont.)
• Partitioning techniques (cont.):
• Range partitioning:
• Choose an attribute as the partitioning attribute.
• A partitioning vector [vo, v1, ..., vn-2] is chosen.
• Let v be the partitioning attribute value of a tuple. Tuples such that vi
 vi+1 go to disk I + 1. Tuples with v < v0 go to disk 0 and tuples with v
 vn-2 go to disk n-1.
E.g., with a partitioning vector [5,11], a tuple with partitioning
attribute value of 2 will go to disk 0, a tuple with value 8 will go to
disk 1, while a tuple with value 20 will go to disk2.
Interquery Parallelism
• Queries/transactions execute in parallel with one another.
• Increases transaction throughput; used primarily to scale up a
transaction processing system to support a larger number of
transactions per second.
• Easiest form of parallelism to support, particularly in a shared-memory
parallel database, because even sequential database systems support
concurrent processing.
• More complicated to implement on shared-disk or shared-nothing
architectures
• Locking and logging must be coordinated by passing messages
between processors.
• Data in a local buffer may have been updated at another processor.
• Cache-coherency has to be maintained — reads and writes of data
in buffer must find latest version of data.
Intraquery Parallelism
• Execution of a single query in parallel on multiple processors/disks; important for
speeding up long-running queries.
• Two complementary forms of intraquery parallelism:
• Intraoperation Parallelism – parallelize the execution of each individual
operation in the query.
• Interoperation Parallelism – execute the different operations in a query
expression in parallel.
the first form scales better with increasing parallelism because
the number of tuples processed by each operation is typically more than the
number of operations in a query.
Introduction to Distributed
Databases
• A distributed database is basically a database that is not
limited to one system, it is spread over different sites, i.e, on
multiple computers or over a network of computers.
• A distributed database system is located on various sites that
don’t share physical components. This may be required when
a particular database needs to be accessed by various users
globally. It needs to be managed such that for the users it
looks like one single database.
Distributed Databases-Types
Homogeneous Database:
In a homogeneous database, all different sites store database
identically. The operating system, database management
system, and the data structures used – all are the same at all
sites. Hence, they’re easy to manage.
Distributed Databases-Types
Heterogeneous Database:
In a heterogeneous distributed database, different sites can use
different schema and software that can lead to problems in
query processing and transactions. Also, a particular site might
be completely unaware of the other sites. Different computers
may use a different operating system, different database
application. They may even use different data models for the
database. Hence, translations are required for different sites to
communicate.
Distributed Databases-Data
Storage
1. Replication –
In this approach, the entire relationship is stored redundantly
at 2 or more sites. If the entire database is available at all
sites, it is a fully redundant database. Hence, in replication,
systems maintain copies of data.
Distributed Databases-Data
Storage
This is advantageous as it increases the availability of data at
different sites. Also, now query requests can be processed in
parallel.
However, it has certain disadvantages as well. Data needs to
be constantly updated. Any change made at one site needs to
be recorded at every site that relation is stored or else it may
lead to inconsistency. This is a lot of overhead. Also,
concurrency control becomes way more complex as
concurrent access now needs to be checked over a number of
sites.
Distributed Databases-Data
Storage
2. Fragmentation –
In this approach, the relations are fragmented (i.e., they’re
divided into smaller parts) and each of the fragments is
stored in different sites where they’re required. It must be
made sure that the fragments are such that they can be used
to reconstruct the original relation (i.e, there isn’t any loss of
data).
Fragmentation is advantageous as it doesn’t create copies of
data, consistency is not a problem.
Distributed Databases-Data
Storage
Fragmentation of relations can be done in two ways:
Horizontal fragmentation – Splitting by rows –
The relation is fragmented into groups of tuples so that each
tuple is assigned to at least one fragment.
Vertical fragmentation – Splitting by columns –
The schema of the relation is divided into smaller schemas.
Each fragment must contain a common candidate key so as
to ensure a lossless join.

ADBMS Chapter 5
No ratings yet
ADBMS Chapter 5
14 pages
ch6 Distributed Database
No ratings yet
ch6 Distributed Database
25 pages
26 Distributed Dbms Nosql
No ratings yet
26 Distributed Dbms Nosql
45 pages
Week 2 Parallel and Distributed Database
No ratings yet
Week 2 Parallel and Distributed Database
7 pages
DBMS-Relational Data Model
100% (2)
DBMS-Relational Data Model
73 pages
Parallel and Distributed Databases in DBMS
No ratings yet
Parallel and Distributed Databases in DBMS
31 pages
Data Communication Basics CH 7
No ratings yet
Data Communication Basics CH 7
27 pages
17 DatabaseArchitectures
No ratings yet
17 DatabaseArchitectures
41 pages
Distributed Databases and Client-Server Architectures
No ratings yet
Distributed Databases and Client-Server Architectures
60 pages
Unit 5 Parallel and Distributed Databases
No ratings yet
Unit 5 Parallel and Distributed Databases
22 pages
Chapter - 7 Distributed Database System
No ratings yet
Chapter - 7 Distributed Database System
29 pages
Ddis U1-3
No ratings yet
Ddis U1-3
40 pages
Unit V NoSQL Databases
No ratings yet
Unit V NoSQL Databases
124 pages
7-Distributed DB
No ratings yet
7-Distributed DB
37 pages
Unit 2-DBP
No ratings yet
Unit 2-DBP
44 pages
Module III
No ratings yet
Module III
132 pages
Chapter 4 - Distributed Database System
No ratings yet
Chapter 4 - Distributed Database System
52 pages
Types of Distributed Data Base System - 49724
No ratings yet
Types of Distributed Data Base System - 49724
37 pages
Elective-I Advanced Database Management Systems: Unit Ii
100% (1)
Elective-I Advanced Database Management Systems: Unit Ii
141 pages
Second Unit ADBMS
No ratings yet
Second Unit ADBMS
53 pages
Unit 5
No ratings yet
Unit 5
28 pages
DB Unit-2
No ratings yet
DB Unit-2
27 pages
DBMS
No ratings yet
DBMS
17 pages
Chapter 4 Distributed Databases
No ratings yet
Chapter 4 Distributed Databases
36 pages
Distributed Databases: Daniel Marcous
No ratings yet
Distributed Databases: Daniel Marcous
41 pages
Chapter 7 - Distributed Database System
No ratings yet
Chapter 7 - Distributed Database System
27 pages
Sayan Ghosh 26900123054 Distributed Database System Cse 6TH Sem
No ratings yet
Sayan Ghosh 26900123054 Distributed Database System Cse 6TH Sem
11 pages
Distributed Databases: by Allyson Moran
No ratings yet
Distributed Databases: by Allyson Moran
37 pages
Unit-2 - Distributed Database System
No ratings yet
Unit-2 - Distributed Database System
7 pages
Ss2 Data Processing 2nd Term
0% (1)
Ss2 Data Processing 2nd Term
33 pages
7 Distributed DB
No ratings yet
7 Distributed DB
38 pages
Unit 2 DDMS
No ratings yet
Unit 2 DDMS
26 pages
Unit V
No ratings yet
Unit V
22 pages
ADTHEORY1
No ratings yet
ADTHEORY1
15 pages
Unit - 2 (1) DBMS
No ratings yet
Unit - 2 (1) DBMS
25 pages
M.C.a. (Sem - IV) Paper - IV - Adavanced Database Techniques
No ratings yet
M.C.a. (Sem - IV) Paper - IV - Adavanced Database Techniques
114 pages
ADBMS Parallel and Distributed Databases
No ratings yet
ADBMS Parallel and Distributed Databases
98 pages
Advanced Database Chapter 6 and 7
No ratings yet
Advanced Database Chapter 6 and 7
30 pages
Introduction To DBMS
No ratings yet
Introduction To DBMS
37 pages
Fundamentals of Database Systems: (Parallel and Distributed Databases)
No ratings yet
Fundamentals of Database Systems: (Parallel and Distributed Databases)
46 pages
Database
No ratings yet
Database
6 pages
Adbms
No ratings yet
Adbms
70 pages
Distributed Databases: Benefits and Issues To Be Considered
No ratings yet
Distributed Databases: Benefits and Issues To Be Considered
25 pages
Chapter 4 Bing
No ratings yet
Chapter 4 Bing
5 pages
Distributed Database System
No ratings yet
Distributed Database System
4 pages
Distributed Database: Database Storage Devices CPU Database Management System Computers Network
No ratings yet
Distributed Database: Database Storage Devices CPU Database Management System Computers Network
9 pages
Distributed Databases: by Chien-Pin Hsu CS157B Section 1 Nov 11, 2004
No ratings yet
Distributed Databases: by Chien-Pin Hsu CS157B Section 1 Nov 11, 2004
24 pages
Distributed DB
No ratings yet
Distributed DB
16 pages
DBMS-Unit 5
No ratings yet
DBMS-Unit 5
27 pages
ParallelDBs PDF
No ratings yet
ParallelDBs PDF
23 pages
Parallel & Distributed Databases: C S 5 6 1 - S P R I N G 2 0 1 2 Wpi, Mohamed Eltabakh
No ratings yet
Parallel & Distributed Databases: C S 5 6 1 - S P R I N G 2 0 1 2 Wpi, Mohamed Eltabakh
23 pages
Enterprise Systems: Distributed Databases and Systems - DT211 4
No ratings yet
Enterprise Systems: Distributed Databases and Systems - DT211 4
25 pages
DDB Slides
No ratings yet
DDB Slides
30 pages
Tybca Recent Trends in It Chpter 1
No ratings yet
Tybca Recent Trends in It Chpter 1
16 pages
Unit No.4 Parallel Database
No ratings yet
Unit No.4 Parallel Database
32 pages
Distributed Database Concepts
No ratings yet
Distributed Database Concepts
52 pages
Introduction To Parallel Databases
No ratings yet
Introduction To Parallel Databases
24 pages
Distributed Database
100% (1)
Distributed Database
24 pages
Distributed Database Vs Conventional Database
50% (2)
Distributed Database Vs Conventional Database
4 pages
Dbms Notes For All Units
No ratings yet
Dbms Notes For All Units
73 pages
Distributed Database Systems: January 2002
No ratings yet
Distributed Database Systems: January 2002
25 pages
A Distributed Database Management System ('DDBMS') Is A Software System
No ratings yet
A Distributed Database Management System ('DDBMS') Is A Software System
5 pages
Most Frequently Asked SQL Interview Questions
No ratings yet
Most Frequently Asked SQL Interview Questions
43 pages
Unit - 1 - Database Concepts
No ratings yet
Unit - 1 - Database Concepts
6 pages
2nd Year NEP Syllabus
No ratings yet
2nd Year NEP Syllabus
30 pages
Technology Guide 3: Information Technology For Management 4 Edition Turban, Mclean, Wetherbe John Wiley & Sons, Inc
No ratings yet
Technology Guide 3: Information Technology For Management 4 Edition Turban, Mclean, Wetherbe John Wiley & Sons, Inc
29 pages
FIT ACADEMY Business Analytics With Power BI
No ratings yet
FIT ACADEMY Business Analytics With Power BI
71 pages
CS3492 Database Management Systems Two Mark Questions 1
No ratings yet
CS3492 Database Management Systems Two Mark Questions 1
43 pages
IS222 S12018 FE Sample Answers
100% (1)
IS222 S12018 FE Sample Answers
18 pages
DBMS HandBook
No ratings yet
DBMS HandBook
27 pages
Unit Ii
No ratings yet
Unit Ii
45 pages
S18 CS5002NP CW2 17031949 Aashish Parajuli
No ratings yet
S18 CS5002NP CW2 17031949 Aashish Parajuli
55 pages
XML Programming With SQL/XML and Xquery: Facto Standard For Retrieving and Exchanging
No ratings yet
XML Programming With SQL/XML and Xquery: Facto Standard For Retrieving and Exchanging
24 pages
20 Computer Science ANU 2020-21
No ratings yet
20 Computer Science ANU 2020-21
56 pages
Database Objective Type Questions
No ratings yet
Database Objective Type Questions
12 pages
Gatepass - Aspdotnet - Full Doucment
No ratings yet
Gatepass - Aspdotnet - Full Doucment
59 pages
All Model Exam
No ratings yet
All Model Exam
13 pages
Structured Query Language
No ratings yet
Structured Query Language
29 pages
Unit-3 Notes
No ratings yet
Unit-3 Notes
21 pages
Est
No ratings yet
Est
39 pages
Brief History of The Relational Model
No ratings yet
Brief History of The Relational Model
5 pages
CSC 313 - Last-Note - OS
No ratings yet
CSC 313 - Last-Note - OS
25 pages
Oral Questions and Answers For Dbms Mysql Mongodb Nosql
No ratings yet
Oral Questions and Answers For Dbms Mysql Mongodb Nosql
10 pages
Lecture 2
No ratings yet
Lecture 2
37 pages
hcs219 Assignment 2
No ratings yet
hcs219 Assignment 2
5 pages
B.C.A. Syllabus
No ratings yet
B.C.A. Syllabus
27 pages
2025 DM4ML Assign1
No ratings yet
2025 DM4ML Assign1
6 pages
SQL Server Question Paper - 1
No ratings yet
SQL Server Question Paper - 1
3 pages
Redacted Resume
No ratings yet
Redacted Resume
1 page
Breaking the Availability Barrier Ii: Achieving Century Uptimes with Active/Active Systems
From Everand
Breaking the Availability Barrier Ii: Achieving Century Uptimes with Active/Active Systems
Dr. Bruce Holenstein
No ratings yet

Adv DBMS-Unit 2

Uploaded by

Adv DBMS-Unit 2

Uploaded by

Advanced DBMS

• Parallel machines are becoming quite common and affordable

You might also like