Parallel Databases

Parallel databases are increasingly common as the cost of hardware has decreased. Large databases require parallelism for storage, queries, and throughput. There are different types of parallelism including interquery, intraquery, interoperation, and intraoperation parallelism. Data can be partitioned horizontally or vertically across multiple disks for parallel input/output and queries can utilize various parallelization techniques. Issues in parallel database design include parallel data loading, resilience to failures, and redundancy.

Uploaded by

Madara Uchiha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

42 views11 pages

Parallel Databases

Uploaded by

Madara Uchiha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 11

Parallel Databases

Introduction
 Parallel machines are becoming quite common and affordable
 Prices of microprocessors, memory and disks have dropped
sharply
 Recent desktop computers feature multiple processors and this
trend is projected to accelerate
 Databases are growing increasingly large
 large volumes of transaction data are collected and stored for later
analysis.
 multimedia objects like images are increasingly stored in
databases
 Large-scale parallel database systems increasingly used for:
 storing large volumes of data
 processing time-consuming decision-support queries
 providing high throughput for transaction processing
Parallelism in Databases
 Data can be partitioned across multiple disks for parallel I/O.
 Individual relational operations (e.g., sort, join, aggregation) can be
executed in parallel
 Queries are expressed in high level language (SQL, translated to
relational algebra)
 makes parallelization easier.
 Different queries can be run in parallel with each other.
Concurrency control takes care of conflicts.
Partitioning

 Types of partitioning

Horizontal partitioning – tuples of a relation are divided among many

disks such that each tuple resides on one disk.

Vertical partitioning-Schema of relation is divided among many disks

such that data fields of each tuple are split and stored on various
multiple disks.
Partitioning
 Partitioning techniques (number of disks = n):
Round-robin:
Send the I th tuple inserted in the relation to disk i mod n.
Hash partitioning:
 Choose one or more attributes as the partitioning attributes.
 Choose hash function h with range 0…n - 1
 Let i denote result of hash function h applied to the partitioning
attribute value of a tuple. Send tuple to disk i.
 Range partitioning:
 Choose an attribute as the partitioning attribute.
 A partitioning vector [vo, v1, ..., vn-2] is chosen.
 Let v be the partitioning attribute value of a tuple. Tuples such that vi  vi+1 go to
disk I + 1. Tuples with v < v0 go to disk 0 and tuples with v  vn-2 go to disk n-1.
Interquery Parallelism
 Queries/transactions execute in parallel with one another.
 Increases transaction throughput; used primarily to scale up a transaction
processing system to support a larger number of transactions per second.
 Easiest form of parallelism to support, particularly in a shared-memory
parallel database, because even sequential database systems support
concurrent processing.
Intraquery Parallelism

 Execution of a single query in parallel on multiple processors/disks;

important for speeding up long-running queries.
 Two complementary forms of intraquery parallelism:
 Intraoperation Parallelism – parallelize the execution of each individual
operation in the query.
 Interoperation Parallelism – execute the different operations in a query
expression in parallel.
the first form scales better with increasing parallelism because
the number of tuples processed by each operation is typically more than the
number of operations in a query.
Interoperator Parallelism

 Pipelined parallelism
 Consider a join of four relations
 r1 r2 r3 r4
 Set up a pipeline that computes the three joins in parallel
 Let P1 be assigned the computation of
temp1 = r1 r2
 And P2 be assigned the computation of temp2 = temp1
r3
 And P3 be assigned the computation of temp2 r4
 Each of these operations can execute in parallel, sending result
tuples it computes to the next operation even as it is computing
further results
Independent Parallelism

 Independent parallelism
 Consider a join of four relations
r1 r2 r3 r4
 Let P1 be assigned the computation of
temp1 = r1 r2
 And P2 be assigned the computation of temp2 = r 3 r4
 And P3 be assigned the computation of temp1 temp 2
 P1 and P2 can work independently in parallel
 P3 has to wait for input from P1 and P2
 Can pipeline output of P1 and P2 to P3, combining
independent parallelism and pipelined parallelism
 Does not provide a high degree of parallelism
 useful with a lower degree of parallelism.
 less useful in a highly parallel system.
Design of Parallel Systems

Some issues in the design of parallel systems:

 Parallel loading of data from external sources is needed in order
to handle large volumes of incoming data.
 Resilience to failure of some processors or disks.
 Probability of some disk or processor failing is higher in a parallel
system.
 Operation (perhaps with degraded performance) should be possible
in spite of failure.
 Redundancy achieved by storing extra copy of every data item at
another processor.
End of Chapter

Unit 5 Parallel and Distributed Databases
No ratings yet
Unit 5 Parallel and Distributed Databases
22 pages
Parallel and Distributed Databases NOTES
No ratings yet
Parallel and Distributed Databases NOTES
98 pages
ADBMS Parallel and Distributed Databases
No ratings yet
ADBMS Parallel and Distributed Databases
98 pages
TDD: Topics in Distributed Databases: Parallel Database Management Systems
No ratings yet
TDD: Topics in Distributed Databases: Parallel Database Management Systems
38 pages
Unit No.4 Parallel Database
No ratings yet
Unit No.4 Parallel Database
32 pages
Lecture 1 Parallel Databases
No ratings yet
Lecture 1 Parallel Databases
30 pages
Parallel Database: Architecture For Parallel Databases. Parallel Query Evaluation Parallelizing Individual Operations
No ratings yet
Parallel Database: Architecture For Parallel Databases. Parallel Query Evaluation Parallelizing Individual Operations
27 pages
ParallelDBs PDF
No ratings yet
ParallelDBs PDF
23 pages
Parallel & Distributed Databases: C S 5 6 1 - S P R I N G 2 0 1 2 Wpi, Mohamed Eltabakh
No ratings yet
Parallel & Distributed Databases: C S 5 6 1 - S P R I N G 2 0 1 2 Wpi, Mohamed Eltabakh
23 pages
M.C.a. (Sem - IV) Paper - IV - Adavanced Database Techniques
No ratings yet
M.C.a. (Sem - IV) Paper - IV - Adavanced Database Techniques
114 pages
Dbms
No ratings yet
Dbms
14 pages
2 Parallel Databases
No ratings yet
2 Parallel Databases
44 pages
Parallel and Distributed Databases in DBMS
No ratings yet
Parallel and Distributed Databases in DBMS
31 pages
Parallel Database System
No ratings yet
Parallel Database System
55 pages
Elective-I Advanced Database Management Systems: Unit Ii
100% (1)
Elective-I Advanced Database Management Systems: Unit Ii
141 pages
LN 2
No ratings yet
LN 2
33 pages
Sayan Ghosh 26900123054 Distributed Database System Cse 6TH Sem
No ratings yet
Sayan Ghosh 26900123054 Distributed Database System Cse 6TH Sem
11 pages
Introduction To DBMS
No ratings yet
Introduction To DBMS
37 pages
Fundamentals of Database Systems: (Parallel and Distributed Databases)
No ratings yet
Fundamentals of Database Systems: (Parallel and Distributed Databases)
46 pages
9.CSI2004-ADBMS Module2 Part1
No ratings yet
9.CSI2004-ADBMS Module2 Part1
54 pages
ADTHEORY1
No ratings yet
ADTHEORY1
15 pages
Adv DBMS-Unit 2
No ratings yet
Adv DBMS-Unit 2
15 pages
Adbms Unit4
No ratings yet
Adbms Unit4
24 pages
Module 3 - Parallel and Distributed Database
No ratings yet
Module 3 - Parallel and Distributed Database
22 pages
Module1 ADBMS
No ratings yet
Module1 ADBMS
99 pages
Ads Unit 3
No ratings yet
Ads Unit 3
8 pages
Parallel DB /D.S.Jagli 1 5/4/2012 1 1. Parallel DB /D.S.Jagli
No ratings yet
Parallel DB /D.S.Jagli 1 5/4/2012 1 1. Parallel DB /D.S.Jagli
70 pages
Sayan Ghosh 26900123054 Distributed Database System Cse 6th Sem
No ratings yet
Sayan Ghosh 26900123054 Distributed Database System Cse 6th Sem
11 pages
Cs6005 - Advanced Database Systems (Unit-1)
No ratings yet
Cs6005 - Advanced Database Systems (Unit-1)
136 pages
Query Parallelism
No ratings yet
Query Parallelism
8 pages
Module III
No ratings yet
Module III
132 pages
Parallel Database
No ratings yet
Parallel Database
22 pages
Unit I
No ratings yet
Unit I
43 pages
Intraquery Parallelism Intraoperation Parallelism Interoperation Parallelism Design of Parallel Systems
No ratings yet
Intraquery Parallelism Intraoperation Parallelism Interoperation Parallelism Design of Parallel Systems
29 pages
8-Parallel Nhom5
No ratings yet
8-Parallel Nhom5
59 pages
Parallel Dbms
No ratings yet
Parallel Dbms
5 pages
UNIT-3: Introduction To Parallel Database and I/O Parallelism
No ratings yet
UNIT-3: Introduction To Parallel Database and I/O Parallelism
52 pages
CH14
No ratings yet
CH14
43 pages
Parallel Database
No ratings yet
Parallel Database
4 pages
Inter and Intra Query Parallelism
No ratings yet
Inter and Intra Query Parallelism
1 page
Parallel Database Systems and Their Architecture
No ratings yet
Parallel Database Systems and Their Architecture
17 pages
Parallel Database
No ratings yet
Parallel Database
27 pages
Database Management Systems: Unit 4 - Parallel DBMS
No ratings yet
Database Management Systems: Unit 4 - Parallel DBMS
14 pages
Lesson2 Parallel Database
No ratings yet
Lesson2 Parallel Database
58 pages
CH 2
No ratings yet
CH 2
51 pages
Module 4
No ratings yet
Module 4
23 pages
Ptimimation of F Ulti-Join Ri
No ratings yet
Ptimimation of F Ulti-Join Ri
14 pages
Introduction To Parallel Databases
No ratings yet
Introduction To Parallel Databases
24 pages
Query Processing in Distributed Database
No ratings yet
Query Processing in Distributed Database
20 pages
Second Unit ADBMS
No ratings yet
Second Unit ADBMS
53 pages
Parallelisation Comment
No ratings yet
Parallelisation Comment
3 pages
Parallel Database QA Detailed
No ratings yet
Parallel Database QA Detailed
2 pages
Third Year Engineering: 21BTCS604 - Advanced DBMS
No ratings yet
Third Year Engineering: 21BTCS604 - Advanced DBMS
51 pages
Parallel Database
No ratings yet
Parallel Database
8 pages
Databace 1
No ratings yet
Databace 1
7 pages
26 Distributed Dbms Nosql
No ratings yet
26 Distributed Dbms Nosql
45 pages
Parallel DBMS: Chapter 22, Sections 22.1-22.6
No ratings yet
Parallel DBMS: Chapter 22, Sections 22.1-22.6
23 pages
Parallel Databases
No ratings yet
Parallel Databases
10 pages
Adbms
No ratings yet
Adbms
70 pages
BIHANA2015 - Hollis - Performance Tuning in Sap Hana PDF
No ratings yet
BIHANA2015 - Hollis - Performance Tuning in Sap Hana PDF
75 pages
Data Information Wisdom
No ratings yet
Data Information Wisdom
42 pages
Prat
No ratings yet
Prat
3 pages
Big Data Question Bank
No ratings yet
Big Data Question Bank
26 pages
PL/SQL Bulk Collect
No ratings yet
PL/SQL Bulk Collect
5 pages
PowerScale Onefs 9.4 Backup and Recovery Guide
No ratings yet
PowerScale Onefs 9.4 Backup and Recovery Guide
88 pages
Answers To Hyperion Interview Questions
No ratings yet
Answers To Hyperion Interview Questions
3 pages
Types of Keys in Database Management System: Sos in Computer Science and Application Pgdca 203: Dbms
No ratings yet
Types of Keys in Database Management System: Sos in Computer Science and Application Pgdca 203: Dbms
11 pages
Power BI Interview Guide
100% (2)
Power BI Interview Guide
48 pages
Oracle Interview Question
No ratings yet
Oracle Interview Question
7 pages
Lecture 2 - Spring Data and Spring Data Rest
No ratings yet
Lecture 2 - Spring Data and Spring Data Rest
47 pages
Here's A List of 100 SQL Questions
No ratings yet
Here's A List of 100 SQL Questions
3 pages
CH 05 Data Engineering
No ratings yet
CH 05 Data Engineering
28 pages
20IT503 - Big Data Analytics - Unit1
No ratings yet
20IT503 - Big Data Analytics - Unit1
59 pages
SQL Server CREATE TABLE Statement
No ratings yet
SQL Server CREATE TABLE Statement
7 pages
Chapter 4 Auditing Database Systems (Multiple Choice)
100% (1)
Chapter 4 Auditing Database Systems (Multiple Choice)
3 pages
SQLServer
No ratings yet
SQLServer
27 pages
Example of SDS
No ratings yet
Example of SDS
7 pages
Searching and Sorting
No ratings yet
Searching and Sorting
28 pages
Data Warehousing Full
No ratings yet
Data Warehousing Full
41 pages
Trees Data Structure
No ratings yet
Trees Data Structure
42 pages
Using Evdre
100% (1)
Using Evdre
42 pages
International Cataloguing Principles (ICP)
No ratings yet
International Cataloguing Principles (ICP)
5 pages
Module 7 Assignment 2
No ratings yet
Module 7 Assignment 2
4 pages
15.2 Practice Questions PDF
No ratings yet
15.2 Practice Questions PDF
3 pages
DBMS Reviewer
No ratings yet
DBMS Reviewer
45 pages
Top 17 Linked List Interview Questions & Answers
No ratings yet
Top 17 Linked List Interview Questions & Answers
4 pages
What Is A Database?: RDBMS Concepts - Basics & Interview Questions
No ratings yet
What Is A Database?: RDBMS Concepts - Basics & Interview Questions
8 pages
RITS Mini Project v1
No ratings yet
RITS Mini Project v1
4 pages
Basic It Tools
No ratings yet
Basic It Tools
3 pages

Parallel Databases

Uploaded by

Parallel Databases

Uploaded by

Parallel Databases

Horizontal partitioning – tuples of a relation are divided among many

Vertical partitioning-Schema of relation is divided among many disks

 Execution of a single query in parallel on multiple processors/disks;

Some issues in the design of parallel systems:

You might also like