Parallel Databases: Solutions To Practice Exercises

This document discusses parallel databases and solutions to practice exercises related to parallel databases. Some key points discussed include: - Hybrid range partitioning can provide benefits of range partitioning without its drawbacks by partitioning ranges into small blocks in a round-robin fashion. - Intra-query parallelism is important for large queries to take advantage of parallel hardware, while inter-query parallelism is better for many small queries to reduce overhead. - In shared memory architectures, data transfer overhead between operators running on different processors is reduced. - A partitioning technique is described that replicates relation partitions across fewer processors than the general case, allowing more fragments to be partitioned for the same number of processors.

Uploaded by

NUBG Gamer

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views3 pages

Parallel Databases: Solutions To Practice Exercises

Uploaded by

NUBG Gamer

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

C H A P T E R 2 1

Parallel Databases

Solutions to Practice Exercises

21.1 If there are few tuples in the queried range, then each query can be processed
quickly on a single disk. This allows parallel execution of queries with reduced
overhead of initiating queries on multiple disks.
On the other hand, if there are many tuples in the queried range, each query
takes a long time to execute as there is no parallelism within its execution. Also,
some of the disks can become hot-spots, further increasing response time.
Hybrid range partitioning, in which small ranges (a few blocks each) are
partitioned in a round-robin fashion, provides the benefits of range partition-
ing without its drawbacks.
21.2 a. When there are many small queries, inter-query parallelism gives good
throughput. Parallelizing each of these small queries would increase the
initiation overhead, without any significant reduction in response time.
b. With a few large queries, intra-query parallelism is essential to get fast re-
sponse times. Given that there are large number of processors and disks,
only intra-operation parallelism can take advantage of the parallel hard-
ware – for queries typically have few operations, but each one needs to
process a large number of tuples.
21.3 a. The speed-up obtained by parallelizing the operations would be offset by
the data transfer overhead, as each tuple produced by an operator would
have to be transferred to its consumer, which is running on a different pro-
cessor.
b. In a shared-memory architecture, transferring the tuples is very efficient.
So the above argument does not hold to any significant degree.

95
96 Chapter 21 Parallel Databases

c. Even if two operations are independent, it may be that they both supply
their outputs to a common third operator. In that case, running all three on
the same processor may be better than transferring tuples across proces-
sors.
21.4 Relation r is partitioned into n partitions, r0 , r1 , . . . , rn−1 , and s is also parti-
tioned into n partitions, s0 , s1 , . . . , sn−1 . The partitions are replicated and as-
signed to processors as shown below.

s0 s1 s2 s3 . . . . sn 1

.
.
r0 P 0,0 P 0,1 .

r1 P 1,0 P 1,1 P 1,2

r2 P 2,1 P 2,2 P 2,3

.
. . . .
. . . .
. .

rn 1 . . . . . . Pn 1,
n 1

Each fragment is replicated on 3 processors only, unlike in the general case

where it is replicated on n processors. The number of processors required is
now approximately 3n, instead of n2 in the general case. Therefore given the
same number of processors, we can partition the relations into more fragments
with this optimization, thus making each local join faster.
21.5 a. A partitioning vector which gives 5 partitions with 20 tuples in each parti-
tion is: [21, 31, 51, 76]. The 5 partitions obtained are 1 − 20, 21 − 30, 31 − 50,
51 − 75 and 76 − 100. The assumption made in arriving at this partitioning
vector is that within a histogram range, each value is equally likely.
b. Let the histogram ranges be called h1 , h2 , . . . , hh , and the partitions
p1 , p2 , . . . , pp . Let the frequencies of the histogram ranges be n1 , n2 , . . . , nh .
Each partition should contain N/p tuples, where N = Σhi=1 ni .
To construct the load balanced partitioning vector, we need to deter-
mine the value of the k1th tuple, the value of the k2th tuple and so on, where
k1 = N/p, k2 = 2N/p etc, until kp−1 . The partitioning vector will then be
[k1 , k2 , . . . , kp−1 ]. The value of the kith tuple is determined as follows. First
determine the histogram range hj in which it falls. Assuming all values in
Exercises 97

a range are equally likely, the kith value will be

kij
sj + (ej − sj ) ∗
nj
where
sj : first value in hj
ej : last value in hj
kij : ki − Σj−1
l=1 nl

21.6 a. The copies of the data items at a processor should be partitioned across
multiple other processors, rather than stored in a single processor, for the
following reasons:
• to better distribute the work which should have been done by the failed
processor, among the remaining processors.
• Even when there is no failure, this technique can to some extent deal
with hot-spots created by read only transactions.
b. RAID level 0 itself stores an extra copy of each data item (mirroring). Thus
this is similar to mirroring performed by the database itself, except that the
database system does not have to bother about the details of performing
the mirroring. It just issues the write to the RAID system, which automati-
cally performs the mirroring.
RAID level 5 is less expensive than mirroring in terms of disk space re-
quirement, but writes are more expensive, and rebuilding a crashed disk
is more expensive.

CAT1 F1 Key
No ratings yet
CAT1 F1 Key
6 pages
16 Respiratory Alkalosis
100% (2)
16 Respiratory Alkalosis
28 pages
18s PDF
No ratings yet
18s PDF
6 pages
2 Parallel Databases
No ratings yet
2 Parallel Databases
44 pages
TDD: Topics in Distributed Databases: Parallel Database Management Systems
No ratings yet
TDD: Topics in Distributed Databases: Parallel Database Management Systems
38 pages
CH14
No ratings yet
CH14
43 pages
Unit I
No ratings yet
Unit I
43 pages
I/O Parallelism Interquery Parallelism Intraquery Parallelism Intraoperation Parallelism Interoperation Parallelism Design of Parallel Systems
No ratings yet
I/O Parallelism Interquery Parallelism Intraquery Parallelism Intraoperation Parallelism Interoperation Parallelism Design of Parallel Systems
42 pages
Parallel and Distributed Storage: Practice Exercises
No ratings yet
Parallel and Distributed Storage: Practice Exercises
4 pages
LN 2
No ratings yet
LN 2
33 pages
PDB Partitioning
No ratings yet
PDB Partitioning
11 pages
Third Year Engineering: 21BTCS604 - Advanced DBMS
No ratings yet
Third Year Engineering: 21BTCS604 - Advanced DBMS
51 pages
Lecture 10: Parallel Query Evaluation: CS 838: Foundations of Data Management Spring 2016
No ratings yet
Lecture 10: Parallel Query Evaluation: CS 838: Foundations of Data Management Spring 2016
4 pages
M.C.a. (Sem - IV) Paper - IV - Adavanced Database Techniques
No ratings yet
M.C.a. (Sem - IV) Paper - IV - Adavanced Database Techniques
114 pages
Intraquery Parallelism Intraoperation Parallelism Interoperation Parallelism Design of Parallel Systems
No ratings yet
Intraquery Parallelism Intraoperation Parallelism Interoperation Parallelism Design of Parallel Systems
29 pages
Centralized Versus Distributed DBMS: T T T T A A A A
No ratings yet
Centralized Versus Distributed DBMS: T T T T A A A A
3 pages
Chapter 21: Parallel Databases
No ratings yet
Chapter 21: Parallel Databases
43 pages
Where To Leave The Data ?: - Parallel Systems - Scalable Distributed Data Structures - Dynamic Hash Table (P2P)
No ratings yet
Where To Leave The Data ?: - Parallel Systems - Scalable Distributed Data Structures - Dynamic Hash Table (P2P)
39 pages
Where To Leave The Data ?: - Parallel Systems - Scalable Distributed Data Structures - Dynamic Hash Table (P2P)
No ratings yet
Where To Leave The Data ?: - Parallel Systems - Scalable Distributed Data Structures - Dynamic Hash Table (P2P)
39 pages
Fundamentals of Database Systems: (Parallel and Distributed Databases)
No ratings yet
Fundamentals of Database Systems: (Parallel and Distributed Databases)
46 pages
Parallel and Distributed Query Processing: Practice Exercises
No ratings yet
Parallel and Distributed Query Processing: Practice Exercises
4 pages
Advanced Database Management System
No ratings yet
Advanced Database Management System
3 pages
Parallel Databases
No ratings yet
Parallel Databases
19 pages
Parallel Databases
No ratings yet
Parallel Databases
11 pages
Parallel Database: Architecture For Parallel Databases. Parallel Query Evaluation Parallelizing Individual Operations
No ratings yet
Parallel Database: Architecture For Parallel Databases. Parallel Query Evaluation Parallelizing Individual Operations
27 pages
Unary Query Processing Operators: CS 186, Spring 2006 Background For Homework 2
No ratings yet
Unary Query Processing Operators: CS 186, Spring 2006 Background For Homework 2
18 pages
Lecture 2 Lecture PPT #3,4,5,6
No ratings yet
Lecture 2 Lecture PPT #3,4,5,6
34 pages
Parrel Query Processing
No ratings yet
Parrel Query Processing
13 pages
Lecture 1 Parallel Databases
No ratings yet
Lecture 1 Parallel Databases
30 pages
Parallel & Distributed Databases: C S 5 6 1 - S P R I N G 2 0 1 2 Wpi, Mohamed Eltabakh
No ratings yet
Parallel & Distributed Databases: C S 5 6 1 - S P R I N G 2 0 1 2 Wpi, Mohamed Eltabakh
23 pages
ParallelDBs PDF
No ratings yet
ParallelDBs PDF
23 pages
Introduction To DBMS
No ratings yet
Introduction To DBMS
37 pages
Int1414 d22cqcn01-n n22dccn052 Lamnhatminh Baitap
No ratings yet
Int1414 d22cqcn01-n n22dccn052 Lamnhatminh Baitap
36 pages
Stanford Dsa
No ratings yet
Stanford Dsa
52 pages
Module III
No ratings yet
Module III
132 pages
Parallel Database System
No ratings yet
Parallel Database System
55 pages
Slide 5
No ratings yet
Slide 5
43 pages
8-Parallel Nhom5
No ratings yet
8-Parallel Nhom5
59 pages
Distributed Databases: CS347 May 30, 2001
No ratings yet
Distributed Databases: CS347 May 30, 2001
48 pages
I/O Parallelism Interquery Parallelism Intraquery Parallelism Intraoperation Parallelism Interoperation Parallelism Design of Parallel Systems
No ratings yet
I/O Parallelism Interquery Parallelism Intraquery Parallelism Intraoperation Parallelism Interoperation Parallelism Design of Parallel Systems
42 pages
Int1414 d22cqcn01-n n22dccn052 Lamnhatminh Baitap
No ratings yet
Int1414 d22cqcn01-n n22dccn052 Lamnhatminh Baitap
43 pages
Parallel Dbs
No ratings yet
Parallel Dbs
42 pages
CAT1 F2 Final Key
No ratings yet
CAT1 F2 Final Key
6 pages
CAS CS 460/660 Introduction To Database Systems Query Evaluation I
No ratings yet
CAS CS 460/660 Introduction To Database Systems Query Evaluation I
32 pages
Cs411fa09 Hw4 Sol
No ratings yet
Cs411fa09 Hw4 Sol
8 pages
Chapter 20: Parallel Databases
No ratings yet
Chapter 20: Parallel Databases
6 pages
Query Processing
No ratings yet
Query Processing
77 pages
Sigmod - 15 - Locality-Aware Partitioning in Parallel Database Systems
No ratings yet
Sigmod - 15 - Locality-Aware Partitioning in Parallel Database Systems
14 pages
HPC2
No ratings yet
HPC2
22 pages
Assignment - 10 Parallel Sorting Techniques: Range-Partitioning Sort
No ratings yet
Assignment - 10 Parallel Sorting Techniques: Range-Partitioning Sort
6 pages
Ch22a ParallelDBs
No ratings yet
Ch22a ParallelDBs
23 pages
Parallel Database
No ratings yet
Parallel Database
4 pages
ICDE 2018 A Graph-Based Database Partitioning Method For Parallel OLAP Query Processing
No ratings yet
ICDE 2018 A Graph-Based Database Partitioning Method For Parallel OLAP Query Processing
12 pages
Parallel DB /D.S.Jagli 1 5/4/2012 1 1. Parallel DB /D.S.Jagli
No ratings yet
Parallel DB /D.S.Jagli 1 5/4/2012 1 1. Parallel DB /D.S.Jagli
70 pages
Advanced Data Types and New Applications: Practice Exercises
No ratings yet
Advanced Data Types and New Applications: Practice Exercises
6 pages
Sampling Based Range Partition Methods For Big Data Analytics
No ratings yet
Sampling Based Range Partition Methods For Big Data Analytics
16 pages
Parallel DBMS: Chapter 22, Sections 22.1-22.6
No ratings yet
Parallel DBMS: Chapter 22, Sections 22.1-22.6
23 pages
Midterm00F Sol
No ratings yet
Midterm00F Sol
6 pages
ADBMS Parallel and Distributed Databases
No ratings yet
ADBMS Parallel and Distributed Databases
98 pages
DRBD-Cookbook: How to create your own cluster solution, without SAN or NAS!
From Everand
DRBD-Cookbook: How to create your own cluster solution, without SAN or NAS!
Joerg Christian Seubert
No ratings yet
The Tech Interview Playbook: From DSA to System Design
From Everand
The Tech Interview Playbook: From DSA to System Design
Chinmoy Mukherjee
No ratings yet
13s PDF
No ratings yet
13s PDF
10 pages
Distributed Databases: Practice Exercises
No ratings yet
Distributed Databases: Practice Exercises
8 pages
22s PDF
No ratings yet
22s PDF
6 pages
Data Analysis and Mining: Practice Exercises
No ratings yet
Data Analysis and Mining: Practice Exercises
4 pages
Information Retrieval: Practice Exercises
No ratings yet
Information Retrieval: Practice Exercises
4 pages
Advanced Transaction Processing: Practice Exercises
No ratings yet
Advanced Transaction Processing: Practice Exercises
4 pages
Advanced Application Development: Practice Exercises
No ratings yet
Advanced Application Development: Practice Exercises
4 pages
Estimation of Water of Crystallization in Salt by Titrating With Standardized Kmno
No ratings yet
Estimation of Water of Crystallization in Salt by Titrating With Standardized Kmno
8 pages
Assignment 3
No ratings yet
Assignment 3
1 page
Upsc Cse
No ratings yet
Upsc Cse
18 pages
Burgess-What Is Literature
No ratings yet
Burgess-What Is Literature
4 pages
Rubric For Oral Presentation
100% (1)
Rubric For Oral Presentation
1 page
Iconlibrary Production Oct2016
No ratings yet
Iconlibrary Production Oct2016
137 pages
Matthew 3
No ratings yet
Matthew 3
7 pages
IIE Bachelor of Commerce in Law Factsheet 2020 (New) V1 PDF
No ratings yet
IIE Bachelor of Commerce in Law Factsheet 2020 (New) V1 PDF
2 pages
Optimization of Transportation Costs in Supply Cha PDF
No ratings yet
Optimization of Transportation Costs in Supply Cha PDF
83 pages
2023 2024 Class Catch Up Friday Program
100% (1)
2023 2024 Class Catch Up Friday Program
6 pages
Form A1
No ratings yet
Form A1
2 pages
Algebra Balance Scales
No ratings yet
Algebra Balance Scales
1 page
Evergreen State - Music Cultures of The World (1993-1994) Sean Williams
No ratings yet
Evergreen State - Music Cultures of The World (1993-1994) Sean Williams
4 pages
11 5 10 Collections
No ratings yet
11 5 10 Collections
12 pages
Validation of The Finometer Device For Measurement
No ratings yet
Validation of The Finometer Device For Measurement
7 pages
Solutions-Grand Marks Booster Challenege#1
No ratings yet
Solutions-Grand Marks Booster Challenege#1
66 pages
India's Grand Strategy
No ratings yet
India's Grand Strategy
14 pages
Qaisar Nadeem Department of Nuclear Engineering, PIEAS Pakistan 1 Meteorology and Radioactive Effluent Dispersion
No ratings yet
Qaisar Nadeem Department of Nuclear Engineering, PIEAS Pakistan 1 Meteorology and Radioactive Effluent Dispersion
21 pages
Unit 6 1
No ratings yet
Unit 6 1
3 pages
UNIT 3 - Test 2
No ratings yet
UNIT 3 - Test 2
7 pages
Full The Lab Manual To Accompany The 8088 and 8086 Microprocessors Programming Interfacing Software Hardware and Applications 4th Edition Walter A. Triebel Ebook All Chapters
No ratings yet
Full The Lab Manual To Accompany The 8088 and 8086 Microprocessors Programming Interfacing Software Hardware and Applications 4th Edition Walter A. Triebel Ebook All Chapters
71 pages
Balston Gas and Liquid Sample Analyzer Filters
No ratings yet
Balston Gas and Liquid Sample Analyzer Filters
50 pages
Chapter 5 Gastrointestinal Agents Reviewer PDF
No ratings yet
Chapter 5 Gastrointestinal Agents Reviewer PDF
6 pages
Agri
No ratings yet
Agri
106 pages
Ncu A2 Extra Tasks Easier U9
No ratings yet
Ncu A2 Extra Tasks Easier U9
2 pages
Trends
No ratings yet
Trends
13 pages
Dhrifi 2015
No ratings yet
Dhrifi 2015
20 pages
Department Presentation From BMTU New
No ratings yet
Department Presentation From BMTU New
18 pages
Astm E9 09
No ratings yet
Astm E9 09
4 pages

Parallel Databases: Solutions To Practice Exercises

Uploaded by

Parallel Databases: Solutions To Practice Exercises

Uploaded by

C H A P T E R 2 1

Solutions to Practice Exercises

r1 P 1,0 P 1,1 P 1,2

r2 P 2,1 P 2,2 P 2,3

Each fragment is replicated on 3 processors only, unlike in the general case

a range are equally likely, the kith value will be

You might also like