0% found this document useful (0 votes)

12 views4 pages

Unit-2 - Query Processing in Distributed DBMS

Uploaded by

dineshprj9

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views4 pages

Unit-2 - Query Processing in Distributed DBMS

Uploaded by

dineshprj9

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Query Processing in Distributed DBMS


 Query processing in a distributed database management system
requires the transmission of data between the computers in a
network. A distribution strategy for a query is the ordering of data
transmissions and local data processing in a database system.
 Generally, a query in Distributed DBMS requires data from
multiple sites, and this need for data from different sites is called
the transmission of data that causes communication costs.
 Query processing in DBMS is different from query processing in
centralized DBMS due to the communication cost of data transfer
over the network.

The process used to retrieve data from a database is called query

processing.
Several processes are involved in query processing to retrieve data from
the database. The actions to be taken are:

 Costs (Transfer of data) of Distributed Query processing

 Using Semi join in Distributed Query processing

Costs (Transfer of Data) of Distributed Query Processing

In Distributed Query processing, the data transfer cost of distributed
query processing means the cost of transferring intermediate files to
other sites for processing and therefore the cost of transferring the
ultimate result files to the location where that result is required.

A user sends a query to site S1, which requires data from its own and
also from another site S2. Now, there are three strategies to process this
query which are given below:

1. We can transfer the data from S2 to S1 and then process the query
2. We can transfer the data from S1 to S2 and then process the query
3. We can transfer the data from S1 and S2 to S3 and then process the
query. So the choice depends on various factors like the size of
relations and the results, the communication cost between different
sites, and at which the site result will be utilized.

Commonly, the data transfer cost is calculated in terms of the size of

the messages. By using the below formula, we can calculate the data
transfer cost:

Data transfer cost = C * Size

Where C refers to the cost per byte of data transferring and Size is the
no. of bytes transmitted.

Example: Consider the following table EMPLOYEE and

DEPARTMENT.
Site1: EMPLOYEE
EID NAME SALARY DID
EID- 10 bytes
SALARY- 20 bytes
DID- 10 bytes
Name- 20 bytes
Total records- 1000
Record Size- 60 bytes

Site2: DEPARTMENT
DID DNAME
DID- 10 bytes
DName- 20 bytes
Total records- 50
Record Size- 30 bytes
Example:
1. Find the name of employees and their department names.
Also, find the amount of data transfer to execute this query when
the query is submitted to Site 3.

Answer: Considering the query is submitted at site 3 and neither of the

two relations is an EMPLOYEE and the DEPARTMENT not available
at site 3. So, to execute this query, we have three strategies:

 Transfer both the tables that are EMPLOYEE and

DEPARTMENT at SITE 3 then join the tables there. The total
cost in this is 1000 * 60 + 50 * 30 = 60,000 + 1500 =
61500 bytes.
 Transfer the table EMPLOYEE to SITE 2, join the table at SITE
2 and then transfer the result at SITE 3. The total cost in this
is 60 * 1000 + 60 * 1000 = 120000 bytes since we have to
transfer 1000 tuples having NAME and DNAME from site 1,
 Transfer the table DEPARTMENT to SITE 1, join the table at
SITE 2 join the table at site1 and then transfer the result at site3.
The total cost is 30 * 50 + 60 * 1000 = 61500 bytes since we
have to transfer 1000 tuples having NAME and DNAME from
site 1 to site 3 which is 60 bytes each.

Using Semi-Join in Distributed Query Processing

 The semi-join operation is used in distributed query processing to

reduce the number of tuples in a table before transmitting it to
another site.
 This reduction in the number of tuples reduces the number and
the total size of the transmission ultimately reducing the total cost
of data transfer.

Let’s say that we have two tables R1, R2 on Site S1, and S2. Now, we
will forward the joining column of one table say R1 to the site where
the other table say R2 is located.
This column is joined with R2 at that site.
The decision whether to reduce R1 or R2 can only be made after
comparing the advantages of reducing R1 with that of reducing R2.
Thus, semi-join is a well-organized solution to reduce the transfer of
data in distributed query processing.

Example: Find the amount of data transferred to execute the

same query given in the above example using a semi-join
operation.

Answer: The following strategy can be used to execute the query.

 Select all (or Project) the attributes of the EMPLOYEE table at site
1 and then transfer them to site 3. For this, we will transfer NAME,
DID(EMPLOYEE) and the size is 30 * 1000 = 30000 bytes.
 Transfer the table DEPARTMENT to site 3 and join the projected
attributes of EMPLOYEE with this table. The size of the
DEPARTMENT table is 30 * 50 = 1500

Applying the above scheme, the amount of data transferred to execute
the query will be 30000 + 1500 = 31500 bytes.

Advanced Database Systems: Chapter 3:query Processing and Evaluation
100% (1)
Advanced Database Systems: Chapter 3:query Processing and Evaluation
36 pages
5 1 GL GL SUL 00004 WellWizard Server 7.0 Manual
No ratings yet
5 1 GL GL SUL 00004 WellWizard Server 7.0 Manual
264 pages
ADBMS Notes
67% (3)
ADBMS Notes
48 pages
Vu Lec 30
No ratings yet
Vu Lec 30
28 pages
8 Query Optimization
No ratings yet
8 Query Optimization
53 pages
Implications of A Distributed Environment Part 2
No ratings yet
Implications of A Distributed Environment Part 2
38 pages
Final DBMS Unit 7
No ratings yet
Final DBMS Unit 7
48 pages
Query
No ratings yet
Query
13 pages
Interview Informatica-2
No ratings yet
Interview Informatica-2
90 pages
What Is Centralized Database?
No ratings yet
What Is Centralized Database?
8 pages
Data Communication Basics CH 7
No ratings yet
Data Communication Basics CH 7
27 pages
6 - Query Processing Updated
No ratings yet
6 - Query Processing Updated
24 pages
Ch6-Introduction To Distributed Database
No ratings yet
Ch6-Introduction To Distributed Database
22 pages
CSE 453 Slide 3
No ratings yet
CSE 453 Slide 3
72 pages
Distributed Database Management Notes - 3
86% (7)
Distributed Database Management Notes - 3
48 pages
Principles of Clinic Sabir Multani
No ratings yet
Principles of Clinic Sabir Multani
136 pages
Vu Lec 35
No ratings yet
Vu Lec 35
42 pages
08 Query Processing Strategies and Optimization
No ratings yet
08 Query Processing Strategies and Optimization
32 pages
Distributed Database Distributed Database Management Systems Management Systems
No ratings yet
Distributed Database Distributed Database Management Systems Management Systems
33 pages
7-Distributed DB
No ratings yet
7-Distributed DB
37 pages
Query Optimization
No ratings yet
Query Optimization
29 pages
CHS 8 - 3rd Q - Module 3
No ratings yet
CHS 8 - 3rd Q - Module 3
12 pages
17 Query Processing PDF
No ratings yet
17 Query Processing PDF
23 pages
Chapter 5
No ratings yet
Chapter 5
45 pages
Queryoptimization Examples
No ratings yet
Queryoptimization Examples
26 pages
DDBMS-Chapter-4-SE-LectureNote (Version 1)
No ratings yet
DDBMS-Chapter-4-SE-LectureNote (Version 1)
11 pages
CH 13 Updated
No ratings yet
CH 13 Updated
30 pages
7 Distributed DB
No ratings yet
7 Distributed DB
38 pages
Database Tuning: Database Tuning Describes A Group of Activities Used To Optimize and Homogenize The
No ratings yet
Database Tuning: Database Tuning Describes A Group of Activities Used To Optimize and Homogenize The
20 pages
Query Processing in Distributed Database
No ratings yet
Query Processing in Distributed Database
20 pages
Chapter 7 - Distributed Database System
No ratings yet
Chapter 7 - Distributed Database System
27 pages
Ddbms
No ratings yet
Ddbms
10 pages
UNIT 4 Query Processing and Different Types of Databases
No ratings yet
UNIT 4 Query Processing and Different Types of Databases
13 pages
Rahul Chugh Adbms Asiignment 2
No ratings yet
Rahul Chugh Adbms Asiignment 2
10 pages
Query Processing
No ratings yet
Query Processing
39 pages
Query Processing
No ratings yet
Query Processing
6 pages
DBMS
No ratings yet
DBMS
24 pages
Advanced Database
No ratings yet
Advanced Database
47 pages
DBMS Ip
No ratings yet
DBMS Ip
10 pages
QueryProcessing Lect 3
No ratings yet
QueryProcessing Lect 3
26 pages
ch6 Distributed Database
No ratings yet
ch6 Distributed Database
35 pages
Ite1003 Database-Management-Systems Eth 1.0 37 Ite1003 21
0% (1)
Ite1003 Database-Management-Systems Eth 1.0 37 Ite1003 21
5 pages
IJCA Joins Semi Joins
No ratings yet
IJCA Joins Semi Joins
5 pages
UT 1 QB Solution
No ratings yet
UT 1 QB Solution
4 pages
Subjects Offered: 1ST Trimester, Academic Year 2015-2016
No ratings yet
Subjects Offered: 1ST Trimester, Academic Year 2015-2016
22 pages
Advance Database Management System: Unit - 2 .Query Processing and Optimization
No ratings yet
Advance Database Management System: Unit - 2 .Query Processing and Optimization
38 pages
Distributed Query Processing Using Different Semijoin Operations
No ratings yet
Distributed Query Processing Using Different Semijoin Operations
26 pages
CE3155 Introduction To ETABS (Multi-Storey)
100% (1)
CE3155 Introduction To ETABS (Multi-Storey)
42 pages
Introduction To Database Management Systems CS470
No ratings yet
Introduction To Database Management Systems CS470
11 pages
DBMS Unit 4
No ratings yet
DBMS Unit 4
9 pages
DDBS Unit 2
No ratings yet
DDBS Unit 2
7 pages
Distributed Databases
No ratings yet
Distributed Databases
58 pages
10 DistQueryOptimization
No ratings yet
10 DistQueryOptimization
14 pages
Olap Exp05
No ratings yet
Olap Exp05
10 pages
Dahua SPP Price List Q2 - 2024 - June - 17062024
No ratings yet
Dahua SPP Price List Q2 - 2024 - June - 17062024
23 pages
Consider An Example Relation 1. Create 3 Fragments of Approximately Equal Size by Horizontal Fragmentation. Describe The
No ratings yet
Consider An Example Relation 1. Create 3 Fragments of Approximately Equal Size by Horizontal Fragmentation. Describe The
4 pages
DDS Unit - 2
No ratings yet
DDS Unit - 2
7 pages
Ericsson Nokia Mapping
No ratings yet
Ericsson Nokia Mapping
90 pages
Ad Bms Notes
No ratings yet
Ad Bms Notes
44 pages
Sample Questions On QUIZ-3
No ratings yet
Sample Questions On QUIZ-3
3 pages
Efficient Join On DBMS
No ratings yet
Efficient Join On DBMS
3 pages
Adbmsasign
No ratings yet
Adbmsasign
3 pages
RD TH
No ratings yet
RD TH
4 pages
4 6028372524222383733
No ratings yet
4 6028372524222383733
11 pages
Standard Operating Procedure Template
71% (7)
Standard Operating Procedure Template
12 pages
Bank Statement
No ratings yet
Bank Statement
7 pages
EN 10305-2 E195 E235 E355 Welded Cold Drawn Precision Steel Tubes
No ratings yet
EN 10305-2 E195 E235 E355 Welded Cold Drawn Precision Steel Tubes
5 pages
Plesk
No ratings yet
Plesk
16 pages
Draft 4 - Types of Merchant Fraud For PAs
No ratings yet
Draft 4 - Types of Merchant Fraud For PAs
3 pages
Integrated Motion On Ethernet - RM
No ratings yet
Integrated Motion On Ethernet - RM
432 pages
Data Analytic Application in Mof - Why and How To
100% (1)
Data Analytic Application in Mof - Why and How To
46 pages
AssessmentCenterReport 163
No ratings yet
AssessmentCenterReport 163
31 pages
Design Approaches and Performance Analysis of Electric Vehicle Using MATLAB Simulink
No ratings yet
Design Approaches and Performance Analysis of Electric Vehicle Using MATLAB Simulink
6 pages
IIT Ropar UG Handbook 2021 15.9.21
No ratings yet
IIT Ropar UG Handbook 2021 15.9.21
62 pages
Viessmann Vitodens 050 W Service Instructions
No ratings yet
Viessmann Vitodens 050 W Service Instructions
100 pages
Data Mining Concepts & Techniques
No ratings yet
Data Mining Concepts & Techniques
147 pages
RUIDE Disteo 23 USER MANUAL 180313 (A5)
No ratings yet
RUIDE Disteo 23 USER MANUAL 180313 (A5)
25 pages
The Transformation of Historical Research in The Digital Age
No ratings yet
The Transformation of Historical Research in The Digital Age
86 pages
m07500362 XXXXXXXX 0en
No ratings yet
m07500362 XXXXXXXX 0en
296 pages
Sheet Metal Forming: MIT 2.008x
No ratings yet
Sheet Metal Forming: MIT 2.008x
48 pages
LTE - FMA Version Update Log
No ratings yet
LTE - FMA Version Update Log
15 pages
Contact The Social Security Administration
No ratings yet
Contact The Social Security Administration
10 pages
Idt Test-B
No ratings yet
Idt Test-B
2 pages
Mini-Lesson 6 Social Responsibility and Empathy Mini-Lesson
No ratings yet
Mini-Lesson 6 Social Responsibility and Empathy Mini-Lesson
3 pages
E5111E
No ratings yet
E5111E
3 pages
Circular On Enhancements To QFM Framework PDF
No ratings yet
Circular On Enhancements To QFM Framework PDF
3 pages
Virtual Reality in The Army - Kashikasingh - A2305220439
No ratings yet
Virtual Reality in The Army - Kashikasingh - A2305220439
10 pages
AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
From Everand
AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
Sama Alshatali
No ratings yet
Dial Plan and Call Routing Demystified On Cisco Collaboration Technologies: Cisco Unified Communication Manager
From Everand
Dial Plan and Call Routing Demystified On Cisco Collaboration Technologies: Cisco Unified Communication Manager
Redouane MEDDANE
No ratings yet
Apache Cassandra Administrator Associate - Exam Practice Tests
From Everand
Apache Cassandra Administrator Associate - Exam Practice Tests
Cristian Scutaru
No ratings yet

Unit-2 - Query Processing in Distributed DBMS

Uploaded by

Unit-2 - Query Processing in Distributed DBMS

Uploaded by

Query Processing in Distributed DBMS

The process used to retrieve data from a database is called query

 Costs (Transfer of data) of Distributed Query processing

Costs (Transfer of Data) of Distributed Query Processing

Commonly, the data transfer cost is calculated in terms of the size of

Data transfer cost = C * Size

Example: Consider the following table EMPLOYEE and

Answer: Considering the query is submitted at site 3 and neither of the

 Transfer both the tables that are EMPLOYEE and

Using Semi-Join in Distributed Query Processing

 The semi-join operation is used in distributed query processing to

Example: Find the amount of data transferred to execute the

Answer: The following strategy can be used to execute the query.

You might also like