0% found this document useful (0 votes)

103 views16 pages

Setting The Degree of Parallelism: Figure C-4

The document discusses hash joins in Oracle database. It explains that a hash join partitions relations into buckets using a hash function on the join attribute. The smaller relation is partitioned first in the build phase, then the larger relation is probed to find matching tuples based on the join attribute. This reduces the number of comparisons needed compared to a nested loop join. The document provides an example of a hash join between Employee and Department tables to further illustrate the process.

Uploaded by

Gaurav Mandke

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

103 views16 pages

Setting The Degree of Parallelism: Figure C-4

Uploaded by

Gaurav Mandke

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 16

After determining the partitioning requirement for each operation in the

execution plan, the query coordinator determines the order in which the
operations must be performed. With this information, the query coordinator
determines the data flow of the statement. Figure C-4 illustrates the data flow of
the following query:
SELECT dname, MAX(sal), AVG(sal)

FROM emp, dept

WHERE emp.deptno = dept.deptno

GROUP BY dname;

Setting the Degree of Parallelism

The parallel execution coordinator may enlist two or more of the instance's parallel
execution servers to process a SQL statement. The number of parallel execution
servers associated with a single operation is known as the degree of parallelism.

The degree of parallelism is specified in the following ways:

 At the statement level

o With hints
o With the PARALLEL clause
 At the table level in the table's definition
 At the index level in the index's definition
 By default based on the number of CPUs

The following example shows a statement that sets the degree of parallelism to 4 on a
table:
ALTER TABLE emp PARALLEL 4;

This next example sets the degree of parallelism on an index:

ALTER INDEX iemp PARALLEL;

This last example sets a hint to 4 on a query:

SELECT /*+ PARALLEL(emp,4) */ COUNT(*) FROM emp ;

Operations That Can Be Parallelized

The Oracle server can use parallel execution for any of these operations:

 Table scan
 Nested loop join
 Sort merge join
 Hash join
 "Not in"
 Group by
 Select distinct
 Union and union all
 Aggregation
 PL/SQL functions called from SQL
 Order by
 Create table as select
 Create index
 Rebuild index
 Rebuild index partition
 Move partition
 Split partition
 Update
 Delete
 Insert ... select
 Enable constraint (the table scan is parallelized)
 Star transformation
 Cube
 Rollup

Hash Join
There are several variants of hash joins, like Simple Hash Join, Partitioned Hash Join, and Hybrid Hash Join.

Simple Hash Join:

Input: The list of tables to be joined, say r and s and the joining attributes r.A and s.B of tables r and s.
Output: Joined relation.

Steps:
1. Identify a hash function such that the result of the function should point to one of the identified range of values (or
buckets, say bucket 0, bucket 1, …, bucket 9 for example).

2. Identify the smaller relation among the relations that are to be joined, say r and s.

3. Partition the smaller relation, say r, into the identified buckets using the join attribute of r. That is, r is partitioned
into available buckets as r0, r1, r2, …, r9. This step is called Building Phase (sometimes Partitioning Phase).

4. Partition the relation s into the identified buckets using the join attribute of s. That is, s is partitioned into
available buckets as s0, s1, s2, …, s9. This step is called Probing Phase as the records of s probe for the
matching records in the appropriate buckets.

5. Finally the result can be collected from different buckets and submitted as result.

How does simple Hash join work?

Example:
Consider the following two table instances for tables Employee(EmpID, EName, Dept) and Department(DeptNo, DName,
DLocation);
EmpID EName DeptNo
E101 Kumar 2
E103 Wess 1
E107 Terry 1
E102 Gowri 2
E110 Morgan 3
E111 Sachin 3
Table 1 - Employee
DeptN DName DLocation
o
1 Design Chennai
2 Production Chennai
3 Administration Bangalore
Table 2 – Department

Step 1: Hash function looks like the one given below;

h(Hash_key) = Hash_key_value mod n

where n is the number of partitions (or buckets). Let us choose the DeptNo attribute as hash key and partition the relation
into 3 partitions such as 0, 1, and 2. Then, our hash function will look like as follows;
h(DeptNo) = DeptNo mod 3

Step 2: The Department table is the smaller table.

Step 3: Partition Department using the hash function. The first record of Department will go into Bucket 1, second record into
Bucket 2, and third record into Bucket 0.

DeptN DName DLocation

o
1 Design Chennai
2 Production Chennai
3 Administration Bangalore
Figure 1 - Hashing of Department table

Step 4: Now partition the larger relation Employee using the same hash function. That is use the following hash function;
h(DeptNo) = DeptNo mod 3
The figure given below shows the partitioning of Employee table into 3 buckets.

EmpID EName DeptNo

E101 Kumar 2
E103 Wess 1
E107 Terry 1
E102 Gowri 2
E110 Morgan 3
E111 Sachin 3

Figure 2 - Hashing of Employee table

After successful partitioning, our hash buckets will look the figure given below; in this figure, the first table shows 0 th partition
of Department table, and the second one shows the 0th partition of Employee table.
Figure 3 - Bucket status after hashing of all records of the tables to be joined

Carefully observe the data stored in every bucket. Bucket 0 stores records of zeroth partitions of both tables where the
attribute DeptNo (joining attribute) is having the same value, i.e, 3. It is true for other partitions also. Hence, at this stage the
join is trivial. This is why this phase of this joining technique is named Probing phase, where the Probe relation searches for
the joining attribute value of other relation and gets joined. At this stage, the joining is trivial.

The major advantage is that, if we join with conventional join technique, then the comparison requires,
6 records X 3 records = 18 comparisons
When we use hash join, we need
(1 record X 2 records in Bucket 0)+ (1 X 2 in Bucket 1)+ (1 X 2 in Bucket 2) = 6 comparisons

Points to note:
1. Only Equi-join or Natural Join can be performed using Hash join
2. Simple hash join assumed one of the relations as small relation, i.e, the whole relation can fit into
memory.
3. Smaller relation is chosen in the building phase.

4. Only a single pass through the tables is required. That is one time scanning of relation is required.

5. Hash function should be chosen such that it should not cause skewness.

Hash Join Optimization

Beginning with MySQL 8.0.18, MySQL employs a hash join for any query for which each join has an equi-join
condition and uses no indexes, such as this one:

SELECT *
FROM t1
JOIN t2
ON t1.c1=t2.c1;

he hash join is used for queries involving multiple joins as well, as long as at least one join condition for each pair of
tables is an equi-join, like the query shown here:

SELECT *
FROM t1
JOIN t2
ON (t1.c1 = t2.c1 AND t1.c2 < t2.c2)
JOIN t3
ON (t2.c1 = t3.c1);
A hash join cannot be used if any pair of joined tables does not have at least one equi-join condition, as can be seen
here:

mysql> EXPLAIN FORMAT=TREE

-> SELECT *
-> FROM t1
-> JOIN t2
-> ON (t1.c1 = t2.c1)
-> JOIN t3
-> ON (t2.c1 < t3.c1)\G
*************************** 1. row ***************************
EXPLAIN: <not executable by iterator executor>

Build – first input, used to build a hash table.

Probe – second input used to check on a hash table.
Hash Table – an array of slots.
Hash Bucket – a linked list anchored to a slot.
Partition – a group of buckets.
Hash – the hash function applied to the joining value.

Producer or Consumer Operations

Operations that require the output of other operations are known as consumer operations. In Figure 8-1,
the GROUP BY SORT operation is the consumer of the HASH JOIN operation because GROUP BY SORT requires
the HASH JOIN output.

Consumer operations can begin consuming rows as soon as the producer operations have produced rows.
In Example 8-2, while the parallel execution servers are producing rows in the FULL SCAN of the sales table, another
set of parallel execution servers can begin to perform the HASH JOIN operation to consume the rows.

Introduction to Nested Loop Joins

In a nutshell, the Nested Loop Join uses one joining table as an outer input table and the
other one as the inner input table. The Nested Loop Join gets a row from the outer table and searches
for the row in the inner table; this process continues until all the output rows of the outer table are searched
in the inner table.
Used where relation table size is very small/.

hash join vs. nested loops join

he major difference between a hash join and a nested loops join is the use of a full-
table scan with the hash join.

We may see the physical join implementations with names like nested loops, sort
merge and hash join.

 Hash joins - In a hash join, the Oracle database does a full-scan of the driving table,
builds a RAM hash table, and then probes for matching rows in the other table. For
certain types of SQL, the hash join will execute faster than a nested loop join, but
the hash join uses more RAM resources.
 Nested loops join - The nested loops table join is one of the original table join plans
and it remains the most common. In a nested loops join, we have two tables a driving
table and a secondary table. The rows are usually accessed from a driving table index
range scan, and the driving table result set is then nested within a probe of the second
table, normally using an index range scan method.

Some queries will perform faster with NESTED LOOPS joins, some with HASH joins, while others
favor sort-merge joins. It is difficult to predict what join technique will be fastest a priori

A hash join is an operation that performs a full-table scans on the smaller of the two
tables (the driving table) and then builds a hash table in RAM memory. The hash
table is then used to retrieve the rows in the larger table.

Fragment-and-Replicate Join

􀂄 Partitioning not possible for some join conditions

􀃌 e.g., non-equijoin conditions, such as r.A > s.B.

􀂄 For some joins , partitioning is not applicable, parallelization can

be accomplished by fragment and replicate technique

􀂄 Special case – asymmetric fragment-and-replicate:

􀃌 One of the relations, say r, is partitioned; any partitioning technique
can be used.
􀃌 The other relation, s, is replicated across all the processors.
􀃌 Processor Pi then locally computes the join of ri with all of s using
any join technique.

􀂄 General case: reduces the sizes of the relations at each

processor.
􀃌 r is partitioned into n partitions,r0, r1, ..., r n-1;s is partitioned into m
partitions, s0, s1, ..., sm-1.

􀃌 Any partitioning technique may be used.

􀃌 There must be at least m * n processors.
􀃌 Label the processors as
􀃌 P0,0, P0,1, ..., P0,m-1, P1,0, ..., Pn-1m-1.
􀃌 Pi,j computes the join of ri with sj. In order to do so, ri is replicated to
Pi,0, Pi,1, ..., Pi,m-1, while si is replicated to P0,i, P1,i, ..., Pn-1,i
􀃌 Any join technique can be used at each processor Pi,j.

􀂄 Both versions of fragment-and-replicate work with any join condition,

since every tuple in r can be tested with every tuple in s.
􀂄 Usually has a higher cost than partitioning, since one of the
relations (for asymmetric fragment-and-replicate) or both relations
(for general fragment-and-replicate) have to be replicated.
􀂄 Sometimes asymmetric fragment-and-replicate is preferable even
though partitioning could be used.
􀃌 E.g., say s is small and r is large, and already partitioned. It may be
cheaper to replicate s across all processors, rather than repartition r
and s on the join attributes.
Fragment and Replicate Join - a parallel joining
technique

Fragment and Replicate Join

Introduction
All of us know about different types of joins in terms of the nature of the join conditions and the operators used for join. They
are Equi-join and Non-Equi Join. Equi-join is performed through checking an equality condition between different joining
attributes of different tables. Non-equi join is performed through checking an inequality condition between joining attributes.
Equi-join is of the form,
SELECT columns FROM list_of_tables WHERE table1.column = table2.column;
whereas, Non-equi join is of the form,
SELECT columns FROM list_of_tables WHERE table1.column < table2.column;
(Or, any other operators >, <>, <=, >= etc. in the place of < in the above example)

We have discussed Partitioned Join in the previous post, where we partitioned the relational tables that are to be joined, into
equal partitions and we performed join on individual partitions locally at every processor. Partitioning the relations on the
joining attribute and join them will work only for joins that involve equality conditions.
Clearly, joining the tables by partitioning will work only for Equi-joins or natural joins. For inequality joins, partitioning will not
work. Consider a join condition as given below;

r r.a>s.b s
In this non-equal join condition, the tuples (records) of r must be joined with records of s for all the records where the value
of attribute r.a is greater than s.b. In other words, all records of r join with some records of s and vice versa. That is, one
of the relations’ all records must be joined with some of the records of other relation. For clear example, see Non-equi join
post.
What does fragment and replicate mean?
Fragment means partitioning a table either horizontally or vertically (Horizontal and Vertical fragmentation). Replicate means
duplicating the relation, i.e, generating similar copies of a table. This join is performed by fragmenting and replicating the
tables to be joined.

Asymmetric Fragment and Replicate Join

(How does Asymmetric Fragment and Replicate Join work?)
It is a variant of Fragment and Replicate join. It works as follows;
1. The system fragments table r into n fragments such that r 0, r1, r2, .., rn-1, where r is one of the tables that is to be joined and
n represents the number of processors. Any partitioning technique, round-robin, hash or range partitioning could be used to
partition the relation.
2. The system replicates the other table, say s into n processors. That is, the system generates n copies of s and sends to n
processors.
3. Now we have, r0 and s in processor P0, r1 and s in processor P1, r2 and s in processor P2, .., rn-1 and s in processor Pn-1.
The processor Pi is performing the join locally on ri and s.
Figure 1 given below shows the process of Asymmetric Fragment-and-Replicate join (it may not be the appropriate example,
but it clearly shows the process);
Figure 1 - process of Asymetric Fragment and Replicate Parallel Join

Points to Note:
1. Non-equal join can be performed in parallel.
2. If one of the relations to be joined is already partitioned into n processors, this technique is best suited, because we need
to replicate the other relation.
3. Unlike in Partitioned Join, any partitioning techniques can be used.
4. If one of the relations to be joined is very small, the technique performs better.

Fragment and Replicate Join

(How does Fragment and Replicate Join work?)
It is the general case of Asymmetric Fragment-and-Replicate join technique. Asymmetric technique is best suited if one of
the relations to be joined is small, and if it can fit into memory. If the relations that are to be joined are large, and the joins is
non-equal then we need to use Fragment-and-Replicate Join. It works as follows;
1. The system fragments table r into m fragments such that r 0, r1, r2, .., rm-1, and s into n fragments such that s 0, s1, s2, .., sn-
1 . Any partitioning technique, round-robin, hash or range partitioning could be used to partition the relations.

2. The values for m and n are chosen based on the availability of processor. That is, we need at least m*n processors to
perform join.
3. Now we have to distribute all the partitions of r and s into available processors. And, remember that we need to
compare every tuple of one relation with every tuple of other relation. That is the records of r 0 partition should be
compared with all partitions of s, and the records of partition s 0 should be compared with all partitions of r. This
must be done with all the partitions of r and s as mentioned above. Hence, the data distribution is done as follows;
                a. As we need m*n processors, let us assume that we have processors P 0,0, P0,1, …, P0,n-1, P1,0, P1,1, …, Pm-1,n-1.
Thus, processor Pi,j performs the join of ri with sj.
                b. To ensure the comparison of every partition of r with every other partition of s, we replicate r i with the
processors, Pi,0, Pi,1, Pi,2, …, Pi,n-1, where 0, 1, 2, …, n-1 are partitions of s. This replication ensures the comparison of every
ri with complete s.
                c. To ensure the comparison of every partition of s with every other partition of r, we replicate s i with the
processors, P0,i, P1,i, P2,i, …, Pm-1,i, where 0, 1, 2, …, m-1 are partitions of r. This replication ensures the comparison of every
si with complete r.
4. Pi,j computes the join locally to produce the join result.
Figure 2 given below shows the process of general case Fragment-and-Replicate join (it may not be the appropriate
example, but it clearly shows the process);

Figure 2 - process of general case Fragment-and-Replicate Join

Points to Note:

1. Asymmetric Fragment-and-replicate join is the special case of general case Fragment-and-replicate join, where n or m is 1,
i.e, if one of the relation does not have partitions.

2. When compared to asymmetric technique, Fragment-and-replicate join reduces the size of the tables at every processor.

3. Any partitioning techniques can be used and any joining technique can be used as well.

4. Fragment-and-replicate technique suits both Equi-join and Non-equi join.

5. It involves higher cost in partitioning.

Sort Merge Joins and Parallel Query

Sort merge join

Sort merge join is used to join two independent data sources. They perform better than
nested loop when the volume of data is big in tables but not as good as hash joins in general.

They perform better than hash join when the join condition columns are already sorted or
there is no sorting required.

The sort merge operation is the most ideal for parallel query because a merge join
always performs full-table scans against the tables. Sort merge joins are generally
best for queries that produce very large result sets such as daily reports and table
detail summary queries. Here we see a simple query that has been formed to
perform a sort merge using parallel query against both tables.
select /*+ use_merge(e,b) parallel(e, 4) parallel(b, 4) */
   e.ename,
   hiredate,
   b.comm
from
   emp e,
   bonus b
where
   e.ename = b.ename
;

Suppose two salespeople attend a conference and each collect over 100 business
cards from potential new customers. They now each have a pile of cards in random
order, and they want to see how many cards are duplicated in both piles. The
salespeople alphabetize their piles, and then they call off names one at a time.
Because both piles of cards have been sorted, it becomes much easier to find the
names that appear in both piles. This example describes a SORT-MERGE join.

In a SORT-MERGE join, Oracle sorts the first row source by its join columns, sorts the
second row source by its join columns, and then merges the sorted row sources
together. As matches are found, they are put into the result set. SORT-MERGE joins
can be effective when lack of data selectivity or useful indexes render a NESTED
LOOPS join inefficient, or when both of the row sources are quite large (greater than
5 percent of the blocks accessed).

However, SORT-MERGE joins can be used only for equijoins (WHERE D.deptno =
E.deptno, as opposed to WHERE D.deptno >= E.deptno). SORT-MERGE joins require
temporary segments for sorting (if SORT_AREA_SIZE or the automatic memory
parameters like MEMORY_TARGET are set too small). This can lead to extra memory
utilization and/or extra disk I/O in the temporary tablespace. Table 1
below illustrates the method of executing the query shown next when a SORT-
MERGE join is performed.
Pipelined Parallelism and Independent Parallelism

Interoperation Parallelism
It is about executing different operations of a query in parallel. A single query may
involve multiple operations at once. We may exploit parallelism to achieve better
performance of such queries. Consider the example query given below;
SELECT AVG(Salary) FROM Employee GROUP BY Dept_Id;
It involves two operations. First one is an Aggregation and the second is grouping. For
executing this query,
We need to group all the employee records based on the attribute Dept_Id
first.
Then, for every group we can apply the AVG aggregate function to get the final
result.
We can use Interoperation parallelism concept to parallelize these two operations.
[Note: Intra-operation is about executing single operation of a query using multiple
processors in parallel]
The following are the variants using which we would achieve Interoperation
Parallelism;
1. Pipelined Parallelism
2. Independent Parallelism

1. Pipelined Parallelism
In Pipelined Parallelism, the idea is to consume the result produced by one operation
by the next operation in the pipeline. For example, consider the following operation;
r1 ⋈ r2 ⋈ r3 ⋈ r4
The above expression shows a natural join operation. This actually joins four tables.
This operation can be pipelined as follows;
Perform temp1 ← r1 ⋈ r2 at processor P1 and send the result temp1 to processor P2 to
perform temp2 ← temp1 ⋈ r3 and send the result temp2 to processor P3 to perform
result ← temp2 ⋈ r4. The advantage is, we do not need to store the intermediate
results, and instead the result produced at one processor can be consumed
directly by the other. Hence, we would start receiving tuples well before P1
completes the join assigned to it.
Disadvantages:
1. Pipelined parallelism is not the good choice, if degree of parallelism is high.
2. Useful with small number of processors.
3. Not all operations can be pipelined. For example, consider the query given in the
first section. Here, you need to group at least one department employees. Then only
the output can be given for aggregate operation at the next processor.
4. Cannot expect full speedup.

2. Independent Parallelism:
Operations that are not depending on each other can be executed in parallel at
different processors. This is called as Independent Parallelism.
For example, in the expression r1 ⋈ r2 ⋈ r3 ⋈ r4, the portion r1 ⋈ r2 can be done in
one processor, and r3 ⋈ r4 can be performed in the other processor. Both results can
be pipelined into the third processor to get the final result.
Disadvantages:
Does not work well in case of high degree of parallelism.

Datasheet Motor PM55L 048 HPG9
100% (7)
Datasheet Motor PM55L 048 HPG9
1 page
Summer Internship Project Report
100% (1)
Summer Internship Project Report
66 pages
Joins DBMS
No ratings yet
Joins DBMS
21 pages
Final Cheat Sheet
No ratings yet
Final Cheat Sheet
3 pages
Electrical System (HCR1500-EDII, D20II)
100% (2)
Electrical System (HCR1500-EDII, D20II)
20 pages
Oracle Hash Join
No ratings yet
Oracle Hash Join
16 pages
Top 50 SAP ABAP Interview Questions and Answers PDF
No ratings yet
Top 50 SAP ABAP Interview Questions and Answers PDF
12 pages
FDBMS Question Bank Answer
No ratings yet
FDBMS Question Bank Answer
28 pages
Unit 3
No ratings yet
Unit 3
63 pages
STO Process - Pricing Procedure
No ratings yet
STO Process - Pricing Procedure
30 pages
Lesson 06
No ratings yet
Lesson 06
44 pages
Module - 2 Important Questions - 230711 - 233019
No ratings yet
Module - 2 Important Questions - 230711 - 233019
16 pages
2017 Book EndodonticPrognosis
100% (1)
2017 Book EndodonticPrognosis
250 pages
COMP303 Lecture No 08 - 153949
No ratings yet
COMP303 Lecture No 08 - 153949
35 pages
Session - 10 Querying
No ratings yet
Session - 10 Querying
36 pages
AML and KYC
0% (2)
AML and KYC
34 pages
08 Query Processing Strategies and Optimization
No ratings yet
08 Query Processing Strategies and Optimization
32 pages
Joins 05062024 114513am
No ratings yet
Joins 05062024 114513am
10 pages
DBMS SRP
No ratings yet
DBMS SRP
13 pages
Dbms Ia1
No ratings yet
Dbms Ia1
9 pages
Project (Time) Control For An EPC Project
No ratings yet
Project (Time) Control For An EPC Project
12 pages
SQL: Data Manipulation (2) : Database Systems: Sixth Edition By: Conally and Begg
No ratings yet
SQL: Data Manipulation (2) : Database Systems: Sixth Edition By: Conally and Begg
35 pages
Teradata Vantage SQL Basics
No ratings yet
Teradata Vantage SQL Basics
14 pages
A Brief Guide To SQL Server JOINs
No ratings yet
A Brief Guide To SQL Server JOINs
13 pages
DBMS Experiment - Lab 5
No ratings yet
DBMS Experiment - Lab 5
26 pages
DBMS Unit 8
No ratings yet
DBMS Unit 8
7 pages
SQL Server Execution Plan
No ratings yet
SQL Server Execution Plan
17 pages
Chapter 1 Part II
No ratings yet
Chapter 1 Part II
22 pages
Cics Question Bank 1 of 28
No ratings yet
Cics Question Bank 1 of 28
28 pages
Dbms Mod3
No ratings yet
Dbms Mod3
18 pages
BCS Topic
No ratings yet
BCS Topic
66 pages
SQL Interview Questions
No ratings yet
SQL Interview Questions
13 pages
DBA-T2.C6-Database Optimization
No ratings yet
DBA-T2.C6-Database Optimization
28 pages
AIML Hackathon
No ratings yet
AIML Hackathon
26 pages
Brownbag Introtosqltuning
No ratings yet
Brownbag Introtosqltuning
36 pages
Abraham Wondale
No ratings yet
Abraham Wondale
73 pages
Tata Nano Car
No ratings yet
Tata Nano Car
34 pages
Query Processing + Optimization: Outline: Operator Evaluation Strategies
No ratings yet
Query Processing + Optimization: Outline: Operator Evaluation Strategies
53 pages
Hash Tables and Query Execution: March 1st, 2004
No ratings yet
Hash Tables and Query Execution: March 1st, 2004
32 pages
Labsheet12 Join
No ratings yet
Labsheet12 Join
13 pages
Database Modeling - notes-VI
No ratings yet
Database Modeling - notes-VI
8 pages
Joins
No ratings yet
Joins
9 pages
Join Operations
No ratings yet
Join Operations
8 pages
Cse CSPC403 DBMS-70
No ratings yet
Cse CSPC403 DBMS-70
1 page
SQL Join
No ratings yet
SQL Join
11 pages
Inbound 91797242154262642
No ratings yet
Inbound 91797242154262642
7 pages
Solution Case Study CCN
100% (1)
Solution Case Study CCN
7 pages
DBMS Short Notes
No ratings yet
DBMS Short Notes
6 pages
Data Warehousing: Need For Speed: Join Techniques
No ratings yet
Data Warehousing: Need For Speed: Join Techniques
22 pages
Group By-Having-Join
No ratings yet
Group By-Having-Join
3 pages
Effective Query Writing
No ratings yet
Effective Query Writing
32 pages
BSCPL Tech Spec MLTP Botanical R00
No ratings yet
BSCPL Tech Spec MLTP Botanical R00
57 pages
CHAPTER 8. Display Data From Multiple Tables
No ratings yet
CHAPTER 8. Display Data From Multiple Tables
6 pages
PostgreSQL - MERGE JOIN Vs HASH JOIN
No ratings yet
PostgreSQL - MERGE JOIN Vs HASH JOIN
3 pages
John Crane Seal Type 1A 2
No ratings yet
John Crane Seal Type 1A 2
6 pages
Brown Bag Intro To SQ L Tuning
No ratings yet
Brown Bag Intro To SQ L Tuning
36 pages
Advance Database Management System: Unit - 2 .Query Processing and Optimization
No ratings yet
Advance Database Management System: Unit - 2 .Query Processing and Optimization
38 pages
By The Disenchanted Developer: 2000 2002. All Rights Reserved
No ratings yet
By The Disenchanted Developer: 2000 2002. All Rights Reserved
24 pages
ABAP Internal Tables
100% (1)
ABAP Internal Tables
10 pages
ETAP Installation Guide
No ratings yet
ETAP Installation Guide
2 pages
16mis0350 VL2017181000858 Da02
No ratings yet
16mis0350 VL2017181000858 Da02
7 pages
Sample Tables: Lastname Departmentid
No ratings yet
Sample Tables: Lastname Departmentid
12 pages
Database Tuning: Database Tuning Describes A Group of Activities Used To Optimize and Homogenize The
No ratings yet
Database Tuning: Database Tuning Describes A Group of Activities Used To Optimize and Homogenize The
20 pages
Plans
No ratings yet
Plans
0 pages
Sample Tables
No ratings yet
Sample Tables
12 pages
Difference Between DBMS and RDBMS
No ratings yet
Difference Between DBMS and RDBMS
11 pages
9/11 Commission Interview Requests For Defense Department Personnel
No ratings yet
9/11 Commission Interview Requests For Defense Department Personnel
6 pages
Excel Techniques
From Everand
Excel Techniques
Online Trainees
2/5 (1)
Hash Joins - Implementation and Tuning
No ratings yet
Hash Joins - Implementation and Tuning
20 pages
Join (SQL) : Sample Tables
No ratings yet
Join (SQL) : Sample Tables
9 pages
10 Vallarta v. CA
No ratings yet
10 Vallarta v. CA
2 pages
Nested Loops, Hash Join and Sort Merge Joins - Difference?: Nested Loop (Loop Over Loop)
No ratings yet
Nested Loops, Hash Join and Sort Merge Joins - Difference?: Nested Loop (Loop Over Loop)
7 pages
Joins
No ratings yet
Joins
16 pages
2550Q-4th2021 - (EB187139-EEFB-462C
No ratings yet
2550Q-4th2021 - (EB187139-EEFB-462C
3 pages
Databases: Wednesday, January 21, 2009 3:20 PM
No ratings yet
Databases: Wednesday, January 21, 2009 3:20 PM
7 pages
Report Writing
No ratings yet
Report Writing
13 pages
Naskah 2 Layout
No ratings yet
Naskah 2 Layout
10 pages
1.1 Purpose: 1.2.1 Selection
No ratings yet
1.1 Purpose: 1.2.1 Selection
7 pages
Different Types of SQL Joins
No ratings yet
Different Types of SQL Joins
12 pages
Computer SQL Project
No ratings yet
Computer SQL Project
11 pages
Understanding Table Joins Using SQL - CodeProject
No ratings yet
Understanding Table Joins Using SQL - CodeProject
10 pages
Oracle Join Algorithms
No ratings yet
Oracle Join Algorithms
7 pages
Most Commonly Used Join Algorithms
No ratings yet
Most Commonly Used Join Algorithms
3 pages
Backface Removal
No ratings yet
Backface Removal
4 pages
Moores Law - Every Month Transistor Area Doubles Per Unit Square Area
No ratings yet
Moores Law - Every Month Transistor Area Doubles Per Unit Square Area
8 pages
Doctrinal
No ratings yet
Doctrinal
42 pages
Module 4 - Mindmap PDF
No ratings yet
Module 4 - Mindmap PDF
1 page
Place of Suppy
No ratings yet
Place of Suppy
15 pages
ILP Instruction Level Parallelism
No ratings yet
ILP Instruction Level Parallelism
14 pages
Industrial Shakers
No ratings yet
Industrial Shakers
4 pages
Lecture 02 Write Basic Go Web Server
No ratings yet
Lecture 02 Write Basic Go Web Server
17 pages
Case Study - Yangpu - Riverfront
No ratings yet
Case Study - Yangpu - Riverfront
2 pages
1 Rakitan Printer 02 Agustus 2021
No ratings yet
1 Rakitan Printer 02 Agustus 2021
1 page

Setting The Degree of Parallelism: Figure C-4

Uploaded by

Setting The Degree of Parallelism: Figure C-4

Uploaded by

After determining the partitioning requirement for each operation in the

FROM emp, dept

WHERE emp.deptno = dept.deptno

Setting the Degree of Parallelism

The degree of parallelism is specified in the following ways:

 At the statement level

This next example sets the degree of parallelism on an index:

This last example sets a hint to 4 on a query:

Operations That Can Be Parallelized

Simple Hash Join:

How does simple Hash join work?

Step 1: Hash function looks like the one given below;

Step 2: The Department table is the smaller table.

DeptN DName DLocation

EmpID EName DeptNo

Figure 2 - Hashing of Employee table

Hash Join Optimization

mysql> EXPLAIN FORMAT=TREE

Build – first input, used to build a hash table.

Producer or Consumer Operations

Introduction to Nested Loop Joins

hash join vs. nested loops join

􀂄 Partitioning not possible for some join conditions

􀂄 For some joins , partitioning is not applicable, parallelization can

􀂄 Special case – asymmetric fragment-and-replicate:

􀂄 General case: reduces the sizes of the relations at each

􀃌 Any partitioning technique may be used.

􀂄 Both versions of fragment-and-replicate work with any join condition,

Fragment and Replicate Join

Asymmetric Fragment and Replicate Join

Fragment and Replicate Join

Figure 2 - process of general case Fragment-and-Replicate Join

4. Fragment-and-replicate technique suits both Equi-join and Non-equi join.

5. It involves higher cost in partitioning.

Sort merge join

You might also like