0% found this document useful (0 votes)

39 views36 pages

PostgresChina2018 刘东明 PostgreSQL并行查询

The document discusses parallel query in PostgreSQL. It covers parallel features introduced over PostgreSQL versions 9.4 to 11, including background workers, dynamic shared memory, executor nodes for parallel operations. It then discusses how parallel databases can improve performance by parallelizing queries across multiple CPUs. Finally, it explains some key parallel queries in PostgreSQL like parallel sequential scan, parallel index scan, parallel bitmap heap scan, and parallel hash join.

Uploaded by

Thoa Nhu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

39 views36 pages

PostgresChina2018 刘东明 PostgreSQL并行查询

Uploaded by

Thoa Nhu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 36

2018PostgreSQL中国技术大会

PostgreSQL Parallel Query

Liu Dongming
[email protected]
Alibaba Cloud Database Technology
2018年PostgreSQL中国技术大会

About Me
• Liu Dongming (刘东明)
• DRDS / PostgreSQL
• Alibaba Cloud PostgreSQL Group
2018年PostgreSQL中国技术大会

Parallel Features
• PostgreSQL 9.4, 9.5 [2014, 2015]
– Backgound workers
– Dynamic shared memory(DSM)
– Shared memory queues

• PostgreSQL 9.6 [2016]

– Executor nodes：Gather，Parallel Seq Scan，Partial Aggregate，Finalize Aggregate

• PostgreSQL 10 [2017]
– Partitions
– Executor nodes：Gather Merge，Parallel Index Scan，Parallel Bitmap Heap Scan

• PostgreSQL 11 [2018]
– Executor nodes：Parallel Append，Parallel Hash Join
– Planner：Partition-wise joins
– Parallel Create Index
2018年PostgreSQL中国技术大会

The Free Lunch Is Over

https://www.karlrupp.net/2018/02/42-years-of-microprocessor-trend-data/
https://en.wikipedia.org/wiki/Herb_Sutter#The_Free_Lunch_Is_Over
2018年PostgreSQL中国技术大会

Parallel Database System

Parallel Database Systems: The Future of High Performance Database Systems 1992
Authors: David Dewitt and Jim Gray

Why Parallel Databases?

• Relational Data Model – Relational queries are ideal candidates for parallelization
• Multiprocessor systems using inexpensive microprocessors provide more power and scalability than
expensive mainframe counterparts

• Shared-memory – All processors have equal access to a global memory and all disks
• Shared-disk – Each processor has its own private memory, but has equal access to all disks
• Shared-nothing – Each processor has its own private memory and disk(s)
2018年PostgreSQL中国技术大会

For Example

SELECT COUNT(*)
FROM people
WHERE inpgconn2018 = 'Y';
2018年PostgreSQL中国技术大会

EXPLAIN ANALYZE SELECT COUNT(*) FROM people WHERE atpgconn2018 = 'Y';

Aggregate (cost=169324.73..169324.74 rows=1 width=8) (actual time=983.729..983.730 rows=1 loops=1)

-> Seq Scan on people (cost=0.00..169307.23 rows=7001 width=0) (actual time=981.723..983.051 rows=9999 loops=1)
Filter: (atpgconn2018 = 'Y'::bpchar)
Rows Removed by Filter: 9990001
Planning Time: 0.066 ms
Execution Time: 983.760 ms max_parallel_workers_per_gather = 0

Finalize Aggregate (cost=97389.77..97389.78 rows=1 width=8) (actual time=384.848..384.848 rows=1 loops=1)

-> Gather (cost=97389.55..97389.76 rows=2 width=8) (actual time=384.708..386.486 rows=3 loops=1)
Workers Planned: 2
Workers Launched: 2
-> Partial Aggregate (cost=96389.55..96389.56 rows=1 width=8) (actual time=379.597..379.597 rows=1 loops=3)
-> Parallel Seq Scan on people (cost=0.00..96382.26 rows=2917 width=0)
(actual time=378.831..379.341 rows=3333 loops=3)
Filter: (atpgconn2018 = 'Y'::bpchar)
Rows Removed by Filter: 3330000
Planning Time: 0.063 ms
Execution Time: 386.532 ms max_parallel_workers_per_gather = 2
2018年PostgreSQL中国技术大会

Parallel Plan
Worker(W)：each worker runs a Finalize
copy of the plan fragment beneath of Aggregate
the Gather node.

Leader(L)：leader runs the Gather node

and the plan fragment on top of the Gather
Gather，may also run the plan fragment
beneath of the Gather node.
Partial Partial Partial
Aggregate Aggregate Aggregate

Parallel Seq Parallel Seq Parallel Seq

Scan Scan Scan

W L W
2018年PostgreSQL中国技术大会

PostgreSQL Query Architecture

Parser

Rewriter

Planner

Executor

Processes Memory IPC IO

Infrastructure for parallelism
2018年PostgreSQL中国技术大会

Background worker processes

postgres
(server process)
fork
fork kill
background
postgres worker
Client
(backend process) background
worker
2018年PostgreSQL中国技术大会

Dynamic Shared Memory

• Traditionaly, PostgreSQL has a Shared Memory
fixed-size shared memory mapped
at the same address in all processes,
inherited from the postmaster Buffer Pool
process.

• For parallel query execution, dynamic

shared memory segments are used;
they are extra shared memory, mapped
at an arbitrary address in each backend, L L
and unmapped at the end of the query. DSM
W

W
2018年PostgreSQL中国技术大会

IPC and Message Propagation

L
Shared memory queues (shm_mq) for
control messages and tuples .

If the background worker generates an

tuple error tuple error
ERROR, WARNING, or other message, it queue queue queue queue
DSM
can send that message to the master,
and the master can receive it.

W W
How parallel queries are executed?
2018年PostgreSQL中国技术大会

parallel-aware node
Node with Parallel prefix can be called parallel-aware operators.
Parallel-oblivious node is one where the node is unaware that it is part of a parallel plan.

Parallel
Parallel Seq Parallel Index Parallel Hash
Bitmap Heap
Scan Scan Join
Scan

Bitmap Heap
Seq Scan Index Scan Hash Join
Scan
2018年PostgreSQL中国技术大会

Parallel Seq Scan

L W W

8kb 8kb 8kb 8kb ...

How to allocate work for workers and leader?

Block-By-Block，each process advances a shared next block pointer to choose a block to scan.
2018年PostgreSQL中国技术大会

Parallel Index Scan

• Parallel index scans are supported only
for btree indexes

• Each process advances a shared next

block pointer to choose an index block
and will scan and return all tuples
referenced by that block

next
L W W

8kb 8kb 8kb ...

2018年PostgreSQL中国技术大会

Parallel Bitmap Heap Scan

Parallel
• Similar to Parallel Seq Scan, but scan only pages that were found Bitmap Heap
to potentially contain interesting tuples Scan

• The bitmap is currently built by a single processes; only the

actual Parallel Bitmap Heap Scan is parallel-aware Bitmap Index
Scan
2018年PostgreSQL中国技术大会

Nest Loop Join

The inner side is always non-
parallel. Although it is executed in
full, this is efficient if the inner side
Gather
is an index scan.

Nest Loop Nest Loop Nest Loop

Join Join Join

Parallel Seq Parallel Seq Parallel Seq

Index Scan Index Scan Index Scan
Scan Scan Scan
2018年PostgreSQL中国技术大会

Merge Join
The inner side is always a non-
parallel plan and therefore
executed in full. Gather

Merge Join Merge Join Merge Join

Parallel Seq Parallel Seq Parallel Seq

Index Scan Index Scan Index Scan
Scan Scan Scan
2018年PostgreSQL中国技术大会

The merge join may be inefficient,

Merge Join
especially if a sort must be performed,
because the work and resulting data
are duplicated in every cooperating Gather
process.

Merge Join Merge Join Merge Join

Parallel Seq Parallel Seq Parallel Seq

Sort Sort Sort
Scan Scan Scan

Seq Scan Seq Scan Seq Scan

2018年PostgreSQL中国技术大会

Hash Join
The inner side is executed in full by
every cooperating process to build
identical copies of the hash table.
This may be inefficient if the hash Gather
table is large or the plan is
expensive.

Hash Join Hash Join Hash Join

Parallel Seq Parallel Seq Parallel Seq

Scan Scan Scan
Hash Hash Hash
HashTable HashTable HashTable

Seq Scan Seq Scan Seq Scan

2018年PostgreSQL中国技术大会

Parallel Hash Join

The parallel hash divides the work of
building a shared hash table over the
cooperating processes.
Gather

Parallel Hash Parallel Hash Parallel Hash

Join Join Join

Parallel Seq Parallel Seq Parallel Seq

Scan Scan Scan
Parallel Hash Parallel Hash Parallel Hash
HashTable

Parallel Seq Parallel Seq Parallel Seq

Scan Scan Scan
2018年PostgreSQL中国技术大会

Execution time of different hash join

hash
Seq Scan on inner Seq Scan on outter
table

hash Parallel Seq

Seq Scan on inner
table Scan on outter
hash Parallel Seq
Seq Scan on inner
table Scan on outter
hash Parallel Seq
Seq Scan on inner Hash Join
table Scan on outter

Parallel Seq Parallel Seq

Scan on inner Scan on outter
hash table

Parallel Seq Parallel Seq

Scan on inner Scan on outter
Parallel Seq Scan Parallel Seq
on inner Scan on outter Parallel Hash Join
2018年PostgreSQL中国技术大会

Partition-wise join
Divide and conquer for joins between partitioned table.

Append

Nest Loop
Merge Join Hash Join
Join

ta_p0 tb_p0 ta_p1 tb_p1 ta_p2 tb_p2

2018年PostgreSQL中国技术大会

Parallel Append

Parallel
Parallel
Parallel
Append
Append
Append

Nest Loop
Merge Join Hash Join
Join

ta_p0 tb_p0 ta_p1 tb_p1 ta_p2 tb_p2

How parallel queries are planned?
2018年PostgreSQL中国技术大会

Cost-base Planner
①
① Think of all ways we could execute a query
② Estimate the runtime of each path, than
path path path path ...
choose the cheapest path
③ Convert path into a plan ready for execution
②

• For parallel query, introduce parallel-aware chepest path

node and partial paths
③
• For partial paths, generate
Gather/GatherMerge on top of them plan for
execution
2018年PostgreSQL中国技术大会

Parallel path

Gather

Nest Loop

VS
Join
Partial Nest
Loop Join

Seq Scan Index Scan

Partial Seq
Index Scan
Scan
2018年PostgreSQL中国技术大会

Rule-based parallel degree

2018年PostgreSQL中国技术大会

Costs
• SET parallel_setup_cost = 1000
– Cost of setting up shared memory for parallelism, and launching
workers.
– Discourage parallel query for short queries

• SET parallel_tuple_cost = 0.1

– Cost of CPU time to pass a tuple from worker to leader process
– Discourage parallel query if large amouts of results have to be sent
back
2018年PostgreSQL中国技术大会

Parallelism cannot be used in the following cases

• Query writes any data or locks any database rows
• CTE(with...)
• FULL OUTER JOINs
• SERIALIZABLE transaction isolation
• Use functions marked PARALLEL UNSAFE
• DECLARE CURSOR
2018年PostgreSQL中国技术大会

Future work
• More operators support parallelism, such as sort
• Dynamic repartitioning
• Cost-based planning of parallel degree?
2018年PostgreSQL中国技术大会

References
• https://speakerdeck.com/macdice/parallelism-in-postgresql-11

• https://www.postgresql.org/docs/11/parallel-plans.html#PARALLEL-JOINS

• http://rhaas.blogspot.com/2013/10/parallelism-progress.html

• http://ashutoshpg.blogspot.com/2017/12/partition-wise-joins-divide-and-conquer.html

• http://www.gotw.ca/publications/concurrency-ddj.htm

• https://www.enterprisedb.com/blog/parallel-hash-postgresql

• https://write-skew.blogspot.com/2018/01/parallel-hash-for-postgresql.html

• http://amitkapila16.blogspot.com/2015/11/parallel-sequential-scans-in-play.html

• https://blog.2ndquadrant.com/parallel-aggregate/
Welcome to Alibaba Cloud
Database Technology Group.

[email protected]
Thanks

Postgresql Benchmark
No ratings yet
Postgresql Benchmark
36 pages
Distributed PostgreSQL
No ratings yet
Distributed PostgreSQL
118 pages
50 46 Pgcon2008 Problem
No ratings yet
50 46 Pgcon2008 Problem
36 pages
PostgreSQL - Identifying Slow Queries and Fixing Them
No ratings yet
PostgreSQL - Identifying Slow Queries and Fixing Them
40 pages
Parallel Query Processing in PostgreSQL
No ratings yet
Parallel Query Processing in PostgreSQL
15 pages
Postgresql Performance Tuning: Ruce Omjian
No ratings yet
Postgresql Performance Tuning: Ruce Omjian
61 pages
PostgreSQL Distributed Architectures and Best Practices
No ratings yet
PostgreSQL Distributed Architectures and Best Practices
42 pages
A Deep Dive Into PostgreSQL Indexing
No ratings yet
A Deep Dive Into PostgreSQL Indexing
29 pages
ADC Theory
No ratings yet
ADC Theory
7 pages
PostgreSQL Internals Notes Compilation
100% (1)
PostgreSQL Internals Notes Compilation
18 pages
Building A Scalable Time-Series Database Using Postgres: Mike Freedman
No ratings yet
Building A Scalable Time-Series Database Using Postgres: Mike Freedman
45 pages
Zafin Learn Session - PostgreSQL Performance For Application Developers
No ratings yet
Zafin Learn Session - PostgreSQL Performance For Application Developers
58 pages
Postgres Topic
No ratings yet
Postgres Topic
116 pages
PostgreSQL Configuration For Humans
No ratings yet
PostgreSQL Configuration For Humans
38 pages
CS 631 - Implementation Techniques For Relational Database Systems
No ratings yet
CS 631 - Implementation Techniques For Relational Database Systems
4 pages
Postgresql Course Material
No ratings yet
Postgresql Course Material
205 pages
Deep Dive Into Postgresql Statistics Pgconf Us 2016 160413073045
No ratings yet
Deep Dive Into Postgresql Statistics Pgconf Us 2016 160413073045
54 pages
Lecture 10: Parallel Query Evaluation: CS 838: Foundations of Data Management Spring 2016
No ratings yet
Lecture 10: Parallel Query Evaluation: CS 838: Foundations of Data Management Spring 2016
4 pages
Lecture 1 Parallel Databases
No ratings yet
Lecture 1 Parallel Databases
30 pages
Psycopg 2010 Stuttgart
No ratings yet
Psycopg 2010 Stuttgart
44 pages
Query Optimization
No ratings yet
Query Optimization
17 pages
The Best of Bruce's Postgres Slides: Ruce Omjian
No ratings yet
The Best of Bruce's Postgres Slides: Ruce Omjian
26 pages
Postgres Pro Vs EDB - v16
No ratings yet
Postgres Pro Vs EDB - v16
7 pages
PostgreSQL Performance Tuning
100% (9)
PostgreSQL Performance Tuning
63 pages
Postgresql
No ratings yet
Postgresql
56 pages
postgresql安装
No ratings yet
postgresql安装
4 pages
ADBMS Parallel and Distributed Databases
No ratings yet
ADBMS Parallel and Distributed Databases
98 pages
Accidentaldbalinuxcon 130102190320 Phpapp02
No ratings yet
Accidentaldbalinuxcon 130102190320 Phpapp02
61 pages
Parallel Database: Architecture For Parallel Databases. Parallel Query Evaluation Parallelizing Individual Operations
No ratings yet
Parallel Database: Architecture For Parallel Databases. Parallel Query Evaluation Parallelizing Individual Operations
27 pages
PostgreSQL IQ
No ratings yet
PostgreSQL IQ
27 pages
Interview Questions Postgres
No ratings yet
Interview Questions Postgres
18 pages
Core Extensions For Postgresql Performance Tuning
No ratings yet
Core Extensions For Postgresql Performance Tuning
4 pages
The Internals of PostgreSQL - Chapter 1 Database Cluster, Databases, and Tables
No ratings yet
The Internals of PostgreSQL - Chapter 1 Database Cluster, Databases, and Tables
10 pages
PGSQL Oracle
No ratings yet
PGSQL Oracle
9 pages
PostgreSQL Cheatsheet
No ratings yet
PostgreSQL Cheatsheet
2 pages
PostgreSQL Interview Questions Top Answers
No ratings yet
PostgreSQL Interview Questions Top Answers
8 pages
Postgrre
No ratings yet
Postgrre
14 pages
Internal Pics
No ratings yet
Internal Pics
72 pages
Postgresql Query Optimization: Step by Step Techniques
No ratings yet
Postgresql Query Optimization: Step by Step Techniques
50 pages
12 Algorithms For System Design Interviews
No ratings yet
12 Algorithms For System Design Interviews
8 pages
GPU Accelerated Databases, Speeding Up Database Time Series Analysis Using OpenCL
No ratings yet
GPU Accelerated Databases, Speeding Up Database Time Series Analysis Using OpenCL
29 pages
PostgreSQL Essentials v16 Student
No ratings yet
PostgreSQL Essentials v16 Student
400 pages
Content CheatsheetPostgreSQL
No ratings yet
Content CheatsheetPostgreSQL
3 pages
Intraquery Parallelism Intraoperation Parallelism Interoperation Parallelism Design of Parallel Systems
No ratings yet
Intraquery Parallelism Intraoperation Parallelism Interoperation Parallelism Design of Parallel Systems
29 pages
Para Distr Query Processing Notes
No ratings yet
Para Distr Query Processing Notes
7 pages
PostgreSQL Internals Through Pictures
100% (3)
PostgreSQL Internals Through Pictures
72 pages
Postgres Monitoring Queries All
No ratings yet
Postgres Monitoring Queries All
16 pages
Foundations PostgreSQL Administration 13
100% (1)
Foundations PostgreSQL Administration 13
307 pages
Pganalyze Effective Indexing in Postgres
No ratings yet
Pganalyze Effective Indexing in Postgres
29 pages
2-Performance Issues in PostgreSQL SLRU - How We Are Optimizing It by Dilip 2
No ratings yet
2-Performance Issues in PostgreSQL SLRU - How We Are Optimizing It by Dilip 2
21 pages
Introduction Postgre SQLAdministration V11
No ratings yet
Introduction Postgre SQLAdministration V11
274 pages
135 - PGCon 2009 - Aster v6
No ratings yet
135 - PGCon 2009 - Aster v6
18 pages
Parallel Execution in Oracle
No ratings yet
Parallel Execution in Oracle
17 pages
To Paralelel or Not
No ratings yet
To Paralelel or Not
62 pages
Recent Postgresql Optimizer Improvements: Tom Lane Postgresql - Red Hat Edition Group Red Hat, Inc
No ratings yet
Recent Postgresql Optimizer Improvements: Tom Lane Postgresql - Red Hat Edition Group Red Hat, Inc
34 pages
Postgresql 95231693412781986
No ratings yet
Postgresql 95231693412781986
218 pages
03-PostgreSQL-Database Admin Overview
No ratings yet
03-PostgreSQL-Database Admin Overview
32 pages
MCQ For Postgres 1
No ratings yet
MCQ For Postgres 1
4 pages
Admin Workshop
No ratings yet
Admin Workshop
117 pages
Dummy
No ratings yet
Dummy
25 pages
A Interview Faq's - 1
No ratings yet
A Interview Faq's - 1
25 pages
SWL Server Interview Questions Part 2
No ratings yet
SWL Server Interview Questions Part 2
12 pages
Intro To Databases Worksheet I. Terms/concepts Associated With The Lesson
No ratings yet
Intro To Databases Worksheet I. Terms/concepts Associated With The Lesson
2 pages
Practical Book For Mysql
No ratings yet
Practical Book For Mysql
45 pages
FALL 2024 DB Session 1-1
No ratings yet
FALL 2024 DB Session 1-1
22 pages
BDA Unit-4
No ratings yet
BDA Unit-4
12 pages
DBMS Tutorial
No ratings yet
DBMS Tutorial
171 pages
Resolving Common Oracle Wait Events Using The Wait Interface
No ratings yet
Resolving Common Oracle Wait Events Using The Wait Interface
14 pages
Amazing MySQL Interview Preparation
No ratings yet
Amazing MySQL Interview Preparation
15 pages
IP Assigment Edited
No ratings yet
IP Assigment Edited
20 pages
Oracle GoldenGate Best Practices - Configuring Oracle GoldenGate For Teradata Databases V5a ID1323119.1-1
No ratings yet
Oracle GoldenGate Best Practices - Configuring Oracle GoldenGate For Teradata Databases V5a ID1323119.1-1
43 pages
Cours Excel
No ratings yet
Cours Excel
37 pages
Clover Intv QSTN
No ratings yet
Clover Intv QSTN
10 pages
Mar, 2024 - Dumpsactual C-ABAPD-2309 PDF Dumps and C-ABAPD-2309 Exam Questions (Q34-Q49)
No ratings yet
Mar, 2024 - Dumpsactual C-ABAPD-2309 PDF Dumps and C-ABAPD-2309 Exam Questions (Q34-Q49)
14 pages
Relational Databases and Microsoft Access
100% (2)
Relational Databases and Microsoft Access
211 pages
Dbmss
No ratings yet
Dbmss
14 pages
ADI Hashing
No ratings yet
ADI Hashing
47 pages
Presentation14 Physical Database Design
No ratings yet
Presentation14 Physical Database Design
21 pages
Command Syntax of Create Table
No ratings yet
Command Syntax of Create Table
5 pages
DBMS 1st Unit Notes
No ratings yet
DBMS 1st Unit Notes
20 pages
Pyomo Online Documentation 5.1.1
No ratings yet
Pyomo Online Documentation 5.1.1
113 pages
SQL Theory & Query
No ratings yet
SQL Theory & Query
23 pages
Subqueries With ANY, IN, or SOME: ANY True True ANY s1 t1 s1 s1 t2 IN Any IN Any Some ANY
No ratings yet
Subqueries With ANY, IN, or SOME: ANY True True ANY s1 t1 s1 s1 t2 IN Any IN Any Some ANY
7 pages
DB2 Problem Determination Using Db2top Utility
100% (3)
DB2 Problem Determination Using Db2top Utility
40 pages
Altibase 7.1.0 GettingStarted Eng PDF
No ratings yet
Altibase 7.1.0 GettingStarted Eng PDF
84 pages
All Tables Details
No ratings yet
All Tables Details
18 pages
Implement - Column-Family Stores
No ratings yet
Implement - Column-Family Stores
37 pages
Troubleshooting PostgreSQL - Sample Chapter
100% (1)
Troubleshooting PostgreSQL - Sample Chapter
15 pages
Transbase Release Notes Version 6 8 1 English
No ratings yet
Transbase Release Notes Version 6 8 1 English
37 pages

PostgresChina2018 刘东明 PostgreSQL并行查询

Uploaded by

PostgresChina2018 刘东明 PostgreSQL并行查询

Uploaded by

2018PostgreSQL中国技术大会

PostgreSQL Parallel Query

• PostgreSQL 9.6 [2016]

The Free Lunch Is Over

Parallel Database System

Why Parallel Databases?

EXPLAIN ANALYZE SELECT COUNT(*) FROM people WHERE atpgconn2018 = 'Y';

Aggregate (cost=169324.73..169324.74 rows=1 width=8) (actual time=983.729..983.730 rows=1 loops=1)

Finalize Aggregate (cost=97389.77..97389.78 rows=1 width=8) (actual time=384.848..384.848 rows=1 loops=1)

Leader(L)：leader runs the Gather node

Parallel Seq Parallel Seq Parallel Seq

PostgreSQL Query Architecture

Processes Memory IPC IO

Background worker processes

Dynamic Shared Memory

• For parallel query execution, dynamic

IPC and Message Propagation

If the background worker generates an

Parallel Seq Scan

8kb 8kb 8kb 8kb ...

How to allocate work for workers and leader?

Parallel Index Scan

• Each process advances a shared next

8kb 8kb 8kb ...

Parallel Bitmap Heap Scan

• The bitmap is currently built by a single processes; only the

Nest Loop Join

Nest Loop Nest Loop Nest Loop

Parallel Seq Parallel Seq Parallel Seq

Merge Join Merge Join Merge Join

Parallel Seq Parallel Seq Parallel Seq

The merge join may be inefficient,

Merge Join Merge Join Merge Join

Parallel Seq Parallel Seq Parallel Seq

Seq Scan Seq Scan Seq Scan

Hash Join Hash Join Hash Join

Parallel Seq Parallel Seq Parallel Seq

Seq Scan Seq Scan Seq Scan

Parallel Hash Join

Parallel Hash Parallel Hash Parallel Hash

Parallel Seq Parallel Seq Parallel Seq

Parallel Seq Parallel Seq Parallel Seq

Execution time of different hash join

hash Parallel Seq

Parallel Seq Parallel Seq

Parallel Seq Parallel Seq

ta_p0 tb_p0 ta_p1 tb_p1 ta_p2 tb_p2

ta_p0 tb_p0 ta_p1 tb_p1 ta_p2 tb_p2

• For parallel query, introduce parallel-aware chepest path

Seq Scan Index Scan

Rule-based parallel degree

• SET parallel_tuple_cost = 0.1

Parallelism cannot be used in the following cases

You might also like