0% found this document useful (0 votes)

99 views18 pages

Guide To PostgreSQL Table Partitioning - by Rasiksuhail - Medium

Uploaded by

Adrian Rangel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

99 views18 pages

Guide To PostgreSQL Table Partitioning - by Rasiksuhail - Medium

Uploaded by

Adrian Rangel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

Guide to PostgreSQL Table

Partitioning
Rasiksuhail · Follow
7 min read · Aug 7, 2023

208 4

Photo by Caspar Camille Rubin on Unsplash

PostgreSQL is a powerful open-source relational database management
system that offers various advanced features for managing large and
complex datasets. One such feature is table partitioning, which allows you
to divide a large table into smaller, more manageable pieces called
partitions.

This guide will explore the concept of table partitioning in PostgreSQL and
discuss how it can be leveraged to improve query performance and manage data
efficiently.

What is Table Partitioning?

Table partitioning is a database design technique used to divide a large table
into smaller, more manageable chunks called partitions. Each partition is
essentially a separate table that stores a subset of the original data. This
technique can significantly improve query performance and data
management for large datasets.

Partitioning can be done based on one or more columns, such as a date

column or a range of values. For example, you can partition a table based
on the date of the records, where each partition represents data for a
specific date range. When querying the data, PostgreSQL can quickly
eliminate partitions that are not relevant to the query, resulting in faster
query execution.

Benefits of Table Partitioning

1. Improved Query Performance: Partitioning allows the database to
quickly narrow down the data to a specific partition, reducing the
amount of data that needs to be scanned during queries. This results in
faster query execution times, especially for large datasets.

2. Easier Data Management: With table partitioning, you can easily

manage large datasets by splitting them into smaller, more manageable
partitions. This can simplify tasks such as data archiving, data purging,
and backup and restore operations.

3. Enhanced Data Loading and Indexing: When loading data into a

partitioned table, the process can be parallelized, leading to faster data
ingestion. Additionally, indexes on partitioned tables can be more
efficient, as they only need to cover a smaller subset of data.

4. Cost-Effective Storage: Partitioning allows you to store older or less

frequently accessed data on cheaper storage media, while keeping
frequently accessed data on faster storage devices.

Partitioning Methods in PostgreSQL

PostgreSQL offers various partitioning methods, including:

Range Partitioning

List Partitioning

Hash Partitioning

Lets look at each partitioning methods

Range Partitioning
Range partitioning is a type of table partitioning where data is divided into
partitions based on a specified range of values in a column. This is
particularly useful when dealing with time-series data or any data that has a
natural order. Each partition represents a distinct range of values, and data
falling within that range is stored in that partition. Range partitioning
allows for efficient retrieval of data within specific ranges, leading to
improved query performance.

Let’s consider an example of a sales table with the following structure:

CREATE TABLE sales (

sale_id SERIAL,
sale_date DATE NOT NULL,
product_id INT,
quantity INT,
amount NUMERIC,
PRIMARY KEY (sale_id, sale_date)
) PARTITION BY RANGE (sale_date);

To create a range-partitioned table for this sales data based on the sale_date
column, we need to follow these steps:

Create Partitions

We’ll create individual tables to represent each partition, each covering a

specific range of dates. For demonstration purposes, we’ll create three
partitions: “sales_january,” “sales_february,” and “sales_march.”

CREATE TABLE sales_january PARTITION OF sales

FOR VALUES FROM ('2023-01-01') TO ('2023-02-01');

CREATE TABLE sales_february PARTITION OF sales

FOR VALUES FROM ('2023-02-01') TO ('2023-03-01');

CREATE TABLE sales_march PARTITION OF sales

FOR VALUES FROM ('2023-03-01') TO ('2023-04-01');

Set Up Constraints

We need to define constraints on each partition to ensure that data is

correctly routed to the appropriate partition. In this example, we will use
CHECK constraints on the sale_date column for each partition:

ALTER TABLE sales_january ADD CONSTRAINT sales_january_check

CHECK (sale_date >= '2023-01-01' AND sale_date < '2023-02-01');

ALTER TABLE sales_february ADD CONSTRAINT sales_february_check

CHECK (sale_date >= '2023-02-01' AND sale_date < '2023-03-01');

ALTER TABLE sales_march ADD CONSTRAINT sales_march_check

CHECK (sale_date >= '2023-03-01' AND sale_date < '2023-04-01');

Insert Data into Partitions

Now, we can insert data into the sales table, and PostgreSQL will
automatically route the data to the appropriate partition based on the
sale_date:
INSERT INTO sales (sale_date, product_id, quantity, amount)
VALUES ('2023-01-15', 101, 5, 100.00);

INSERT INTO sales (sale_date, product_id, quantity, amount)

VALUES ('2023-02-20', 102, 10, 200.00);

INSERT INTO sales (sale_date, product_id, quantity, amount)

VALUES ('2023-03-10', 103, 8, 150.00);

Querying Data from Partitions

When querying data, PostgreSQL will automatically access only the relevant
partitions based on the WHERE clause.

-- Retrieve sales data for January

SELECT * FROM sales WHERE sale_date >= '2023-01-01' AND sale_date < '2023-02-01';

-- Retrieve sales data for February

SELECT * FROM sales WHERE sale_date >= '2023-02-01' AND sale_date < '2023-03-01';

-- Retrieve sales data for March

SELECT * FROM sales WHERE sale_date >= '2023-03-01' AND sale_date < '2023-04-01';

These queries will only access the appropriate partitions, resulting in

improved query performance.

List Partitioning in PostgreSQL

List partitioning is another type of table partitioning in PostgreSQL, where
data is divided into partitions based on specific values in a column. Unlike
range partitioning, which uses a range of values, list partitioning allows you
to define specific values for each partition. This partitioning technique is
useful when data can be categorized into distinct, non-overlapping sets.

Let’s consider an example of a products table with the following structure:

CREATE TABLE products (

product_id SERIAL PRIMARY KEY,
category TEXT,
product_name TEXT,
price NUMERIC
) partition by list(category);

To create a list-partitioned table for this products data based on the category
column, we need to follow these steps:

Create Partitions

We’ll create individual tables to represent each partition, with each

partition covering a specific category of products. For demonstration
purposes, we’ll create three partitions: “electronics,” “clothing,” and
“furniture.”

CREATE TABLE electronics PARTITION OF products

FOR VALUES IN ('Electronics');

CREATE TABLE clothing PARTITION OF products

FOR VALUES IN ('Clothing');
CREATE TABLE furniture PARTITION OF products
FOR VALUES IN ('Furniture');

Set Up Constraints

Since list partitioning is based on specific values, we don’t need CHECK

constraints. However, we need to set up the partitions correctly by adding
rows to the appropriate tables.

Insert Data into Partitions

Now, we can insert data into the products table, and PostgreSQL will
automatically route the data to the appropriate partition based on the
category.

INSERT INTO products (category, product_name, price)

VALUES ('Electronics', 'Smartphone', 500.00);

INSERT INTO products (category, product_name, price)

VALUES ('Clothing', 'T-Shirt', 25.00);

INSERT INTO products (category, product_name, price)

VALUES ('Furniture', 'Sofa', 800.00);

Querying Data from Partitions

When querying data, PostgreSQL will automatically access only the relevant
partition based on the WHERE clause.
-- Retrieve electronics products
SELECT * FROM products WHERE category = 'Electronics';

-- Retrieve clothing products

SELECT * FROM products WHERE category = 'Clothing';|

-- Retrieve furniture products

SELECT * FROM products WHERE category = 'Furniture';

List partitioning in PostgreSQL is a valuable technique for managing and

querying data based on specific values in a column. By dividing data into
partitions based on categories or other distinct sets, list partitioning allows
for faster data retrieval and improved data management

Hash Partitioning in PostgreSQL

Hash partitioning is a type of table partitioning in PostgreSQL, where data is
divided into partitions based on the hash value of a specified column.
Unlike range or list partitioning, which uses specific values or ranges, hash
partitioning uses a hash function to distribute data uniformly across
partitions. This partitioning technique is useful when you want to evenly
distribute data across partitions to achieve load balancing.

Let’s consider an example of an orders table with the following structure:

CREATE TABLE orders (

order_id SERIAL PRIMARY KEY,
order_date DATE,
customer_id INT,
total_amount NUMERIC
) partition by hash(customer_id);
To create a hash-partitioned table for this orders data based on the
customer_id column, we need to follow these steps:

Create Partitions

Search
We’ll create individual tables to represent each Write
partition,Sign up
with Sign in
each
partition covering a specific range of hash values. For demonstration
purposes, let’s create three partitions.

CREATE TABLE orders_1 PARTITION OF orders

FOR VALUES WITH (MODULUS 3, REMAINDER 0);

CREATE TABLE orders_2 PARTITION OF orders

FOR VALUES WITH (MODULUS 3, REMAINDER 1);

CREATE TABLE orders_3 PARTITION OF orders

FOR VALUES WITH (MODULUS 3, REMAINDER 2);

In this example, we use the HASH() function to specify that the data should
be partitioned based on the hash value of the customer_id column. We use
MODULUS and REMAINDER to specify the number of partitions (3 in this case)
and the remainder value for each partition.

Insert Data into Partitions

Now, we can insert data into the orders table, and PostgreSQL will
automatically route the data to the appropriate partition based on the hash
value of the customer_id :

INSERT INTO orders (order_date, customer_id, total_amount)

VALUES ('2023-01-15', 101, 500.00);

INSERT INTO orders (order_date, customer_id, total_amount)

VALUES ('2023-02-20', 102, 600.00);

INSERT INTO orders (order_date, customer_id, total_amount)

VALUES ('2023-03-10', 103, 700.00);

Querying Data from Partitions

When querying data, PostgreSQL will automatically access the appropriate

partition based on the hash value of the customer_id .

-- Retrieve orders for customer_id 101

SELECT * FROM orders WHERE customer_id = 101;

-- Retrieve orders for customer_id 102

SELECT * FROM orders WHERE customer_id = 102;

-- Retrieve orders for customer_id 103

SELECT * FROM orders WHERE customer_id = 103;

Hash partitioning in PostgreSQL is a useful technique for distributing data

evenly across partitions based on the hash value of a specified column. By
leveraging hash functions to uniformly distribute data, hash partitioning
achieves load balancing and improves query performance.

Partition makes querying faster.

PostgreSQL table partitioning is a powerful feature that can significantly

enhance the performance and management of large datasets. By dividing
data into smaller partitions, you can optimize query performance, simplify
data management, and achieve more efficient data loading and indexing.
When designing a partitioning strategy, consider your data and query
patterns to choose the most appropriate partitioning method. With the right
implementation, table partitioning can be a game-changer for handling
massive amounts of data in PostgreSQL.

Start Partitioning !

Explore my other blogs as well:

Data Engineering 2023: Unlocking the Power of Data for

Businesses
In the fast-paced world of data, businesses are constantly seeking
innovative ways to harness the power of their data…
medium.com

Data Debt: The Silent Killer of Data-Driven Organizations

Big Data Debt
medium.com
Thanks for Reading !

I post about Data , AI , Startups , Leadership, Writing & Culture.

Stay Tuned for my next blog !!

Sql Postgres Database Data Engineering Query Optimization

Written by Rasiksuhail Follow

717 Followers

Exploring Data!!

More from Rasiksuhail

Rasiksuhail Rasiksuhail

Orchestrating dbt with Airflow: A Orchestrating dbt with Airflow: A

Step by Step Guide to Automatin… Step by Step Guide to Automatin…
Data Pipelines
In today’s —Part
data-driven world,I organizations Data Pipelines
In today’s —Part
data-centric II
landscape,
rely heavily on automated data pipelines to… organizations heavily rely on automated dat…
process, transform, and analyze vast pipelines to manage vast data volumes. dbt
amounts
Jul 23, 2023of… 200 3 (data build… 124
Jul 28, 2023 3

Rasiksuhail Rasiksuhail

Introduction to dbt—Step by Step Venturing into Complex

Guide for Beginners Techniques with dbt: Dynamic…
Data Build Tool Data Selection,
DBT (Data Lineage,
Build Tool) and
is an open-source
Governance
command-line tool that helps data analysts…
and engineers transform and manage their
Apr 3, 2023 123 data pipelines…
Apr 24, 2023 86 1
See all from Rasiksuhail

Recommended from Medium

João Salgado Code Geass

Partitioning your PostgreSQL table Optimizing PostgreSQL Queries:

I recently had to deal with a PostgreSQL table From 300 Seconds to 2 Seconds…
with a very large number of rows in a heavil… for Billions
Handling data of Records
in the billions is challenging
used database (read and write). The and often pushes the limits of traditional…
database… relational databases. PostgreSQL, while
May 6 4 robust…
Aug 26 21

Lists
ChatGPT data science and AI
21 stories · 776 saves 40 stories · 229 saves

Natural Language Processing Staff Picks

1670 stories · 1250 saves 722 stories · 1266 saves
Vishal Barvaliya Mehdi Lotfinejad

IN vs EXISTS in SQL 10 Command-Line Utilities in

When you’re working with SQL, you’ll often PostgreSQL
find yourself needing to filter data based on…
values in other tables. Two common ways to
do this…
Aug 13 202 2 Aug 17 19

Zach Quinn in Learning SQL Dylan Smith in Javarevisited

How I Reduced My Query’s Run Interview: How to Check Whether a

Time From 30 Min. To 30 Sec. In 1… Username Exists Among One…
Hour
The query optimization steps a senior data Billion Users?
My articles are open to everyone; non-
engineer took to reduce the process time of… member readers can read the full article by…
a query processing 1 billion+ rows. clicking this link.
Mar 13 780 18 Aug 18 1.1K 30

See more recommendations

Help Status About Careers Press Blog Privacy Terms Text to speech Teams

SQL Server Partitioning
100% (2)
SQL Server Partitioning
20 pages
Oracle Partitioning Interview Questions and Answers
0% (1)
Oracle Partitioning Interview Questions and Answers
3 pages
2 Partitioning+QC+Done
No ratings yet
2 Partitioning+QC+Done
74 pages
Partitioning in Oracle 9i
100% (8)
Partitioning in Oracle 9i
19 pages
Partitioning Shines in Postgresql 11: Amit Langote, NTT Oss Center Pgconf - Asia, Tokyo Dec 11, 2018
No ratings yet
Partitioning Shines in Postgresql 11: Amit Langote, NTT Oss Center Pgconf - Asia, Tokyo Dec 11, 2018
48 pages
18 Partitioned Tables and Indexes: Introduction To Partitioning
No ratings yet
18 Partitioned Tables and Indexes: Introduction To Partitioning
84 pages
Oracle Partitioning For Developers
No ratings yet
Oracle Partitioning For Developers
70 pages
Learn How To Partition in Oracle 9i Release 2: Title Slide
No ratings yet
Learn How To Partition in Oracle 9i Release 2: Title Slide
31 pages
Chapt 23
No ratings yet
Chapt 23
30 pages
Parallel Databases
No ratings yet
Parallel Databases
19 pages
A Comprehensive Guide To Oracle Partitioning With Samples
No ratings yet
A Comprehensive Guide To Oracle Partitioning With Samples
36 pages
Partitioning
No ratings yet
Partitioning
224 pages
PracticalPartitioning v2
No ratings yet
PracticalPartitioning v2
76 pages
Intro To Cassandra For Developers
No ratings yet
Intro To Cassandra For Developers
61 pages
5 Partitioning
No ratings yet
5 Partitioning
23 pages
Getting To Know The Ins and Outs of Oracle Partitioning in Oracle Database 11g
No ratings yet
Getting To Know The Ins and Outs of Oracle Partitioning in Oracle Database 11g
48 pages
Oracle Partitioning in Oracle Database 11g
No ratings yet
Oracle Partitioning in Oracle Database 11g
47 pages
Table Partitioning:: Secret Weapon For Big Data Problems
No ratings yet
Table Partitioning:: Secret Weapon For Big Data Problems
46 pages
Things You Always Wanted To Know About Oracle Partitioning
No ratings yet
Things You Always Wanted To Know About Oracle Partitioning
43 pages
Oracle 12c Partitioned and Subpartitioned Tables
No ratings yet
Oracle 12c Partitioned and Subpartitioned Tables
24 pages
Table Partitioning: Creating Partition Tables
No ratings yet
Table Partitioning: Creating Partition Tables
8 pages
Teradata PPI
No ratings yet
Teradata PPI
14 pages
Zafin Learn Session - PostgreSQL Performance For Application Developers
No ratings yet
Zafin Learn Session - PostgreSQL Performance For Application Developers
58 pages
C3 - Code Table Partition - 04 - 10 - 2023
No ratings yet
C3 - Code Table Partition - 04 - 10 - 2023
6 pages
Partition Table
No ratings yet
Partition Table
5 pages
Oracle Optimization Tutorial - Partitioning
No ratings yet
Oracle Optimization Tutorial - Partitioning
5 pages
Oracle Partitioned Tables
No ratings yet
Oracle Partitioned Tables
38 pages
Postgresql Question
No ratings yet
Postgresql Question
12 pages
Major Features: Postgres 10: Ruce Omjian
No ratings yet
Major Features: Postgres 10: Ruce Omjian
20 pages
Oracle 11g Partitioning
No ratings yet
Oracle 11g Partitioning
11 pages
Oracle Performance Tuning - Oracle Partitioning - Introduction
No ratings yet
Oracle Performance Tuning - Oracle Partitioning - Introduction
57 pages
11g Partitioning Features Part2
No ratings yet
11g Partitioning Features Part2
4 pages
Creating Partition Table in ODOO 17
No ratings yet
Creating Partition Table in ODOO 17
6 pages
Lab5 Partitioning2
No ratings yet
Lab5 Partitioning2
5 pages
Lab 04
No ratings yet
Lab 04
4 pages
Oracle Partitioning
No ratings yet
Oracle Partitioning
6 pages
Postgres Partitioning
No ratings yet
Postgres Partitioning
6 pages
Partitioning in Oracle 1728042170
No ratings yet
Partitioning in Oracle 1728042170
12 pages
Partitions: Creating A Range-Partitioned Table
No ratings yet
Partitions: Creating A Range-Partitioned Table
3 pages
Oracle Partitions by Fayyaz Ahmed
No ratings yet
Oracle Partitions by Fayyaz Ahmed
7 pages
Lab4 Partitioning
No ratings yet
Lab4 Partitioning
2 pages
Oracle Partitioning - Enhance Performance & Data Management-1
No ratings yet
Oracle Partitioning - Enhance Performance & Data Management-1
6 pages
Performance Tuning - Partitioning
No ratings yet
Performance Tuning - Partitioning
11 pages
Partitioned Tables and Indexes: Introduction To Partitioning
No ratings yet
Partitioned Tables and Indexes: Introduction To Partitioning
18 pages
12-02-2024
No ratings yet
12-02-2024
4 pages
Partition Types
No ratings yet
Partition Types
4 pages
3 RD Unit Partioning
No ratings yet
3 RD Unit Partioning
3 pages
How To Partition PostgreSQL Database
No ratings yet
How To Partition PostgreSQL Database
8 pages
ADB25 Lab 5
No ratings yet
ADB25 Lab 5
6 pages
Oracle (2PM) 2
No ratings yet
Oracle (2PM) 2
3 pages
Partitioning in Oracle
No ratings yet
Partitioning in Oracle
5 pages
Table Partitioning in SQL Server
No ratings yet
Table Partitioning in SQL Server
11 pages
Partitioning For Database Performance
No ratings yet
Partitioning For Database Performance
3 pages
Basics of Partitioning
100% (1)
Basics of Partitioning
2 pages
CDA C2 R 074 en File 68.en
No ratings yet
CDA C2 R 074 en File 68.en
3 pages
Dbms 2 Syllabus
No ratings yet
Dbms 2 Syllabus
15 pages
Cursors and Triggers
No ratings yet
Cursors and Triggers
5 pages
Microsoft SQL Server
No ratings yet
Microsoft SQL Server
111 pages
,,,old New Tables
No ratings yet
,,,old New Tables
2 pages
Relational Model Concepts
No ratings yet
Relational Model Concepts
5 pages
DDM Unit 2
No ratings yet
DDM Unit 2
23 pages
Sap HR Abap
No ratings yet
Sap HR Abap
140 pages
PB 1 IP Answer Key 2024
No ratings yet
PB 1 IP Answer Key 2024
6 pages
DBMS All Notes
No ratings yet
DBMS All Notes
58 pages
DBMS-Logical Database Design and The Relational Model
No ratings yet
DBMS-Logical Database Design and The Relational Model
52 pages
IP.21 Learning Path
No ratings yet
IP.21 Learning Path
1 page
OLAP
No ratings yet
OLAP
8 pages
Full End To End Information On S/4 Hana Database
No ratings yet
Full End To End Information On S/4 Hana Database
160 pages
(FREE PDF Sample) (Ebook) High Performance PostgreSQL For Rails (Beta) : Reliable, Scalable, Maintainable Database Applications by Andrew Atkinson ISBN 9798888650387, 8888650385 Ebooks
100% (1)
(FREE PDF Sample) (Ebook) High Performance PostgreSQL For Rails (Beta) : Reliable, Scalable, Maintainable Database Applications by Andrew Atkinson ISBN 9798888650387, 8888650385 Ebooks
76 pages
DBMS - Module 4
No ratings yet
DBMS - Module 4
25 pages
Index For Practical Record-12a
No ratings yet
Index For Practical Record-12a
3 pages
XMLType Datatype in Oracle9i
No ratings yet
XMLType Datatype in Oracle9i
52 pages
Some Useful SQL Commands
No ratings yet
Some Useful SQL Commands
9 pages
Chapter - Database Concept-2 Pu
No ratings yet
Chapter - Database Concept-2 Pu
8 pages
Unit-1 RDBMS
No ratings yet
Unit-1 RDBMS
24 pages
Itu 07301 2020-2021 Se 2021 PDF
0% (1)
Itu 07301 2020-2021 Se 2021 PDF
3 pages
DBS-C01-S02-B-03-Relational Databases
No ratings yet
DBS-C01-S02-B-03-Relational Databases
3 pages
SQL Query Assignement
No ratings yet
SQL Query Assignement
7 pages
Er HW
No ratings yet
Er HW
2 pages
Lab 11
No ratings yet
Lab 11
2 pages
Advanced Database Concepts
No ratings yet
Advanced Database Concepts
16 pages
4 Non-Procedural Access
No ratings yet
4 Non-Procedural Access
9 pages
What Is Oracle Database ?: 2. Explain Oracle Grid Architecture?
No ratings yet
What Is Oracle Database ?: 2. Explain Oracle Grid Architecture?
4 pages
Department of Information Sciences and Technologies
No ratings yet
Department of Information Sciences and Technologies
12 pages
ETL Test
No ratings yet
ETL Test
4 pages