0% found this document useful (0 votes)

190 views29 pages

A Deep Dive Into PostgreSQL Indexing

1) The document discusses PostgreSQL indexes, including how they work, why they are used, and how to create different types of indexes. 2) Indexes provide an entry point to locate table rows faster than a sequential scan. Common index types include B-tree and bitmap indexes. 3) Indexes can be created on single columns, expressions, and partially on a table where a condition is true. Index creation locks or doesn't lock the table depending on options used.

Uploaded by

Thomas Justin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

190 views29 pages

A Deep Dive Into PostgreSQL Indexing

Uploaded by

Thomas Justin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 29

Deep Dive Into PostgreSQL Indexes

Ibrar Ahmed
Senior Database Architect - Percona LLC
May 2019
Table Characteristics

• Rows / Tuples stored in a table

• Every table in PostgreSQL has physical disk file(s)

postgres=# CREATE TABLE foo(id int, name text);

postgres=# SELECT relfilenode FROM pg_class WHERE relname LIKE 'foo’;
relfilenode
-------------
16384

• The physical files on disk can be seen in the PostgreSQL $PGDATA directory.

$ ls -lrt $PGDATA/base/13680/16384
-rw------- 1 vagrant vagrant 0 Apr 29 11:48 $PGDATA/base/13680/16384

• Tuple stored in a table does not have any order

2
Selecting Data 1/2
• Select whole table, must be a sequential scan.

• Select table’s rows where id is 5432, it should not be a sequential scan.

EXPLAIN SELECT name FROM bar;

QUERY PLAN
----------------------------------------------------------------------------
Seq Scan on bar (cost=0.00..163693.05 rows=9999905 width=11

EXPLAIN SELECT name FROM bar WHERE id = 5432;

QUERY PLAN
----------------------------------------------------------------------------
Gather (cost=1000.00..116776.94 rows=1 width=11)
Workers Planned: 2
-> Parallel Seq Scan on bar (cost=0.00..115776.84 rows=1 width=11)
Filter: (id = 5432)

3
Selecting Data 2/2
CREATE TABLE foo(id INTEGER, name TEXT); Page 0/N Tuple - 1
Tuple - 2

INSERT INTO foo VALUES(1, 'Alex'); Tuple - 3

INSERT INTO foo VALUES(2, 'Bob');

Tuple - n

Tuple - 1
Page 1/N Tuple - 2
Tuple - 3
SELECT ctid, * FROM foo;
ctid | id | name
-------+----+------ Tuple - n H
Tuple - 1
Page 2/N E
(0,1) | 1 | Alex Tuple - 2
A
Tuple - 3
(0,2) | 2 | Bob P
(2 rows)
Tuple - n

• How to select the data from the HEAP?

• Need to scan each and every page and look for the Page N/N
Tuple - 1
Tuple - 2
tuple in the page Tuple - 3

Cost? Tuple - n

4
PostgreSQL Indexes
https://www.postgresql.org/docs/current/indexes.html

5
Why Index?
• Indexes are entry points for tables
• Index used to locate the tuples in the table
• The sole reason to have an index is performance
• Index is stored separately from the table’s main storage (PostgreSQL Heap)
• More storage required to store the index along with original table
postgres=# EXPLAIN SELECT name FROM bar WHERE id = 5432;
QUERY PLAN
----------------------------------------------------------------------------
Seq Scan on bar (cost=0.00..159235.00 rows=38216 width=32)
Filter: (id = 5432)

postgres=# CREATE INDEX bar_idx ON bar(id);

postgres=# EXPLAIN SELECT name FROM bar WHERE id = 5432;

QUERY PLAN
----------------------------------------------------------------------------
Bitmap Heap Scan on bar (cost=939.93..64313.02 rows=50000 width=32)
Recheck Cond: (id = 5432)
-> Bitmap Index Scan on bar_idx (cost=0.00..927.43 rows=50000 width=0)
Index Cond: (id = 5432)

6
Index

• PostgreSQL standard way to create a index

(https://www.postgresql.org/docs/current/sql-createindex.html)

postgres=# CREATE INDEX idx_btree ON bar(id);

postgres=# SELECT relfilenode FROM pg_class WHERE relname LIKE ‘idx_btree’;

relfilenode
-------------
16425

• PostgreSQL index has its own file on disk.

The physical file on disk can be seen in the PostgreSQL $PGDATA directory.

$ ls -lrt $PGDATA/13680/16425
-rw-------1 vagrant vagrant 1073741824 Apr 29 13:05 $PGDATA/base/13680/16425

7
Creating Index 1/2

• Index based on single column of the table

postgres=# CREATE INDEX bar_idx ON bar(id);

postgres=# EXPLAIN SELECT name FROM bar WHERE id = 5432;

QUERY PLAN
----------------------------------------------------------------------
Bitmap Heap Scan on bar (cost=939.93..64313.02 rows=50000 width=32)
Recheck Cond: (id = 5432)
-> Bitmap Index Scan on bar_idx (cost=0.00..927.43 rows=50000 width=0)
Index Cond: (id = 5432)

8
Creating Index 2/2
PostgreSQL locks the table when creating index

CREATE INDEX idx_btree ON bar USING BTREE(id);

CREATE INDEX
Time: 12303.172 ms (00:12.303)

CONCURRENTLY option creates the index without locking the table

CREATE INDEX CONCURRENTLY idx_btree ON bar USING BTREE(id);

CREATE INDEX
Time: 23025.372 ms (00:23.025)

9
Expression Index 1/2
EXPLAIN SELECT * FROM bar WHERE lower(name) LIKE 'Text1';
QUERY PLAN
-------------------------------------------------------------
Seq Scan on bar (cost=0.00..213694.00 rows=50000 width=40)
Filter: (lower((name)::text) ~~ 'Text1'::text)

CREATE INDEX idx_exp ON bar (lower(name));

EXPLAIN SELECT * FROM bar WHERE lower(name) LIKE 'Text1';

QUERY PLAN
-----------------------------------------------------------------------------
Bitmap Heap Scan on bar (cost=1159.93..64658.02 rows=50000 width=40)
Filter: (lower((name)::text) ~~ 'Text1'::text)
-> Bitmap Index Scan on idx_exp (cost=0.00..1147.43 rows=50000 width=0)
Index Cond: (lower((name)::text) = 'Text1'::text)

1
0
Expression Index 2/2
postgres=# EXPLAIN SELECT * FROM bar WHERE (dt + (INTERVAL '2 days')) < now();
QUERY PLAN
---------------------------------------------------------------
Seq Scan on bar (cost=0.00..238694.00 rows=3333333 width=40)
Filter: ((dt + '2 days'::interval) < now())

postgres=# CREATE INDEX idx_math_exp ON bar((dt + (INTERVAL '2 days')));

postgres=# EXPLAIN SELECT * FROM bar WHERE (dt + (INTERVAL '2 days')) < now();
QUERY PLAN
-------------------------------------------------------------------------------------
Bitmap Heap Scan on bar (cost=62449.77..184477.10 rows=3333333 width=40)
Recheck Cond: ((dt + '2 days'::interval) < now())
-> Bitmap Index Scan on idx_math_exp (cost=0.00..61616.43 rows=3333333 width=0)
Index Cond: ((dt + '2 days'::interval) < now())

1
1
Partial Index
Index Partial Index
CREATE INDEX idx_full ON bar(id); CREATE INDEX idx_part ON bar(id) where id < 10000;

EXPLAIN SELECT * FROM bar EXPLAIN SELECT * FROM bar

WHERE id < 1000 WHERE id < 1000

AND name LIKE 'text1000’; AND name LIKE 'text1000’;

QUERY PLAN QUERY PLAN

------------------------------------------------------------------------ -----------------------------------------------------------------------
--
Bitmap Heap Scan on bar (cost=199.44..113893.44 rows=16667 width=40)
Bitmap Heap Scan on bar (cost=61568.60..175262.59 rows=16667 width=40)
Recheck Cond: (id < 1000)
Recheck Cond: (id < 1000)
Filter:
Q: What will happen when we query ((name)::text
where id >1000? ~~ 'text1000'::text)
Filter: ((name)::text ~~ 'text1000'::text)
-> Bitmap Index Scan on idx_part (cost=0.00..195.28 rows=3333333
-> Bitmap Index Scan on idx_full (cost=0.00..61564.43 rows=3333333 width=0)
width=0)
A: Answer is simple, this index won’t selected.
Index Cond: (id < 1000)
Index Cond: (id < 1000)

SELECT pg_size_pretty(pg_total_relation_size('idx_part'));
SELECT pg_size_pretty(pg_total_relation_size('idx_full'));
pg_size_pretty
pg_size_pretty
----------------
----------------
240 kB
214 MB
(1 row)
(1 row)

12
Index Types
https://www.postgresql.org/docs/current/indexes-types.html

13
B-Tree Index 1/2
• What is a B-Tree index? Wikipedia: (https://en.wikipedia.org/wiki/Self-
• Supported Operators balancing_binary_search_tree)
• Less than < In computer science, a self-balancing (or height-balanced) binary search tree
• Less than equal to <=
• Equal = is any node-based binary search tree that automatically keeps its height
• Greater than equal to >= small in the face of arbitrary item insertions and deletions.
• Greater than >

CREATE INDEX idx_btree ON foo USING BTREE (name);

postgres=# EXPLAIN ANALYZE SELECT * FROM foo WHERE name = 'text%';

QUERY PLAN
----------------------------------------------------------------------------------------------------------------
Index Scan using idx_btree on foo (cost=0.43..8.45 rows=1 width=19) (actual time=0.015..0.015 rows=0 loops=1)
Index Cond: ((name)::text = 'text%'::text)
Planning Time: 0.105 ms
Execution Time: 0.031 ms
(4 rows)

14
B-Tree Index 2/2
CREATE TABLE foo(id INTEGER, name TEXT); Page 0/N Tuple - 1
Tuple - 2
INSERT INTO foo VALUES(1, 'Alex'); Tuple - 3

INSERT INTO foo VALUES(2, 'Bob');

-------+------
(0,1) | Alex Tuple - 1
(0,2) | Bob Page N/N Tuple - 2

(2,2) | Alex Tuple - 3

Tuple - n

EXPLAIN ANALYZE SELECT * FROM bar WHERE name = 'text%';

QUERY PLAN
Index Scan using idx_hash on bar (cost=0.43..8.45 rows=1 width=19) (actual time=0.023..0.023
rows=0 loops=1)
Index Cond: ((name)::text = 'text%'::text)
Planning Time: 0.080 ms
Execution Time: 0.041 ms
(4 rows)

16
BRIN Index 1/2

• BRIN is a “Block Range Index”

• Used when columns have some correlation with their physical location in the table
• Space optimized because BRIN index contains only three items
• Page number
• Min value of column
• Max value of column

CREATE INDEX idx_btree ON bar USING BTREE (date);

CREATE INDEX idx_hash ON bar USING HASH (date);

CREATE INDEX idx_brin ON bar USING BRIN (date);

17
BRIN Index 2/2
Sequential Scan BRIN Index

postgres=# EXPLAIN ANALYZE SELECT * postgres=# EXPLAIN ANALYZE SELECT *

FROM bar FROM bar
WHERE dt > '2022-09-28’ WHERE dt > '2022-09-28’
AND dt < '2022-10-28'; AND dt < '2022-10-28';
QUERY PLAN QUERY PLAN
----------------------------------------------------- ------------------------------------------------------
Seq Scan on bar (cost=0.00..2235285.00 rows=1 Bitmap Heap Scan on bar (cost=92.03..61271.08 rows=1
width=27) width=27) (actual time=1.720..4.186 rows=29 loops=1)
(actual time=0.139..7397.090 rows=29 Recheck Cond: ((dt > '2022-09-28 00:00:00’)
loops=1) AND (dt < '2022-10-28 00:00:00'))
Filter: ((dt > '2022-09-28 00:00:00) Rows Removed by Index Recheck: 18716
AND (dt < '2022-10-28 00:00:00)) Heap Blocks: lossy=128
Rows Removed by Filter: 99999971 -> Bitmap Index Scan on idx_brin
Planning Time: 0.114 ms (cost=0.00..92.03 rows=17406 width=0)
Execution Time: 7397.107 ms (actual time=1.456..1.456 rows=1280 loops=1)
(5 rows) Index Cond: ((dt > '2022-09-28 00:00:00’)
AND (dt < '2022-10-28 00:00:00'))
Planning Time: 0.130 ms
Execution Time: 4.233 ms
(8 rows)

18
GIN Index 1/2
• Generalized Inverted Index
• GIN is to handle where we need to index composite values
• Slow while creating the index because it needs to scan the document up front

postgres=# SELECT DISTINCT name, dt FROM bar LIMIT 5;

name | dt
---------------------------------------------------------------------------+------------
{"name": "Alex", "phone": ["333-333-333", "222-222-222", "111-111-111"]} | 2019-05-13
{"name": "Bob", "phone": ["333-333-444", "222-222-444", "111-111-444"]} | 2019-05-14
{"name": "John", "phone": ["333-3333", "777-7777", "555-5555"]} | 2019-05-15
{"name": "David", "phone": ["333-333-555", "222-222-555", "111-111-555"]} | 2019-05-16
(4 rows)

19
GIN Index 2/2
• Generalized Inverted Index
• GIN is to handle where we need to index composite values
• Slow while creating index because it needs to scan the document up front
CREATE INDEX idx_gin ON bar USING GIN (name);

postgres=# EXPLAIN ANALYZE SELECT * FROM bar postgres=# EXPLAIN ANALYZE SELECT * FROM bar
WHERE name @> '{"name": "Alex"}’; WHERE name @> '{"name": "Alex"}';
QUERY PLAN QUERY PLAN
----------------------------------------------------- -----------------------------------------------------
Seq Scan on bar (cost=0.00..108309.34 rows=3499 Bitmap Heap Scan on bar (cost=679.00..13395.57
width=96) (actual time=396.019..1050.143 rows=1000000 rows=4000 width=96) (actual time=91.110..445.112
loops=1) rows=1000000
Even if you create a BTREE index, it won’t be loops=1)
considered.
Filter: (name @> '{"name": Because
"Alex"}'::jsonb)
it does not know the individual element in (name
Recheck Cond: value. @> '{"name": "Alex"}'::jsonb)
Rows Removed by Filter: 3000000 Heap Blocks: exact=16394
Planning Time: 0.107 ms -> Bitmap Index Scan on
Execution Time: 1079.861 ms idx_gin (cost=0.00..678.00 rows=4000 width=0)
(actual time=89.033..89.033 rows=1000000 loops=1)
Index Cond: (name @> '{"name":
"Alex"}'::jsonb)
Planning Time: 0.168 ms
Execution Time: 475.447 ms
20
GiST Index

• Generalized Search Tree

• A GiST index is lossy

• Tree-structured access method

21
Where and What?

• B-Tree: Use this index for most of the queries and different data types

• Hash: Used for equality operators

• BRIN: For really large sequentially lineup datasets

• GIN: Used for documents and arrays

• GiST: Used for full text search

22
Index Only Scans

• Index is stored separately from the table’s main storage (PostgreSQL Heap)

• Query needs to scan both the index and the heap

• Index Only Scans only used when all the columns in the query part of the index

• In this case PostgreSQL fetches data from index only

23
Index Only Scans
CREATE INDEX idx_btree_ios ON bar (id,name);

EXPLAIN SELECT id, name, dt FROM bar WHERE id > 100000 AND id <100010;
QUERY PLAN
Index Scan using idx_btree_ios on bar (cost=0.56..99.20 rows=25 width=19)
Index Cond: ((id > 100000) AND (id < 100010))
(2 rows)

EXPLAIN SELECT id, name FROM bar WHERE id > 100000 AND id <100010;
QUERY PLAN
Index Only Scan using idx_btree_ios on bar (cost=0.56..99.20 rows=25 width=15)
Index Cond: ((id > 100000) AND (id < 100010))
(2 rows)

24
Duplicate Indexes
SELECT indrelid::regclass relname,
indexrelid::regclass indexname, indkey
FROM pg_index
GROUP BY relname,indexname,indkey;
relname | indexname | indkey
--------------------------+-----------------------------------------------+---------
pg_index | pg_index_indexrelid_index | 1
pg_toast.pg_toast_2615 | pg_toast.pg_toast_2615_index | 1 2
pg_constraint | pg_constraint_conparentid_index | 11

SELECT indrelid::regclass relname, indkey, amname

FROM pg_index i, pg_opclass o, pg_am a
WHERE o.oid = ALL (indclass)
AND a.oid = o.opcmethod
GROUP BY relname, indclass, amname, indkey
HAVING count(*) > 1;
relname | indkey | amname
---------+--------+--------
bar | 2 | btree
(1 row)

25
Unused Indexes
SELECT relname, indexrelname, idx_scan
FROM pg_catalog.pg_stat_user_indexes;

relname | indexrelname | idx_scan

---------+---------------+----------
foo | idx_foo_date | 0
bar | idx_btree | 0
bar | idx_btree_id | 0
bar | idx_btree_name| 6
bar | idx_brin_brin | 4
(7 rows)

26
?
“Poor leaders rarely ask questions of
themselves or others. Good leaders, on
the other hand, ask many questions.
Great leaders ask the great questions.”

Michael Marquardt author of

Leading with Questions

27
Thank You to Our Sponsors
Rate My Session

PostgreSQL Internals Notes Compilation
100% (1)
PostgreSQL Internals Notes Compilation
18 pages
Q-Ans All Competitive Exam Guide Ebook by Education For Assam (BIJAY KOCH)
No ratings yet
Q-Ans All Competitive Exam Guide Ebook by Education For Assam (BIJAY KOCH)
49 pages
Cook P. Fundamentals of HTML, SVG, CSS and JavaScript For Data Visual. 2022
No ratings yet
Cook P. Fundamentals of HTML, SVG, CSS and JavaScript For Data Visual. 2022
87 pages
EDB Postgres Advanced Server Guide v11
No ratings yet
EDB Postgres Advanced Server Guide v11
329 pages
Patroni
100% (1)
Patroni
137 pages
Postgres Topic
No ratings yet
Postgres Topic
116 pages
Introduction To PL PGSQL Development
No ratings yet
Introduction To PL PGSQL Development
145 pages
Equnix PostgreSQL Query Tuning
100% (2)
Equnix PostgreSQL Query Tuning
45 pages
MongoDB University - PreExamDBA
No ratings yet
MongoDB University - PreExamDBA
64 pages
Mastering Postgresql Administration: B M, E DB April, 2009
No ratings yet
Mastering Postgresql Administration: B M, E DB April, 2009
111 pages
Talend Open Studio For Data Integration: User Guide
No ratings yet
Talend Open Studio For Data Integration: User Guide
452 pages
EDB High Availability Scalability v1.0
No ratings yet
EDB High Availability Scalability v1.0
23 pages
Inside PostgreSQL Shared Memory
100% (3)
Inside PostgreSQL Shared Memory
25 pages
205 Oracle To Postgres Migration
100% (2)
205 Oracle To Postgres Migration
58 pages
PostgreSQL Backups The Modern Way
No ratings yet
PostgreSQL Backups The Modern Way
50 pages
EnterpriseDB PostgreSQL Exercises
No ratings yet
EnterpriseDB PostgreSQL Exercises
29 pages
The Magic of Tuning in PostgreSQL
No ratings yet
The Magic of Tuning in PostgreSQL
15 pages
SS1123 - D2T - Apache Cassandra Overview PDF
100% (1)
SS1123 - D2T - Apache Cassandra Overview PDF
45 pages
Postgresql Management and Automation With Clustercontrol
50% (2)
Postgresql Management and Automation With Clustercontrol
42 pages
Os Practical
No ratings yet
Os Practical
23 pages
Icom IC-T90A Instruction Manual
100% (1)
Icom IC-T90A Instruction Manual
100 pages
Koe088 Natural Language Processing 2023 24
No ratings yet
Koe088 Natural Language Processing 2023 24
2 pages
Photography Proposal Example
No ratings yet
Photography Proposal Example
7 pages
Cassandra - An Introduction
100% (1)
Cassandra - An Introduction
35 pages
Pganalyze - Best Practices For Optimizing Postgres Query Performance
100% (1)
Pganalyze - Best Practices For Optimizing Postgres Query Performance
26 pages
Introduction To Cassandra
No ratings yet
Introduction To Cassandra
37 pages
Backup Strategies
100% (1)
Backup Strategies
44 pages
CB116-Lab-Workbook (6.x)
No ratings yet
CB116-Lab-Workbook (6.x)
28 pages
HTML CSS Bootstrap
No ratings yet
HTML CSS Bootstrap
80 pages
F&G Devices Inspection and Test Plan
No ratings yet
F&G Devices Inspection and Test Plan
3 pages
Everything You Need To Know About PostgreSQL EXPLAIN
No ratings yet
Everything You Need To Know About PostgreSQL EXPLAIN
44 pages
Mongodb Vs Couchbase Architecture WP PDF
No ratings yet
Mongodb Vs Couchbase Architecture WP PDF
45 pages
pgpool-II - Streaming Replication
No ratings yet
pgpool-II - Streaming Replication
41 pages
Regular-Lab-Book-Upto 3 RD Experiment
No ratings yet
Regular-Lab-Book-Upto 3 RD Experiment
32 pages
MongoDB Performance Best Practices
No ratings yet
MongoDB Performance Best Practices
15 pages
Oracle Migration 2 Post GR Esq L
No ratings yet
Oracle Migration 2 Post GR Esq L
13 pages
Daa Unit-4
No ratings yet
Daa Unit-4
31 pages
Netezza Questions
100% (1)
Netezza Questions
22 pages
Cassandra DBA
No ratings yet
Cassandra DBA
5 pages
Percona XtraDB Cluster 5.7
No ratings yet
Percona XtraDB Cluster 5.7
92 pages
Overview of Photonic Layer Functional Elements V4go
No ratings yet
Overview of Photonic Layer Functional Elements V4go
142 pages
Santu CV Job Final (07!01!25)
No ratings yet
Santu CV Job Final (07!01!25)
10 pages
Hotkeys Meshmixer
No ratings yet
Hotkeys Meshmixer
5 pages
Mongo DB Exercise
No ratings yet
Mongo DB Exercise
45 pages
Viewsonic-Manuals N3235w-1M SM 1a
No ratings yet
Viewsonic-Manuals N3235w-1M SM 1a
100 pages
EDB Event 22062017 EDB Vs Oracle
No ratings yet
EDB Event 22062017 EDB Vs Oracle
63 pages
Postgrre
No ratings yet
Postgrre
14 pages
Unidad de Corte 5510
No ratings yet
Unidad de Corte 5510
20 pages
Administration PGSQL
No ratings yet
Administration PGSQL
109 pages
Pgpool-II 1st Steps
No ratings yet
Pgpool-II 1st Steps
12 pages
Apache Cassandra
No ratings yet
Apache Cassandra
7 pages
Tuning Your PostgreSQL Server
No ratings yet
Tuning Your PostgreSQL Server
12 pages
PostgreSQL Database Administration
No ratings yet
PostgreSQL Database Administration
1 page
Pgpool-II For Beginners
No ratings yet
Pgpool-II For Beginners
12 pages
MySQL Performance Tuning - MySQL 8 Query Performance Tuning - A Systematic Method For Improving Execution Speeds
No ratings yet
MySQL Performance Tuning - MySQL 8 Query Performance Tuning - A Systematic Method For Improving Execution Speeds
3 pages
MySQL Perf Tuning Best Practices
No ratings yet
MySQL Perf Tuning Best Practices
30 pages
MS-Word Assignment
No ratings yet
MS-Word Assignment
13 pages
VSS ppt.1-2
No ratings yet
VSS ppt.1-2
13 pages
Tuning Linux For MongoDB
No ratings yet
Tuning Linux For MongoDB
26 pages
Crunchy Postgresql High-Availability Suite Keeps Critical Applications Running
No ratings yet
Crunchy Postgresql High-Availability Suite Keeps Critical Applications Running
2 pages
Cyber Security Notes
No ratings yet
Cyber Security Notes
15 pages
PostgreSQL Functions by Example
No ratings yet
PostgreSQL Functions by Example
41 pages
I PPR Extracted
No ratings yet
I PPR Extracted
6 pages
A Performance Comparison of SQL and NoSQL Databases
No ratings yet
A Performance Comparison of SQL and NoSQL Databases
5 pages
Appendix 1: Apmoption Apm3Rdpar Csotpm
No ratings yet
Appendix 1: Apmoption Apm3Rdpar Csotpm
8 pages
Trellix Insights: Key Benefits
No ratings yet
Trellix Insights: Key Benefits
8 pages
Exam C1000 - 085 IBM Netezza Performance Server V11.x Administrator
No ratings yet
Exam C1000 - 085 IBM Netezza Performance Server V11.x Administrator
3 pages
SAP BASIS Transaction Codes User Administration
No ratings yet
SAP BASIS Transaction Codes User Administration
3 pages
PGSQL SQLs
No ratings yet
PGSQL SQLs
9 pages
Lel Tender Specs
No ratings yet
Lel Tender Specs
6 pages
GMP 11 Good Measurement Practice For Assignment and Adjustment of Calibration Intervals For Laboratory Standards
No ratings yet
GMP 11 Good Measurement Practice For Assignment and Adjustment of Calibration Intervals For Laboratory Standards
10 pages
PNZ Series
No ratings yet
PNZ Series
2 pages
Ite 1 Reviewer
No ratings yet
Ite 1 Reviewer
4 pages
Mysql Dba Qa
No ratings yet
Mysql Dba Qa
4 pages
Cassandra Installation Review
No ratings yet
Cassandra Installation Review
6 pages
PGSQL CheatSheet Mysql2psql
No ratings yet
PGSQL CheatSheet Mysql2psql
7 pages
Clapingo Android Internship Assignment
No ratings yet
Clapingo Android Internship Assignment
5 pages
Roach 1
No ratings yet
Roach 1
2 pages
Benzara MBA 2024 MAIT
No ratings yet
Benzara MBA 2024 MAIT
3 pages
Social Entrepreneurship: Assignment 1: Social Enterprise and Entrepreneur Desicrew Solutions and Saloni Malhotra
No ratings yet
Social Entrepreneurship: Assignment 1: Social Enterprise and Entrepreneur Desicrew Solutions and Saloni Malhotra
3 pages
Resume-Pruthiraj Swain LinkedIn PDF
No ratings yet
Resume-Pruthiraj Swain LinkedIn PDF
3 pages
PostgreSQL Administration TOC
No ratings yet
PostgreSQL Administration TOC
4 pages
IBM InfoSphere Replication Server and Data Event Publisher
From Everand
IBM InfoSphere Replication Server and Data Event Publisher
Pav Kumar-Chatterjee
No ratings yet
Microsoft AZURE® AZ-104 Administrator Practice Tests
From Everand
Microsoft AZURE® AZ-104 Administrator Practice Tests
iCertify Training
No ratings yet
DRBD-Cookbook: How to create your own cluster solution, without SAN or NAS!
From Everand
DRBD-Cookbook: How to create your own cluster solution, without SAN or NAS!
Joerg Christian Seubert
No ratings yet
Instant Pentaho Data Integration Kitchen
From Everand
Instant Pentaho Data Integration Kitchen
Sergio Ramazzina
No ratings yet
Oracle Exadata Complete Self-Assessment Guide
From Everand
Oracle Exadata Complete Self-Assessment Guide
Gerardus Blokdyk
No ratings yet
Kubernetes A Complete Guide
From Everand
Kubernetes A Complete Guide
Gerardus Blokdyk
No ratings yet
SnapLogic Second Edition
From Everand
SnapLogic Second Edition
Gerardus Blokdyk
No ratings yet
Oracle Data Guard A Clear and Concise Reference
From Everand
Oracle Data Guard A Clear and Concise Reference
Gerardus Blokdyk
No ratings yet

A Deep Dive Into PostgreSQL Indexing

Uploaded by

A Deep Dive Into PostgreSQL Indexing

Uploaded by

Deep Dive Into PostgreSQL Indexes

• Rows / Tuples stored in a table

postgres=# CREATE TABLE foo(id int, name text);

• Tuple stored in a table does not have any order

• Select table’s rows where id is 5432, it should not be a sequential scan.

EXPLAIN SELECT name FROM bar;

EXPLAIN SELECT name FROM bar WHERE id = 5432;

INSERT INTO foo VALUES(1, 'Alex'); Tuple - 3

INSERT INTO foo VALUES(2, 'Bob');

• How to select the data from the HEAP?

postgres=# CREATE INDEX bar_idx ON bar(id);

postgres=# EXPLAIN SELECT name FROM bar WHERE id = 5432;

• PostgreSQL standard way to create a index

postgres=# CREATE INDEX idx_btree ON bar(id);

postgres=# SELECT relfilenode FROM pg_class WHERE relname LIKE ‘idx_btree’;

• PostgreSQL index has its own file on disk.

• Index based on single column of the table

postgres=# CREATE INDEX bar_idx ON bar(id);

postgres=# EXPLAIN SELECT name FROM bar WHERE id = 5432;

CREATE INDEX idx_btree ON bar USING BTREE(id);

CONCURRENTLY option creates the index without locking the table

CREATE INDEX CONCURRENTLY idx_btree ON bar USING BTREE(id);

CREATE INDEX idx_exp ON bar (lower(name));

EXPLAIN SELECT * FROM bar WHERE lower(name) LIKE 'Text1';

postgres=# CREATE INDEX idx_math_exp ON bar((dt + (INTERVAL '2 days')));

EXPLAIN SELECT * FROM bar EXPLAIN SELECT * FROM bar

WHERE id < 1000 WHERE id < 1000

AND name LIKE 'text1000’; AND name LIKE 'text1000’;

QUERY PLAN QUERY PLAN

CREATE INDEX idx_btree ON foo USING BTREE (name);

postgres=# EXPLAIN ANALYZE SELECT * FROM foo WHERE name = 'text%';

INSERT INTO foo VALUES(2, 'Bob');

(2,2) | Alex Tuple - 3

EXPLAIN ANALYZE SELECT * FROM bar WHERE name = 'text%';

• BRIN is a “Block Range Index”

CREATE INDEX idx_btree ON bar USING BTREE (date);

CREATE INDEX idx_hash ON bar USING HASH (date);

CREATE INDEX idx_brin ON bar USING BRIN (date);

postgres=# EXPLAIN ANALYZE SELECT * postgres=# EXPLAIN ANALYZE SELECT *

postgres=# SELECT DISTINCT name, dt FROM bar LIMIT 5;

• Generalized Search Tree

• A GiST index is lossy

• Tree-structured access method

• Hash: Used for equality operators

• BRIN: For really large sequentially lineup datasets

• GIN: Used for documents and arrays

• GiST: Used for full text search

• Query needs to scan both the index and the heap

• In this case PostgreSQL fetches data from index only

SELECT indrelid::regclass relname, indkey, amname

relname | indexrelname | idx_scan

Michael Marquardt author of

You might also like