0% found this document useful (0 votes)

29 views3 pages

Cache Hit Ratio

The document discusses how to check database performance using PostgreSQL. It recommends checking the cache hit rate, which should be around 99%, and examining index usage. Large tables without indexes being used for queries may indicate performance issues. The example shows checking an events table, finding it had no indexes used, adding an index concurrently, and seeing a query using that table become much faster as a result.

Uploaded by

gurukarthick_dba

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views3 pages

Cache Hit Ratio

Uploaded by

gurukarthick_dba

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

For many application developers their database is a black box.

Data goes in, comes back out and in

between there developers hope its a pretty short time span. Without becoming a DBA there’s a few pieces
of data that most application developers can easily grok which will help them understand if their database is
performing adequately. This post will provide some quick tips that allow you to determine whether your
database performance is slowing down your app, and if so what you can do about it.

Understanding your Cache and its Hit Rate

The typical rule for most applications is that only a fraction of its data is regularly accessed. As with many
other things data can tend to follow the 80/20 rule with 20% of your data accounting for 80% of the reads
and often times its higher than this. Postgres itself actually tracks access patterns of your data and will on
its own keep frequently accessed data in cache. Generally you want your database to have a cache hit rate
of about 99%. You can find your cache hit rate with:

SELECT
sum(heap_blks_read) as heap_read,
sum(heap_blks_hit) as heap_hit,
sum(heap_blks_hit) / (sum(heap_blks_hit) + sum(heap_blks_read)) as ratio
FROM
pg_statio_user_tables;

We can see in this dataclip that the cache rate for Heroku Postgres is 99.99%. If you find yourself with a
ratio significantly lower than 99% then you likely want to consider increasing the cache available to your
database, you can do this on Heroku Postgres by performing a fast database changeover or on something
like EC2 by performing a dump/restore to a larger instance size.

Understanding Index Usage

The other primary piece for improving performance is indexes. Several frameworks will add indexes on
your primary keys, though if you’re searching on other fields or joining heavily you may need to manually
add such indexes.

Indexes are most valuable across large tables as well. While accessing data from cache is faster than disk,
even data within memory can be slow if Postgres must parse through hundreds of thousands of rows to
identify if they meet a certain condition. To generate a list of your tables in your database with the largest
ones first and the percentage of time which they use an index you can run:

SELECT
relname,
100 * idx_scan / (seq_scan + idx_scan) percent_of_times_index_used,
n_live_tup rows_in_table
FROM
pg_stat_user_tables
WHERE
seq_scan + idx_scan > 0
ORDER BY
n_live_tup DESC;

While there is no perfect answer, if you’re not somewhere around 99% on any table over 10,000 rows you
may want to consider adding an index. When examining where to add an index you should look at what
kind of queries you’re running. Generally you’ll want to add indexes where you’re looking up by some other
id or on values that you’re commonly filtering on such as created_at fields.

Pro tip: If you’re adding an index on a production database use CREATE INDEX CONCURRENTLY to have it
build your index in the background and not hold a lock on your table. The limitation to creating
indexesconcurrently is they can typically take 2-3 times longer to create and can’t be run within a
transaction. Though for any large production site these trade-offs are worth the trade-off in experience to
your end users.

Heroku Dashboard as an Example

Looking at a real world example of the recently launched Heroku dashboard, we can run this query and see
our results:

# SELECT relname, 100 * idx_scan / (seq_scan + idx_scan) percent_of_times_index_used, n_live_tup rows_in_table

FROM pg_stat_user_tables ORDER BY n_live_tup DESC;
relname | percent_of_times_index_used | rows_in_table
---------------------+-----------------------------+---------------
events | 0| 669917
app_infos_user_info | 0| 198218
app_infos | 50 | 175640
user_info | 3| 46718
rollouts | 0| 34078
favorites | 0| 3059
schema_migrations | 0| 2
authorizations | 0| 0
delayed_jobs | 23 | 0

From this we can wee the events table which has around 700,000 rows has no indexes that have been
used. From here you could investigate within my application and see some of the common queries that are
used, one example is pulling the events for this blog post which you are reaching. You can see
yourexecution plan by running an EXPLAIN ANALYZE which gives you can get a better idea of the
performance of a specific query:
EXPLAIN ANALYZE SELECT * FROM events WHERE app_info_id = 7559; QUERY
PLAN
-------------------------------------------------------------------
Seq Scan on events (cost=0.00..63749.03 rows=38 width=688) (actual time=2.538..660.785 rows=89 loops=1)
Filter: (app_info_id = 7559)
Total runtime: 660.885 ms

Given there’s a sequential scan across all that data this is an area we can optimize with an index. We can
add our index concurrently to prevent locking on that table and then see how performance is:
CREATE INDEX CONCURRENTLY idx_events_app_info_id ON events(app_info_id);
EXPLAIN ANALYZE SELECT * FROM events WHERE app_info_id = 7559;

----------------------------------------------------------------------
Index Scan using idx_events_app_info_id on events (cost=0.00..23.40 rows=38 width=688) (actual time=0.021..0.115
rows=89 loops=1)
Index Cond: (app_info_id = 7559)
Total runtime: 0.200 ms

While we can see the obvious improvement in this single query we can examine the results in New
Relic and see that we’ve significantly reduced our time spent in the database with the addition of this and a
few other indexes:

NewRelicGraph

Index Cache Hit Rate

Finally to combine the two if you’re interested in how many of your indexes are within
your cache you can run:

SELECT
sum(idx_blks_read) as idx_read,
sum(idx_blks_hit) as idx_hit,
(sum(idx_blks_hit) - sum(idx_blks_read)) / sum(idx_blks_hit) as ratio
FROM
pg_statio_user_indexes;

Generally, you should also expect this to be in the 99% similar to your regular cache
hit rate.

The DynamoDB Book
100% (1)
The DynamoDB Book
448 pages
Designing Data Intensive Applications
25% (4)
Designing Data Intensive Applications
61 pages
Write Your Own Adventure Programs B
100% (1)
Write Your Own Adventure Programs B
52 pages
Odoo Performance
100% (4)
Odoo Performance
44 pages
Database Caching Strategies Using Redis
No ratings yet
Database Caching Strategies Using Redis
22 pages
Interpret Statspack Report
No ratings yet
Interpret Statspack Report
9 pages
Zafin Learn Session - PostgreSQL Performance For Application Developers
No ratings yet
Zafin Learn Session - PostgreSQL Performance For Application Developers
58 pages
Sqlfordevscom Next Level Database Techniques For Developers 37 40
No ratings yet
Sqlfordevscom Next Level Database Techniques For Developers 37 40
4 pages
Performance Tuning PostgreSQL
No ratings yet
Performance Tuning PostgreSQL
25 pages
TopDev - High Performance and Scalability Database Design - V2.1
No ratings yet
TopDev - High Performance and Scalability Database Design - V2.1
52 pages
Query Optimization
No ratings yet
Query Optimization
9 pages
Postgresql Query Optimization: Step by Step Techniques
No ratings yet
Postgresql Query Optimization: Step by Step Techniques
50 pages
Lab 3
No ratings yet
Lab 3
3 pages
Database Performance Optimization. Andrey Avtomonov
100% (1)
Database Performance Optimization. Andrey Avtomonov
26 pages
Why Postgresql For Analytics Infrastructure (DW) ?: Huy Nguyen Cto, Cofounder - Holistics - Io
No ratings yet
Why Postgresql For Analytics Infrastructure (DW) ?: Huy Nguyen Cto, Cofounder - Holistics - Io
50 pages
Oracle E-Biz Performance
No ratings yet
Oracle E-Biz Performance
32 pages
Oracle SQL High Performance Tuning: Guy Harrison Director, R&D Melbourne
100% (1)
Oracle SQL High Performance Tuning: Guy Harrison Director, R&D Melbourne
56 pages
Enkitec RealWorldExadata
No ratings yet
Enkitec RealWorldExadata
38 pages
Accidentaldbalinuxcon 130102190320 Phpapp02
No ratings yet
Accidentaldbalinuxcon 130102190320 Phpapp02
61 pages
Database - Design
No ratings yet
Database - Design
9 pages
PostgreSQL Advanced CheatSheet 1731972672
No ratings yet
PostgreSQL Advanced CheatSheet 1731972672
10 pages
Logical IO Vs Physical IO Vs Consistent Gets
No ratings yet
Logical IO Vs Physical IO Vs Consistent Gets
11 pages
PgConf 2016 EU
No ratings yet
PgConf 2016 EU
25 pages
Cassandra Data Modeling Best Practices
No ratings yet
Cassandra Data Modeling Best Practices
57 pages
Equnix PostgreSQL Query Tuning
100% (2)
Equnix PostgreSQL Query Tuning
45 pages
PostgreSQL - Performance Analysis & Tuning
No ratings yet
PostgreSQL - Performance Analysis & Tuning
3 pages
Overview - Explain - Measuring Performance - Disk Architectures - Indexes - Join Algorithms (CTD.)
No ratings yet
Overview - Explain - Measuring Performance - Disk Architectures - Indexes - Join Algorithms (CTD.)
69 pages
Pganalyze Best Practices For Optimizing Postgres Query Performance
No ratings yet
Pganalyze Best Practices For Optimizing Postgres Query Performance
26 pages
Top 10, No - Make That 11, Things About Oracle Database 11g Release 1
No ratings yet
Top 10, No - Make That 11, Things About Oracle Database 11g Release 1
81 pages
Pganalyze Effective Indexing in Postgres
No ratings yet
Pganalyze Effective Indexing in Postgres
29 pages
SQL Tuning
No ratings yet
SQL Tuning
27 pages
Finding Database Bottlenecks
No ratings yet
Finding Database Bottlenecks
4 pages
Oracle Database Performance Tuning: Presented By-Rahul Gaikwad
No ratings yet
Oracle Database Performance Tuning: Presented By-Rahul Gaikwad
42 pages
Partitioning With Oracle 11G: Bert Scalzo, Domain Expert, Oracle Solutions
No ratings yet
Partitioning With Oracle 11G: Bert Scalzo, Domain Expert, Oracle Solutions
45 pages
DBMS Complete Presentation Detailed
No ratings yet
DBMS Complete Presentation Detailed
13 pages
Query Optimization
No ratings yet
Query Optimization
17 pages
Scaling Twitter 12758
No ratings yet
Scaling Twitter 12758
56 pages
Deep Dive Into Postgresql Statistics Pgconf Us 2016 160413073045
No ratings yet
Deep Dive Into Postgresql Statistics Pgconf Us 2016 160413073045
54 pages
Awrrpt 1 87965 87969
No ratings yet
Awrrpt 1 87965 87969
328 pages
Checklist
No ratings yet
Checklist
2 pages
Dynamo DB Insights
No ratings yet
Dynamo DB Insights
17 pages
Postgresql Interview Questions - Postgresql Intereview Questions With Answers
No ratings yet
Postgresql Interview Questions - Postgresql Intereview Questions With Answers
10 pages
Pganalyze - Best Practices For Optimizing Postgres Query Performance
100% (1)
Pganalyze - Best Practices For Optimizing Postgres Query Performance
26 pages
Sqlfordevscom Next Level Database Techniques For Developers Pages 5 10
No ratings yet
Sqlfordevscom Next Level Database Techniques For Developers Pages 5 10
6 pages
Sqlfordevscom Next Level Database Techniques For Developers PDF
No ratings yet
Sqlfordevscom Next Level Database Techniques For Developers PDF
50 pages
Index
No ratings yet
Index
23 pages
Database Performance
No ratings yet
Database Performance
3 pages
Application Tuning
No ratings yet
Application Tuning
11 pages
Reading Statspack Report
100% (1)
Reading Statspack Report
24 pages
Pandoc
No ratings yet
Pandoc
16 pages
Quick Oracle 9i Performance Tuning Tips & Scripts
No ratings yet
Quick Oracle 9i Performance Tuning Tips & Scripts
7 pages
SQL and PostgreSQL The Complete Developer's Guide
No ratings yet
SQL and PostgreSQL The Complete Developer's Guide
5 pages
Performance Tuning The Mysql Server: Ligaya Turmelle Mysql Support Engineer
No ratings yet
Performance Tuning The Mysql Server: Ligaya Turmelle Mysql Support Engineer
34 pages
2014-Db-Franck Pachot-Interpreting Awr Reports Straight To The Goal-Manuskript
No ratings yet
2014-Db-Franck Pachot-Interpreting Awr Reports Straight To The Goal-Manuskript
11 pages
Tuning With AWR
100% (1)
Tuning With AWR
29 pages
Table of Contents
No ratings yet
Table of Contents
7 pages
Join-Fu: The Art of SQL - ZendCon 2008
100% (2)
Join-Fu: The Art of SQL - ZendCon 2008
48 pages
Show My Homework Spelling Test
100% (1)
Show My Homework Spelling Test
8 pages
Non Disclosure Agreement - Web-Portal
100% (1)
Non Disclosure Agreement - Web-Portal
3 pages
Pci Micro Project On Election System DDNHR
No ratings yet
Pci Micro Project On Election System DDNHR
18 pages
How To Install Let's Encrypt On Windows Server 2019
No ratings yet
How To Install Let's Encrypt On Windows Server 2019
19 pages
ICT Special Curri CG Q1 Intro To ICT MELCS 1
No ratings yet
ICT Special Curri CG Q1 Intro To ICT MELCS 1
2 pages
Djhendry CV 11-15
No ratings yet
Djhendry CV 11-15
2 pages
Script Freebtc
64% (14)
Script Freebtc
2 pages
Beta Bank Case Organization
No ratings yet
Beta Bank Case Organization
11 pages
Part B Unit 3 CH 4,5
No ratings yet
Part B Unit 3 CH 4,5
2 pages
Actualpdf: Unlimited Lifetime Access To 5000+ Certification Actual Exams PDF
No ratings yet
Actualpdf: Unlimited Lifetime Access To 5000+ Certification Actual Exams PDF
28 pages
Usability Engineering IRCTC UI
No ratings yet
Usability Engineering IRCTC UI
12 pages
SG 4 VPN
No ratings yet
SG 4 VPN
2 pages
Presenting DeepSeek-Coder
No ratings yet
Presenting DeepSeek-Coder
2 pages
SMP Gateway and SEL Relays
No ratings yet
SMP Gateway and SEL Relays
6 pages
SPM Unitwise Imp Questions
No ratings yet
SPM Unitwise Imp Questions
4 pages
Test Hall Ticket 1101 02444 231218 0008: Registration Number
No ratings yet
Test Hall Ticket 1101 02444 231218 0008: Registration Number
1 page
Ecografo M7 - Especificaciones
No ratings yet
Ecografo M7 - Especificaciones
2 pages
BEREKET Database Design Basics
No ratings yet
BEREKET Database Design Basics
7 pages
Impact CAD Brochure - English
No ratings yet
Impact CAD Brochure - English
6 pages
AP® Computer Science AB Syllabus Course Overview (C1)
No ratings yet
AP® Computer Science AB Syllabus Course Overview (C1)
8 pages
Kisssoft Training Cylindrical Gear Design, Analysis and Optimization
No ratings yet
Kisssoft Training Cylindrical Gear Design, Analysis and Optimization
4 pages
GIT-4th Lesson-Note
No ratings yet
GIT-4th Lesson-Note
14 pages
10 VHDL Concurrent Statements
No ratings yet
10 VHDL Concurrent Statements
12 pages
Letter To Field Office SD-WAN
No ratings yet
Letter To Field Office SD-WAN
126 pages
Chapter 01: Introduction CSS430 Systems Programming
No ratings yet
Chapter 01: Introduction CSS430 Systems Programming
27 pages
Learner Guide Troubleshooting HP Networks 1041 No Watermark
No ratings yet
Learner Guide Troubleshooting HP Networks 1041 No Watermark
108 pages
Step-By-Step Trai Ni NG Manual: Engl I SH Versi On
No ratings yet
Step-By-Step Trai Ni NG Manual: Engl I SH Versi On
42 pages
Class 11 CT 4 AK (CH 7 Lists)
No ratings yet
Class 11 CT 4 AK (CH 7 Lists)
3 pages
Social Networking
100% (1)
Social Networking
5 pages

Cache Hit Ratio

Uploaded by

Cache Hit Ratio

Uploaded by

For many application developers their database is a black box.

Data goes in, comes back out and in

Understanding your Cache and its Hit Rate

Understanding Index Usage

Heroku Dashboard as an Example

# SELECT relname, 100 * idx_scan / (seq_scan + idx_scan) percent_of_times_index_used, n_live_tup rows_in_table

Index Cache Hit Rate

You might also like