0% found this document useful (0 votes)

289 views12 pages

Prometheus Promql For Humans

This document provides an introduction and cheatsheet for PromQL, the query language for Prometheus. It explains the basics of instant and range vectors, important functions for visualizing range vectors as instant vectors like rate() and increase(), and how to filter and aggregate metrics using labels. It also provides examples for common queries like HTTP request rates, CPU and memory usage, and alert firing counts.

Uploaded by

Shirouit

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

289 views12 pages

Prometheus Promql For Humans

Uploaded by

Shirouit

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

PromQL For Humans

PromQL is a built in query-language made for Prometheus. Here at Timber we've

found Prometheus to be awesome, but PromQL difficult to wrap our heads around.
This is our attempt to change that.

PromQL Cheatsheet

Basics
Instant Vectors

Only Instant Vectors can be graphed.

http_requests_total
This gives us all the http requests, but we've got 2 issues.

1. There are too many data points to decipher what's going on.
2. You'll notice that http_requests_total only goes up, because it's a counter.
These are common in Prometheus, but not useful to graph.

I'll show you how to approach both.

It's Easy To Filter By Label.

http_requests_total{job="prometheus", code="200"}
You Can Check A Substring Using Regex Matching.

http_requests_total{status_code=~"2.*"}

If you're interested in learning more, here are the docs on Regex.

Range Vectors

Contain data going back in time.

Recall: Only Instant Vectors can be graphed. You'll soon be able to see how to
visualize Range Vectors using functions.

http_requests_total[5m]

You can also use (s, m, h, d, w, y) to represent (seconds, minutes, hours, ...)
respectively.

Important Functions
For Range Vectors

You'll notice that we're able to graph all these functions. Since only Instant Vectors
can be graphed, they take a Range Vector as a parameter and return a Instant
Vector.

Increase Of Http_requests_total Averaged Over The Last 5 Minutes.

rate(http_requests_total[5m])

Irate

Looks at the 2 most recent samples (up to 5 minutes in the past), rather than
averaging like rate
irate(http_requests_total[5m])

It's best to use rate when alerting, because it creates a smooth graph since the
data is averaged over a period of time. Spikey graphs can cause alert overload,
fatigue, and bad times for all due to repeatedly triggering thresholds.

HTTP Requests In The Last Hour.

This is equal to the rate * # of seconds

increase(http_requests_total[1h])
These are a small fraction of the functions, just what we found most popular. You
can find the rest here.

For Instant Vectors

Broken By Status Code

sum(rate(http_requests_total[5m]))

You'll notice that rate(http_requests_total[5m]) above provides a large

amount of data. You can filter that data using your labels, but you can also look at
your system as a whole using sum (or do both).

You can also use min , max , avg , count , and quantile similarly.
This query tells you how many total HTTP requests there are, but isn't directly useful
in deciphering issues in your system. I'll show you some functions that allow you to
gain insight into your system.

Sum By Status Code

sum by (status_code) (rate(http_requests_total[5m]))

You can also use without rather than by to sum on everything not passed as a
parameter to without.
Now, you can see the difference between each status code.

Offset

You can use offset to change the time for Instant and Range Vectors. This can
be helpful for comparing current usage to past usage when determining the
conditions of an alert.

sum(rate(http_requests_total[5m] offset 5m))

Remember to put offset directly after the selector.

Operators
Operators can be used between scalars, vectors, or a mix of the two. Operations
between vectors expect to find matching elements for each side (also known as
one-to-one matching), unless otherwise specified.

There are Arithmetic (+, -, *, /, %, ^), Comparison (==, !=, >, <, >=, <=) and Logical
(and, or, unless) operators.

Vector Matching

One-to-One

Vectors are equal i.f.f. the labels are equal.

API 5xxs Are 10% Of HTTP Requests

rate(http_requests_total{status_code=~"5."}[5m]) > .1 rate(http

_requests_total[5m])
We're looking to graph whenever more than 10% of an instance's HTTP requests
are errors. Before comparing rates, PromQL first checks to make sure that the
vector's labels are equal.

You can use on to compare using certain labels or ignoring to compare on all
labels except.

Many-to-One

It's possible to use comparison and arithmetic operations where an element on one
side can be matched with many elements on the other side. You must explicitly tell
Prometheus what to do with the extra dimensions.

You can use group_left if the left side has a higher cardinality, else use group
_right .

Examples
Disclaimer: We've hidden some of the information in the pictures using the Legend
Format for privacy reasons.

CPU Usage By Instance

100 * (1 - avg by(instance)(irate(node_cpu{mode='idle'}[5m])))

Average CPU Usage per instance for a 5 minute window.

Memory Usage

node_memory_Active / on (instance) node_memory_MemTotal

Percentage of memory being used by instance.

Disk Space

node_filesystem_avail{fstype!~"tmpfs|fuse.lxcfs|squashfs"} / node_
filesystem_size{fstype!~"tmpfs|fuse.lxcfs|squashfs"}

Percentage of disk space being used by instance. We're looking for the available
space, ignoring instances that have tmpfs , fuse.lxcfs , or squashfs in their
fstype and dividing that by their total size.

HTTP Error Rates As A % Of Traffic

rate(http_requests_total{status_code=~"5.*"}[5m]) / rate(http_requ
ests_total[5m])

Alerts Firing In The Last 24 Hours

sum(sort_desc(sum_over_time(ALERTS{alertstate="firing"}[24h]))) by
(alertname)

You can find more useful examples here.

3 Pillars Of Observability
It's important to understand where metrics fit in when it comes to observing your
application. I recommend you take a look at the 3 pillars of observability principle.
Metrics are an important part of your observability stack, but logs and tracing are
equally so.

We're a cloud-based logging company at Timber that seamlessly augments your

logs with context. We've got a great product built, and you can check it out for free!

How To Build Your Own Social Media Monitoring Service - Marshall Sponder - Webmetricsguru - Dot - Com - 3!31!2010 - V2
100% (1)
How To Build Your Own Social Media Monitoring Service - Marshall Sponder - Webmetricsguru - Dot - Com - 3!31!2010 - V2
173 pages
Onan RV Troubleshooing Guide
75% (4)
Onan RV Troubleshooing Guide
17 pages
Service Level Provisioning With Fully Automated Storage Tiering (FAST)
No ratings yet
Service Level Provisioning With Fully Automated Storage Tiering (FAST)
40 pages
(Ebook PDF) Vold's Theoretical Criminology 7th Editioninstant Download
100% (6)
(Ebook PDF) Vold's Theoretical Criminology 7th Editioninstant Download
45 pages
UML For E-Bank
33% (3)
UML For E-Bank
10 pages
Algorithmic Number Theory, Vol. 1 Efficient Algorithms - Bach E., Shallit J.
100% (3)
Algorithmic Number Theory, Vol. 1 Efficient Algorithms - Bach E., Shallit J.
516 pages
Tracing For Java Developers
100% (1)
Tracing For Java Developers
79 pages
KeyCloak - Restrict Access To Group
No ratings yet
KeyCloak - Restrict Access To Group
4 pages
Total Design Manuall
No ratings yet
Total Design Manuall
313 pages
Vijay Narayanan - Enterprise API Management
100% (1)
Vijay Narayanan - Enterprise API Management
16 pages
Sed - An Introduction and Tutorial
No ratings yet
Sed - An Introduction and Tutorial
42 pages
2017-Asec-Thomas Darimont-Open Source Identity Management Mit Keycloak-Praesentation
No ratings yet
2017-Asec-Thomas Darimont-Open Source Identity Management Mit Keycloak-Praesentation
39 pages
Logs
No ratings yet
Logs
7 pages
LA - Android - Unit I ONE
100% (1)
LA - Android - Unit I ONE
27 pages
SDLC
No ratings yet
SDLC
2 pages
The Unknown Life of Jesus Christ
No ratings yet
The Unknown Life of Jesus Christ
104 pages
Api Check List PDF
No ratings yet
Api Check List PDF
10 pages
Java Performance Tuning (Full Presentation) by Ender
No ratings yet
Java Performance Tuning (Full Presentation) by Ender
172 pages
Git Tutorial
No ratings yet
Git Tutorial
46 pages
Message Modeling With DFDL: IBM Integration Bus
No ratings yet
Message Modeling With DFDL: IBM Integration Bus
42 pages
Journey To Event Driven - Part 4 - Four Pillars of Event Streaming Microservices - Confluent
No ratings yet
Journey To Event Driven - Part 4 - Four Pillars of Event Streaming Microservices - Confluent
33 pages
Electronic Components
No ratings yet
Electronic Components
23 pages
Kotlin Docs
No ratings yet
Kotlin Docs
215 pages
Graphql Shorthand Notation Cheat Sheet
No ratings yet
Graphql Shorthand Notation Cheat Sheet
1 page
Varnish Book 2019 Framework App
No ratings yet
Varnish Book 2019 Framework App
335 pages
CustomObject-Utilization Best Practices - Final
No ratings yet
CustomObject-Utilization Best Practices - Final
13 pages
Prime Home Energy Storage Datasheet PDF
No ratings yet
Prime Home Energy Storage Datasheet PDF
2 pages
Introduction To API Security
100% (1)
Introduction To API Security
33 pages
Client Side Scripting Language (22519) : A Laboratory Manual For
No ratings yet
Client Side Scripting Language (22519) : A Laboratory Manual For
23 pages
Android Secure Storage
No ratings yet
Android Secure Storage
10 pages
MISP Cheat Sheet
No ratings yet
MISP Cheat Sheet
3 pages
Basic Admin - 1 - 5
100% (1)
Basic Admin - 1 - 5
28 pages
TCP Cong Control
No ratings yet
TCP Cong Control
34 pages
Various MDX Cheat Sheet
No ratings yet
Various MDX Cheat Sheet
2 pages
Google Search Engine - Google Search
No ratings yet
Google Search Engine - Google Search
1 page
Linux Servers
No ratings yet
Linux Servers
104 pages
04 Resource Monitoring
100% (1)
04 Resource Monitoring
35 pages
Beginning Java Web Services
No ratings yet
Beginning Java Web Services
348 pages
Booting&Modules 8
100% (1)
Booting&Modules 8
5 pages
Silicon On Insulator
No ratings yet
Silicon On Insulator
6 pages
How To Enable GitHub Actions On Your Profile README For A Snake-Eating Contribution Graph ? - DeV Community
No ratings yet
How To Enable GitHub Actions On Your Profile README For A Snake-Eating Contribution Graph ? - DeV Community
14 pages
The TOEFL ITP Tests at A Glance
No ratings yet
The TOEFL ITP Tests at A Glance
4 pages
Data Structures Trees
No ratings yet
Data Structures Trees
47 pages
Unit 4
No ratings yet
Unit 4
15 pages
REST in Practice - Part I
No ratings yet
REST in Practice - Part I
76 pages
A - First Solar FS 275
No ratings yet
A - First Solar FS 275
2 pages
Lars Vogel, Alex Blewitt - Distributed Version Control With Git - Mastering The Git Command Line - Third Edition (2014, Lars Vogel)
No ratings yet
Lars Vogel, Alex Blewitt - Distributed Version Control With Git - Mastering The Git Command Line - Third Edition (2014, Lars Vogel)
409 pages
CB Defense User Guide: CB Predictive Security Cloud
No ratings yet
CB Defense User Guide: CB Predictive Security Cloud
178 pages
Bhi & Cae Assessment Cover Sheet
No ratings yet
Bhi & Cae Assessment Cover Sheet
16 pages
PDF Jaeles-Introduction
No ratings yet
PDF Jaeles-Introduction
45 pages
UserGuide Iteraplan
No ratings yet
UserGuide Iteraplan
237 pages
Polarographic Analysis and Its Importance in Pharmaceutical Field PDF
No ratings yet
Polarographic Analysis and Its Importance in Pharmaceutical Field PDF
17 pages
Project Structure On GitHub
No ratings yet
Project Structure On GitHub
4 pages
Loki Design Document
No ratings yet
Loki Design Document
8 pages
Akash High Scale Benchmarks
No ratings yet
Akash High Scale Benchmarks
74 pages
Ignite Sample
0% (1)
Ignite Sample
88 pages
Spring Web Flow
No ratings yet
Spring Web Flow
58 pages
IRIG 106-01 Chapter 1-5 PDF
No ratings yet
IRIG 106-01 Chapter 1-5 PDF
62 pages
Baeldung 2014 Spring Development Report
No ratings yet
Baeldung 2014 Spring Development Report
9 pages
Dtv-md-0359-Directv Shef Public Beta Command Set-V1.0
No ratings yet
Dtv-md-0359-Directv Shef Public Beta Command Set-V1.0
25 pages
Cheat Sheet: Eclipse Vert.x: 4. Timer and Periodic Tasks 5. HTTP
No ratings yet
Cheat Sheet: Eclipse Vert.x: 4. Timer and Periodic Tasks 5. HTTP
12 pages
Git Basic Usage Installation
No ratings yet
Git Basic Usage Installation
3 pages
MongoDB Security Guide
No ratings yet
MongoDB Security Guide
118 pages
Halo Lighting Architectural Lighting Catalog 1985
No ratings yet
Halo Lighting Architectural Lighting Catalog 1985
84 pages
Chapter 7 - Introduction To Arrays
No ratings yet
Chapter 7 - Introduction To Arrays
33 pages
Grid-Connected EV Charging With Renewable Energy Integration in Parking Lots
No ratings yet
Grid-Connected EV Charging With Renewable Energy Integration in Parking Lots
64 pages
Definition and Scope of Ergonomic
No ratings yet
Definition and Scope of Ergonomic
8 pages
Xcode Cheat Sheet: Search Navigation Editing
No ratings yet
Xcode Cheat Sheet: Search Navigation Editing
2 pages
Poptropica English L1 - Scope and Sequence
No ratings yet
Poptropica English L1 - Scope and Sequence
2 pages
Dear Sir/Madam,: IITH Campus Recruitment Program 2019-20
No ratings yet
Dear Sir/Madam,: IITH Campus Recruitment Program 2019-20
2 pages
Web 2.0 Handout Tagging
No ratings yet
Web 2.0 Handout Tagging
2 pages
FINAL EXAM - Reading and Writing
No ratings yet
FINAL EXAM - Reading and Writing
3 pages
Compare Two Images
0% (1)
Compare Two Images
3 pages
Sofialidis HPC Ansys Fluent 01
No ratings yet
Sofialidis HPC Ansys Fluent 01
18 pages
Test and Evaluation of Aircraft Avionics and Weapon Systems 2nd Edition Robert B. Mcshea PDF Download
No ratings yet
Test and Evaluation of Aircraft Avionics and Weapon Systems 2nd Edition Robert B. Mcshea PDF Download
52 pages
wph16 01 Que 20220616
No ratings yet
wph16 01 Que 20220616
20 pages
Final Showdown 2
No ratings yet
Final Showdown 2
46 pages
Chapter 5 & 6
No ratings yet
Chapter 5 & 6
28 pages
Sumo
No ratings yet
Sumo
21 pages
Set Alpha - Model Paper PSPM SP025 - KMM 23-24 - Answer
No ratings yet
Set Alpha - Model Paper PSPM SP025 - KMM 23-24 - Answer
9 pages
RFLI (With Reviewers)
No ratings yet
RFLI (With Reviewers)
22 pages
Statement of Financial Position (S.F.P)
No ratings yet
Statement of Financial Position (S.F.P)
3 pages
New Bunawan
No ratings yet
New Bunawan
7 pages
Spring Security: Effectively secure your web apps, RESTful services, cloud apps, and microservice architectures
From Everand
Spring Security: Effectively secure your web apps, RESTful services, cloud apps, and microservice architectures
Badr Nasslahsen
No ratings yet
Tomcat 6 Developer's Guide
From Everand
Tomcat 6 Developer's Guide
Damodar Chetty
4/5 (1)
HBase Administration Cookbook
From Everand
HBase Administration Cookbook
Yifeng Jiang
No ratings yet
Master C# Interview Preparation: Dot Net Interview Preparation, #2
From Everand
Master C# Interview Preparation: Dot Net Interview Preparation, #2
Nirbhay Chauhan
No ratings yet
Unix / Linux FAQ: with Tips to Face Interviews
From Everand
Unix / Linux FAQ: with Tips to Face Interviews
Prof. N.B. Venkateswarlu
No ratings yet
Learning SaltStack - Second Edition
From Everand
Learning SaltStack - Second Edition
Colton Myers
No ratings yet
IBM Integration Bus Third Edition
From Everand
IBM Integration Bus Third Edition
Gerardus Blokdyk
No ratings yet

Prometheus Promql For Humans

Uploaded by

Prometheus Promql For Humans

Uploaded by

PromQL For Humans

PromQL is a built in query-language made for Prometheus. Here at Timber we've

Only Instant Vectors can be graphed.

I'll show you how to approach both.

It's Easy To Filter By Label.

If you're interested in learning more, here are the docs on Regex.

Contain data going back in time.

Increase Of Http_requests_total Averaged Over The Last 5 Minutes.

HTTP Requests In The Last Hour.

This is equal to the rate * # of seconds

For Instant Vectors

You'll notice that rate(http_requests_total[5m]) above provides a large

Sum By Status Code

sum by (status_code) (rate(http_requests_total[5m]))

sum(rate(http_requests_total[5m] offset 5m))

Remember to put offset directly after the selector.

Vectors are equal i.f.f. the labels are equal.

API 5xxs Are 10% Of HTTP Requests

rate(http_requests_total{status_code=~"5.*"}[5m]) > .1 * rate(http

CPU Usage By Instance

Average CPU Usage per instance for a 5 minute window.

node_memory_Active / on (instance) node_memory_MemTotal

Percentage of memory being used by instance.

HTTP Error Rates As A % Of Traffic

Alerts Firing In The Last 24 Hours

You can find more useful examples here.

We're a cloud-based logging company at Timber that seamlessly augments your

You might also like

rate(http_requests_total{status_code=~"5."}[5m]) > .1 rate(http