0% found this document useful (0 votes)

15 views15 pages

Devo P Monitoring

Uploaded by

Balamurugan Subramaniyan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views15 pages

Devo P Monitoring

Uploaded by

Balamurugan Subramaniyan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

Ultimate monitoring using Prometheus: Ensuring

optimal performance & Reliability

Components:
• Prometheus: It’s an open-source tool for monitoring and alerting applications. It
uses the concept of scrapping when target systems metric points are contacted to
fetch data at regular intervals.
• Node exporter: It is a monitoring agent that is installed on all target machines so that
Prometheus can fetch the data from all the metrics endpoints.
• Blackbox exporter: It is used to get information from the website like traffic is
coming from the website or not
• Alert manager: The Alert manager handles alerts sent by client applications such as
the Prometheus server. Used to set alert based on conditions so we will be notified ex
if the website is down for continuous 1- 5 minutes, service unavailability

Pre-requisites to start:
Created a security group with the following ports open:
• 22 for SSH
• 80 for HTTP
• 443 for HTTPS
• 25 for SMTP
• 465 for SMTPS
• 587 for SMTP
• 9090 for Prometheus
• 9093 for Alert manager
• 9115 for Blackbox Exporter
• 9100 for Node Exporter
Project steps:
Step 1: Lunched a 2 Ec2 Instance with ubuntu AMI, instance type=t2.medium, storage=20GB
and name them as Virtual machine 1 and Virtual machine 2

Prometheus components exporter tar files: https://prometheus.io/download/

Step 2: In Virtual machine 1:

Downloaded Node Exporter and start

→ sudo apt update
## Download Node Exporter
→ wget
https://github.com/prometheus/node_exporter/releases/download/v1.8.1/node_exporter-
1.8.1.linux-amd64.tar.gz

##Extract Node Exporter

→ tar xvfz node_exporter-1.8.1.linux-amd64.tar.gz
→ mv node_exporter-1.8.1.linux-amd64 node_exporter
##Start Node Exporter
→ cd node_exporter
→ ./node_exporter &

Step 3:
In Virtual machine 2 install Prometheus, Alert manager, Blackbox Exporter
Install Prometheus
→ sudo apt update
→ wget
https://github.com/prometheus/prometheus/releases/download/v2.52.0/prometheus-
2.52.0.linux-amd64.tar.gz
→ tar xvfz prometheus-2.52.0.linux-amd64.tar.gz
→ mv prometheus-2.52.0.linux-amd64 prometheus
→ cd prometheus
→ ./prometheus --config.file=prometheus.yml &

Alert Manager
→ wget
https://github.com/prometheus/alertmanager/releases/download/v0.27.0/alertmanager-
0.27.0.linux-amd64.tar.gz
→ tar xvfz alertmanager-0.27.0.linux-amd64.tar.gz
→ mv alertmanager-0.27.0.linux-amd64 alertmanager
→ cd alertmanager
→ ./alertmanager --config.file=alertmanager.yml &

Blackbox Exporter
→ wget
https://github.com/prometheus/blackbox_exporter/releases/download/v0.25.0/blackbox_
exporter-0.25.0.linux-amd64.tar.gz
→ tar xvfz blackbox_exporter-0.25.0.linux-amd64.tar.gz
→ mv blackbox_exporter-0.25.0.linux-amd64 blackbox_exporter
→ cd blackbox_exporter
→ ./blackbox_exporter &

Once completing the above steps, we will be able to see all folders like this

Once the VM-1 node exporter is up and running we can see the below webpage
Step 4:
Now let’s run a simple game application to monitor

To run the boardgame application on the website page we need to have java and maven to
build, so will install them using the below commands

→ cd Boardgame
→ sudo apt install openjdk-11-jre-headless
→ sudo apt install maven -y
→ mvn package // to build the project

We can execute the jar file to run the application on browser

→ cd target
→ ls // can see .jar file
→ java -jar database_service_project-0.0.4.jar

Now will access the game application at: http://3.135.20.106:8080/

Step 5:
Next, go to VM-2 to configure the Prometheus server by defining alert-rules for the
different scenarios. and based on these rules we will get the alerts
→ cd Prometheus
→ ./Prometheus &

Can access the Prometheus server at: http://3.145.128.69:9090/graph

For now, we can’t see any alert rules so let’s create a new alert_rules.yaml file to configure
alert rules in Prometheus server

vi alert_rules.yaml

groups:
- name: alert_rules # Name of the alert rules group
rules:
- alert: InstanceDown
expr: up == 0 # Expression to detect instance down
for: 1m
labels:
severity: "critical"
annotations:
summary: "Endpoint {{ $labels.instance }} down"
description: "{{ $labels.instance }} of job {{ $labels.job }} has been down for more than 1 minute."

- alert: WebsiteDown
expr: probe_success == 0 # Expression to detect website down
for: 1m
labels:
severity: critical
annotations:
description: The website at {{ $labels.instance }} is down.
summary: Website down

- alert: HostOutOfMemory
expr: node_memory_MemAvailable / node_memory_MemTotal * 100 < 25 # Expression to detect
low memory
for: 5m
labels:
severity: warning
annotations:
summary: "Host out of memory (instance {{ $labels.instance }})"
description: "Node memory is filling up (< 25% left)\n VALUE = {{ $value }}\n LABELS: {{
$labels }}"

- alert: HostOutOfDiskSpace
expr: (node_filesystem_avail{mountpoint="/"} * 100) / node_filesystem_size{mountpoint="/"} <
50 # Expression to detect low disk space
for: 1s
labels:
severity: warning
annotations:
summary: "Host out of disk space (instance {{ $labels.instance }})"
description: "Disk is almost full (< 50% left)\n VALUE = {{ $value }}\n LABELS: {{ $labels }}"

- alert: HostHighCpuLoad
expr: (sum by (instance) (irate(node_cpu{job="node_exporter_metrics",mode="idle"}[5m]))) > 80
# Expression to detect high CPU load
for: 5m
labels:
severity: warning
annotations:
summary: "Host high CPU load (instance {{ $labels.instance }})"
description: "CPU load is > 80%\n VALUE = {{ $value }}\n LABELS: {{ $labels }}"

- alert: ServiceUnavailable
expr: up{job="node_exporter"} == 0 # Expression to detect service
unavailability
for: 2m
labels:
severity: critical
annotations:
summary: "Service Unavailable (instance {{ $labels.instance }})"
description: "The service {{ $labels.job }} is not available\n VALUE = {{ $value }}\n LABELS: {{
$labels }}"

- alert: HighMemoryUsage
expr: (node_memory_Active / node_memory_MemTotal) * 100 > 90 # Expression to detect high
memory usage
for: 10m
labels:
severity: critical
annotations:
summary: "High Memory Usage (instance {{ $labels.instance }})"
description: "Memory usage is > 90%\n VALUE = {{ $value }}\n LABELS: {{ $labels }}"

- alert: FileSystemFull
expr: (node_filesystem_avail / node_filesystem_size) * 100 < 10 # Expression to detect file system
almost full
for: 5m
labels:
severity: critical
annotations:
summary: "File System Almost Full (instance {{ $labels.instance }})"
description: "File system has < 10% free space\n VALUE = {{ $value }}\n LABELS: {{ $labels }}"

Now we need to input the above rules file to Prometheus server by updating
prometheus.yml file

Now to view these alert rules on our Prometheus website page, you need to restart the
Prometheus server

→ pgrep prometheus //to get process id

→ kill id
→ ./prometheus &
Step 6:
Now we need to connect both Alert manager and VM-1 node exporter to prometheus server
by updating prometheus.yml file

After restarting the Prometheus server and we should be able to see the node exporter on
Prometheus target section
Next, need to configure the Blackbox exporter to scrape the data from the website
application, so let’s update scrapping configs on prometheus.yml file

vi prometheus.yml file

Restart the Prometheus server to reflect the changes

Need to start the Blackbox exporter

When we start Alert manager, and we won’t be able see any alerts as of now since we
haven’t configured alert manager

So, let’s configure it

Now we need to configure email notification to get emails when the defined conditions are
met

To receive email notification, we need to enable 2 step verifications on the Gmail account

Step 7:

Next, go to https://myaccount.google.com/apppasswords
And enter name and get a app password which can be used for routing configuration

cd alertmanager
vi alertmanager.yml

---
route:
group_by:
- alertname
group_wait: 30s
group_interval: 5m
repeat_interval: 1h
receiver: email-notifications
receivers:
- name: email-notifications
email_configs:
- to: [email protected]
from: [email protected]
smarthost: smtp.gmail.com:587
auth_username: [email protected]
auth_identity: [email protected]
auth_password: luwg yvge wwez fjti
send_resolved: true
inhibit_rules:
- source_match:
severity: critical
target_match:
severity: warning
equal:
- alertname
- dev
- instance

Now, Restart the alert manager and check

Hurray, the monitoring setup complete!!!!

Everything seems fine now
Step 8:
Next, will try check the entire functionality by shutting down the game application.

The status is in pending state

After 1 minute the status will change to firing state and soon will receive an email
notification

Can view the notification on alert manager

Next will try terminating the node exporter

Terminating node exporter will send the notification for both ec2 instance as well the
service

Some Tutorials in Computer Networking Hacking
From Everand
Some Tutorials in Computer Networking Hacking
Dr. Hidaia Mahmood Alassouli
No ratings yet
Prometheus Ebook v2
75% (4)
Prometheus Ebook v2
231 pages
Turnbull James Monitoring With Prometheus PDF
100% (1)
Turnbull James Monitoring With Prometheus PDF
394 pages
Devo
No ratings yet
Devo
17 pages
DevOps Shack Ultimate Monitoring Project
No ratings yet
DevOps Shack Ultimate Monitoring Project
7 pages
Devops Ultimate Monitoring Project
No ratings yet
Devops Ultimate Monitoring Project
17 pages
29 Using Prometheus Alertmanager Node Exporter To Monitor A Companys Geo Distributed Infrastructure
No ratings yet
29 Using Prometheus Alertmanager Node Exporter To Monitor A Companys Geo Distributed Infrastructure
12 pages
SRECon EMEA 2017 - Monitoring Cloudflare's Planet-Scale Edge Network With Prometheus
No ratings yet
SRECon EMEA 2017 - Monitoring Cloudflare's Planet-Scale Edge Network With Prometheus
76 pages
Monitoring Ec2 Instance
No ratings yet
Monitoring Ec2 Instance
15 pages
SRE-Practical Work 3 Monitoring and Alerting Setup
No ratings yet
SRE-Practical Work 3 Monitoring and Alerting Setup
6 pages
Assignment 3
No ratings yet
Assignment 3
13 pages
Prometheus Concepts
No ratings yet
Prometheus Concepts
4 pages
Booking Confirmation
No ratings yet
Booking Confirmation
56 pages
16 - Prometheus Handout
No ratings yet
16 - Prometheus Handout
31 pages
Prometheus Monitor
No ratings yet
Prometheus Monitor
10 pages
7.IT Infra Support Q&A
No ratings yet
7.IT Infra Support Q&A
3 pages
Prometheus Grafana Setup
100% (1)
Prometheus Grafana Setup
5 pages
How To Install and Configure Prometheus - Grafana - and Node Exporter - Linkedin
No ratings yet
How To Install and Configure Prometheus - Grafana - and Node Exporter - Linkedin
7 pages
Prometheus Grafana Setup
No ratings yet
Prometheus Grafana Setup
4 pages
Prometheus and Grafana Monitoring Tools 1703260158
No ratings yet
Prometheus and Grafana Monitoring Tools 1703260158
59 pages
Promotheus 01
No ratings yet
Promotheus 01
4 pages
Prom Qna
No ratings yet
Prom Qna
43 pages
DevOps Shack - Comprehensive Monitoring Guide
No ratings yet
DevOps Shack - Comprehensive Monitoring Guide
41 pages
Setup of Prometheus, Node Exporter, and Grafana
No ratings yet
Setup of Prometheus, Node Exporter, and Grafana
18 pages
Monotoring Tool
No ratings yet
Monotoring Tool
3 pages
Monitor Health Graf Prom
No ratings yet
Monitor Health Graf Prom
34 pages
Mastering Prometheus & Grafana
No ratings yet
Mastering Prometheus & Grafana
18 pages
Session 14 Alerting
No ratings yet
Session 14 Alerting
24 pages
Setup Prometheus Monitoring On Kubernetes
No ratings yet
Setup Prometheus Monitoring On Kubernetes
6 pages
Grafana
No ratings yet
Grafana
13 pages
All MonitoringTools Configurations
No ratings yet
All MonitoringTools Configurations
5 pages
Adding Observability To A Kubernetes Cluster Using Prometheus - by Martin Hodges - Jan, 2024 - Medium
No ratings yet
Adding Observability To A Kubernetes Cluster Using Prometheus - by Martin Hodges - Jan, 2024 - Medium
2 pages
Prometheus Course
No ratings yet
Prometheus Course
162 pages
07 - Own Alert Rules Part 2 Revised
No ratings yet
07 - Own Alert Rules Part 2 Revised
5 pages
Grafana 02
No ratings yet
Grafana 02
6 pages
Intro To Prometheus Workshop - Grafana
No ratings yet
Intro To Prometheus Workshop - Grafana
67 pages
Prometheus
No ratings yet
Prometheus
34 pages
Kubernetes Monitoring With Prometheus Grafana
No ratings yet
Kubernetes Monitoring With Prometheus Grafana
6 pages
House Dzone Refcard 293 Getting Started Prometheus
No ratings yet
House Dzone Refcard 293 Getting Started Prometheus
6 pages
Comprehensive Plan For Adding Metrics, Thresholds, Alerts, Dashboards, and Monitoring For Colo Server Facilities
No ratings yet
Comprehensive Plan For Adding Metrics, Thresholds, Alerts, Dashboards, and Monitoring For Colo Server Facilities
4 pages
11 Prometheus Interview Question & Answers
No ratings yet
11 Prometheus Interview Question & Answers
9 pages
Prometheus and Grafana
No ratings yet
Prometheus and Grafana
6 pages
Understanding Software Engineering Vol 3: Programming Basic Software Functionalities.
From Everand
Understanding Software Engineering Vol 3: Programming Basic Software Functionalities.
Gabriel Clemente
No ratings yet
An Introduction To Prometheus: Brian Brazil Founder
No ratings yet
An Introduction To Prometheus: Brian Brazil Founder
42 pages
16 Monitoring Part4 02
No ratings yet
16 Monitoring Part4 02
5 pages
Configuration of a Simple Samba File Server, Quota and Schedule Backup
From Everand
Configuration of a Simple Samba File Server, Quota and Schedule Backup
Dr. Hedaya Alasooly
No ratings yet
Prometheus and Grafana 1712312993
No ratings yet
Prometheus and Grafana 1712312993
6 pages
Prometheus
No ratings yet
Prometheus
17 pages
Evaluation of Some Cloud Based Virtual Private Server (VPS) Providers
From Everand
Evaluation of Some Cloud Based Virtual Private Server (VPS) Providers
Dr. Hidaia Mamood Alassouli
No ratings yet
16 - Prometheus Checklist
No ratings yet
16 - Prometheus Checklist
9 pages
Footprinting, Reconnaissance, Scanning and Enumeration Techniques of Computer Networks
From Everand
Footprinting, Reconnaissance, Scanning and Enumeration Techniques of Computer Networks
Dr. Hidaia Mahmood Alassouli
No ratings yet
Prom Notes
No ratings yet
Prom Notes
47 pages
Evaluation of Some Cloud Based Virtual Private Server (VPS) Providers
From Everand
Evaluation of Some Cloud Based Virtual Private Server (VPS) Providers
Dr. Hidaia Mahmood Alassouli
No ratings yet
NoSQL Injection for Elasticsearch
From Everand
NoSQL Injection for Elasticsearch
Gary Drocella
No ratings yet
Unit 5
No ratings yet
Unit 5
13 pages
Observing Enterprise Kubernetes Clusters at Scale
No ratings yet
Observing Enterprise Kubernetes Clusters at Scale
59 pages
16 - Prometheus (Dark Theme)
No ratings yet
16 - Prometheus (Dark Theme)
10 pages
SESSION6 - Real Time Monitoring - 1
No ratings yet
SESSION6 - Real Time Monitoring - 1
16 pages
Common Windows, Linux and Web Server Systems Hacking Techniques
From Everand
Common Windows, Linux and Web Server Systems Hacking Techniques
Dr. Hidaia Mahmood Alassouli
No ratings yet
Kubernetes Monitoring Using Prometheus and Grafana
No ratings yet
Kubernetes Monitoring Using Prometheus and Grafana
8 pages
Rttwo
No ratings yet
Rttwo
2 pages
Oneert
No ratings yet
Oneert
4 pages
Rtfour
No ratings yet
Rtfour
1 page
Fivert
No ratings yet
Fivert
1 page
Diff Maven Jenkins
No ratings yet
Diff Maven Jenkins
1 page
Docker Infra
No ratings yet
Docker Infra
1 page
Docker Containers
No ratings yet
Docker Containers
1 page
CI Tools
No ratings yet
CI Tools
1 page
Splunk
No ratings yet
Splunk
5 pages
Splunk Troubleshooting
No ratings yet
Splunk Troubleshooting
7 pages
Linux Dockers
No ratings yet
Linux Dockers
1 page
Groovy Design Pattern
No ratings yet
Groovy Design Pattern
3 pages
Git Commd
No ratings yet
Git Commd
2 pages
Log4j Properties
No ratings yet
Log4j Properties
2 pages
Advantages of Log4j
No ratings yet
Advantages of Log4j
5 pages
Secret Manager Script
No ratings yet
Secret Manager Script
1 page
Shell Find and Replace
No ratings yet
Shell Find and Replace
6 pages
Devi Academy: Valasaravakkam, Chennai - 600 087
No ratings yet
Devi Academy: Valasaravakkam, Chennai - 600 087
11 pages
Git Commit
No ratings yet
Git Commit
1 page
Devops Vs Agile
No ratings yet
Devops Vs Agile
3 pages
Shell Conversion From Lower To Upper
No ratings yet
Shell Conversion From Lower To Upper
1 page
Maven
No ratings yet
Maven
4 pages
Shell Script Tutorial
No ratings yet
Shell Script Tutorial
6 pages
Configure A Default Web Site
No ratings yet
Configure A Default Web Site
2 pages
Handling Deployment Failures in Production
No ratings yet
Handling Deployment Failures in Production
9 pages
Iptables Tables
No ratings yet
Iptables Tables
6 pages
C Programming Lab - Manual Final
No ratings yet
C Programming Lab - Manual Final
55 pages
Syllabus and Course Information
No ratings yet
Syllabus and Course Information
1 page
Red Hat Enterprise Linux-8-8.2 Release Notes-En-US
100% (1)
Red Hat Enterprise Linux-8-8.2 Release Notes-En-US
131 pages
Q2 WEEK 5-6-1 Answers
No ratings yet
Q2 WEEK 5-6-1 Answers
12 pages
Ug Avenar en F01u378877 PDF
No ratings yet
Ug Avenar en F01u378877 PDF
94 pages
Rigging Pro: Design and Evaluate Slings and Rigging Hardware With Computerized 3D Images
No ratings yet
Rigging Pro: Design and Evaluate Slings and Rigging Hardware With Computerized 3D Images
2 pages
IFN 554 Week 4 Lecture Slides
No ratings yet
IFN 554 Week 4 Lecture Slides
52 pages
DCPlusPlus Guide
No ratings yet
DCPlusPlus Guide
7 pages
"Rewrite It in Rust" Considered Harmful?
No ratings yet
"Rewrite It in Rust" Considered Harmful?
7 pages
Programming For Problem Solving Using C and C++
No ratings yet
Programming For Problem Solving Using C and C++
26 pages
L20 Cassandra - Fa12
No ratings yet
L20 Cassandra - Fa12
27 pages
Userbak 1
No ratings yet
Userbak 1
15 pages
Philippine Science High School - Cagayan Valley Campus
No ratings yet
Philippine Science High School - Cagayan Valley Campus
3 pages
Foss Lab Manual
No ratings yet
Foss Lab Manual
117 pages
Hivdr-E E6p 201110
No ratings yet
Hivdr-E E6p 201110
2 pages
Kroenke Dbc6e PP Ch01
No ratings yet
Kroenke Dbc6e PP Ch01
46 pages
Department of Computer Science and Engineering Aptitude Test
No ratings yet
Department of Computer Science and Engineering Aptitude Test
5 pages
Expert PDF Editor Crack 4-0-210
No ratings yet
Expert PDF Editor Crack 4-0-210
2 pages
Sample IERJ Paper For A4 Page Size: Type of Manuscript
No ratings yet
Sample IERJ Paper For A4 Page Size: Type of Manuscript
2 pages
ES 2023 L6 Embedded System Interfacing RTOS
No ratings yet
ES 2023 L6 Embedded System Interfacing RTOS
13 pages
A Coragem de Nao Agradarpdf 2 PDF Free
No ratings yet
A Coragem de Nao Agradarpdf 2 PDF Free
269 pages
19bec149 Exp7 Ca
No ratings yet
19bec149 Exp7 Ca
8 pages
Snowflake Schema: The Snowflake Schema Is An Extension of Star Schema. in A Snowflake
No ratings yet
Snowflake Schema: The Snowflake Schema Is An Extension of Star Schema. in A Snowflake
4 pages
Special Division Bench Hon'Ble Mr. Justice Yashwant Varma Hon'Ble Mr. Justice Ravinder Dudeja Notice
No ratings yet
Special Division Bench Hon'Ble Mr. Justice Yashwant Varma Hon'Ble Mr. Justice Ravinder Dudeja Notice
540 pages
SOP On Initiation & Authorization of Online Exit Request Under NPS
No ratings yet
SOP On Initiation & Authorization of Online Exit Request Under NPS
34 pages
Concept of Modeling Lecture Ch2: Introduction To 3D Model
No ratings yet
Concept of Modeling Lecture Ch2: Introduction To 3D Model
27 pages
Software Requirement Specification
No ratings yet
Software Requirement Specification
6 pages
AutoCAD Command and Shortcut List, PDF Ebook Included PDF
No ratings yet
AutoCAD Command and Shortcut List, PDF Ebook Included PDF
50 pages
SQL Cheatsheet Zero To Mastery V1.01
No ratings yet
SQL Cheatsheet Zero To Mastery V1.01
19 pages
Paraphrasing Tool For Hindi Text
No ratings yet
Paraphrasing Tool For Hindi Text
34 pages

Devo P Monitoring

Uploaded by

Devo P Monitoring

Uploaded by

Ultimate monitoring using Prometheus: Ensuring

optimal performance & Reliability

Prometheus components exporter tar files: https://prometheus.io/download/

Step 2: In Virtual machine 1:

Downloaded Node Exporter and start

##Extract Node Exporter

We can execute the jar file to run the application on browser

Now will access the game application at: http://3.135.20.106:8080/

Can access the Prometheus server at: http://3.145.128.69:9090/graph

→ pgrep prometheus //to get process id

Restart the Prometheus server to reflect the changes

So, let’s configure it

Now, Restart the alert manager and check

Hurray, the monitoring setup complete!!!!

The status is in pending state

Can view the notification on alert manager

You might also like