SLA Monitoring
SLA Monitoring
SLA Monitoring
TM
The SLA can outline how the SLI is going to be measured, the length of time or number of
measurements that must be outside the range, and the consequences when the agreement is
breached. If there are no consequences, there is no SLA.
When talking about SLAs it is very easy to get into a finger pointing exercise where each party feels
they are right. ”Yes, the performance wasn’t what was expected, the customer is right.” ”No, the SLA
wasn’t breached, the provider is right.” Looking at things from multiple angles sometimes reveals
there is no one right answer.
The first step is to collect objective and accurate measurements. Cloud providers often provide
reports on how they are performing in relation to SLAs, but verifying this with your own monitoring
increases confidence in the level of performance being received. Too much noise in the data and
you won’t be able to effectively determine if the SLA has been breached. However, measuring from
a single location isn’t effective; you need to measure from vantage points that match where your
users are located. The wider number of vantage points, the easier it is to see if issues are regional or
global.
Users are interacting with more than just the home page of an application. They are logging in,
searching for information, and downloading content. If there are SLOs in place for multiple areas of
the application, those need to be monitored.
Transaction monitoring can be more complicated than single page monitoring when determining
which parameters should be used. Different inputs may yield different results based on application
logic or the APIs used to pull information. Using the same inputs may not provide enough insight,
but it isn’t feasible to test every permutation.
Catchpoint offers the ability to quickly create multi-step transactions through a Selenium-based
Chrome script recorder. The script recorder makes it easy to create and upload a transaction to
detect issues with key business processes. Logic can be inserted into scripts to choose different
search terms, select valid travel dates, or always click on the second item in a list by dynamically
cycling through a list of terms, dates, or numbers.
Alerting is a core component of monitoring. Alerts let us know when a problem is occurring and
when an action needs to be taken. The actions taken when a SaaS service is facing problems are
different from when an internally-hosted application experiences similar problems, but they are no
less important. Configuring alerts when errors occur or response time thresholds are passed can set
in motion a series of events that may include:
When tracking the SLAs for third-party tags, the hosts and zones feature allows you to create custom
categories of content known as zones. A zone can be created for your social media tags, advertising
tag, analytics tags, etc. Once the zones are enabled, alerts can be triggered when thresholds are
passed. This enables organizations to quickly identify which third-party component is potentially in
breach of an SLA.
When not actively troubleshooting a performance problem, it can be easy to lose track of whether
SLAs are being met. Receiving regular reports that track performance in relation to the SLOs ensures
that all stakeholders are made aware if and when an SLA has been breached in a given time period.
Color-coded reports from Catchpoint can be emailed on a scheduled basis (hourly to quarterly) or
added as a widget to a dashboard to keep all stakeholders informed of SLA statuses.
While having multiple vendors means more SLAs to manage, the benefits of
less downtime far outweighs the consequences of sustained outages.
SLAs provide value and protection to both the consumer and the vendor. The
framework outlined here can help consumers monitor and track the various
service levels holding vendors accountable and potentially recouping costs.
You’re already monitoring your internal applications infrastructure; adding a
few more monitors won’t hurt anybody. In fact, it will help everyone involved.
Catchpoint is a leading digital experience intelligence company Real browser, multi-transaction, mobile, HTML code, API,
that provides unparalleled insight into your customer-critical streaming, DNS, FTP, TCP, SMTP, ping, traceroute, SSH, NTP, IMAP,
services to help you consistently deliver amazing digital web socket and MQTT.
experiences. Catchpoint is the only performance digital
experience monitoring platform that provides integrated synthetic Deepest and broadest diagnostics
and real user monitoring, comprehensive test types, real-time 100 days of object level data; 3 years of raw aggregate data.
analytics, and a diverse node network to help you continuously
preempt performance issues and optimize service delivery. More To request a free trial, visit
than 400 customers in over 30 countries trust Catchpoint to catchpoint.com/freetrial