100% found this document useful (1 vote)

91 views23 pages

AWS Splunk Infrastructure Monitoring 101 The Power To Predict and Prevent

Uploaded by

Ameer Shahil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

91 views23 pages

AWS Splunk Infrastructure Monitoring 101 The Power To Predict and Prevent

Uploaded by

Ameer Shahil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

Infrastructure Monitoring 101

The Power to
Predict and Prevent
The ability to see what’s happening across
an organization’s infrastructure helps
teams to predict and prevent outages

Infrastructure Monitoring 101: The Power to Predict and Prevent | 1

Infrastructure Is About
More Than Keeping
the Lights On
Apps and services are expected to work quickly and seamlessly on any
number of devices, from different kinds of networks and in different
Customer experience — often the frontend of a locations around the globe. Supporting a connected experience — one
mobile, web or business application — has become that is secure and personalized, constantly improving and with little-to-
no downtime — requires many different interconnected technologies to
one of the most important metrics for success for function in concert. Each of these technology layers emits volumes of
global organizations. These experiences rely on data that contain the information required to monitor, troubleshoot and
ultimately improve those experiences — issues or outages are resolved as
layers of interconnected technologies that work soon as they arise.
together to deliver information, transactions and For many years, IT teams were monitoring pieces of infrastructure
interactions to an end user. As the experiences separately, but that method creates silos and isn’t scalable, and it certainly
isn’t practical. The advent of microservices, serverless architecture and
grow in complexity, so does the technology. cloud computing has given us improvements in efficiency, but has also
introduced new kinds of considerations to IT infrastructure, and new
monitoring challenges.

Having knowledgeable teams and sophisticated systems in place are only

part of the job; they have to accommodate the business’s need for rapid,
constant change while keeping the systems in good operating order. That
can only be done with a solution that provides a holistic view and that can
scale with the business. Something that can help ITOps teams see the big
picture and dig into the details when it’s required.

Infrastructure Monitoring 101: The Power to Predict and Prevent | 3

Complex IT Infrastructures
Are More Likely to Fail
When someone taps their way through an app, they rarely (if ever) consider the various
technology stacks that work together to make that experience possible. What does the
complex web of IT infrastructure look like?

!
| 4
Building

Dev/Apps Servers

Cloud Network
EF41#

E8<
/&!

Office Storage
The Complexity of
IT Infrastructure

BackUp/DR Security

Desktop/BM

5
Infrastructure Monitoring 101: The Power to Predict and Prevent | 5
More complexity =
more room for failure
As we see in the preceding graphic, modern IT infrastructure is an
extraordinarily complex system of interconnected technologies, each of
which has the potential to run into issues or fail outright. And with more
components being added to these stacks as technology evolves, new
opportunities for outages arise. In fact, between 2017 and 2018, instances
of outages or “server service degradation periods” increased from 25% to
31%, and if we look at on-premises data centers, that number rises to 48%.*

One troubling fact revealed in Uptime Institute’s 2020 datacenter survey

is that only about half of organizations actually calculate the cost of a
downtime incident. This number is trending up, probably as a result of the
cost impact and publicity that results from service interruptions.

34%
60% of data center’s outages could have been
prevented with better management, processes
31%
or configuration.
25% 2019
OUTAGES

2018
OUTAGES

2017
OUTAGES

| 6
Seventy-eight percent of organizations say they had an IT service outage • On August 25, Slack suffered a service outage that affected users in the
in the past three years — a higher percentage than in previous years — and U.K. and western and southern Europe. Slack users faced troubles with
41% classified it as minimal or negligible. Outages in these categories signal files, messages and connecting to Slack.
bigger problems and are troubling more for their frequency than for their
• And on September 23, Tesla suffered an hour-long global network
singular impact. When asked about significant, serious or severe outages —
outage with its internal systems that left several Tesla owners unable
which can cause substantial financial and reputational damage — 31% have
to connect to their cars through the mobile app or the website. Tesla’s
been affected.**
energy products, Tesla solar, and Powerwall home battery systems
About 20% of organizations had a serious or severe outage in the past three were inoperative too. The outage was due to an internal break of their
years — that is, an outage that was costly, caused reputational damage and, application programming interface (API).
in some cases, had major other implications. Nearly a third of all outages
The best way to ensure that issues are resolved quickly — or prevented
cause financial or reputational damage.
altogether — is to monitor and troubleshoot the underlying infrastructure
Outages are increasingly costly for businesses. In 2020, a greater as well as the mission-critical apps and services that run on them. While
percentage of outages cost more than $1 million (now nearly one in six observing any one element of the infrastructure stack is a straightforward
rather than one in ten, as in 2019), and a greater percentage cost between proposition, observing each piece individually introduces a host of
$100,000 and $1 million (40% vs. 28%). additional problems.
Another angle on that statistic: because of largely preventable errors,
almost half of employees and users experienced issues with their apps and
services. That kind of disruption can result in thousands of employee hours
wasted, customer dissatisfaction and, ultimately, loss of business.

Companies of all sizes can be affected by these outages, and because

companies frequently rely on each other’s infrastructures for their products
and services, there is a cascading effect throughout the connected systems
when one service goes out. For example, these three major outages
occurred in 2020:

• U.S. mobile operator T-Mobile suffered a major outage on June 15.

Customers reported problems with their mobile phone service, mobile
internet connection and their ability to text friends and family. Reports
came in over a period of almost ten hours that evening, peaking with
113,980 reports in one 15-minute period.

* Source: Uptime Institute 2018 (8th annual Data Center Survey)

** Source: Uptime Institute 2020 (10th annual Data Center Survey)

Infrastructure Monitoring 101: The Power to Predict and Prevent | 7

How to Cut Through the Fog
Better Manage Infrastructure
Monitoring
The visibility problem

We can think of ITOps as a stack of physical and logical layers, each with its own technologies, systems and services,
and each with a corresponding team or individual responsible for monitoring and maintaining it. This makes gaining
visibility into the infrastructure as a whole fundamentally problematic, despite being essential.

A per-layer monitoring practice leads to siloed teams and incompatible views of data. Each layer has different vital
metrics, different monitoring tools and dashboards and different personnel behind the keyboard. In practice, per-layer
monitoring means people looking at limited information using different languages, leading to difficulties detecting and
investigating outages and issues as well as restoring service.

Infrastructure Monitoring 101: The Power to Predict and Prevent | 8

Different types of data
created by IT infrastructure
Analysts including Gartner, Forrester, IDC and Computing UK have all developed their own set of essential data. The following is a list
of observable telemetry data that we have found to be critical for monitoring the infrastructure stack. These essential data types can
be categorized into three groups:

Metrics Traces Logs

Numbers that give us insights about a A trace is the record of the progression Immutable records of discrete events
process or an activity, or the status of an of a request through an application, that happen over time. Event logs exist
underlying system, network or storage. including all of its myriad services. in plain or structured text, or binary.
Generally, metrics are measured over time
— often referred to as a time series. A single trace typically captures data about: • System and server logs (syslog, journald)
• Spans (service name, operation name, • Firewall and intrusion detection system
• System Metrics (CPU Use, Memory Use, duration and other metadata) logs
Disk I/O) • Errors • Social media feeds (Twitter, etc.)
• App Metrics (Rate, Errors, Duration) • Duration of important operations within • Application, platform and server logs
• Business metrics (revenue, customer each service (log4j, log4net, Apache, MySQL, AWS)
signups, bounce rate, cart abandonment) • Custom attributes

Infrastructure Monitoring 101: The Power to Predict and Prevent | 9

9
Observability is key record a variety of operational, security, error and debugging data such as
system libraries loaded during boot, application processes open, network

to a successful IT connections, file systems mounted and system memory usage. The level
of detail is configurable by the system administrator; however, there

monitoring solution are sufficient options to provide a complete picture of system activity
throughout its lifetime. Having visibility into these pieces of server data and
monitoring them proactively can help teams find resolutions more quickly
One way to avoid the problems of per-layer monitoring is building
or prevent outages altogether.
with observability in mind. Observability is the natural evolution
of what we used to call monitoring. Observability recognizes
that today’s infrastructure and applications are living, breathing Imagine a gaming company whose users depend
organisms that evolve at a much faster rate than ever before.
on reliable, high-speed access to a web app — not
Observability encompasses all of the things we used to do in
monitoring, like watching for known failure conditions, and extends
terribly hard to picture, is it? Having immediate
it to support the challenges of today’s applications, like being visibility and insight into server performance would
prepared for all the unknown failure conditions. be critical to that company’s success. The ability to
quickly resolve server-based issues (or predict and
avoid them altogether) would have a significant
IT stack layers: impact on the product’s uptime and directly impact
customer satisfaction and, ultimately, revenues.
Servers
A high-quality user experience depends
on effective monitoring of the systems
Having a single tool from which to monitor the health of servers — one that
that support the product. It allows
correlates event data and log data into a seamless experience — enables
administrators and ITOps personnel
ITOps teams to quickly isolate what is driving the failure (like memory
to see resource usage patterns and
usage on a single server) and resolve it. It also facilitates proactivity. The
optimize the servers keeping websites
ability to create alerts and automations within the monitoring tool saves
and applications running smoothly.
teams time and allows them to focus their efforts on other tasks.
Server operating systems routinely

Infrastructure Monitoring 101: The Power to Predict and Prevent | 10

Serverless Virtualization
Serverless computing offers a number of advantages over traditional Virtualization has revolutionized the modern datacenter. Whether it be
cloud-based or server-centric infrastructure. For many developers, network, server, application or desktop virtualization, each offers numerous
serverless architectures offer greater scalability, more flexibility, and benefits such as cost savings, physical server consolidation, dynamic load
quicker time to release, all at a reduced cost. Serverless apps are deployed balancing, ease of migrations and more. While these benefits are compelling,
in containers that automatically launch on demand when called. virtualization has also introduced a new level of complexity to managing the
datacenter. Visibility, or a lack thereof, is probably the biggest challenge.
Under a standard Infrastructure-as-a-Service (IaaS) cloud computing model,
users purchase units of capacity, meaning you pay a public cloud provider for Across virtualized machines, datacenter administrators lack the necessary
always-on server components to run your apps. It’s the user’s responsibility visibility to help them solve problems faced by their application owners.
to scale up server capacity during times of high demand and to scale down Capturing and storing all the relevant data at full fidelity is vital to truly
when that capacity is no longer needed. The cloud infrastructure necessary understanding application performance, especially when mission-critical
to run an app is active even when the app isn’t being used. applications run in virtualized environments. Visualizing this data within the
context of data from other technology tiers is essential to understanding
With serverless architecture, by contrast, apps are launched only as
exactly which events in which tier are causing problems and impacting
needed. When an event triggers app code to run, the public cloud provider
performance. Correlating, trending and analyzing virtualization data and
dynamically allocates resources for that code. The user stops paying when
data from other technology tiers such as storage, networks and operating
the code finishes executing. In addition to the cost and efficiency benefits,
systems is a big data problem.
serverless frees developers from routine and menial tasks associated with
app scaling and server provisioning. Gaining insight into virtual deployments and making essential
correlations with the applications and other parts of the infrastructure —
With serverless, routine tasks such as managing the operating system and
by monitoring the resource usage on virtual environments like VMware
file system, security patches, load balancing, capacity management, scaling,
and others — is vital to efficiently managing resources and gaining the
logging and monitoring are all offloaded to a cloud services provider such as
benefits of virtualization.
Amazon Web Services (AWS).

Cloud
Network
Running workloads in a cloud environment is not “set it and forget it.” ITOps
While each organization’s needs and data sources will vary, there are reasons for
teams still need to monitor the performance, usage, security and availability
monitoring network data that are common across companies and institutions:
of the cloud infrastructure continuously. And with the right solutions, it’s
• Protecting corporate networks from attacks.
possible to manage IT systems and derive actionable insights from all of the
• Providing visibility into network traffic. data in one system, even if the services are running in hybrid environments.
• Determining the role of the network in the overall availability and
When an organization migrates its services to a cloud platform (or between
performance of critical services.
cloud platforms), for instance, having end-to-end visibility into every stage
Monitoring a network means more than having visibility into the state of the of the migration can help teams establish baseline performance, monitor
hardware that supports that network, like routers, switches, etc. It includes services during the transition and ensure that all services are running
monitoring network event logs, activities across the network infrastructure, optimally after the transition is over.
traffic bottlenecks or suspicious behavior.
Infrastructure Monitoring 101: The Power to Predict and Prevent | 11
Services running on hybrid and cloud infrastructures can be opaque, Containers enable a number of significant benefits to organizations,
leading to gaps in ITOps teams’ understanding of the system as a whole. developers and users — faster deployment, smaller footprints and
Organizations eager to get the benefits of cloud often overspend on cloud consistency across environments. But containers, like virtual machines,
services — on deprecated or unused services, unknown redundancies have their own system metrics that need to be monitored, and with many
or excessive resources. Ingesting all of the cloud infrastructure data into containers running side-by-side, the task of monitoring, optimizing and
a single environment, replacing the multitude of individual monitoring troubleshooting them becomes much more complicated.
tools with a consolidated solution, can provide an understanding of how
Cloud-native infrastructure such as containers, Kubernetes and serverless
resources are performing and being used, allowing for optimization of
are highly dynamic and ephemeral. When the cloud infrastructure only lives
utilities and billing.
for minutes, the monitoring solution needs to detect and enable automatic
Public Cloud Hybrid Cloud remediation within seconds.
AWS On-Prem
For all the benefits that containers bring to IT organizations, they bring new
Private/Public Cloud Mix
considerations that must be addressed including:
Infrastructure monitoring should provide out-of-the-box, end-to-end
• Significant blind spots: Containers are designed to be disposable. Because of
visibility into all stages of cloud migration — before, during and after — this, they introduce several layers of abstraction between the application and
and full visibility into public cloud IaaS. The right monitoring solution will the underlying hardware to ensure portability and scalability. This all contributes
simplify the multitude of monitoring tools, and allow you to monitor your to a significant blind spot when it comes to conventional monitoring.
entire stack in one place. Teams can collaborate more efficiently, with
• Increased need to record: The easy portability of so many interdependent
greater visibility into resources. Built-in dashboards and accurate alerts
components creates an increased need to maintain telemetry data to ensure
provide shorter mean time to detect (MTTD), helping resolve issues before
observability into the performance and reliability of the application, container
they impact operations.
and orchestration platform.

Kubernetes and Containers • The importance of visualizations: The scale introduced by containers and
container orchestration requires the ability to both visualize the environment to
Since the introduction of the concept in 2013, adoption of containers has
gain immediate insight into your infrastructure health but also be able to zoom
skyrocketed across technology organizations. They share some conceptual
in and view the health and performance of containers, node and pods. The right
features with virtual machines, but they differ in a few essential ways. The monitoring solution should provide this workflow.
easiest way to understand a container is to think of it as exactly that — a
container — a receptacle that holds something securely and can be used A good container monitoring solution enables ITOps to stay on top of a
to transport its contents. A software container performs a similar function. dynamic container-based environment by unifying container data with other
It allows developers to package an application’s code, configuration files, infrastructure data to provide better contextualization and root cause analysis.
libraries, system tools, and everything else needed to execute that app Learn more about container monitoring in The Essential Guide to
into a self-contained unit, so that they can move the package and run it Container Monitoring.
anywhere with ease.

Infrastructure Monitoring 101: The Power to Predict and Prevent | 12

Layers of visibility
Each layer of the IT stack mentioned in the last section presents its own challenges in regards to
visibility, which get compounded when organizations work to monitor the stack as a whole. And
monitoring the stack as a whole is essential — it’s what supports the development and usage of
the applications and drives customer and employee experiences.

Having a solution that provides a holistic view of the infrastructure alongside detailed views of
individual components is vital if an organization wants to proactively tackle infrastructure issues
and reduce mean time to detection (MTTD), investigation and restoration. It’s also an essential
piece of future planning; knowing how the infrastructure has performed historically, and how it’s
performing in real time, provides invaluable insights that reduce complexity when integrating new
technologies and building new experiences for users and employees.

SERVERS NETWORK VIRTUALIZATION CLOUD CONTAINERS SERVERLESS

Infrastructure Monitoring 101: The Power to Predict and Prevent | 13

The Importance of
IT Infrastructure
Monitoring Strategy
Developing an IT infrastructure monitoring strategy will help ITOps teams
avoid spending too much time struggling with increasing system complexity
and maintaining the tools that were supposed to make monitoring easier and
more reliable. To combat these challenges, system administrators and site
reliability engineers need a clear view of performance and availability across
the infrastructure as a whole.

A strong infrastructure monitoring strategy consists of two key principles:

Centralized and observable data AI/ML enabled

Separate monitoring tools for each layer of the IT infrastructure are a The volume, velocity and variety of new data must be managed by the
fundamental issue when it comes to understanding the health of the right solution. Adding AI and ML to an infrastructure monitoring tool
whole system and solving any problems that arise within it. The answer unlocks powerful opportunities for the ITOps team. ITOps can use AI
to the problem is to have a single tool that ingests all of the data and and ML to replace standard monitoring procedures and use predictive
provides onboard correlation and alerting functionality. algorithms to tackle problems before they arise.

A single platform with a unified experience that provides ITOps with The biggest benefit of an AI/ML-powered monitoring system is the
access to all the information across domains opens up opportunities enormous savings in time and effort on the part of ITOps teams. When
for cross-functional investigation and holistic end-to-end infrastructure repetitive tasks and processes are automated, ITOps teams have the
monitoring. It removes blind spots from the system and, as a result, bandwidth to do the kinds of things AI and ML are ill-equipped to do —
reduces mean time to resolution (MTTR) because teams can more creative problem solving, upgrading existing technologies and planning
quickly identify the problem, fix it and move forward. for the future.

Infrastructure Monitoring 101: The Power to Predict and Prevent | 15

The Full
Monitoring
Stack
IT infrastructure monitoring frees ITOps teams from the
crush of reactive monitoring and crisis management.
With critical insights into the systems they rely on, ITOps
teams benefit from increased observability into business
operations and previously “dark” data.

Infrastructure Monitoring 101: The Power to Predict and Prevent | 16

APM NPMD AIOps Observability
While infrastructure monitoring can Improvements to infrastructure AIOps takes the concept of AI Observability goes beyond just
help identify if there is a problem, monitoring provide new and ML features and expands on monitoring. Customers use point
application performance monitoring opportunities for network it. Rather than having specialized tools to monitor and investigate
helps teams locate where the administrators — especially in functions for these intelligent performance issues across different
problem is occuring. APM tools are network performance monitoring systems, AIOps brings AI and ML to stages of the cloud journey, leading
designed to ensure applications and diagnostics (NPMD). With every user and every IT use case, to fragmented operational data
provide the right level of service a more complete view of the so that nearly any function across trapped in siloed tools. Manual
without interruption. Application infrastructure that supports the the business can leverage AI to get correlation of vast amounts of
speed and uptime — for internal network, sysadmins can improve ahead. With data and intelligence operational data from various
enterprise apps and consumer their mean times to detection from the entire infrastructure stack sources is neither accurate nor in
apps — is directly tied to an (MTTD), investigation and resolution. informing employee decisions real time, leading to higher MTTD
organization’s profitability. Knowing And with the implementation of AI across the organization, new and MTTR. Bringing together
where in the environment an outage and ML, they can use predictive opportunities and efficiencies infrastructure monitoring, APM, and
originates can result in much faster analytics to prevent or minimize become possible. logs gives teams the true end-to-
incident resolution, reducing the outages, altogether. end visibility they need to ensure
consequences of the outage by a system performance.
significant margin.

Infrastructure Monitoring 101: The Power to Predict and Prevent | 17

Customers Who’ve
Succeeded With
Infrastructure Monitoring
Acquia Namely Imprivata CloudShare
Acquia helps companies build digital Namely is an all-in-one HR solution Imprivata, the healthcare IT security CloudShare provides cloud-based
customer experiences. As its user with data-driven analytics that company, provides healthcare solutions that make it easy for
base grew, the company needed give companies incredible insight organizations globally with a security application professionals to work
better insight into its customers’ into how to best manage their and identity platform that delivers in the cloud. Users can efficiently
instances and quicker access to people. Since reliability is a critical ubiquitous access, positive identity create virtual machine environments,
data it could trust. Acquia turned to requirement for HR solutions, management and multifactor collaborate with others and deploy
Splunk to monitor its growing AWS Namely needed a monitoring tool authentication (MFA). Imprivata projects into production, with no
environment. The solution enables that would help ensure its clients enables healthcare securely by background in IT required. The
Acquia’s engineering team to release experienced seamless performance establishing trust between people, firm needed a way to collect and
code faster and more reliably, helps while processing essential payroll, technology and information to correlate critical performance and
technical support teams troubleshoot benefits, HR and time management address critical compliance and business metrics from thousands
issues in real time, and even gives transactions. security challenges while improving of virtual servers. Since deploying
Acquia’s customers the ability to productivity and the patient Splunk Enterprise and the Splunk
Splunk provides Namely with real-
directly monitor the capacity of their experience. Migrating to Splunk App for VMware, CloudShare has
time monitoring across its advanced
own services. By relying on Splunk, Cloud, Imprivata has seen benefits seen benefits including:
microservices architecture. This has
Acquia has reaped a variety of including: • Increased customer conversion and
allowed Namely to:
benefits, including: • DevOps teams can focus on high- retention rates.
• Accelerate product development
• Shorter mean time to resolution priority business needs. • Improved capacity planning based
with confidence
(MTTR) • Streamlined security compliance. on a better understanding of usage
• Develop more advanced features
• Avoiding the cost of massive on- patterns.
• Fewer disruptions • Focus the engineering team on
premises storage infrastructure. • End-to-end visibility and correlation
• Slashed support times and overall enhancements to the Namely
• Disaster recovery and business of business and operational data.
happier customers platform, providing its clients with a
continuity of critical Splunk services.
first-class product for building better
• Less burden on Acquia’s technical
workplaces
team

Learn more about Acquia’s cloud Learn more about Namely’s Learn more about Imprivata’s Learn more about CloudShare’s
monitoring success. microservices monitoring success. container monitoring success. virtualization monitoring success.

Infrastructure Monitoring 101: The Power to Predict and Prevent | 19

Splunk
Infrastructure
Monitoring
Splunk Infrastructure Monitoring is the most comprehensive,
flexible and scalable infrastructure monitoring solution for your LOGS
entire IT landscape — on-prem, hybrid/cloud — leveraging data Logs and events from your
hosts with a detailed record
from any source, at any scale, in real time. IT teams can deliver
of errors, changes and events
on ever-increasing customer expectations by avoiding even that can help teams isolate
seconds of downtime.
METRICS the root cause.
Metrics from your on-premises
and cloud-based host that

What does Splunk monitor? help identify performance

trends and issues across the
infrastructure.
Any environment at any scale
TRACES
NoSample Full-Fidelity trace
ingestion and granular detail
ensures that no anomaly goes
undetected and allows teams
to find where issues lie.

Infrastructure Monitoring 101: The Power to Predict and Prevent | 20

Advanced AI-driven alerting Enrich infrastructure data
for faster triage with service context
Find and fix performance issues in seconds before they impact Combine infrastructure data with data across your
end-users. With built-in data science, and a comprehensive entire environment for a holistic view of IT and business
library of data science-driven functions, Splunk Infrastructure performance. Send infrastructure data from Infrastructure
Monitoring instantly and accurately alerts on dynamic Monitoring directly into Splunk IT Service Intelligence (ITSI)
thresholds, multiple conditions, and complex rules to to search and analyze across multiple layers of the IT stack.
dramatically reduce mean time to detect (MTTD). Alert preview Deep-linking to Splunk Cloud enables in-context monitoring
helps prevent alert storms. and investigation. Understand interdependencies with built-in
correlation with all APM traces for faster troubleshooting.

Instant visualizations for

real-time monitoring
Built on a streaming architecture, Splunk Infrastructure
Monitoring lets you interact with your data in real time.
Whether built-in or customized, charts and dashboards
update in real time with the metrics that matter most to you
— instead of waiting minutes, if not hours, with most batch
querying monitoring tools. See a live heatmap of your entire
infrastructure in one unified view, for full-stack visibility.

Infrastructure Monitoring 101: The Power to Predict and Prevent | 21

Splunk Infrastructure Monitoring
Infrastructure teams spend too much time struggling with system
complexity and the tools that were supposed to make monitoring
easier. To combat these challenges, system administrators and site
reliability engineers need a clear view of infrastructure performance
and availability.
Splunk Infrastructure Monitoring is the industry’s most powerful
analytics-driven cloud monitoring and investigation solution for all
environments.

Get up and running in minutes

Get started today with a free trial of Splunk Infrastructure
Monitoring and eliminate fragmented operational data trapped
in siloed tools so you can deliver on ever-increasing customer
expectations by avoiding even seconds of downtime.
Want to learn more? Read the Splunk Infrastructure Monitoring
Product Brief

About Splunk and AWS

Splunk and AWS are uniquely positioned to help organizations
achieve digital transformation success with their highly
integrated data-driven cloud adoption and modernization
offerings. With industry-leading cloud infrastructure from
AWS, combined with the Splunk ® Data-to-Everything™
Platform, companies can innovate with confidence, migrate
and modernize existing environments, and scale without
limits, with data at the center of every business outcome.

Infrastructure Monitoring 101: The Power to Predict and Prevent | 22

Get Started.
Visit Splunk App for Infrastructure product page.

Learn More

Splunk, Splunk>, Data-to-Everything, D2E and Turn Data Into Doing are trademarks and registered
trademarks of Splunk Inc. in the United States and other countries. All other brand names, product
names or trademarks belong to their respective owners. © 2021 Splunk Inc. All rights reserved.

21-10581-SPLK-Infrastructure Monitoring 101 The Power to Predict and Prevent-114-EB

Infrastructure Monitoring 101: The Power to Predict and Prevent | 23

Tentative Load List 2 MLD STP R0
No ratings yet
Tentative Load List 2 MLD STP R0
1 page
Altium Designer Knjiga Na Engleskom PDF
100% (2)
Altium Designer Knjiga Na Engleskom PDF
84 pages
Building Services
No ratings yet
Building Services
17 pages
Splunk Windows Forwarder
No ratings yet
Splunk Windows Forwarder
18 pages
Ic33 Print Out 660 English PDF
100% (1)
Ic33 Print Out 660 English PDF
54 pages
Conditional Access
No ratings yet
Conditional Access
39 pages
Tillett Car Seat 2011 Brochure
No ratings yet
Tillett Car Seat 2011 Brochure
8 pages
IAPP Cipm - Instructiuni Tematica Si Examen
No ratings yet
IAPP Cipm - Instructiuni Tematica Si Examen
7 pages
NCC Limited DMRCL: Date-23.10.2020
No ratings yet
NCC Limited DMRCL: Date-23.10.2020
19 pages
Abracon LTE GPS Antenna Data Sheet
No ratings yet
Abracon LTE GPS Antenna Data Sheet
8 pages
Closurenotice
No ratings yet
Closurenotice
1 page
Module 5 - Interests Formula and Rates
No ratings yet
Module 5 - Interests Formula and Rates
14 pages
Pioneers of The Data Age
No ratings yet
Pioneers of The Data Age
67 pages
Splunk-6 2 2-Admin
100% (2)
Splunk-6 2 2-Admin
585 pages
SPLK-2003 Splunk SOAR Certified Automation Developer Exam Updated Dumps
No ratings yet
SPLK-2003 Splunk SOAR Certified Automation Developer Exam Updated Dumps
10 pages
Financial Statement Analysis: Abid Hussain
No ratings yet
Financial Statement Analysis: Abid Hussain
14 pages
Paper Thermofluid
No ratings yet
Paper Thermofluid
16 pages
Arts Manager
No ratings yet
Arts Manager
2 pages
SPLK-1004 Splunk Exam Valid Questions
No ratings yet
SPLK-1004 Splunk Exam Valid Questions
10 pages
Why Is The BSP The Main Government Agency Responsible For Promoting Price Stability
No ratings yet
Why Is The BSP The Main Government Agency Responsible For Promoting Price Stability
4 pages
Free Updated SPLK-2003 Exam Dumps
No ratings yet
Free Updated SPLK-2003 Exam Dumps
8 pages
Tonoyan Et Al-2010-Entrepreneurship Theory and Practice
No ratings yet
Tonoyan Et Al-2010-Entrepreneurship Theory and Practice
40 pages
Standard Shipment Process (Mass Processing) : LE (Logistics Execution)
No ratings yet
Standard Shipment Process (Mass Processing) : LE (Logistics Execution)
9 pages
Splunk How Machine Data Dupports GDPR Compliance
No ratings yet
Splunk How Machine Data Dupports GDPR Compliance
4 pages
Listado Commandlets PDF
No ratings yet
Listado Commandlets PDF
4,029 pages
Co Branded - Splunk ITSI Product Brief
No ratings yet
Co Branded - Splunk ITSI Product Brief
3 pages
Splunk Troubleshooting
No ratings yet
Splunk Troubleshooting
7 pages
Splunk Development Day 4: - Vikram Yadav (VY)
No ratings yet
Splunk Development Day 4: - Vikram Yadav (VY)
18 pages
Splunk Enterprise Security
No ratings yet
Splunk Enterprise Security
2 pages
IntrusionDetection Splunk
No ratings yet
IntrusionDetection Splunk
5 pages
Deploying Splunk Enterprise On Microsoft Azure Cloud
No ratings yet
Deploying Splunk Enterprise On Microsoft Azure Cloud
67 pages
What Is Splunk - (Easy Guide With Pictures) - Cyber Security Kings
No ratings yet
What Is Splunk - (Easy Guide With Pictures) - Cyber Security Kings
9 pages
Getting Started With Intune Migrations
No ratings yet
Getting Started With Intune Migrations
37 pages
Splunk User Behavior Analytics
No ratings yet
Splunk User Behavior Analytics
2 pages
SQL Server 2016 ITDM - Sales Deck
No ratings yet
SQL Server 2016 ITDM - Sales Deck
39 pages
Macroeconomics 2Nd Edition Krugman Solutions Manual Full Chapter PDF
100% (11)
Macroeconomics 2Nd Edition Krugman Solutions Manual Full Chapter PDF
34 pages
SLA Monitoring
100% (1)
SLA Monitoring
17 pages
Stock Analysis Strategy For US's Stock Market Based On Risk, Profitability, and Market Value Insights
No ratings yet
Stock Analysis Strategy For US's Stock Market Based On Risk, Profitability, and Market Value Insights
5 pages
Splunk Pricing Options
No ratings yet
Splunk Pricing Options
3 pages
CIS Microsoft SQL Server 2016 Benchmark v1.3.0
No ratings yet
CIS Microsoft SQL Server 2016 Benchmark v1.3.0
69 pages
Take Splunk For A Test Drive: Getting Started With Splunk
100% (1)
Take Splunk For A Test Drive: Getting Started With Splunk
3 pages
Axon Data Governance - The Playbook For Information Segmentation (2021)
No ratings yet
Axon Data Governance - The Playbook For Information Segmentation (2021)
27 pages
Using ES 5.0 Labs
50% (2)
Using ES 5.0 Labs
28 pages
SIEM - Splunk
No ratings yet
SIEM - Splunk
58 pages
STEP - Splunk Training and Enablement Platform
No ratings yet
STEP - Splunk Training and Enablement Platform
14 pages
Splunk Notes
No ratings yet
Splunk Notes
2 pages
Splunk Development Day 5: - Vikram Yadav (VY)
No ratings yet
Splunk Development Day 5: - Vikram Yadav (VY)
29 pages
Using Splunk To Develop An Incident Response Plan: White Paper
No ratings yet
Using Splunk To Develop An Incident Response Plan: White Paper
7 pages
Splunk As A SIEM Tech Brief
100% (1)
Splunk As A SIEM Tech Brief
3 pages
Content (2) - 1
No ratings yet
Content (2) - 1
158 pages
Ultimate Microsoft Intune for Administrators: Master Enterprise Endpoint Security and Manage Devices, Apps, and Cloud Security with Expert Microsoft Intune Strategies (English Edition)
From Everand
Ultimate Microsoft Intune for Administrators: Master Enterprise Endpoint Security and Manage Devices, Apps, and Cloud Security with Expert Microsoft Intune Strategies (English Edition)
Paul Winstanley
No ratings yet
Splunk Use Cases Webinar
100% (1)
Splunk Use Cases Webinar
26 pages
Scalability and High Volume Performance of Indexer Clustering at Splunk
No ratings yet
Scalability and High Volume Performance of Indexer Clustering at Splunk
44 pages
Product Description Schedule 34.1 - FINAL - 05072021
No ratings yet
Product Description Schedule 34.1 - FINAL - 05072021
77 pages
Azure Analytics Assessment Questionnaire
No ratings yet
Azure Analytics Assessment Questionnaire
29 pages
IT2184 Tunning
No ratings yet
IT2184 Tunning
36 pages
List of Guitar Manufacturers - Wikipedia
No ratings yet
List of Guitar Manufacturers - Wikipedia
11 pages
Notes From preMBA Stats
No ratings yet
Notes From preMBA Stats
4 pages
Defend Against Malicious Insiders Using Splunk Enterprise Security, Splunk's Machine Learning Toolkit, and Statistics
No ratings yet
Defend Against Malicious Insiders Using Splunk Enterprise Security, Splunk's Machine Learning Toolkit, and Statistics
36 pages
SIEM Trends: To Watch in 2021
No ratings yet
SIEM Trends: To Watch in 2021
8 pages
Splunk-8 0 5-Installation
No ratings yet
Splunk-8 0 5-Installation
99 pages
BMC Helix CMDB - Datasheet
No ratings yet
BMC Helix CMDB - Datasheet
2 pages
Cyops1.1 Chp07-Dts Oa
No ratings yet
Cyops1.1 Chp07-Dts Oa
49 pages
DSP Pyq
No ratings yet
DSP Pyq
13 pages
REGEX Cheat Sheet - 1 PDF
No ratings yet
REGEX Cheat Sheet - 1 PDF
4 pages
Courses For Itsi Admins
No ratings yet
Courses For Itsi Admins
1 page
VOCABULARY-sport Car Lis
No ratings yet
VOCABULARY-sport Car Lis
16 pages
McAfee ePO Backup
No ratings yet
McAfee ePO Backup
4 pages
Splunk PDF
No ratings yet
Splunk PDF
11 pages
Machine Data Management & Analytics - Splunk Enterprise
No ratings yet
Machine Data Management & Analytics - Splunk Enterprise
9 pages
Splunk 1
No ratings yet
Splunk 1
4 pages
Splunk-7.0.0-Knowledge - Knowledge Manager Manual 2
No ratings yet
Splunk-7.0.0-Knowledge - Knowledge Manager Manual 2
426 pages
Tableau Performance Optimization Flow Chart 2020
No ratings yet
Tableau Performance Optimization Flow Chart 2020
3 pages
Splunk UnixApp-5.0.1-User
No ratings yet
Splunk UnixApp-5.0.1-User
80 pages
MockTest 4A 1 Q Eng
No ratings yet
MockTest 4A 1 Q Eng
9 pages
SHC Cheatsheet
No ratings yet
SHC Cheatsheet
2 pages
Courses For Cloud Customers
No ratings yet
Courses For Cloud Customers
1 page
RA386 Architecture Related Provisions
No ratings yet
RA386 Architecture Related Provisions
11 pages
Forrester Wave In-Memory Data Grids
No ratings yet
Forrester Wave In-Memory Data Grids
16 pages
Splunk Security Investigation and Aws
100% (1)
Splunk Security Investigation and Aws
5 pages
Vi Semester Result Analysis (2021 Batch) - 2023-2024
No ratings yet
Vi Semester Result Analysis (2021 Batch) - 2023-2024
2 pages
SumoLogic - Professional Services - Security Analytics PDF
No ratings yet
SumoLogic - Professional Services - Security Analytics PDF
75 pages
Integrating Splunk With Arcsight
No ratings yet
Integrating Splunk With Arcsight
11 pages
Splunk Punk: Taming Logs, Alerts, and the Chaos of SIEM
From Everand
Splunk Punk: Taming Logs, Alerts, and the Chaos of SIEM
Scott Markham
No ratings yet
SYLLABUS OCS 4410 Fall 2022
No ratings yet
SYLLABUS OCS 4410 Fall 2022
4 pages
Tata Capital - MBL One Pager
No ratings yet
Tata Capital - MBL One Pager
5 pages
2025 Official Semester Dates Letter 376220
No ratings yet
2025 Official Semester Dates Letter 376220
1 page
Hicks and Slutsky
No ratings yet
Hicks and Slutsky
8 pages
Building and Site Security Policy
No ratings yet
Building and Site Security Policy
1 page
AppDynamics Third Edition
From Everand
AppDynamics Third Edition
Gerardus Blokdyk
No ratings yet

AWS Splunk Infrastructure Monitoring 101 The Power To Predict and Prevent

Uploaded by

AWS Splunk Infrastructure Monitoring 101 The Power To Predict and Prevent

Uploaded by

Infrastructure Monitoring 101

Infrastructure Monitoring 101: The Power to Predict and Prevent | 1

Having knowledgeable teams and sophisticated systems in place are only

Infrastructure Monitoring 101: The Power to Predict and Prevent | 3

One troubling fact revealed in Uptime Institute’s 2020 datacenter survey

Companies of all sizes can be affected by these outages, and because

• U.S. mobile operator T-Mobile suffered a major outage on June 15.

* Source: Uptime Institute 2018 (8th annual Data Center Survey)

Infrastructure Monitoring 101: The Power to Predict and Prevent | 7

Infrastructure Monitoring 101: The Power to Predict and Prevent | 8

Metrics Traces Logs

Infrastructure Monitoring 101: The Power to Predict and Prevent | 9

Infrastructure Monitoring 101: The Power to Predict and Prevent | 10

Infrastructure Monitoring 101: The Power to Predict and Prevent | 12

SERVERS NETWORK VIRTUALIZATION CLOUD CONTAINERS SERVERLESS

Infrastructure Monitoring 101: The Power to Predict and Prevent | 13

A strong infrastructure monitoring strategy consists of two key principles:

Centralized and observable data AI/ML enabled

Infrastructure Monitoring 101: The Power to Predict and Prevent | 15

Infrastructure Monitoring 101: The Power to Predict and Prevent | 16

Infrastructure Monitoring 101: The Power to Predict and Prevent | 17

Infrastructure Monitoring 101: The Power to Predict and Prevent | 19

What does Splunk monitor? help identify performance

Infrastructure Monitoring 101: The Power to Predict and Prevent | 20

Instant visualizations for

Infrastructure Monitoring 101: The Power to Predict and Prevent | 21

Get up and running in minutes

About Splunk and AWS

Infrastructure Monitoring 101: The Power to Predict and Prevent | 22

21-10581-SPLK-Infrastructure Monitoring 101 The Power to Predict and Prevent-114-EB

Infrastructure Monitoring 101: The Power to Predict and Prevent | 23

You might also like