Deploying Splunk Enterprise On Microsoft Azure
Deploying Splunk Enterprise On Microsoft Azure
Splunk® provides the leading platform for Operational can also run independently from within different cloud
Intelligence. Splunk software searches, monitors, instances. Depending on the deployment infrastructure,
analyzes and visualizes machine-generated big data considerations must also be taken to allocate the
from websites, applications, servers, networks, sensors proper amount of resources per component type.
and mobile devices. More than 11,000 organizations
Forwarders perform data collection, data forwarding
use Splunk software to deepen business and customer
and data load balancing. Low amounts of resources
understanding, mitigate cybersecurity risk, improve
are required to run a forwarder as they typically read
service performance and reduce costs. Splunk
and send data with minimal overhead. A Universal
Enterprise indexes machine data in real time, enabling
Forwarder is a lightweight package of the Splunk
multiple roles across the organization — from system
software that can perform most, if not all, of the
administrators to business analysts — to rapidly gain
forwarder functionality.
insight from the massive amounts of machine data
generated by your environment. Indexers write the data to a storage device and perform
searching on the data. These can be resource intense
Adopting a cloud strategy enables organizations
and require I/O and CPU allotment.
to increase agility, reduce costs, decrease time to
market and empower innovation. Splunk Enterprise is Search heads search for information across indexers
perfect for deploying in a cloud environment, offering and require CPU and memory allotment.
enterprise-grade availability and scalability to support
Budgeting system resources and bandwidth to enable
the collection of hundreds of terabytes of data per
search and index performance depend on the total
day from workloads residing on-premises, in the
volume of data being indexed and the number of
cloud or across hybrid environments. This document
active concurrent searches (scheduled or otherwise)
covers guidelines for deploying Splunk Enterprise on
at any time.
Microsoft Azure, an open and flexible cloud platform
with a growing collection of integrated cloud services, In addition to rapidly writing data to disk, indexers
including analytics, computing, database, mobile, perform much of the work involved in running searches:
networking, storage and web. reading data off disk, decompressing it, extracting
knowledge and reporting. Since indexers incur most
Splunk Deployment Components of the workload, increases in indexing volume should
A typical Splunk deployment includes Splunk be tied to an increase in indexer instances. Deploying
forwarders, indexers and search heads. Splunk additional indexers will distribute the load of increased
Enterprise is a single package that can perform one or data volume, resulting in reduced contention for
many of the roles that each component would normally resources and improved search performance.
deliver, in addition to others. The software can be Common Azure deployments leverage a combination
installed within minutes on your choice of hardware of forwarders and network streams to send data to the
(physical, cloud or virtual) and operating system. The Splunk indexer(s). While forwarders are not required to
package is available for download for most operating gather data from the source, they do provide certain
systems. Depending on the deployment infrastructure, benefits such as flexibility, load balancing and reliability.
considerations must also be taken to allocate the Using a syslog output (from a data source) or a file
proper amount of resources per component type. mount is also a common method of getting data into
While all major Splunk components can be run from the Splunk indexer. Additionally, modular inputs, which
a single installation on a single cloud instance, they are extensions to Splunk Enterprise that define a
TECH BRIEF
custom data input, and HTTP Event Collector, a highly unmanaged. Splunk recommends using managed VHDs
efficient and secure mechanism to send high volumes for Splunk storage. More specifically, you can store
of data directly to Splunk, can be used to collect data Splunk application and configurations in the persistent
from various API sources. OS disk and store Splunk indexes across multiple
persistent data disks.
Other Splunk components include the Deployment
Server (configuration management), License Master Managed Disks are preferred for various reasons:
(license management) and Master Node (data • Managed Disks transparently handle storage
replication management). accounts. With Unmanaged Disks, if IOPS across all
disks in a storage account approach storage account
Performance Considerations limits, you must create additional storage accounts as
Within Microsoft Azure well as rebalance your virtual machine disks across
There are several performance factors to consider the storage accounts to insure they stay within
when deploying Splunk software on Microsoft Azure. the IOPS limit. Managed Disks remove the need to
These considerations are Azure Virtual Machine (VM) provision additional storage accounts, effectively
image and size, and underlying Azure Storage. removing these IOPS limits.
Databases Web Services App Servers Networking Web Services App Servers
Cloud On-Premises
Splunk, Splunk>, Data-to-Everything, D2E and Turn Data Into Doing are trademarks and registered trademarks of Splunk Inc. in the United States and
other countries. All other brand names, product names or trademarks belong to their respective owners. © 2020 Splunk Inc. All rights reserved. 20-13235-Splunk-ES on Microsoft Azure-102-TB