0% found this document useful (0 votes)

140 views

Splunk Search Optimization

The document discusses strategies for optimizing searches on the Splunk platform. It covers techniques like filtering data early, limiting data retrieved from indexes, using appropriate time windows, and partitioning data into separate indexes. Tips provided include narrowing search criteria, specifying indexes/sources, and understanding the type of search being performed.

Uploaded by

sanjay

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

140 views

Splunk Search Optimization

Uploaded by

sanjay

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

You are on page 1/ 3

Splunk Search Optimization

Optimizing the search is a strategy that lets the quest run as effectively as
possible. In this section, we will learn how to optimize searches on the Splunk
platform

A search also runs longer when not configured, retrieves more enormous quantities
of data from the indexes than is required, and inefficiently consumes more memory
and network resources. Multiply these problems across hundreds or thousands of
searches, and the result is slow or sluggish.

There's a collection of fundamental concepts we can obey to maximize our searches.

Retrieve only the required data

Move as little data as possible
Parallelize as much work as possible
Set appropriate time windows
We are using the following methods to incorporate search optimization concepts.

Filter as much as possible in the initial search

Perform joins and lookups on only the required data
Perform evaluations on the minimum number of events possible
Move commands that bring data to the search head as late as possible in our search
criteria.
Indexes and lookups
The Splunk program uses the information in the index files to classify the events
that can be retrieved from the disk when we run a scan. The lower the number of
events to be retrieved from the disk, the faster the quest is going.

How we build our quest can have a huge effect on the number of retrieved events
from the disk.

When data is indexed the data will be translated into events based on time

The processed data consists of several files:

The raw data in compressed form (rawdata)

The indexes that point to the raw data (index files, also referred to as tsidx
files)
Some metadata files
These files are written to the disk and reside in age-organized directory sets
called buckets.

Use indexes effectively

One way of limiting the data extracted from the disk is to partition data into
different indexes. If we rarely scan multiple data types at a time, partition
various data types into separate indexes. Limit our searches to the specific index.
Store data about web access in one index, for example, and firewall data in
another. For sparse data, it is suggested to use different indexes, which otherwise
may be lost in a large amount of irrelevant data.

An optimized search
We can optimize the entire search by moving some of the components from the second
search to locations earlier in the search process.

Moving the criteria A=25 before the first pipe filters the events earlier and
reduces the amount of times that the index is accessed. The number of events
extracted is 300,000. This is a reduction of 700,000 compared to the original
search. The lookup is performed on 300,000 events instead of 1 million events.
Moving the criteria L>100 immediately after the lookup filters the events further
reduces the number of events that are returned by 100,000. The eval is performed on
200,000 events instead of 1 million events.

The criteria E>50 is dependent on the results of the eval command and cannot be
moved. The results are the same as the original search. 50,000 events are returned,
but with much less impact on resources.

Quick tips for optimization

The key to fast searching is to limit the data to the absolute minimum that needs
to be pulled from the disk. In the search, filter the data as early as possible, so
processing takes place on the minimum amount of data needed.

Limit the data from disk

The techniques for restricting the amount of data retrieved from the disk range
from setting a narrow time frame, being as precise as possible, and retrieving the
smallest required events.

Narrow the time window

Limiting the time span is one of the most powerful ways to restrict the data that
is taken off disk. Use the picker time range or specify time alterers in our search
to identify the smallest time window necessary for our search.

If we need to view data from the last hour only, don't use the Last 24 hours
default time range.

If we must use a broad time range, such as Last week or All-time, then use other
techniques to limit the amount of data retrieved from disk.

Specify the index, source, or source type

To optimize our searches, it's necessary to understand how our data is structured.
Take the time to learn which indexes contain our data, our data sources, and the
type of source. Knowing the data regarding this information lets, us narrow down
the searches.

Run the following search.

Search=*
This search is not optimized, but it provides us with an opportunity to learn about
the data we have access to.
In the Selected fields list, click on each field and look at the values for host,
source, and sourcetype.
In the Interesting fieldslist, click on the index Look at the names of the indexes
that we have access to.
In our quest, define the index, source, or source form where possible. When the
Splunk program indexes data, it will automatically add a number of fields to each
case. Fields of the index, source, and source type are automatically added as
default fields to each event. A default field is an indexed field recognized by the
Splunk program at search time in our case. The host and the source, and source type
fields describe where the event originated.

Write better searches

This topic examines some of the causes of slow searches and includes guidelines to
help us write more efficient searches. Several factors, including: can influence
the pace of our searches

The volume of data that we are searching

How our searches are constructed
The number of concurrent searches
To optimize the speed at which our search runs, minimize the processing time
required for each component of the search.

Know your type of search

Search optimization guidelines depend on the type of search we are running and the
characteristics of the data we are looking for. Searches fall into two categories,
which are based on the objective we wish to accomplish. A search is intended to
retrieve events, or a search is designed to produce a report summarizing or
organizing the data.

Searches that retrieve events

Raw event searches retrieve events from a Splunk database without any further
processing of the retrieved events. When picking up events from the index, be clear
about the events we want to imagine. This can be done with keywords and field-value
pairs unique to the events.

If the events in the dataset we want to retrieve occur frequently, the search is
called a dense search. If the events in the dataset that we want to retrieve are
rare, the search is called a sparse search. Sparse searches that run against large
data volumes take longer than dense searches for the same data set.

Searches that generate reports

Report-generating searches, or transforming searches, conduct additional analysis
on events after retrieval of the events from an index. This processing can include
filtering, transforming, and other operations using one or more statistical
functions against result collection. Since this processing takes place in memory,
the more restrictive and precise we retrieve the events, the faster the search will
run.

Tips for tuning your searches

In most cases, because of the complexity of our query, our search is slow to
retrieve events from the index. For instance, if our search contains extremely
large OR lists, complex sub searches (which break down into OR lists), and phrase
search types, processing takes longer. This section explores tips to fine-tune our
searches to make them more successful.

It takes a lot of memory to conduct statistics with a BY clause on a set of field

values that have high cardinality, lots of uncommon or special values. One
potential solution is to lower the value for the chunk size setting used with the
command tstats. Additionally, it can also be beneficial to reduce the number of
distinct values that the BY clause must process.

Restrict searches to the specific index

If we rarely search over more than one data type at a time, divide the different
data types into separate indexes. Limit our searches then to the same index-Store
Web access data, for example, in one index, and another firewall. This is
recommended for sparse data, which could otherwise be buried in a large volume of
unrelated data.

lab lookups subsearches
0% (1)
lab lookups subsearches
16 pages
SPLK-2003 Updated Dumps - Splunk SOAR Certified Automation Developer Exam
0% (1)
SPLK-2003 Updated Dumps - Splunk SOAR Certified Automation Developer Exam
11 pages
Splunk Lab - Scheduling Reports & Alerts
No ratings yet
Splunk Lab - Scheduling Reports & Alerts
8 pages
Splunk Test Blueprint User
No ratings yet
Splunk Test Blueprint User
3 pages
Splunk Fundamentals 1 Lab Exercises: Lab Module 9 - Transforming Commands
No ratings yet
Splunk Fundamentals 1 Lab Exercises: Lab Module 9 - Transforming Commands
14 pages
Fundamentals2 LabSolutions8.0
No ratings yet
Fundamentals2 LabSolutions8.0
70 pages
Splunk Module 2 Basic Searching
No ratings yet
Splunk Module 2 Basic Searching
94 pages
Splunk troubleshooting
No ratings yet
Splunk troubleshooting
7 pages
Dynamic Dashboards 9.1 Slides
No ratings yet
Dynamic Dashboards 9.1 Slides
78 pages
Using ES 5.0 Labs
50% (2)
Using ES 5.0 Labs
28 pages
Module 10 Lab Exercise - Creating Reports and Dashboards: Description
No ratings yet
Module 10 Lab Exercise - Creating Reports and Dashboards: Description
5 pages
Splunk Notes
No ratings yet
Splunk Notes
2 pages
Lab Exercises: Lab Module 4 - Ingesting Data
No ratings yet
Lab Exercises: Lab Module 4 - Ingesting Data
8 pages
CyberArk-PAM Implementation
No ratings yet
CyberArk-PAM Implementation
6 pages
Splunk-7.0.0-Knowledge - Knowledge Manager Manual 2
No ratings yet
Splunk-7.0.0-Knowledge - Knowledge Manager Manual 2
426 pages
Multivalue Fields - Lab Guide: Index Type Sourcetype Interesting Fields
No ratings yet
Multivalue Fields - Lab Guide: Index Type Sourcetype Interesting Fields
17 pages
Lab 8 Splunk Boss of The SOC (15 Pts + 20 Pts Extra)
No ratings yet
Lab 8 Splunk Boss of The SOC (15 Pts + 20 Pts Extra)
16 pages
High Availability HSRP, VRRP, GLBP Gns3 Lab
No ratings yet
High Availability HSRP, VRRP, GLBP Gns3 Lab
1 page
Using Splunk 6 Lab Exercises
No ratings yet
Using Splunk 6 Lab Exercises
11 pages
50 Splunk Interview Questions and Answers
No ratings yet
50 Splunk Interview Questions and Answers
10 pages
Cyops1.1 Chp07-Dts Oa
No ratings yet
Cyops1.1 Chp07-Dts Oa
49 pages
Splunk Module 9 Troubleshooting Methods and Tools
100% (1)
Splunk Module 9 Troubleshooting Methods and Tools
38 pages
Splunk Fundamentals 1 Lab Exercises: Lab Module 11 - Using Pivot
No ratings yet
Splunk Fundamentals 1 Lab Exercises: Lab Module 11 - Using Pivot
8 pages
Splunk Fundamentals 1 Lab Exercises: Lab Module 12 - Creating Lookups
No ratings yet
Splunk Fundamentals 1 Lab Exercises: Lab Module 12 - Creating Lookups
7 pages
IntrusionDetection Splunk
No ratings yet
IntrusionDetection Splunk
5 pages
VM Slides
No ratings yet
VM Slides
120 pages
STEP - Splunk Training and Enablement Platform
No ratings yet
STEP - Splunk Training and Enablement Platform
14 pages
ITIL Foundation - Slide Deck
No ratings yet
ITIL Foundation - Slide Deck
177 pages
Splunk UnixApp-5.0.1-User
No ratings yet
Splunk UnixApp-5.0.1-User
80 pages
SPLK-2003 Splunk SOAR Certified Automation Developer Exam Updated Dumps
No ratings yet
SPLK-2003 Splunk SOAR Certified Automation Developer Exam Updated Dumps
10 pages
Setting Up Splunk For Vmware Monitoring: Proven Practice Guide
No ratings yet
Setting Up Splunk For Vmware Monitoring: Proven Practice Guide
18 pages
Splunk Education Student Handbook
No ratings yet
Splunk Education Student Handbook
37 pages
Splunk Fundamentals 1 Lab Exercises: Lab Module 10 - Creating Reports and Dashboards
No ratings yet
Splunk Fundamentals 1 Lab Exercises: Lab Module 10 - Creating Reports and Dashboards
10 pages
Using Splunk 6 Labs
No ratings yet
Using Splunk 6 Labs
11 pages
splk-1003 4
No ratings yet
splk-1003 4
7 pages
Splunk Training Plan V0.1
No ratings yet
Splunk Training Plan V0.1
2 pages
Simulating, Detecting, and Responding To Log4Shell With Splunk - Splunk
No ratings yet
Simulating, Detecting, and Responding To Log4Shell With Splunk - Splunk
1 page
Splunk Lab - Creating Maps
No ratings yet
Splunk Lab - Creating Maps
19 pages
SPLK 1002
No ratings yet
SPLK 1002
6 pages
Splunk 6.0.1 SearchTutorial
No ratings yet
Splunk 6.0.1 SearchTutorial
70 pages
Chennuri.9030: Cell: +919030406427 E-Mail
No ratings yet
Chennuri.9030: Cell: +919030406427 E-Mail
4 pages
Splunk 6.4 Administration - Splunk
0% (1)
Splunk 6.4 Administration - Splunk
5 pages
Splunk Core Certified Exam
No ratings yet
Splunk Core Certified Exam
5 pages
Introduction To ITSI
No ratings yet
Introduction To ITSI
18 pages
Splunk Certification Exams Study Guide
No ratings yet
Splunk Certification Exams Study Guide
11 pages
Chapter 3 PKI-Overview
No ratings yet
Chapter 3 PKI-Overview
39 pages
Courses For Cloud Customers
No ratings yet
Courses For Cloud Customers
1 page
SplunkCloud-6 6 3-SearchTutorial PDF
No ratings yet
SplunkCloud-6 6 3-SearchTutorial PDF
103 pages
Arcsight Logger - Commonly Used Event Fields: Example Queries
No ratings yet
Arcsight Logger - Commonly Used Event Fields: Example Queries
2 pages
Splunk QA Official
No ratings yet
Splunk QA Official
26 pages
Splunk 6.4.0 SearchReference
No ratings yet
Splunk 6.4.0 SearchReference
481 pages
ESM Health Check PDF
No ratings yet
ESM Health Check PDF
41 pages
Splunk Lab - Search Under The Hood
No ratings yet
Splunk Lab - Search Under The Hood
11 pages
Creating Splunk Knowledge - Labs
No ratings yet
Creating Splunk Knowledge - Labs
25 pages
SOC_Analyst_Training_InfosecTrain_v3
No ratings yet
SOC_Analyst_Training_InfosecTrain_v3
18 pages
Splunk Module 8 Parsing Phase and Data
No ratings yet
Splunk Module 8 Parsing Phase and Data
36 pages
Splunk Punk: Taming Logs, Alerts, and the Chaos of SIEM
From Everand
Splunk Punk: Taming Logs, Alerts, and the Chaos of SIEM
Scott Markham
No ratings yet
Mastering Active Directory
From Everand
Mastering Active Directory
VICTOR P HENDERSON
No ratings yet
Mastering Splunk for Cybersecurity: Advanced Threat Detection and Analysis
From Everand
Mastering Splunk for Cybersecurity: Advanced Threat Detection and Analysis
Robert Johnson
No ratings yet
Ultimate AWS Certified Solutions Architect Associate Exam Guide: Master Designing Resilient, Scalable Architectures with Core and Advanced AWS Services to Crack the SAA-C03 Certification (English Edition)
From Everand
Ultimate AWS Certified Solutions Architect Associate Exam Guide: Master Designing Resilient, Scalable Architectures with Core and Advanced AWS Services to Crack the SAA-C03 Certification (English Edition)
Venkata Sasi Kanumuri
No ratings yet

Splunk Search Optimization

Uploaded by

Splunk Search Optimization

Uploaded by

Splunk Search Optimization

There's a collection of fundamental concepts we can obey to maximize our searches.

Retrieve only the required data

Filter as much as possible in the initial search

The processed data consists of several files:

The raw data in compressed form (rawdata)

Use indexes effectively

Quick tips for optimization

Limit the data from disk

Narrow the time window

Specify the index, source, or source type

Run the following search.

Write better searches

The volume of data that we are searching

Know your type of search

Searches that retrieve events

Searches that generate reports

Tips for tuning your searches

It takes a lot of memory to conduct statistics with a BY clause on a set of field

Restrict searches to the specific index

You might also like