0% found this document useful (0 votes)

307 views47 pages

Architecang and Sizing Your Splunk Deployment: Simeon Yep

Uploaded by

Securisq Networks

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

307 views47 pages

Architecang and Sizing Your Splunk Deployment: Simeon Yep

Uploaded by

Securisq Networks

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 47

Copyright

© 2013 Splunk Inc.

ArchitecAng and Sizing

your Splunk Deployment
Simeon Yep
Sales Engineering Manager, Splunk

#splunkconf
Legal NoAces
During the course of this presentaAon, we may make forward-‐looking statements regarding future events or the
expected performance of the company. We cauAon you that such statements reflect our current
expectaAons and esAmates based on factors currently known to us and that actual events or results could differ
materially. For important factors that may cause actual results to differ from those contained in our forward-‐
looking statements, please review our filings with the SEC. The forward-‐looking statements made in this
presentaAon are being made as of the Ame and date of its live presentaAon. If reviewed aTer its live
presentaAon, this presentaAon may not contain current or accurate informaAon. We do not assume any
obligaAon to update any forward-‐looking statements we may make. In addiAon, any informaAon about
our roadmap outlines our general product direcAon and is subject to change at any Ame without noAce. It is for
informaAonal purposes only and shall not, be incorporated into any contract or other commitment. Splunk
undertakes no obligaAon either to develop the features or funcAonality described or to include any such feature or
funcAonality in a future release.

Splunk, Splunk>, Splunk Storm, Listen to Your Data, SPL and The Engine for Machine Data are trademarks and registered trademarks of
Splunk Inc. in the United States and other countries. All other brand names, product names, or trademarks belong to their respecCve
owners.
©2013 Splunk Inc. All rights reserved.

2
IntroducAon
About Me
!  5+ years @ Splunk
!  Experience:
–  SupporAng, administering, and architecAng large scale deployments
–  OEM – technical sales
–  Strategic accounts – technical sales
!  Based in HQ (San Francisco oﬃce)
!  Currently: Business Development, Technical Synergies

4
Agenda
!  Sizing Fundamentals
!  ArchitecAng Fundamentals
!  Deployment Topologies

5
Sizing Fundamentals
Sizing Fundamentals
!  Understand the sizing factors
!  Data volume
!  Search volume
!  Sizing sheet

7
Sizing Factors
!  How much data (raw sizes)?
–  Daily volume
–  Peak volume
–  Retained volume (archive size)
–  Future volume?
!  How much searching?
–  Use cases
–  How many people? How oTen?
!  Jobs
–  SummarizaAon, alerAng, reporAng

8
Data Volumes
!  EsAmate input volume
–  Verify raw log sizes
–  Leverage _internal metrics to get actual input volumes
!  Conﬁrm esAmates with actual data
–  Create a baseline with real or simulated data
–  Find compression rates (range from 30%-‐120%, typically 50%)
–  Determine retenAon needs
!  Document use cases
–  Use case determines search needs
–  Plan for expansion as adopAon grows (search and volume)

9
Data Sizing Exercise
!  Via Filesystem
!  Use the Splunk log ﬁles: metrics.log or license_usage.log
!  OpAonally:
–  License report view in Splunk Enterprise 6
–  S.o.S app in 5.x

10
Search Volumes
!  Gather use case informaAon
–  How much ad-‐hoc searching?
–  How much background searching?
!  Ad-‐hoc searching
–  Evaluate the data being searched
–  Evaluate the Ame duraAon (real-‐Ame vs historic)
–  Real-‐Ame searches are typically less overhead
!  Background searching
–  AlerAng and monitoring
–  General reports
–  Summary indexing

11
Final Sizing Numbers
!  Data capacity
–  Daily and peak
!  User capacity
–  Concurrent and total
!  Search capacity
–  Concurrent and total

*Document the use cases!!

12
Architecture
Architecture
!  Splunk server roles: distributed/clustered deployments
!  Reference server
!  Rules of thumb
!  Hardware factors

14
Splunk Distributed Roles
Search Head (regular and job server)
search head

Indexer
indexer

Forwarder (universal)
forwarder

15
Splunk Distributed Roles

Cluster Master (clustering/replicaAon requirement)

License Master

Deployment Server

16
Recommended ConﬁguraAons
Stand-‐ Indexer Search head Indexer Search Cluster
alone (distributed) (distributed) (clustered) head master
(clustered) (clustered)
Forwarding * * * * *

Searching √ √ * √
Indexing √ √ * √
Deployment * *
server
License
master
* √ * *

Cluster
master √
√ -‐ common * -‐ uncommon

17
What s a Reference Server?
!  Sizing based on commodity x86 servers
!  Dual quad-‐core CPUs at 3.0 GHz (dual six core is common)
!  8 GB of RAM – (16 GB is common)
!  64-‐bit OS
!  4x10k RPM local SAS drives in RAID 1+0 (800+ IOPs)
!  VariaAons cause corresponding changes in performance/
requirements

18
Rules of Thumb
!  These all have excepAons and qualiﬁcaAons
!  1 reference indexer per 100 GB/day
!  1 reference search head per 8 to 12 users
!  1 reference job server per 20 concurrent jobs
!  1 deployment server per 3000 polls/min
!  ReplicaAon later…

19
How Many Indexers?
!  Rule of thumb says: 1 per 100 GB/day
!  Leaves room for:
–  Daily peaks
–  Light searching and reporAng for about 3 concurrent users
!  Need more indexers for:
–  Heavy reporAng
–  More users
–  Slower disks, slower CPUs, fewer CPUs

20
How Many Search Heads?
!  Rule of thumb says: 1 per 8 to 12 concurrent users
!  Limit is concurrent queries
!  30-‐50 web sessions
!  1:1 raAo of search query to CPU core
!  Only add ﬁrst search head if ≥3 indexers
!  Don’t add search heads; add indexers: indexers do most work
!  But you need more if:
–  Running a lot of scheduled jobs on the search head

21
Search Head vs. Job Server
!  Search Head Pooling (SHP): uses NFS to manage user proﬁles/
conﬁguraAons and job queue
!  Search head and job server are equivalent with SHP
!  Use job servers for scheduled searches (summaries, alerts,
and reports)
!  Use search heads for ad-‐hoc searching

22
How Many Deployment Servers?
!  Rule of thumb says: 1 per 3000 polls/minute
!  Just use one deployment server, and adjust the polling period
!  Small deployments can share the same splunkd
!  Low requirement for disk performance (good candidate
for virtualizaAon)
!  Windows OS – 1 per 500 polls/minute
!  Or use something other than deployment server
–  Puppet, SCCM, cfengine, chef…

23
More is Bexer?
!  CPUs
–  Search process uAlizes up to 1 CPU core (1:1)
–  Indexers sAll need to do the heavy liTing (search exists on indexer AND
search head)
–  Limited beneﬁt for indexing (up to 2 CPU cores for indexing)
!  Memory
–  Good for search heads and indexers (16+ GB)
!  Disks
–  Faster is bexer (15k rpm)
–  More disks in RAID 1+0 = faster

24
Performance and Sizing Tips
System change Search Speed Indexing Speed

Faster disks ++ ++
Add an indexer ++ ++
Add a search head +
Report acceleraAon/
summaries ++
25
Performance and Sizing Tips
System change Search Speed Indexing Speed

OpAmize searches +++

OpAmize ﬁeld extracAon +
OpAmize input parsing +
Faster CPU + +
26
Capacity à Architecture
!  Sizing recipe
–  Capacity
–  Rules of thumb determines number of servers
!  Building blocks for architecture

27
Architecture Factors
!  What are my sizing requirements?
!  Where is the data?
!  Where are the users?
!  What is the security policy?
!  What are the retenAon and compliance policies?
!  What is the availability requirement?
!  What about the cloud?

28
Architecture Factors
!  What are my sizing requirements?
–  Data capacity
–  Search capacity
–  User capacity
!  Obtained from the sizing process

29
Architecture Factors
!  Where is the data?
–  Local or remote to the indexing machine
–  If remote – use forwarders when possible
–  Index in local data center (zone) or index centrally
–  Persist network data to disk as a best pracAce
–  Use intermediate forwarders to distribute data
!  Where are the users?
–  User experience aﬀected by search head locaAon
ê  Time zone tuning (5.x +)
ê  Distributed search over LAN vs WAN

30
Architecture Factors
!  What is the Security Policy?
–  Apply user security policies
ê  Auth method
ê  Roles
ê  Filters
–  Apply physical security policies
ê  Index locaAon

31
Architecture Factors
!  RetenAon, compliance, governance
–  Where is the data allowed to be?
–  Where is the data not allowed to go?
–  Where must the data go?
!  Availability
–  Local failover, fault-‐tolerance, clustering
–  Geographic disaster recovery/fault-‐tolerance
–  Index replicaAon!

32
Architecture Factors
!  Same old story
!  Cloud consideraAons
–  AuthenAcaAon restricAons
–  Data transfer costs
–  Security – SSL tunnel
–  Zones

33
Topologies
Architecture Factors à Topology

Topology Examples

Centralized Decentralized Hybrid

35
Centralized Topology
Search Head Pooling

Indexers

Intermediate
Forwarders Forwarder Forwarders

Syslog Devices

36
Decentralized Topology

Search Head Pooling

37
Hybrid Topology

38
Scaling and Expansion
!  Add to your indexer pool for more performance or capacity
–  Mixed pla{orm and hardware is okay
!  Use search head pooling for more UI capacity
–  Requires NFS
!  Create new indexes for new data types
–  Follows best pracAces

39
Index ReplicaAon (aka Clustering)
!  What is it?
–  Indexes are replicated to 1 or more indexers (tunable)
–  Splunk controlled
!  Basics
–  Master node (manages indexing and searching locaAon)
–  Distributed deployment
–  NOT = “index and forward “
!  HA license VS. index replicaAon
–  HAL – Separate fully funcAoning Splunk deployments
–  IR – Data is made available on 1 or more indexers

40
Index ReplicaAon
!  Rule of thumb: 1 per 50 GB/day
–  Assume simple replicaAon (2 in existence)
–  Increase in I/O, CPU, and disk requirement
!  Need more indexers if:
–  Increase in replicaAon factor
–  Performance or capacity needs (search and index)

41
Index ReplicaAon (aka – Clustering)
Cluster Master Search Head Pooling

Forwarders Peer Nodes

42
Index ReplicaAon
!  Data is replicated and made available
!  WAN conﬁguraAon is not recommended
!  Careful consideraAon when inserted into standard topologies
!  Increases
–  Storage requirement
–  I/O requirement (disk, network)
–  Total indexer requirement
!  Disaster recovery and high availability .conf session

43
Final Thoughts
!  Sizing is more than data volume – it’s also search load
!  Centralized architecture is the baseline
!  VariaAons on architecture are driven by
–  Sizing
–  Data locaAon
–  User locaAon
–  RetenAon/access/governance
–  Availability requirements

44
Next Steps
1 Download the .conf2013 Mobile App
If not iPhone, iPad or Android, use the Web App

2 Take the survey & WIN A PASS FOR .CONF2014… Or one of these
bags!
3
View the sessions listed on the next slide
Available on the Mobile App

45
More InformaAon
!  Contact: [email protected]
!  DocumenaAon: hxp://docs.splunk.com
!  Answers: hxp://answers.splunk.com
!  Other presentaAons
–  Best PracAces: Deploying Splunk on Physical, Virtual, and Cloud Environments
–  ArchitecAng Splunk for High Availability and Disaster Recovery

46
THANK YOU

Splunk Course Notes
No ratings yet
Splunk Course Notes
70 pages
Splunk Fundamentals
No ratings yet
Splunk Fundamentals
9 pages
Using Enterprise Security v7.3 Slides Oct2024
No ratings yet
Using Enterprise Security v7.3 Slides Oct2024
264 pages
Splunk's Architecture
No ratings yet
Splunk's Architecture
4 pages
MS Office
No ratings yet
MS Office
49 pages
Loan Origination System: A Minor Project Report On
No ratings yet
Loan Origination System: A Minor Project Report On
38 pages
Electrical Switchboards Form Separation
100% (1)
Electrical Switchboards Form Separation
16 pages
Splunk Getting Started With Itsi
No ratings yet
Splunk Getting Started With Itsi
23 pages
Best Practices and Better Practices For Admins Latest Slides: Collaborate: #Bestpractices Sign Up at HTTP://SPLK - It/slack
No ratings yet
Best Practices and Better Practices For Admins Latest Slides: Collaborate: #Bestpractices Sign Up at HTTP://SPLK - It/slack
105 pages
Splunk Admin Course Contents
100% (1)
Splunk Admin Course Contents
4 pages
Netwitness® Endpoint Installation Guide: For Version 4.4
No ratings yet
Netwitness® Endpoint Installation Guide: For Version 4.4
158 pages
19-Full Wave Rectifier With Filter and Regulator+earth Resistance Measurement-20-12-2022
No ratings yet
19-Full Wave Rectifier With Filter and Regulator+earth Resistance Measurement-20-12-2022
3 pages
Splunk Lab - Intro To Dashboards
No ratings yet
Splunk Lab - Intro To Dashboards
12 pages
NVP1918 Nextchip
No ratings yet
NVP1918 Nextchip
122 pages
ETSI TS 101 211: Digital Video Broadcasting (DVB) Implementation and Usage of Service Information (SI)
No ratings yet
ETSI TS 101 211: Digital Video Broadcasting (DVB) Implementation and Usage of Service Information (SI)
67 pages
Splunk Module 9 Troubleshooting Methods and Tools
100% (1)
Splunk Module 9 Troubleshooting Methods and Tools
38 pages
The Gic Cookbook: A Guide To Establishing A New Global In-House Centre (GIC) in India
No ratings yet
The Gic Cookbook: A Guide To Establishing A New Global In-House Centre (GIC) in India
24 pages
Business and It Strategic Alignment Applying Soea Framework: Nassir Dino, Awel Dico, PHD, Dida Midekso, PHD
No ratings yet
Business and It Strategic Alignment Applying Soea Framework: Nassir Dino, Awel Dico, PHD, Dida Midekso, PHD
8 pages
Ddr4 Sdram Rdimm Core: Product Description
No ratings yet
Ddr4 Sdram Rdimm Core: Product Description
24 pages
A Brief History of Free Space Optical Communications
0% (1)
A Brief History of Free Space Optical Communications
24 pages
SAW As RFID Tags and Sensor Report Final
No ratings yet
SAW As RFID Tags and Sensor Report Final
22 pages
Splunk Development Day 4: - Vikram Yadav (VY)
No ratings yet
Splunk Development Day 4: - Vikram Yadav (VY)
18 pages
John Dulemba 08292011
No ratings yet
John Dulemba 08292011
6 pages
Intel Quality Manual
No ratings yet
Intel Quality Manual
16 pages
Splunk-7 2 1-Indexer
No ratings yet
Splunk-7 2 1-Indexer
446 pages
Rsa Nwe 4.4 User Guide PDF
No ratings yet
Rsa Nwe 4.4 User Guide PDF
294 pages
Rab CCTV
No ratings yet
Rab CCTV
10 pages
Assembly of Excavator
No ratings yet
Assembly of Excavator
15 pages
Scalability and High Volume Performance of Indexer Clustering at Splunk
No ratings yet
Scalability and High Volume Performance of Indexer Clustering at Splunk
44 pages
Untitled - Load Flow Report PDF
No ratings yet
Untitled - Load Flow Report PDF
2 pages
Europass CV 20130527 Odipiyo EN
No ratings yet
Europass CV 20130527 Odipiyo EN
4 pages
835 Companion Guide
No ratings yet
835 Companion Guide
17 pages
Splunk Kiran Resume 1
100% (1)
Splunk Kiran Resume 1
4 pages
NS2-DH01-P0ZEN-140003 - ITP FOR ELECTRICAL EQUIPMENT (MV, LV, PANEL, CUBICLE) - Rev.D
No ratings yet
NS2-DH01-P0ZEN-140003 - ITP FOR ELECTRICAL EQUIPMENT (MV, LV, PANEL, CUBICLE) - Rev.D
10 pages
Cada Manual
No ratings yet
Cada Manual
6 pages
Ableton Midi Controller
No ratings yet
Ableton Midi Controller
3 pages
Architecting Splunk For High Availability and Disaster Recovery
No ratings yet
Architecting Splunk For High Availability and Disaster Recovery
47 pages
Assertion Reasoning MCQ Questions Answers
No ratings yet
Assertion Reasoning MCQ Questions Answers
7 pages
Aliza Khokhar Coal Lab 4
No ratings yet
Aliza Khokhar Coal Lab 4
4 pages
Toshiba Universal Smart X Series 4
No ratings yet
Toshiba Universal Smart X Series 4
14 pages
How To Monitor - Etc - Shadow and - Etc - Passwd File For Changes With Auditd - The Geek Diary
No ratings yet
How To Monitor - Etc - Shadow and - Etc - Passwd File For Changes With Auditd - The Geek Diary
4 pages
Data Onboarding From Scratch
No ratings yet
Data Onboarding From Scratch
51 pages
Optimal Sizing of A Wind, Fuel Cell, Electrolyzer, Battery and Supercapacitor System For Off-Grid Applications
No ratings yet
Optimal Sizing of A Wind, Fuel Cell, Electrolyzer, Battery and Supercapacitor System For Off-Grid Applications
14 pages
Discussion Lab3C PLC
33% (3)
Discussion Lab3C PLC
3 pages
Splunk Development Day 5: - Vikram Yadav (VY)
No ratings yet
Splunk Development Day 5: - Vikram Yadav (VY)
29 pages
Splunk Indexer Clustering Masterclass 101
100% (1)
Splunk Indexer Clustering Masterclass 101
6 pages
Conf2015 DWaddle DefensePointSecurity Deploying SplunkSSLBestPractices
No ratings yet
Conf2015 DWaddle DefensePointSecurity Deploying SplunkSSLBestPractices
50 pages
Splunk-7 2 1-Admin
100% (2)
Splunk-7 2 1-Admin
940 pages
HHI Generator Manual-FINAL
No ratings yet
HHI Generator Manual-FINAL
45 pages
2 - Introduction To The ME Engine PDF
67% (3)
2 - Introduction To The ME Engine PDF
14 pages
Splunk 6.4 Administration - Splunk
0% (1)
Splunk 6.4 Administration - Splunk
5 pages
Splunk 6.4.0 Troubleshooting
No ratings yet
Splunk 6.4.0 Troubleshooting
117 pages
Virtual Elements in CDS Views 1731724717
No ratings yet
Virtual Elements in CDS Views 1731724717
7 pages
Ossec in The Enterprise Final LR
No ratings yet
Ossec in The Enterprise Final LR
129 pages
Log Stash
No ratings yet
Log Stash
41 pages
Splunk Test Blueprint Architect v.1.1
No ratings yet
Splunk Test Blueprint Architect v.1.1
4 pages
STEP - Splunk Training and Enablement Platform
No ratings yet
STEP - Splunk Training and Enablement Platform
14 pages
TVL CSS11 - Q3 - M7
No ratings yet
TVL CSS11 - Q3 - M7
16 pages
Splunk DBX 1.0.9 DeployDBX
No ratings yet
Splunk DBX 1.0.9 DeployDBX
41 pages
02.splunk Install SplunkDBConnect
No ratings yet
02.splunk Install SplunkDBConnect
3 pages
Linux Journal - August 2017
No ratings yet
Linux Journal - August 2017
122 pages
Splunk Development Day 3: - Vikram Yadav (VY)
No ratings yet
Splunk Development Day 3: - Vikram Yadav (VY)
10 pages
Advanced Dashboards & Visualizations - Labs: Dashboard in The Course App
100% (1)
Advanced Dashboards & Visualizations - Labs: Dashboard in The Course App
52 pages
Splunk Dump
No ratings yet
Splunk Dump
33 pages
Search Report 42
No ratings yet
Search Report 42
159 pages
Sample Data Into Splunk Enterprise
No ratings yet
Sample Data Into Splunk Enterprise
58 pages
Splunk Questions and Answers Final Document
No ratings yet
Splunk Questions and Answers Final Document
128 pages
Dynamic Dashboards 9.1 Slides
No ratings yet
Dynamic Dashboards 9.1 Slides
78 pages
Introduction To ITSI
No ratings yet
Introduction To ITSI
18 pages
Lab 8 Splunk Boss of The SOC (15 Pts + 20 Pts Extra)
No ratings yet
Lab 8 Splunk Boss of The SOC (15 Pts + 20 Pts Extra)
16 pages
How To Create Indexer Cluster Using CLI in Splunk Under 10 Mins
No ratings yet
How To Create Indexer Cluster Using CLI in Splunk Under 10 Mins
7 pages
Useful Cli Commands
No ratings yet
Useful Cli Commands
10 pages
IT2184 Tunning
No ratings yet
IT2184 Tunning
36 pages
SPLK-2002Dumps
No ratings yet
SPLK-2002Dumps
12 pages
Splunk Admin42 Ver1.1
100% (1)
Splunk Admin42 Ver1.1
310 pages
© 2019 Caendra Inc. - Hera For IHRP - Effectively Using Splunk (Scenario 2)
No ratings yet
© 2019 Caendra Inc. - Hera For IHRP - Effectively Using Splunk (Scenario 2)
22 pages
Dataonboarding
No ratings yet
Dataonboarding
17 pages
Splunk Validated Architectures: October 2020
100% (1)
Splunk Validated Architectures: October 2020
48 pages
Splunk Fundamentals 1 Lab Exercises: (Sourcetype DB - Audit) (Cs - Mime - Type)
No ratings yet
Splunk Fundamentals 1 Lab Exercises: (Sourcetype DB - Audit) (Cs - Mime - Type)
8 pages
Courses For Itsi Admins
No ratings yet
Courses For Itsi Admins
1 page
Multivalue Fields - Lab Guide: Index Type Sourcetype Interesting Fields
No ratings yet
Multivalue Fields - Lab Guide: Index Type Sourcetype Interesting Fields
17 pages
SHC Cheatsheet
No ratings yet
SHC Cheatsheet
2 pages
Splunk Lab - Creating Maps
No ratings yet
Splunk Lab - Creating Maps
19 pages
Courses For Cloud Customers
No ratings yet
Courses For Cloud Customers
1 page
Software Presentation Easy Worship 2007 v1
100% (3)
Software Presentation Easy Worship 2007 v1
5 pages
Splunk Lab - Data Models
No ratings yet
Splunk Lab - Data Models
14 pages
Splunk Design
No ratings yet
Splunk Design
8 pages
SPLNK ADM Splunk Certified Administrator
No ratings yet
SPLNK ADM Splunk Certified Administrator
1 page
Splunk Punk: Taming Logs, Alerts, and the Chaos of SIEM
From Everand
Splunk Punk: Taming Logs, Alerts, and the Chaos of SIEM
Scott Markham
No ratings yet
Splunk Lab - Search Under The Hood
No ratings yet
Splunk Lab - Search Under The Hood
11 pages
Learning SaltStack - Second Edition
From Everand
Learning SaltStack - Second Edition
Colton Myers
No ratings yet
Ultimate AWS Certified Solutions Architect Associate Exam Guide: Master Designing Resilient, Scalable Architectures with Core and Advanced AWS Services to Crack the SAA-C03 Certification (English Edition)
From Everand
Ultimate AWS Certified Solutions Architect Associate Exam Guide: Master Designing Resilient, Scalable Architectures with Core and Advanced AWS Services to Crack the SAA-C03 Certification (English Edition)
Venkata Sasi Kanumuri
No ratings yet
Mastering Splunk for Cybersecurity: Advanced Threat Detection and Analysis
From Everand
Mastering Splunk for Cybersecurity: Advanced Threat Detection and Analysis
Robert Johnson
No ratings yet
Mastering Active Directory
From Everand
Mastering Active Directory
VICTOR P HENDERSON
No ratings yet

Architecang and Sizing Your Splunk Deployment: Simeon Yep

Uploaded by

Architecang and Sizing Your Splunk Deployment: Simeon Yep

Uploaded by

Copyright

© 2013 Splunk Inc.

ArchitecAng and Sizing

Cluster Master (clustering/replicaAon requirement)

OpAmize searches +++

Centralized Decentralized Hybrid

Search Head Pooling

Forwarders Peer Nodes

You might also like