SlideShare a Scribd company logo
NYSE: DVN
devonenergy.com
Elastic Cloud Enterprise in Azure
A Devon Energy Story
Devon - Internal
2
Paul PC
• Alphabet soup: MBA, GSE, GREM, GCIA, GCIH, GSEC, GPEN, GPYC, CISSP
• Security Architect / Team Lead for Devon Energy, an Independent E&P
headquartered in OKC
• Fluent in Romanian, English, Python
• Loves the cloud almost as much as his Ducati
Prakhar S
• Educational Kaizen : Bachelor of Engineering (CSE), RHCE, SAS® Certified
• Works for Accenture Technology, collaborating with Devon Energy currently
• Close to nine years of experience, aspires to contribute in cutting edge tech
Who Are We?
Devon - Internal
3
Agenda / History
2013: ES 1.7 and Moloch
2015: Production-grade Servers
2017: Elastic Search in Security
Analytics Platform
2019: SIEM replacement
Devon - Internal
4
2017: Analytics Platform v1.0 – Working Experiments
Axioms:
• Collaboration with Data
Analytics team
• Only Cloud components
• Use as much azure 1st party
as possible
• Be able to ingest 500GB/
day
Devon - Internal
5
Approximately
• 100 billion documents
• 100 TB of Indexed data
• 1.5 – 2.0 TB DAILY
• 1300 indices
• 10000 shards
• Ingestion rate approximately 20-25k EPS
Scale
Devon - Internal
6
2019: Analytics Platform v3.0 – Production SIEM++
Enrichment moved to
Logstash
DYI Retention to BLOB
First shard optimizations
efforts
Automation is our best L1 HoD
More data, more user stories
• Network
• AKS
• Accounting
• SAP security
Devon - Internal
7
• Traditional SIEM problems:
• Retention
• Parsing - legacy logs are terrible
• WEF is great, until it isn’t
• Just enough normalization
• Enrichment > Correlation
• Atomic indicators
• Identity information
• Cluster > purpose collectors
• Logstash nodes are cattle, not pets
• Puppet was good, K8s is great
• Throughput vs. node count
Logstash - The Core of Our FrankenSIEM

Devon - Internal
8
Intent : Security should not be managing
it
What’s cool about ECE
• It saves Time : Streamline scaling, securing,
upgrading the stack
• Great way to enhance utility and value was to
grant other teams access to their logs
• Centralized management of logstash pipelines
and cluster user settings is cool
• Inbuilt monitoring gives a great insight
It was very different than traditional ES
• Troubleshooting is a bit different
• Elasticsearch.yml is not to be trifled with
• Initially, had to rely a lot more on support
Why ECE
8
Devon - Internal
9
Indexing throughput
• Use dedicated Master Nodes
• Index refresh interval refresh_interval : 30s (even -1 for initial load)
• Tune Indexing buffer size
OOMs
• Give JVM 50% of available memory
• Prevent JVM resizing : minHeap=maxHeap
Cautiously budget your cache
• Limit and monitor Field Data cache : it never goes away
• Using circuit breaker can save the day
Disk Sizing
• Monitor number of replica for your use case
• Experiment with Sharding : Larger shards means better indexing rate but needs can vary; more shards come with cost
Appraise your precedence, go Quid-pro-quo
• Memory Intensive queries vs near-real-time data vs long-term retention
Major Gotchas
Devon - Internal
10
• It hurt being the first Azure customers for ECE
• Docker ignorance didn’t help
• ECE 1.1.x supported specific version of docker with specific version of kernel
• Updates tanked the cluster at-least twice – multi zone helps
• Problems at our scale:
• Migrating shards between containers takes days
• In-place container upgrades:
advanced configuration will save your day/week
• Moving to dedicated master nodes – election problems
• Unexpected socket hang ups, difficult to diagnose
• Monitoring:
• Independent Monitoring cluster
• Created our own health check scripts
What We Learned – Operational Glitches
Devon - Internal
11
What’s Next For Us
Chugging the Docker and Kubernetes
Kool-Aid:
• Curator
• Enrichment data collection
• Logstash + autoscaling
• Health checks
Dedicated nodes:
• ML Nodes
• Search Only Nodes
• Ingest nodes
Separate security and non-security to
their own spaces
SAML / OpenIDc with MFA
Thank you.
Come call BS at Happy Hour


Contact info:
@p4ulpc – paul.poputa-clean@dvn.c
Prakhar.sengar@dvn.com
Code:
https://github.com/paulpc/elasticon

More Related Content

PDF
Elastic on a Hyper-Converged Infrastructure for Operational Log Analytics
PDF
Capgemini: Observability within the Dutch government
PDF
CSX: Real-time Business Discovery with the Elastic Stack
PDF
Taking Care of Business at Office Depot with Elastic Cloud Enterprise
PDF
Better Search and Business Analytics at Southern Glazer’s Wine & Spirits
PDF
Elastic Cloud Enterprise @ Cisco
PDF
Elastic at Procter & Gamble: A Network Story
PDF
Improving search at Wellcome Collection
Elastic on a Hyper-Converged Infrastructure for Operational Log Analytics
Capgemini: Observability within the Dutch government
CSX: Real-time Business Discovery with the Elastic Stack
Taking Care of Business at Office Depot with Elastic Cloud Enterprise
Better Search and Business Analytics at Southern Glazer’s Wine & Spirits
Elastic Cloud Enterprise @ Cisco
Elastic at Procter & Gamble: A Network Story
Improving search at Wellcome Collection

What's hot (20)

PDF
Centralized logging in a changing environment at the UK’s DVLA
PDF
Security Events Logging at Bell with the Elastic Stack
PDF
Elastic @ John Deere
PDF
Industrial production process visualization with the Elastic Stack in real-ti...
PDF
Transformational Search Performance with EnergyIQ
PDF
Log Monitoring and Anomaly Detection at Scale at ORNL
PDF
Migrating a legacy logging system: Etsy’s journey to Elastic Cloud
PDF
Achieving cyber mission assurance with near real-time impact
PDF
How eStruxture Data Centers is Using ECE to Rapidly Scale Their Business
PDF
How KeyBank Used Elastic to Build an Enterprise Monitoring Solution
PDF
Building a reliable and cost effect logging system at Box
PDF
The Elastic Evolution of CenturyLink’s Network Management System
PDF
Logging, Metrics, and APM: The Operations Trifecta
PDF
Siscale Lightning Talk: Automated Root Cause Analysis with Elastic Stack
PPTX
Getting Started with Runtime Security on Azure Kubernetes Service (AKS)
PPTX
Scaling Your Skillset with Your Data with Jarrett Garcia (Nielsen)
PDF
Empower Your Security Practitioners with Elastic SIEM
PDF
American Ancestors Use Case - Scalability & Support Using the Elasticsearch S...
PDF
Bandwidth: Use Cases for Elastic Cloud on Kubernetes
PDF
Datadog: From a single product to a growing platform by Alexis Lê-Quôc, CTO
Centralized logging in a changing environment at the UK’s DVLA
Security Events Logging at Bell with the Elastic Stack
Elastic @ John Deere
Industrial production process visualization with the Elastic Stack in real-ti...
Transformational Search Performance with EnergyIQ
Log Monitoring and Anomaly Detection at Scale at ORNL
Migrating a legacy logging system: Etsy’s journey to Elastic Cloud
Achieving cyber mission assurance with near real-time impact
How eStruxture Data Centers is Using ECE to Rapidly Scale Their Business
How KeyBank Used Elastic to Build an Enterprise Monitoring Solution
Building a reliable and cost effect logging system at Box
The Elastic Evolution of CenturyLink’s Network Management System
Logging, Metrics, and APM: The Operations Trifecta
Siscale Lightning Talk: Automated Root Cause Analysis with Elastic Stack
Getting Started with Runtime Security on Azure Kubernetes Service (AKS)
Scaling Your Skillset with Your Data with Jarrett Garcia (Nielsen)
Empower Your Security Practitioners with Elastic SIEM
American Ancestors Use Case - Scalability & Support Using the Elasticsearch S...
Bandwidth: Use Cases for Elastic Cloud on Kubernetes
Datadog: From a single product to a growing platform by Alexis Lê-Quôc, CTO
Ad

Similar to Elastic Cloud Enterprise in Azure with Devon (20)

PPTX
Simplify Your Way To Expert Kubernetes Management
PPTX
Kubernetes at NU.nl (Kubernetes meetup 2019-09-05)
PPTX
A Case Study of NoSQL Adoption: What Drove Wordnik Non-Relational?
PPTX
Application Deployment and Management at Scale at 1&1
PDF
Ceph Day Shanghai - Opening
PDF
Application Deployment and Management at Scale with 1&1 by Matt Baldwin
PPTX
Case Study: University Alabama-Birmingham.
PPTX
OpenStack and Ceph case study at the University of Alabama
PDF
A1 engineered systems principles and architecture
PDF
Storage os kubernetes clusters need persistent data
PPTX
Webinar: Sizing Up Object Storage for the Enterprise
PDF
Five Years of EC2 Distilled
PPTX
Database as a Service (DBaaS) on Kubernetes
PDF
DCSF19 Container Security: Theory & Practice at Netflix
PPTX
Deliver Best-in-Class HPC Cloud Solutions Without Losing Your Mind
PDF
Criteo meetup - S.R.E Tech Talk
PPTX
Managing Security At 1M Events a Second using Elasticsearch
PPTX
Operating OpenStack on a Budget
PPTX
Operating OpenStack on a Budget
PPTX
Webcast: DevOps in AWS is different! How can containers help?
Simplify Your Way To Expert Kubernetes Management
Kubernetes at NU.nl (Kubernetes meetup 2019-09-05)
A Case Study of NoSQL Adoption: What Drove Wordnik Non-Relational?
Application Deployment and Management at Scale at 1&1
Ceph Day Shanghai - Opening
Application Deployment and Management at Scale with 1&1 by Matt Baldwin
Case Study: University Alabama-Birmingham.
OpenStack and Ceph case study at the University of Alabama
A1 engineered systems principles and architecture
Storage os kubernetes clusters need persistent data
Webinar: Sizing Up Object Storage for the Enterprise
Five Years of EC2 Distilled
Database as a Service (DBaaS) on Kubernetes
DCSF19 Container Security: Theory & Practice at Netflix
Deliver Best-in-Class HPC Cloud Solutions Without Losing Your Mind
Criteo meetup - S.R.E Tech Talk
Managing Security At 1M Events a Second using Elasticsearch
Operating OpenStack on a Budget
Operating OpenStack on a Budget
Webcast: DevOps in AWS is different! How can containers help?
Ad

More from Elasticsearch (20)

PDF
An introduction to Elasticsearch's advanced relevance ranking toolbox
PDF
From MSP to MSSP using Elastic
PDF
Cómo crear excelentes experiencias de búsqueda en sitios web
PDF
Te damos la bienvenida a una nueva forma de realizar búsquedas
PDF
Tirez pleinement parti d'Elastic grâce à Elastic Cloud
PDF
Comment transformer vos données en informations exploitables
PDF
Plongez au cœur de la recherche dans tous ses états.
PDF
Modernising One Legal Se@rch with Elastic Enterprise Search [Customer Story]
PDF
An introduction to Elasticsearch's advanced relevance ranking toolbox
PDF
Welcome to a new state of find
PDF
Building great website search experiences
PDF
Keynote: Harnessing the power of Elasticsearch for simplified search
PDF
Cómo transformar los datos en análisis con los que tomar decisiones
PDF
Explore relève les défis Big Data avec Elastic Cloud
PDF
Comment transformer vos données en informations exploitables
PDF
Transforming data into actionable insights
PDF
Opening Keynote: Why Elastic?
PDF
Empowering agencies using Elastic as a Service inside Government
PDF
The opportunities and challenges of data for public good
PDF
Enterprise search and unstructured data with CGI and Elastic
An introduction to Elasticsearch's advanced relevance ranking toolbox
From MSP to MSSP using Elastic
Cómo crear excelentes experiencias de búsqueda en sitios web
Te damos la bienvenida a una nueva forma de realizar búsquedas
Tirez pleinement parti d'Elastic grâce à Elastic Cloud
Comment transformer vos données en informations exploitables
Plongez au cœur de la recherche dans tous ses états.
Modernising One Legal Se@rch with Elastic Enterprise Search [Customer Story]
An introduction to Elasticsearch's advanced relevance ranking toolbox
Welcome to a new state of find
Building great website search experiences
Keynote: Harnessing the power of Elasticsearch for simplified search
Cómo transformar los datos en análisis con los que tomar decisiones
Explore relève les défis Big Data avec Elastic Cloud
Comment transformer vos données en informations exploitables
Transforming data into actionable insights
Opening Keynote: Why Elastic?
Empowering agencies using Elastic as a Service inside Government
The opportunities and challenges of data for public good
Enterprise search and unstructured data with CGI and Elastic

Recently uploaded (20)

PPTX
Big Data Technologies - Introduction.pptx
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Transforming Manufacturing operations through Intelligent Integrations
PPTX
Telecom Fraud Prevention Guide | Hyperlink InfoSystem
PDF
Advanced Soft Computing BINUS July 2025.pdf
PPTX
Cloud computing and distributed systems.
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
HCSP-Presales-Campus Network Planning and Design V1.0 Training Material-Witho...
PPT
Teaching material agriculture food technology
PDF
Modernizing your data center with Dell and AMD
PDF
NewMind AI Monthly Chronicles - July 2025
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
breach-and-attack-simulation-cybersecurity-india-chennai-defenderrabbit-2025....
PPTX
MYSQL Presentation for SQL database connectivity
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Advanced IT Governance
Big Data Technologies - Introduction.pptx
NewMind AI Weekly Chronicles - August'25 Week I
Transforming Manufacturing operations through Intelligent Integrations
Telecom Fraud Prevention Guide | Hyperlink InfoSystem
Advanced Soft Computing BINUS July 2025.pdf
Cloud computing and distributed systems.
The Rise and Fall of 3GPP – Time for a Sabbatical?
HCSP-Presales-Campus Network Planning and Design V1.0 Training Material-Witho...
Teaching material agriculture food technology
Modernizing your data center with Dell and AMD
NewMind AI Monthly Chronicles - July 2025
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Advanced methodologies resolving dimensionality complications for autism neur...
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
breach-and-attack-simulation-cybersecurity-india-chennai-defenderrabbit-2025....
MYSQL Presentation for SQL database connectivity
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Advanced IT Governance

Elastic Cloud Enterprise in Azure with Devon

  • 1. NYSE: DVN devonenergy.com Elastic Cloud Enterprise in Azure A Devon Energy Story
  • 2. Devon - Internal 2 Paul PC • Alphabet soup: MBA, GSE, GREM, GCIA, GCIH, GSEC, GPEN, GPYC, CISSP • Security Architect / Team Lead for Devon Energy, an Independent E&P headquartered in OKC • Fluent in Romanian, English, Python • Loves the cloud almost as much as his Ducati Prakhar S • Educational Kaizen : Bachelor of Engineering (CSE), RHCE, SAS® Certified • Works for Accenture Technology, collaborating with Devon Energy currently • Close to nine years of experience, aspires to contribute in cutting edge tech Who Are We?
  • 3. Devon - Internal 3 Agenda / History 2013: ES 1.7 and Moloch 2015: Production-grade Servers 2017: Elastic Search in Security Analytics Platform 2019: SIEM replacement
  • 4. Devon - Internal 4 2017: Analytics Platform v1.0 – Working Experiments Axioms: • Collaboration with Data Analytics team • Only Cloud components • Use as much azure 1st party as possible • Be able to ingest 500GB/ day
  • 5. Devon - Internal 5 Approximately • 100 billion documents • 100 TB of Indexed data • 1.5 – 2.0 TB DAILY • 1300 indices • 10000 shards • Ingestion rate approximately 20-25k EPS Scale
  • 6. Devon - Internal 6 2019: Analytics Platform v3.0 – Production SIEM++ Enrichment moved to Logstash DYI Retention to BLOB First shard optimizations efforts Automation is our best L1 HoD More data, more user stories • Network • AKS • Accounting • SAP security
  • 7. Devon - Internal 7 • Traditional SIEM problems: • Retention • Parsing - legacy logs are terrible • WEF is great, until it isn’t • Just enough normalization • Enrichment > Correlation • Atomic indicators • Identity information • Cluster > purpose collectors • Logstash nodes are cattle, not pets • Puppet was good, K8s is great • Throughput vs. node count Logstash - The Core of Our FrankenSIEM

  • 8. Devon - Internal 8 Intent : Security should not be managing it What’s cool about ECE • It saves Time : Streamline scaling, securing, upgrading the stack • Great way to enhance utility and value was to grant other teams access to their logs • Centralized management of logstash pipelines and cluster user settings is cool • Inbuilt monitoring gives a great insight It was very different than traditional ES • Troubleshooting is a bit different • Elasticsearch.yml is not to be trifled with • Initially, had to rely a lot more on support Why ECE 8
  • 9. Devon - Internal 9 Indexing throughput • Use dedicated Master Nodes • Index refresh interval refresh_interval : 30s (even -1 for initial load) • Tune Indexing buffer size OOMs • Give JVM 50% of available memory • Prevent JVM resizing : minHeap=maxHeap Cautiously budget your cache • Limit and monitor Field Data cache : it never goes away • Using circuit breaker can save the day Disk Sizing • Monitor number of replica for your use case • Experiment with Sharding : Larger shards means better indexing rate but needs can vary; more shards come with cost Appraise your precedence, go Quid-pro-quo • Memory Intensive queries vs near-real-time data vs long-term retention Major Gotchas
  • 10. Devon - Internal 10 • It hurt being the first Azure customers for ECE • Docker ignorance didn’t help • ECE 1.1.x supported specific version of docker with specific version of kernel • Updates tanked the cluster at-least twice – multi zone helps • Problems at our scale: • Migrating shards between containers takes days • In-place container upgrades: advanced configuration will save your day/week • Moving to dedicated master nodes – election problems • Unexpected socket hang ups, difficult to diagnose • Monitoring: • Independent Monitoring cluster • Created our own health check scripts What We Learned – Operational Glitches
  • 11. Devon - Internal 11 What’s Next For Us Chugging the Docker and Kubernetes Kool-Aid: • Curator • Enrichment data collection • Logstash + autoscaling • Health checks Dedicated nodes: • ML Nodes • Search Only Nodes • Ingest nodes Separate security and non-security to their own spaces SAML / OpenIDc with MFA
  • 12. Thank you. Come call BS at Happy Hour 
 Contact info: @p4ulpc – [email protected] [email protected] Code: https://github.com/paulpc/elasticon