Starburst Introduction - March 2021
Starburst Introduction - March 2021
600% Growth YoY Named Open Source ANSI SQL MPP On-Prem, High Massive
Startup to Watch Query Engine or Cloud Concurrency Scale
2020
82
100+ NPS Score Rapid Time to Low Cost of Enterprise 24x7 Expert
Enterprise Insights Ownership Grade Security Support
Customers
2
Our Customers Our Value to Them
Reduced
Decision Risk
Increased Revenue
and Profit
Higher Customer
Retention
Accelerated Time
to Market
3
Today’s data management approach delays analytics
Business has a question Data Engineering services the request Business gets an answer
Database
Multiple Copies
Cloud Data Cloud Data
Warehouse Lake
5
Connectivity: Creating a Portable Access Layer
Data Scientists Finance Marketers Data Analysts
6
Starburst Trino: SQL Engine Architecture
Data: Storage
Trino Cluster: Compute
Report,
SQL Coordinator Parse
Node Metadata
API Glue/Hive
Optimize
Catalog
Results (CBO)
Data Location
BI Tool, SQL Client, API
CLI Schedule
Key Data
Worker
Coordinator Node GCS ADLS Blob Storage S3
Worker Worker
Node
Auto-scaling group
Data
Worker
Connectors
Node
ODBC/JDBC, CLI
Intra-Cluster
API Call
Separation of compute and storage
Deploy Starburst everywhere - On-premise or Cloud
Hive Metastore
Horizontal Pod
via Helm Charts Service Object Store
Autoscaler (HPA) Storage
Pod
Hadoop / Hive
/ Delta
Starburst-Remote
● Fully scalable approach that
Connector allows connection between all
your environments
Local Storage
Cloud Storage
A Federated Semantic Layer - powered by Trino ( formerly Presto SQL )
Data Engineers
• Achieved GDPR compliance by leaving data where it lives • Eliminated ETL for joining Oracle, HDFS while
locally replacing Spark/Impala
• Reduced infrastructure usage by 30% • Reduce time to insight for critical risk models 96%
• Improved time to insights for engineers/analysts by 800% • De-risk business decisions in real time
12