StarRocks Intro
StarRocks Intro
D4 Fact D2
• Added complexity to data pipeline
D3
Real-time analytics do
NOT handle updates well
Update / delete—Clickhouse
• Updates are forced into DB Implemented as “Alter table update” (mutations)
Asynchronous
• Either Merge on Read or
https://clickhouse.Yandex/docs/en/query_language/alter/#alter-mutations
Segment Replacement
• Query performance struggles
when processing updates/deletes
• Many use cases CANNOT
be supported
Challenges Facing Todays Real-time Analytics
High concurrency or
real-time? Pick one!
Extremely difficult
to maintain shard1 shard2 shard3 Resharding shard1 shard2 shard3 shard4 shard5
2x to 6x faster in standard
benchmark testing
Batch replace
Data Capture
Online
Complete update/ StarRocks
data
Kafka Application
delete functions
Sub-second query
latency even when data
is frequently updated
StarRocks—Real-time Operations Made Easy
High concurrency
and high throughput
MySQL Protocol
No dependencies on
external components
FE–Leader FE–Leader FE–Observer
Auto scaling without human Catalog Manager Catalog Manager … Catalog Manager
Other Products
Low concurrency with only 10 – 100 users High concurrency with 10000s of users