Structured Streaming
Structured Streaming
Learning Objectives
u DataStreamReader
u DataStreamWriter
u 2 approaches:
data sink
Derar Alhussein © Udemy | Databricks Certified Data Engineer Associate - Preparation
Treating Infinite Data as a Table
Input Data Stream Unbounded Table
streamDF
1. Fault Tolerance
u Checkpointing + Write-ahead logs
u record the offset range of data being processed during each trigger interval.
2. Exactly-once guarantee
u Idempotent sinks
u Advanced methods
u Windowing
u Watermarking