Blogs on OLake
IBM Db2 LUW to Lakehouse: Sync to Apache Iceberg Using OLake
A practical guide to syncing IBM Db2 for LUW databases to Apache Iceberg using OLake, covering setup, configuration, sync modes, troubleshooting, and DB2-specific considerations like RUNSTATS and REORG.
How to Compact Apache Iceberg Tables: Small Files + Automation with Apache Amoro
A practical guide to fixing small-file bloat in Apache Iceberg, showing when and how to run compaction, the performance gains you can expect, and how Amoro automates it to turn Iceberg tables into self-optimizing lakehouses.
Sync MSSQL to Your Lakehouse with OLake
A practical guide to syncing Microsoft SQL Server (MSSQL) into Apache Iceberg using OLake, covering sync modes, CDC setup, schema changes, data type mapping, and troubleshooting.
Ingesting Files from S3 with OLake: Turn Buckets into Reliable Streams (AWS + MinIO + LocalStack)
A comprehensive guide to ingesting data from Amazon S3 and S3-compatible storage using OLake, covering stream discovery, format support, incremental sync, and best practices for AWS, MinIO, and LocalStack.
Bridging the Gap: Making OLake's MOR Iceberg Tables Compatible with Databrick's Query Engine
Learn how to make OLake's Merge-on-Read (MOR) Iceberg tables compatible with Databricks using an automated MOR to COW write script that transforms MOR tables into Copy-on-Write (COW) format for accurate analytics queries.
OLake — now an Arrow-based Iceberg Ingestion Tool
Discover how OLake's new Arrow-based architecture delivers 1.75x faster ingestion performance.









