Data Warehouse and BigQuery
Data Warehouse and BigQuery
Data
Warehouse and
BigQuery
02
What is a data
warehouse (DWH)?
Is a OLAP (Online analytical processing)
solution
What is BigQuery?
On Google Cloud Platform:
Serverless DWH
Offers scalability and high availability
Maximizes flexibility by separating the
compute engine process from storage
04
Partitioning in
BigQuery
Divides a large table into smaller, manageable
parts based on a column (e.g., date or integer
range).
Clustering in
BigQuery
Sorts table data by specified columns,
improving query performance by reducing
scanned data.
06
How to decide
which technique to
use?
Partitioning: Best for date/time filtering.
Single column.
Clustering: Best for high-cardinality
columns.
Both: Use when queries benefit from both
partitioning and clustering.
07
“OPPORTUNITIES
DON’T HAPPEN.
YOU CREATE
THEM.”
– Chris Grosser