Airflow Chapter1
Airflow Chapter1
Airflow
INTRODUCTION TO AIRFLOW IN PYTHON
Mike Metzger
Data Engineer
What is data engineering?
Data engineering is:
Taking any action involving data and turning it into a reliable, repeatable, and maintainable
process.
Creation
Scheduling
Monitoring
1 h ps://air ow.apache.org/docs/stable/
Luigi
SSIS
Bash scripting
etl_dag = DAG(
dag_id='etl_pipeline',
default_args={"start_date": "2020-01-08"}
)
Using a DAG named example-etl, a task named download- le and a start date of 2020-01-10:
Mike Metzger
Data Engineer
What is a DAG?
DAG, or Directed Acyclic Graph:
1 h ps://en.m.wikipedia.org/wiki/Directed_acyclic_graph
Are wri en in Python (but can use components wri en in other languages).
Mike Metzger
Data Engineer
DAGs view
Web UI is easier