Data-Engineering-Syllabus
Data-Engineering-Syllabus
DESCRIPTION
This course consists of two modules: analytics
TEaching Style engineering and data engineering.
Days consist of 30 minute live lectures In the data engineering module, students deepen
followed by interactive readings and their Python and Sql skills by building APIs in Flask,
and learn cloud computing and data orchestration
labs performed either individually or in skills.
pairs.
During the data engineering module, students are
Quizzes are regularly delivered to paired with companies to perform externships (more
assess understanding. below).
Module 1 Module 2
Analytics Engineering Data Engineering
Weeks 1 - 12 Weeks 13 - 24
DESCRIPTION
This course prepares students for their externship by training them in two subjects: (1)
software engineering fundamentals and (2) data pipelines.
Students learn to retrieve and manipulate data in Python, write clean functions, and data model
with object oriented programming. Then with SQL, students learn single table queries,
relational queries, and advanced SQL techniques.
Students learn to build data pipelines that move data from a transactional database to an analytics
database, and the data modeling and data queries that come with it.
Marketing Data
Sales Data
Fivetran Snowflake DBT Mode
Meltano Redshift PowerBI
Stitch BigQuery Looker
• Module 1: Analytics Engineering •
Weeks 1 - 12
DESCRIPTION
This module teaches design patterns in backend web programming and building data engineering
pipelines. Throughout the second module, we will allocate 1 - 2 hours per week for technical
interview prep, and six hours per week for students to work on externships.
Backend Engineering
We’ll learn the classic web design programming pattern of model-view-controller, performing ETL in
Python with the adapter pattern, and will build out an object relational mapper.
Through cloud computing we'll learn how to deploy our backend APIs with AWS. Then
we'll learn how to automate requests for external data and load data into an analytics
database.
Airflow
Data
External Client Staging
Staging Warehouse
Data