Data Engineering YouTube Roadmap
Data Engineering YouTube Roadmap
CS50 2022
3. Programming Language
Do any courses, your main goal here is to understand how to write basic Python code
and how to work with different datasets!
a. freeCodeCamp - Learn Python - Full Course for Beginners
b. Programming with Mosh - Python Tutorial for Beginners
You don’t have to remember all the commands but just understand what they do and
how to write them
a. Kunal Kushwaha - Introduction to Linux and Terminal Commands
b. freeCodeCamp - Top 50 Most Popular Linux Commands
✅
What will you learn?
✅
Python
✅
SQL
✅
Building Data Models
✅
Basics of DBMS
✅
Writing ETL Job
✅
Querying Data Programmatically
PostgreSQL
✅
What will you learn?
✅
Python
✅
SQL
✅
Cloud Computing Basics
✅
AWS Services - Athena, Glue, Redshift, S3, IAM
Creating Data Pipeline
2. Covid Data Analysis Project
✅
What will you learn?
✅
Python
✅
SQL
✅
Building Data Model
✅
AWS Services - Athena, Glue, Redshift, S3, IAM
✅
Creating Data Pipeline
PostgreSQL
3. YouTube Data Analysis (End-To-End Data Engineering Project)
What will you learn?
✅ Python and PySpark
✅ SQL
✅ How to understand the business problem
✅ AWS Services - Athena, Glue, Redshift, S3, IAM, Lambda, Quicksight
✅ Building Data Pipeline and Scheduling it
4. Twitter Data Pipeline using Airflow
✅
What will you learn?
✅
Python
✅
Basics of Airflow
✅
Working with Twitter Data and Package - Tweepy
✅
Python Package - Pandas
Writing ETL job and storing data on S3