Data Engineering Roadmap
Data Engineering Roadmap
a. CS50 2022
b. Book - Grokking Algorithms: An illustrated guide
2. Programming Language
Do any courses, your main goal here is to understand how to write basic Python
Code and how to work with different datasets!
Practice Projects:
● Scrape Data Using BeautifulSoup Library eg. Amazon, Covid, Wikipedia, or any
website you like
● Build A Calculator Using Python
a. Udemy - The Complete SQL Bootcamp for the Manipulation and Analysis of
Data (Recommended)
b. Coursera - SQL for Data Science
c. DataCamp - Intro To SQL DataCamp
4. Basics Of Linux
Why Linux? Because you will be working with many remote machines, doing SSH to
access them, and performing operations so it’s important to learn them.
You don’t have to remember all the commands but just understand what they do and
how to write them
Do Hands-On Project
● Beginner Data Engineering Portfolio Project (Recommended)
a. Fundamentals
i. Coursera - Data Warehousing for Business Intelligence Specialization
(recommended for deep dive)
ii. Udemy - Data Warehouse Fundamentals for Beginners (recommended
for quick learning)
b. Tools
i. Snowflake - Snowflake – The Complete Masterclass
ii. Snowflake Doc - https://www.snowflake.com/certifications/
Do Hands-On Project
1. Build ETL Pipeline Using AWS Cloud
2. Covid Data Analysis Project
3. YouTube Data Analysis (End-To-End Data Engineering Project)
Recommended Books
1. Designing Data-Intensive Applications
2. Fundamentals of Data Engineering
3. The Data Warehouse Toolkit
Follow Me Here:
1. Twitter - https://twitter.com/parmardarshil07
2. Linkedin - https://www.linkedin.com/in/darshil-parmar/
3. YouTube - https://www.youtube.com/c/DarshilParmar