Open navigation menu

Scribd

0% found this document useful (0 votes)

21 views2 pages

20CS11Q3

This document outlines a course on modern data engineering. The course is divided into 5 units that cover topics such as data lakes architectures, data engineering tools on Microsoft Azure, data pipelines, the bronze and silver layers for data collection and curation, delta lake tables, and the gold layer for data aggregation. By the end of the course, students will be able to understand data lakes, explain data engineering pipelines and services, create delta lake tables, develop data curation and aggregation pipelines, and verify aggregated data.

Uploaded by

Copyright

© © All Rights Reserved

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views2 pages

20CS11Q3

This document outlines a course on modern data engineering. The course is divided into 5 units that cover topics such as data lakes architectures, data engineering tools on Microsoft Azure, data pipelines, the bronze and silver layers for data collection and curation, delta lake tables, and the gold layer for data aggregation. By the end of the course, students will be able to understand data lakes, explain data engineering pipelines and services, create delta lake tables, develop data curation and aggregation pipelines, and verify aggregated data.

Uploaded by

Copyright

© © All Rights Reserved

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

MODERN DATA ENGINEERING

(Job Oriented Elective)

Course Code:20CS11Q3 L T P C
3 0 0 3
Course outcomes: At the end of the course students will be able to
CO1: Understand data lakes architectures and data engineering tools and services. (L2)
CO2: Explain architectures and pipelines to create data lakes. (L2)
CO3: experiment with delta lake tables (L3)
CO4: Build the data pipeline for the data curation stage. (L2)
CO5: Develop gold layer for data aggregation to meet customer expectations. (L3)

UNIT-I: (10 Lectures)

Discovering Storage and Compute Data Lakes: Introducing data lakes, Discovering data lake
architectures, Data Warehouse Vs Datalakes.
Data Engineering on Microsoft Azure: Introducing data engineering in Azure, Performing data
engineering in Microsoft Azure-Self-managed data, engineering services (IaaS), Azure-managed
data engineering services (PaaS), Data processing services in Microsoft Azure, Data engineering
as a service (SaaS), Data cataloging and sharing services in Microsoft Azure; Opening a free
account with Microsoft Azure. (Chapter 2,3)

Learning Outcomes: At the end of the module, students will be able to:

1. Explain introduction of data lakes. (L2)

2. Describe data engineering in Azure. (L2)
3. Summarize data engineering services.(L2)

UNIT-II: (10 Lectures)

Understanding Data Pipelines: Exploring data pipelines, Process of creating a data pipeline,
Running a data pipeline, Sample lakehouse project
Data Collection Stage – The Bronze Layer: Architecting the Electroniz data lake,
Understanding the bronze layer, Configuring data sources, Configuring data destinations,
Building the ingestion pipelines

Learning Outcomes: At the end of the module, students will be able to:

1. Understand the bronze layer. (L2)

2. Describe the process of configuring data sources. (L2)
3. Explain how to build ingestion pipelines.(L2)
UNIT-III: (10 Lectures)
Understanding Delta Lake: Understanding how Delta Lake enables the lakehouse,
Understanding Delta Lake, Creating a Delta Lake table, Changing data in an existing Delta Lake
table, Performing time travel, Performing upserts of data, Understanding isolation levels,
Understanding concurrency control, Cleaning up Azure resources

Learning Outcomes: At the end of the module, students will be able to:

1.summarize the process of clean the raw data (L2)

2. create a delta lake table (L3)
3. illustrate isolation levels and concurrency control(L2)

UNIT-IV: (10 Lectures)

Data Curation Stage – The Silver Layer: The need for curating raw data, The process of
curating raw data, Developing a data curation pipeline, Running the pipeline for the silver layer,
Verifying curated data in the silver layer, Cleaning up Azure resources. (Chapter 7)

Learning Outcomes: At the end of the module, students will be able to:

1.explain the need for curating the data. (L2)

2. Outline the process of curating the data. (L2)
3. Develop the data curation pipeline. (L2)

UNIT-V: (10 Lectures)

Data Aggregation Stage – The Gold Layer : The need to aggregate data, The process of
aggregating data, Developing a data aggregation pipeline, Running the aggregation pipeline,
Understanding data consumption, Verifying aggregated data in the gold layer, Meeting customer
expectations. (Chapter 8)

Learning Outcomes: At the end of the module, students will be able to:

1. Explain the need to aggregate data. (L2)

2. Build a data aggregation pipeline. (L3)
3. Interpret verification of aggregated data in the gold layer(L2)

Text Books:
1. Manoj Kukreja,, Data Engineering with Apache Spark, Delta Lake, and Lakehouse, Packt
Publishing, 2021.

References Books:
1. Scott Haines, Modern Data Engineering with Apache Spark: A Hands-On Guide for
Building Mission-Critical Streaming Applications, Apress, 2022.

Web References:
1. https://www.coursera.org/learn/introduction-to-data-engineering
2. https://www.coursera.org/professional-certificates/microsoft-azure-dp-203-data-engineeri
ng
3. https://aws.amazon.com/compare/the-difference-between-a-data-warehouse-data-lake-an
d-data-mart/

You might also like

Fundamentals of Data Engineering
No ratings yet
Fundamentals of Data Engineering
16 pages
Fundamentals of Data Engineering Concepts
No ratings yet
Fundamentals of Data Engineering Concepts
219 pages
Complete Step-By-Step Roadmap To Learn Data Engineering in 2025
No ratings yet
Complete Step-By-Step Roadmap To Learn Data Engineering in 2025
13 pages
Data Engineering Bootcamp
No ratings yet
Data Engineering Bootcamp
14 pages
Item Analysis Mean PL Mps
No ratings yet
Item Analysis Mean PL Mps
10 pages
DWDM R20 Lab Manual 3-1 Cse 2022-2023 Sem 1
No ratings yet
DWDM R20 Lab Manual 3-1 Cse 2022-2023 Sem 1
151 pages
Zazzafar Kishi Complt by Mumy Islam-1
No ratings yet
Zazzafar Kishi Complt by Mumy Islam-1
34 pages
Azure Data Engineering Course Content Day Wise.
No ratings yet
Azure Data Engineering Course Content Day Wise.
6 pages
Buc Academic Programmes Feb Advert 2021
No ratings yet
Buc Academic Programmes Feb Advert 2021
4 pages
Ryan International School Chandigarh Winter Holiday Homework
100% (1)
Ryan International School Chandigarh Winter Holiday Homework
7 pages
Data Engineer Roadmap
No ratings yet
Data Engineer Roadmap
2 pages
Data Engineering Roadmap
No ratings yet
Data Engineering Roadmap
3 pages
INT323 Lec 0 1
No ratings yet
INT323 Lec 0 1
32 pages
Essentials of Data Engineering - Saini, DR - Mukesh - 2024 - Anna's Archive
No ratings yet
Essentials of Data Engineering - Saini, DR - Mukesh - 2024 - Anna's Archive
431 pages
JLPT N5 July 2024
No ratings yet
JLPT N5 July 2024
5 pages
Up Police Si Books 2020 0a48b33a
No ratings yet
Up Police Si Books 2020 0a48b33a
6 pages
Linear Programme Instruction
0% (1)
Linear Programme Instruction
17 pages
OD M2 Building A Data Lake
No ratings yet
OD M2 Building A Data Lake
59 pages
Course - Data Engineering
No ratings yet
Course - Data Engineering
3 pages
Evolution of The Data Engineer
No ratings yet
Evolution of The Data Engineer
1 page
01-c Plant Nursery Skill Development
No ratings yet
01-c Plant Nursery Skill Development
5 pages
Gestalt Therapy 100 Key Points and Techniques 2nd Edition ISBN 1138067725, 9781138067721 All Sections Download
No ratings yet
Gestalt Therapy 100 Key Points and Techniques 2nd Edition ISBN 1138067725, 9781138067721 All Sections Download
14 pages
A3 V5 Tesol PDF
No ratings yet
A3 V5 Tesol PDF
17 pages
Conceptual Paper
No ratings yet
Conceptual Paper
15 pages
Antwak Providence Proposal v3
No ratings yet
Antwak Providence Proposal v3
14 pages
Syllabus For Data Engineering
No ratings yet
Syllabus For Data Engineering
3 pages
Programme Guide Data Engineering V1
No ratings yet
Programme Guide Data Engineering V1
4 pages
Captain's Skills
No ratings yet
Captain's Skills
174 pages
A Data Quality-Driven View of Mlops
No ratings yet
A Data Quality-Driven View of Mlops
12 pages
CV NguyenLamTruong EngNew PDF
No ratings yet
CV NguyenLamTruong EngNew PDF
1 page
Graziella Moraes Silva CV
No ratings yet
Graziella Moraes Silva CV
10 pages
Data Engineering Syllabus
No ratings yet
Data Engineering Syllabus
5 pages
DM Lecture 5
No ratings yet
DM Lecture 5
31 pages
CW Marksheet and Cover Template
No ratings yet
CW Marksheet and Cover Template
3 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
91 pages
Storybook Reading Lesson Plan Proudest Blue
No ratings yet
Storybook Reading Lesson Plan Proudest Blue
5 pages
DLP in EDUC 105 - GROUP2
No ratings yet
DLP in EDUC 105 - GROUP2
5 pages
Data Engineering Overview
No ratings yet
Data Engineering Overview
3 pages
Intro To Data Engineering!
No ratings yet
Intro To Data Engineering!
34 pages
2OEeUEnBTY CompleteGuideToBecomeModernDataEngineer
No ratings yet
2OEeUEnBTY CompleteGuideToBecomeModernDataEngineer
43 pages
L1 - Introduction and Data EcoSystem
No ratings yet
L1 - Introduction and Data EcoSystem
42 pages
Complete Roadma 2
No ratings yet
Complete Roadma 2
3 pages
Azure Data Engineering Syllabus
No ratings yet
Azure Data Engineering Syllabus
17 pages
University of Bristol Postgraduate Prospectus 2018 - Web
No ratings yet
University of Bristol Postgraduate Prospectus 2018 - Web
128 pages
My Career Roadmap
No ratings yet
My Career Roadmap
3 pages
Data - Engineer Questions
No ratings yet
Data - Engineer Questions
3 pages
Introduction To Data Engineering
No ratings yet
Introduction To Data Engineering
13 pages
IDBI
No ratings yet
IDBI
3 pages
De Unit - I
No ratings yet
De Unit - I
43 pages
Data Engineering For Model-Based Systems Engineering
No ratings yet
Data Engineering For Model-Based Systems Engineering
24 pages
Data Engineeing 1 Pages 2
No ratings yet
Data Engineeing 1 Pages 2
14 pages
Complete Data Engineering Roadmap With Resources
No ratings yet
Complete Data Engineering Roadmap With Resources
16 pages
2nd Provisional Merit List of LP Hailakandi
No ratings yet
2nd Provisional Merit List of LP Hailakandi
26 pages
Data Engineer Syllabus
No ratings yet
Data Engineer Syllabus
5 pages
12 Definitive Traits of A Middle Child
No ratings yet
12 Definitive Traits of A Middle Child
2 pages
100 Data Engineering QUESTIONS ANSWERS
No ratings yet
100 Data Engineering QUESTIONS ANSWERS
59 pages
Data Engineer Roadmap
No ratings yet
Data Engineer Roadmap
4 pages
UNIT 1 Merged
No ratings yet
UNIT 1 Merged
11 pages
Essentials of Data engineeringByMukeshSaini
No ratings yet
Essentials of Data engineeringByMukeshSaini
30 pages
Lecture 1.1 - Introduction To DE
No ratings yet
Lecture 1.1 - Introduction To DE
27 pages
Data Engineers Instagram Story
No ratings yet
Data Engineers Instagram Story
8 pages
A Internship Report UTTAM
No ratings yet
A Internship Report UTTAM
9 pages
Data Engineering Roadmap
No ratings yet
Data Engineering Roadmap
3 pages
BCSG 0034
No ratings yet
BCSG 0034
2 pages
Evolution of Data Engineer.
No ratings yet
Evolution of Data Engineer.
2 pages
Data-Engineering Compressed
No ratings yet
Data-Engineering Compressed
20 pages
How Many Miles To Babylon Teacher S Pack PDF
No ratings yet
How Many Miles To Babylon Teacher S Pack PDF
30 pages
Data Engineering Course Outline
No ratings yet
Data Engineering Course Outline
3 pages
8611 - Assignment 2 (AG)
100% (1)
8611 - Assignment 2 (AG)
14 pages
De Courseoutline White
No ratings yet
De Courseoutline White
4 pages
Data Engineer Preparation
No ratings yet
Data Engineer Preparation
5 pages
Annexure 1. A. 5 Board Results Achievement Circular CISCE - 24-25 - Grade 10 - 12
No ratings yet
Annexure 1. A. 5 Board Results Achievement Circular CISCE - 24-25 - Grade 10 - 12
5 pages
Sir C.R.Reddy College of Engineering, Eluru Department of Information Technology Course Handout
No ratings yet
Sir C.R.Reddy College of Engineering, Eluru Department of Information Technology Course Handout
12 pages
Data Engineering UNIT-1
No ratings yet
Data Engineering UNIT-1
5 pages
(DATA SCIENCE Syllabus
No ratings yet
(DATA SCIENCE Syllabus
2 pages
Data Engineering Nanodegree Program Syllabus
33% (3)
Data Engineering Nanodegree Program Syllabus
15 pages
Data Engineering Unit-1
No ratings yet
Data Engineering Unit-1
16 pages
Page 2
No ratings yet
Page 2
3 pages
Data Analyst & Data Engineer
No ratings yet
Data Analyst & Data Engineer
4 pages
Anthropology Natural Selection Lab Report Final
No ratings yet
Anthropology Natural Selection Lab Report Final
11 pages
Data Engineering UNIT-1
100% (1)
Data Engineering UNIT-1
14 pages
Iran
No ratings yet
Iran
7 pages
Read-Aloud Strategies Newsletter
No ratings yet
Read-Aloud Strategies Newsletter
5 pages
DELM 212 Educational Leadership
No ratings yet
DELM 212 Educational Leadership
12 pages
Roadmap
No ratings yet
Roadmap
3 pages
100 Dataengineering Interview Questions TRRaveendra 1694654407
No ratings yet
100 Dataengineering Interview Questions TRRaveendra 1694654407
58 pages
Introduction To Data Engineering
No ratings yet
Introduction To Data Engineering
8 pages
Lesson Plan For Position and Movement Mathematics 8 Lesson 1
No ratings yet
Lesson Plan For Position and Movement Mathematics 8 Lesson 1
5 pages
Keyboard Prep Piano Classes For 8-10yo
No ratings yet
Keyboard Prep Piano Classes For 8-10yo
3 pages
IGNOU MCA Data Warehousing and Data Mining Previous Years Unsolved Papers MCS 221
From Everand
IGNOU MCA Data Warehousing and Data Mining Previous Years Unsolved Papers MCS 221
Manish Soni
No ratings yet