0% found this document useful (0 votes)
21 views2 pages

(DATA SCIENCE Syllabus

The document outlines the course structure for a Data Engineering class in a B.Tech program, detailing course objectives and outcomes. It covers key topics such as the Data Engineering Life Cycle, data architecture design, data storage systems, and data ingestion methods. Additionally, it lists required textbooks and reference materials for the course.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
21 views2 pages

(DATA SCIENCE Syllabus

The document outlines the course structure for a Data Engineering class in a B.Tech program, detailing course objectives and outcomes. It covers key topics such as the Data Engineering Life Cycle, data architecture design, data storage systems, and data ingestion methods. Additionally, it lists required textbooks and reference materials for the course.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

CBIT–B.

Tech(R23)–CSE-DS

II B.Tech – II Semester
(23E32401T) DATA ENGINEERING
(CSE-DATA SCIENCE)

Int. Marks Ext. Marks Total Marks L T P C


30 70 100 3 0 0 3
Course Objectives:
• Explain basic concepts of Data Engineering
• Discuss bout Data Engineering Life Cycle
• How to design Good Data Architecture

Course Outcomes: By the end of the course students will be able to:
CO1: Understand Data Engineering Life cycle
CO2: Apply appropriate data modeling techniques for different types of data.
CO3: Evaluate and select appropriate technologies and frameworks for specific data
engineering tasks.
CO4: Analyze the use of OLTP Applications in Data Science.
CO5: Implement data quality checks and governance processes to ensure data
reliability and compliance.
UNIT-I: Introduction to Data Engineering: Definition, Data Engineering Life Cycle,
Evolution of Data Engineer, Data Engineering Versus Data Science, Data Engineering Skills
and Activities,
Data Maturity, Data Maturity Model, Skills of a Data Engineer, Business Responsibilities,
Technical Responsibilities, Data Engineers and Other Technical Roles.

UNIT-II: Data Engineering Life Cycle: Data Life Cycle Versus Data Engineering Life
Cycle, Generation: Source System, Storage, Ingestion, Transformation, Serving Data.
Major undercurrents across the Data Engineering Life Cycle: Security, Data
Management, Data Ops, Data Architecture, Orchestration, Software Engineering.

UNIT-III: Designing Good Data Architecture: Enterprise Architecture, Data Architecture,


Principles of Good Data Architecture, Major Architecture Concepts.
Data Generation in Source Systems: Sources of Data, Files and Unstructured Data, APIs,
Application Databases (OLTP), OLAP, Change Data Capture, Logs, Database Logs, CRUD,
Source System Practical Details.

UNIT-IV: Storage: Raw Ingredients of Data Storage, Data Storage Systems, Data
Engineering Storage Abstractions, Data warehouse, Data Lake, Data Lakehouse.
Ingestion: Data Ingestion, Key Engineering considerations for the Ingestion Phase, Batch
Ingestion Considerations, Message and Stream Ingestion Considerations, Ways to Ingest Data

UNIT-V: Queries, Modeling and Transformation: Queries, Life of a Query, Query


Optimizer, Queries on Streaming Data, Data Modelling, Modeling Streaming Data,
Transformations, Streaming Transformations and Processing.
Serving Data for Analytics, Machine Learning and Reverse ETL: General Considerations
for serving Data, Business Analytics, Operational Analytics, Embedded Analytics, Ways to
serve data for analytics and ML, Reverse ETL.

29
CBIT–B.Tech(R23)–CSE-DS

Textbooks:
1. Joe Reis, Matt Housley, Fundamentals of Data Engineering, O'Reilly Media, Inc.,June
2022,ISBN: 9781098108304

Reference Books:
1. Paul Crickard , Data Engineering with Python,Packt Publishing, October 2020.
2. Ralph Kimball, Margy Ross, The Data Warehouse Toolkit: The Definitive Guide to
Dimensional Modeling, Wiley, 3rd Edition, 2013
3. James Densmore, Data Pipelines Pocket Reference: Moving and Processing Data for
Analytics, O'Reilly Media, 1st Edition,

30

You might also like