0% found this document useful (0 votes)
14 views

Azure Data engineering Syllabus

The Azure Data Engineering course syllabus outlines a comprehensive training program covering SQL, Azure services, Data Factory, Databricks, Python, Spark, and DevOps. It includes over 60 hours of instruction, practical projects, and is suitable for individuals from any background. The course emphasizes hands-on learning and real-world applications in data engineering.

Uploaded by

rahulnikrad10
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
14 views

Azure Data engineering Syllabus

The Azure Data Engineering course syllabus outlines a comprehensive training program covering SQL, Azure services, Data Factory, Databricks, Python, Spark, and DevOps. It includes over 60 hours of instruction, practical projects, and is suitable for individuals from any background. The course emphasizes hands-on learning and real-world applications in data engineering.

Uploaded by

rahulnikrad10
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 17

AZURE DATA

ENGINEERING
Course Syllabus

Page #1
Trainer's
Profile

18 years of professional experience


12 years of training experience in Data Analytics and Data
Engineering
Microsoft certified in Azure Data Engineering, SQL and
POWER BI.
Over 5000+ hours of training delivered.
Trained over 2000+ students across multiple technologies
from people in different industries.
Expertise in batch, one-to-one and corporate trainings.
Creator and lead instructor @cloudanddatauniverse.
Over 25 projects delivered across multiple technologies.

Page #2
Azure Data Engineering Syllabus

Learning Path

CLOUD AND DATA UNIVERSE

Page #4
Azure Data Engineering Syllabus

SQL : 14+ Hours


Introduction Advanced SQL Concepts
Introduction to SQL Creating and Managing
Different vendors in SQL Views
Installation of Microsoft Creating and Managing
SQL Server Stored Procedures
Installing Sample Database Creating and Managing
Functions
Creating and Managing
Index
SQL Fundamentals
CTE
Querying tables
Query Optimization
Inserting data
Creating, Deleting and
Managing tables
Data Warehousing
Sorting and Filtering Data
Introduction
Grouping and Pivoting
OLAP & OLTP
Data
Star & SnowFlake Schema
Creating constraints on
Incremental Load
tables
ETL process
Importing and Exporting
Facts & Dimensions
data from various sourses
SCD types
Joining tables
ACID Properties
String Functions
Date Functions
Subquery
Case statement
Window functions
Data types
Combining tables

CLOUD AND DATA UNIVERSE

Page #5
Azure Data Engineering Syllabus

Azure Cloud : 1 Hour


Introduction
Cloud Computing
Top cloud providers
Benefits of Cloud
Computing
Introduction to Azure
Sign up for Azure Trial
How to upgrade to a pay-
as-you-go
Provision Azure Data lake
gen2
Provision Azure SQL DB
Provision Azure Data
Factory
Provision Azure Databricks
Provision Azure Synapse

CLOUD AND DATA UNIVERSE

Page #6
Azure Data Engineering Syllabus

Azure Data Factory : 12+ Hours


Introduction Transformations
What is ETL? Select
ETL vs ELT Sort
Top tools in ETL Filter
Introduction to Azure Data Derived
Factory Conditional split
Cast
Surrogate
Aggregate
Activities
Pivot
Copy Data
Unpivot
Set Variable
Rank
Get Metadata
Window
For Each
Union
Append Variable
Lookup
Delete
Exists
Lookup
Join
Stored procedure
Flatten
Filter
Parse
IF
Stringify
Switch
Execute Pipeline
Script ADF Features
Validation Introduction to Dataflow
Integration Runtimes
Incremental loads
Triggers
Monitoring Pipelines

CLOUD AND DATA UNIVERSE

Page #7
Azure Data Engineering Syllabus

Python Fundamentals : 6+ Hours


Introduction Libraries
What is Python? Matplotlib
Installation Numpy
Pandas

Fundamentals Advanced
Print Functions-Introduction
Variables Functions-Scope of variable
Concatenation Functions-args&kwargs
Data Types Lambda
Interpolation map
operators filter
If statement reduce
loops Classes
Collections - List
Collections - Tuple
Collections - Set
Collections - Dictionary
List comprehensions
Date Functions
Exception handling
string functions
Regex

CLOUD AND DATA UNIVERSE

Page #8
Azure Data Engineering Syllabus

Big Data Fundamentals : 1 Hour


Introduction
Introduction to Big data
History & evolution of Big
data
Understanding Distributed
computing
Hadoop ecosystem
overview
Azure data engineering
stack
Different file types

CLOUD AND DATA UNIVERSE

Page #9
Azure Data Engineering Syllabus

Spark & Databricks : 14 Hours


Spark DataFrames
Introduction Introduction to DataFrame
Spark components & API Creating data frame from
Spark architecture RDD
Transformations Imposing schema
Actions Read options
Lineage graph Write options
DAG Rename columns
Add columns
Select columns
RDD Drop columns
Remove duplicates
Introduction to RDD
Sort columns
Creating RDD from text file
Filter
Various operations on RDD
Union
Conditional columns
Casting
Databricks Aggregations
Introduction Pivot
Sign up for Databricks Unpivot
Community Edition Window functions
Setting up Cluster Joins
Understanding dbutils Join Algorithms
Accessing & mounting Partitioning
Azure Data lake gen2 Bucketing
Creating & monitoring jobs
Auto loader
Unity catalog

CLOUD AND DATA UNIVERSE

Page #10
Azure Data Engineering Syllabus

Spark & Databricks : 14 Hours


Spark Advanced
Understanding Data
distribution
Spark Execution Plans
Join Algorithms
Spark SQL
Memory management
Garbage collection
JDBC
Data skewness
Adaptive query execution
Resource allocation
Serialization &
Deserialization

CLOUD AND DATA UNIVERSE

Page #11
Azure Data Engineering Syllabus

Delta Lake & LakeHouse : 3 Hours

Introduction to Delta lake


Read & Write Delta format
Creating Delta tables
Understanding DeltaLog
DML operations
VersionHistory
TimeTravel
Restore
Checkpointing
Partioning
SchemaEvolution
CopyInto
Merge
Optimize
Vaccum
Zorder
LiquidClustering
ChangeFeed
ReorgTable
DeletionVectors
SCD Type 1
SCD Type 2

CLOUD AND DATA UNIVERSE

Page #12
Azure Data Engineering Syllabus

Synapse : 1 Hour
Introduction
Getting started with
Synapse
Setting up synapse

Synapse Fundamentals
Pool concept
Dedicated & Serverless
pools
Pricing
Databases
Spark pool
Creating pipelines
Creating notebooks

CLOUD AND DATA UNIVERSE

Page #13
Azure Data Engineering Syllabus

Azure DevOps : 1 hour


Introduction
Introduction to DevOps
Getting started with Azure
DevOps
Setting up Azure
components with Azure
DevOps

Code versioning
Code versioning
Creating Repository
Creating branches
Setting policies
Code check-in
Pull requests
Code merging

Deployment
Deployment
Creating build pipelines
Creating release pipelines

CLOUD AND DATA UNIVERSE

Page #14
Azure Data Engineering Syllabus

End-to-End Project
A master piece covering end-end-to project
with below features

Requirement gathering.
Understanding client
business.
Architecture
Documentation
End-to-end design and
development
Creating pipelines in data
factory and databricks
using pyspark.
Building entire cicd
process in azure devops
Testing
Effort estimation
Databricks workflows

CLOUD AND DATA UNIVERSE

Page #15
Azure Data Engineering Syllabus

FAQ’s

Which tools does this course cover?


This course cover’s SQL, AZURE, AZURE DATA
FACTORY, DATABRICKS, PYTHON, PYSPARK, SYNAPSE
& AZURE DEVOPS.

I am completely from a different background, can I


enroll in this course and how difficult will it be for
me to transition in this azure data engineering
domain?
Yes, anyone can enroll in this course irrespective of
their current profile. This course starts from scratch
and no pre-requisites are needed, we cover everything
from beginning to end. The most important thing is to
be committed to the learning path and be consistent
throughout the course.

CLOUD AND DATA UNIVERSE

Page #16
Azure Data Engineering Syllabus

Feedback & Reviews

CLOUD AND DATA UNIVERSE

Page #21
Azure Data Engineering Syllabus

Feedback & Reviews

CLOUD AND DATA UNIVERSE

Page #22

You might also like