Azure Data Engineer Learning Pathway
Azure Data Engineer Learning Pathway
Additional Study
Microsoft Applied Skills
Design and develop data processing Design and implement data security Monitor and optimize data storage and
• Backup and restore in Azure Synapse Dedicated SQL • Implement encryption data processing
pool • Data ingestion security considerations • Auto Optimize in Azure Databricks
• Implement workload management • Configure authentication • Modify user-defined functions Targeted validation for real-world scenarios. Demonstrate
• Use extended Apache Spark history server to debug and • Designing distributed tables proficiency in specific, scenario- based skill sets so you can make
• Access control lists (ACLs) in Azure
diagnose Apache Spark applications a bigger impact on every project, at your organization, and in
Data Lake Storage Gen2 • Data spillage scenario - Search and
your career
• Enterprise Data Warehouse Architecture • Synapse access control purge
• Stream processing with Azure Databricks • Column-level security • Quickstart: Create an Azure Synapse
workspace using an ARM template
Explore Applied Skills
• Azure Synapse Analytics • Manage authorization through
• Monitoring for performance efficiency column and row level security • Indexing dedicated SQL pool tables
• Work with windowing functions • Manage user permissions • Performance tuning with result set
• Schema drift • Auditing for Azure SQL Database and caching
Azure Synapse Analytics • Optimize Apache Spark jobs 30 days to Learn it Challenge
• Time handling in Stream Analytics
• Checkpoint and replay concepts in Azure Stream • Retention Policy on storage accounts • Troubleshoot library installation errors
Analytics jobs • Understand network security options • Debug data factory pipelines 30 Days to Learn It can help you build skills and start your
• Scale an Azure Stream Analytics job to increase • Dynamic Data Masking preparation for Microsoft Certifications for AI, DevOps, Microsoft
throughput • Secure a dedicated SQL pool 365, low code, IoT, data science, cloud development, and more.
Select your challenge below, work through learning modules, and
exchange ideas with peers through a global community forum.
Design and develop data processing Monitor and optimize data storage and
• Use repartitioning to optimize processing data processing
• Azure Stream Analytics output error policy • Monitor and Alert Data Factory by
using Azure Monitor Explore the challenges
• Stream Analytics output to Cosmos DB
• Stream processing with Stream Analytics • Exercise – Implement workload
• Data Loading best practices management
• Get Started with Synapse Analytics • Monitor your Azure Synapse Analytics
dedicated SQL pool workload using
• Monitor your Synapse Workspace DMVs
• Collect custom logs with Log Analytics
agent
• Use Synapse Studio to monitor your
workspace pipeline runs
• Deploying Apache Airflow in Azure to
build and run data pipelines