0% found this document useful (1 vote)

1K views5 pages

Exam DP-203: Data Engineering On Microsoft Azure - Skills Measured

The document provides information on the DP-203 exam which tests skills in designing and implementing data storage and processing on Microsoft Azure. It outlines four main skills measured: 1) design and implement data storage, 2) design and develop data processing, 3) design and implement data security, and 4) monitor and optimize data storage and processing. For each skill, it provides examples of concepts and tasks that may be assessed on the exam.

Uploaded by

welhie

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (1 vote)

1K views5 pages

Exam DP-203: Data Engineering On Microsoft Azure - Skills Measured

Uploaded by

welhie

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Exam DP-203: Data Engineering on Microsoft Azure –

Skills Measured

Audience Profile
Candidates for this exam should have subject matter expertise integrating, transforming, and
consolidating data from various structured and unstructured data systems into a structure that is
suitable for building analytics solutions.

Azure Data Engineers help stakeholders understand the data through exploration, and they
build and maintain secure and compliant data processing pipelines by using different tools and
techniques. These professionals use various Azure data services and languages to store and
produce cleansed and enhanced datasets for analysis.

Azure Data Engineers also help ensure that data pipelines and data stores are high-performing,
efficient, organized, and reliable, given a set of business requirements and constraints. They deal
with unanticipated issues swiftly, and they minimize data loss. They also design, implement,
monitor, and optimize data platforms to meet the data pipelines needs.

A candidate for this exam must have strong knowledge of data processing languages such as
SQL, Python, or Scala, and they need to understand parallel processing and data architecture
patterns.

Skills Measured

NOTE: The bullets that follow each of the skills measured are intended to illustrate how we
assess that skill. This list is not definitive or exhaustive.

NOTE: Most questions cover features that are General Availability (GA). The exam may contain
questions on Preview features, if those features are commonly used.

Design and Implement Data Storage (40-45%)

Design a data storage structure

 design an Azure Data Lake solution

 recommend file types for storage
 recommend file types for analytical queries
 design for efficient querying
 design for data pruning
 design a folder structure that represents the levels of data transformation
 design a distribution strategy
 design a data archiving solution

Design a partition strategy

 design a partition strategy for files

 design a partition strategy for analytical workloads
 design a partition strategy for efficiency/performance
 design a partition strategy for Azure Synapse Analytics
 identify when partitioning is needed in Azure Data Lake Storage Gen2

Design the serving layer

 design star schemas

 design slowly changing dimensions
 design a dimensional hierarchy
 design a solution for temporal data
 design for incremental loading
 design analytical stores
 design metastores in Azure Synapse Analytics and Azure Databricks

Implement physical data storage structures

 implement compression
 implement partitioning
 implement sharding
 implement different table geometries with Azure Synapse Analytics pools
 implement data redundancy
 implement distributions
 implement data archiving

Implement logical data structures

 build a temporal data solution

 build a slowly changing dimension
 build a logical folder structure
 build external tables
 implement file and folder structures for efficient querying and data pruning

Implement the serving layer

 deliver data in a relational star schema

 deliver data in Parquet files
 maintain metadata
 implement a dimensional hierarchy
Design and Develop Data Processing (25-30%)

Ingest and transform data

 transform data by using Apache Spark

 transform data by using Transact-SQL
 transform data by using Data Factory
 transform data by using Azure Synapse Pipelines
 transform data by using Stream Analytics
 cleanse data
 split data
 shred JSON
 encode and decode data
 configure error handling for the transformation
 normalize and denormalize values
 transform data by using Scala
 perform data exploratory analysis

Design and develop a batch processing solution

 develop batch processing solutions by using Data Factory, Data Lake, Spark, Azure
Synapse Pipelines, PolyBase, and Azure Databricks
 create data pipelines
 design and implement incremental data loads
 design and develop slowly changing dimensions
 handle security and compliance requirements
 scale resources
 configure the batch size
 design and create tests for data pipelines
 integrate Jupyter/IPython notebooks into a data pipeline
 handle duplicate data
 handle missing data
 handle late-arriving data
 upsert data
 regress to a previous state
 design and configure exception handling
 configure batch retention
 design a batch processing solution
 debug Spark jobs by using the Spark UI

Design and develop a stream processing solution

 develop a stream processing solution by using Stream Analytics, Azure Databricks, and
Azure Event Hubs
 process data by using Spark structured streaming
 monitor for performance and functional regressions
 design and create windowed aggregates
 handle schema drift
 process time series data
 process across partitions
 process within one partition
 configure checkpoints/watermarking during processing
 scale resources
 design and create tests for data pipelines
 optimize pipelines for analytical or transactional purposes
 handle interruptions
 design and configure exception handling
 upsert data
 replay archived stream data
 design a stream processing solution

Manage batches and pipelines

 trigger batches
 handle failed batch loads
 validate batch loads
 manage data pipelines in Data Factory/Synapse Pipelines
 schedule data pipelines in Data Factory/Synapse Pipelines
 implement version control for pipeline artifacts
 manage Spark jobs in a pipeline

Design and Implement Data Security (10-15%)

Design security for data policies and standards

 design data encryption for data at rest and in transit

 design a data auditing strategy
 design a data masking strategy
 design for data privacy
 design a data retention policy
 design to purge data based on business requirements
 design Azure role-based access control (Azure RBAC) and POSIX-like Access Control List
(ACL) for Data Lake Storage Gen2
 design row-level and column-level security

Implement data security

 implement data masking

 encrypt data at rest and in motion
 implement row-level and column-level security
 implement Azure RBAC
 implement POSIX-like ACLs for Data Lake Storage Gen2
 implement a data retention policy
 implement a data auditing strategy
 manage identities, keys, and secrets across different data platform technologies
 implement secure endpoints (private and public)
 implement resource tokens in Azure Databricks
 load a DataFrame with sensitive information
 write encrypted data to tables or Parquet files
 manage sensitive information

Monitor and Optimize Data Storage and Data Processing (10-15%)

Monitor data storage and data processing

 implement logging used by Azure Monitor

 configure monitoring services
 measure performance of data movement
 monitor and update statistics about data across a system
 monitor data pipeline performance
 measure query performance
 monitor cluster performance
 understand custom logging options
 schedule and monitor pipeline tests
 interpret Azure Monitor metrics and logs
 interpret a Spark directed acyclic graph (DAG)

Optimize and troubleshoot data storage and data processing

 compact small files

 rewrite user-defined functions (UDFs)
 handle skew in data
 handle data spill
 tune shuffle partitions
 find shuffling in a pipeline
 optimize resource management
 tune queries by using indexers
 tune queries by using cache
 optimize pipelines for analytical or transactional purposes
 optimize pipeline for descriptive versus analytical workloads
 troubleshoot a failed spark job
 troubleshoot a failed pipeline run

Microsoft: Exam Questions DP-203
100% (2)
Microsoft: Exam Questions DP-203
17 pages
DP-900 Dumps
100% (6)
DP-900 Dumps
84 pages
DP 203 Questions 2
No ratings yet
DP 203 Questions 2
36 pages
ENOVIAStudioModelingPlatformMQLGuide V6R2017x
No ratings yet
ENOVIAStudioModelingPlatformMQLGuide V6R2017x
188 pages
DP-203 Exam Answers
100% (1)
DP-203 Exam Answers
43 pages
DP-203 Exam - Actual Q&as, Page 1 - ExamTopics-1
No ratings yet
DP-203 Exam - Actual Q&as, Page 1 - ExamTopics-1
1 page
DP 203
No ratings yet
DP 203
514 pages
Databricks Data Engg Pro Certification Dumps
100% (2)
Databricks Data Engg Pro Certification Dumps
41 pages
DP900 ExamTopic Questions - 70 To 200
No ratings yet
DP900 ExamTopic Questions - 70 To 200
49 pages
DP 900
50% (2)
DP 900
229 pages
DP-300 Study Notes
100% (1)
DP-300 Study Notes
13 pages
Exam DP-203: Data Engineering On Microsoft Azure - Skills Measured
0% (1)
Exam DP-203: Data Engineering On Microsoft Azure - Skills Measured
5 pages
DP 203
No ratings yet
DP 203
16 pages
Persistence Best Practices For Java
No ratings yet
Persistence Best Practices For Java
202 pages
DP 203
100% (1)
DP 203
87 pages
DP-203T00 Microsoft Azure Data Engineering-02
No ratings yet
DP-203T00 Microsoft Azure Data Engineering-02
23 pages
Azure Data Factory
100% (4)
Azure Data Factory
16 pages
DP 203 Microsoft Azure Data Engineer Associate Exam Study Guide PDF
No ratings yet
DP 203 Microsoft Azure Data Engineer Associate Exam Study Guide PDF
23 pages
DP-203 StudyGuide ENU FY23Q2a Vnext
No ratings yet
DP-203 StudyGuide ENU FY23Q2a Vnext
13 pages
DP-300 V12.35 (6855)
No ratings yet
DP-300 V12.35 (6855)
51 pages
Case Management System
No ratings yet
Case Management System
34 pages
Exam DP 203 Data Engineering On Microsoft Azure Skills Measured
No ratings yet
Exam DP 203 Data Engineering On Microsoft Azure Skills Measured
8 pages
DP-900 Q&a
No ratings yet
DP-900 Q&a
99 pages
DP-900 Dump
67% (6)
DP-900 Dump
64 pages
Microsoft: Exam Questions DP-900
100% (1)
Microsoft: Exam Questions DP-900
15 pages
DP-900 Exam 1-50
No ratings yet
DP-900 Exam 1-50
50 pages
DP-203T00 Microsoft Azure Data Engineering-03
No ratings yet
DP-203T00 Microsoft Azure Data Engineering-03
21 pages
DP-203 Discussion Dump
100% (1)
DP-203 Discussion Dump
68 pages
Microsoft Certified Azure Data Engineer Associate Skills Measured
No ratings yet
Microsoft Certified Azure Data Engineer Associate Skills Measured
5 pages
Microsoft Certified: Azure Data Engineer Associate - Skills Measured
No ratings yet
Microsoft Certified: Azure Data Engineer Associate - Skills Measured
4 pages
DP 900 Part 2 PDF
100% (2)
DP 900 Part 2 PDF
13 pages
DP 203
No ratings yet
DP 203
37 pages
Off 142q Vce
No ratings yet
Off 142q Vce
16 pages
DP-203T00 Microsoft Azure Data Engineering-05
No ratings yet
DP-203T00 Microsoft Azure Data Engineering-05
20 pages
DP-203 - Data Engineering On Microsoft Azure 2021-1
100% (2)
DP-203 - Data Engineering On Microsoft Azure 2021-1
42 pages
Azure DP 203
100% (1)
Azure DP 203
57 pages
DP-900 Practice Set
100% (2)
DP-900 Practice Set
23 pages
DP-203 Exam Demo
No ratings yet
DP-203 Exam Demo
7 pages
DP-203 Exam-PG-111-120 - ExamTopics - Passei Direto
No ratings yet
DP-203 Exam-PG-111-120 - ExamTopics - Passei Direto
10 pages
Microsoft - Actualtests.dp 203.v2021!04!13.by - Liam.25q
No ratings yet
Microsoft - Actualtests.dp 203.v2021!04!13.by - Liam.25q
31 pages
DP - 900 - 2
100% (1)
DP - 900 - 2
79 pages
DP - 900 - 4
0% (1)
DP - 900 - 4
17 pages
DP 900 TestPrep - Cloudthat
60% (5)
DP 900 TestPrep - Cloudthat
13 pages
Udemy Course DP-900 Microsoft Azure Data Fundaments Guide Part 1 of 2
No ratings yet
Udemy Course DP-900 Microsoft Azure Data Fundaments Guide Part 1 of 2
12 pages
Microsoft DP-900: Practice Test
No ratings yet
Microsoft DP-900: Practice Test
26 pages
PASS Azure Data Engineering Bootcamp
No ratings yet
PASS Azure Data Engineering Bootcamp
35 pages
Azure DP 900 - 80 Questions Tfhfuffhy
100% (3)
Azure DP 900 - 80 Questions Tfhfuffhy
25 pages
Microsoft - Certshared.dp 203.free - pdf.2023 Sep 25.by - Osborn.177q.vce
No ratings yet
Microsoft - Certshared.dp 203.free - pdf.2023 Sep 25.by - Osborn.177q.vce
24 pages
Exam DP-900: Microsoft Azure Data Fundamentals - Skills Measured
0% (2)
Exam DP-900: Microsoft Azure Data Fundamentals - Skills Measured
7 pages
DP 900
100% (1)
DP 900
33 pages
DP 900 1
100% (1)
DP 900 1
63 pages
Data Factory
100% (2)
Data Factory
26 pages
Dp-300 - Amostra Grátis
No ratings yet
Dp-300 - Amostra Grátis
30 pages
dp-203 Dedb75bd432f
No ratings yet
dp-203 Dedb75bd432f
98 pages
Course Presentation DP 900 AzureDataFundamentals
100% (2)
Course Presentation DP 900 AzureDataFundamentals
142 pages
Azure Data Factory
100% (2)
Azure Data Factory
14 pages
Latest Microsoft AZ-203 Dumps Questions
No ratings yet
Latest Microsoft AZ-203 Dumps Questions
24 pages
DP-900 QnA
No ratings yet
DP-900 QnA
49 pages
DP-600 Exam Valid Dumps Questions
No ratings yet
DP-600 Exam Valid Dumps Questions
31 pages
Azure Data Engineer Associate Syllabus
No ratings yet
Azure Data Engineer Associate Syllabus
4 pages
Study Guide For Exam DP-203 - Data Engineering On Microsoft Azure - Microsoft Learn
No ratings yet
Study Guide For Exam DP-203 - Data Engineering On Microsoft Azure - Microsoft Learn
4 pages
Croma Campus - DP-203 Data Engineering On Microsoft Azure Training Curriculum
No ratings yet
Croma Campus - DP-203 Data Engineering On Microsoft Azure Training Curriculum
7 pages
Exam DP-200: Implementing An Azure Data Solution - Skills Measured
No ratings yet
Exam DP-200: Implementing An Azure Data Solution - Skills Measured
2 pages
DP-203 Exam Dumps Your Ultimate Triumph Key
No ratings yet
DP-203 Exam Dumps Your Ultimate Triumph Key
2 pages
Exam AZ-304: Microsoft Azure Architect Design - Skills Measured
No ratings yet
Exam AZ-304: Microsoft Azure Architect Design - Skills Measured
7 pages
Exam AZ-303: Microsoft Azure Architect Technologies - Skills Measured
No ratings yet
Exam AZ-303: Microsoft Azure Architect Technologies - Skills Measured
8 pages
Exam AZ-204: Developing Solutions For Microsoft Azure - Skills Measured
No ratings yet
Exam AZ-204: Developing Solutions For Microsoft Azure - Skills Measured
7 pages
Exam DP-200: Implementing An Azure Data Solution - Skills Measured
No ratings yet
Exam DP-200: Implementing An Azure Data Solution - Skills Measured
5 pages
Troubleshooting '9300' and '10807' Errors: Symptom
No ratings yet
Troubleshooting '9300' and '10807' Errors: Symptom
3 pages
Computer
No ratings yet
Computer
4 pages
Postgraduate PG Master Computer Applications Mca Semester 2 2023 November Advanced Dbms2020 Pattern
No ratings yet
Postgraduate PG Master Computer Applications Mca Semester 2 2023 November Advanced Dbms2020 Pattern
2 pages
BMC Discovery 11.3: Getting Started
No ratings yet
BMC Discovery 11.3: Getting Started
37 pages
Library Management System Project 1
No ratings yet
Library Management System Project 1
8 pages
Prac
No ratings yet
Prac
54 pages
Spring Boot With MongoDB
No ratings yet
Spring Boot With MongoDB
16 pages
Exporting Video Footage
No ratings yet
Exporting Video Footage
14 pages
Bigdataqcm PDF
100% (1)
Bigdataqcm PDF
206 pages
Dav Report
No ratings yet
Dav Report
17 pages
Question Bank-Java FSD 7th Sem
No ratings yet
Question Bank-Java FSD 7th Sem
6 pages
Az-305 8
No ratings yet
Az-305 8
50 pages
Assignment 2 Solution 1
No ratings yet
Assignment 2 Solution 1
28 pages
Cognizant Probable Sample Questions
No ratings yet
Cognizant Probable Sample Questions
2 pages
Unit 1: Fundamentals of AS ABAP
No ratings yet
Unit 1: Fundamentals of AS ABAP
6 pages
Web Based Students Record Management
No ratings yet
Web Based Students Record Management
9 pages
Container Coding
No ratings yet
Container Coding
39 pages
Wic Ebt Technical Implementation Guide: Date: February 26, 2018
No ratings yet
Wic Ebt Technical Implementation Guide: Date: February 26, 2018
132 pages
Mooc On Weka
No ratings yet
Mooc On Weka
59 pages
1Z0 184 25 Exam Dumps
No ratings yet
1Z0 184 25 Exam Dumps
8 pages
SQL Cheat Sheet
No ratings yet
SQL Cheat Sheet
1 page
CE31501 Soft-Computing Tools in Engineering ES 2013
No ratings yet
CE31501 Soft-Computing Tools in Engineering ES 2013
1 page
The SQL Tutorial For Data Analysis
No ratings yet
The SQL Tutorial For Data Analysis
103 pages
2nd Year Statistics Question Bank CH#15
No ratings yet
2nd Year Statistics Question Bank CH#15
4 pages
Lab1 3-Instalacion A2billing
No ratings yet
Lab1 3-Instalacion A2billing
14 pages
O'Level Mock Workshop Paper-2
No ratings yet
O'Level Mock Workshop Paper-2
15 pages
TinoTranscriptDB A Database of Transcripts and Microsatellite
No ratings yet
TinoTranscriptDB A Database of Transcripts and Microsatellite
13 pages

Exam DP-203: Data Engineering On Microsoft Azure - Skills Measured

Uploaded by

Exam DP-203: Data Engineering On Microsoft Azure - Skills Measured

Uploaded by

Exam DP-203: Data Engineering on Microsoft Azure –

Design and Implement Data Storage (40-45%)

 design an Azure Data Lake solution

Design a partition strategy

 design a partition strategy for files

Design the serving layer

 design star schemas

Implement physical data storage structures

Implement logical data structures

 build a temporal data solution

Implement the serving layer

 deliver data in a relational star schema

Ingest and transform data

 transform data by using Apache Spark

Design and develop a batch processing solution

Design and develop a stream processing solution

Manage batches and pipelines

Design and Implement Data Security (10-15%)

Design security for data policies and standards

 design data encryption for data at rest and in transit

Implement data security

 implement data masking

Monitor and Optimize Data Storage and Data Processing (10-15%)

Monitor data storage and data processing

 implement logging used by Azure Monitor

Optimize and troubleshoot data storage and data processing

 compact small files

You might also like