0% found this document useful (0 votes)

3K views36 pages

DP 203 Questions 2

The document contains 13 multiple choice questions about Azure data and analytics services. It covers topics like Azure Stream Analytics, Azure Synapse Analytics, table design and distribution, external tables, and more. For each question, the correct answer is identified and a short explanation for the answer is provided with relevant documentation links for more information.

Uploaded by

Aayoshi Dutta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3K views36 pages

DP 203 Questions 2

Uploaded by

Aayoshi Dutta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 36

Question 1: Correct

Your data engineering team has an Azure Stream Analytics job in place. Currently the
job is configured to take in events from an Azure Event Hub. It then outputs data to an
Azure Dedicated SQL pool within Azure Synapse Analytics. The engineers have been
reviewing the metrics. They are seeing a high number of Backlogged input events.
Which of the following can be done to ensure the Backlogged input events are kept in
check?
● Add another output to the Stream Analytics job
● Change the partition key of the incoming stream
● Add another input to the Stream Analytics job
● Increase the number of streaming units assigned to the job
● (Correct)
Explanation
One reason for a high value of Backlogged Inputs events could be because the job is not
able to keep up with the incoming stream. By adding more streaming units , it can help
to add more resources to ensure the job can keep up with the incoming streams.

For more information on monitoring for a Stream Analytics job, one can visit the
following URL

https://docs.microsoft.com/en-us/azure/stream-analytics/stream-analytics-monitoring
Question 2: Correct
Your data engineering team is planning on setting up a dedicated SQL pool in an Azure
Synapse Analytics workspace. A separate set of users will be responsible for loading
data into the SQL pool. And another set of users will be responsible for querying of data
from the SQL pool. You have to ensure that the loading process has enough resources
assigned to it. Which of the following can be implemented for this requirement?
● Assign more resources via workload classification
● (Correct)
● Make sure to use the COPY statement while loading the data
● Make use of materialized views
Explanation
You need to make use of Workload Classifiers to ensure that more resources are
allocated to the users who will be performing the load process.

For more information on Workload Classifiers, one can visit the following URL

https://docs.microsoft.com/en-us/azure/synapse-analytics/sql-data-warehouse/sql-dat
a-warehouse-workload-classification
Question 3: Correct
Your team is designing the tables for a data warehouse. The data warehouse is going to
be hosted in a Dedicated SQL pool in Azure Synapse Analytics. The following tables are
going to be hosted initially in the pool

You have to choose the right distribution for each table. You have to ensure data
movement is minimized across tables

Which of the following distribution type would you choose for the Sales table?
● Hash
● (Correct)
● Round Robin
● Replicated
Explanation
Here since this is a large fact table, you should use Hash distributed tables

For more information on table distribution, one can visit the following URL

https://docs.microsoft.com/en-us/azure/synapse-analytics/sql-data-warehouse/sql-dat
a-warehouse-tables-distribute
Question 4: Correct
Your team is designing the tables for a data warehouse. The data warehouse is going to
be hosted in a Dedicated SQL pool in Azure Synapse Analytics. The following tables are
going to be hosted initially in the pool

You have to choose the right distribution for each table. You have to ensure data
movement is minimized across tables

Which of the following distribution type would you choose for the Customer table?
● Hash
● Round Robin
● Replicated
● (Correct)
Explanation
Here since this is a dimension table and you need to ensure data movement is
minimized , you should choose replicated tables so that the data is available across all
nodes in the SQL pool.

For more information on replicated table design, one can visit the following URL

https://docs.microsoft.com/en-us/azure/synapse-analytics/sql-data-warehouse/design
-guidance-for-replicated-tables
Question 5: Correct
Your team is designing the tables for a data warehouse. The data warehouse is going to
be hosted in a Dedicated SQL pool in Azure Synapse Analytics. The following tables are
going to be hosted initially in the pool

You have to choose the right distribution for each table. You have to ensure data
movement is minimized across tables

Which of the following distribution type would you choose for the Date table?
● Hash
● Round Robin
● Replicated
● (Correct)
Explanation
Here since this is a dimension table and you need to ensure data movement is
minimized , you should choose replicated tables so that the data is available across all
nodes in the SQL pool.

For more information on replicated table design, one can visit the following URL

https://docs.microsoft.com/en-us/azure/synapse-analytics/sql-data-warehouse/design
-guidance-for-replicated-tables
Question 6: Correct
Your team has several Azure Stream Analytics jobs in place. They need to make use of
several windowing functions based on the needed requirement. Which of the following
windowing function can be used for the below requirement?

“Ensure that the data stream is segmented into distinct time segments and ensure that
events don’t overlap.”
● Sliding window
● Session window
● Tumbling window
● (Correct)
● Hopping window
Explanation
Here we need to use the Tumbling window.

For more information on Azure Stream Analytics windowing functions, one can visit the
following URL

https://docs.microsoft.com/en-us/azure/stream-analytics/stream-analytics-window-fun
ctions
Question 7: Correct
Your team has several Azure Stream Analytics jobs in place. They need to make use of
several windowing functions based on the needed requirement. Which of the following
windowing function can be used for the below requirement?

“Ensure to output events only for points in time when the content of the window
actually changes”
● Sliding window
● (Correct)
● Session window
● Tumbling window
● Hopping window
Explanation
Here we need to use the Sliding window.

For more information on Azure Stream Analytics windowing functions, one can visit the
following URL

https://docs.microsoft.com/en-us/azure/stream-analytics/stream-analytics-window-fun
ctions
Question 8: Correct
Your team has several Azure Stream Analytics jobs in place. They need to make use of
several windowing functions based on the needed requirement. Which of the following
windowing function can be used for the below requirement?

“Ensure to group events that arrive at similar times”

● Sliding window
● Session window
● (Correct)
● Tumbling window
● Hopping window
Explanation
Here we need to use the Sliding window.

For more information on Azure Stream Analytics windowing functions, one can visit the
following URL

https://docs.microsoft.com/en-us/azure/stream-analytics/stream-analytics-window-fun
ctions
Question 9: Correct
You have to design a Fact table . The table will be used to store Orders-based data. The
size of the table will be around 10 GB on disk. The table needs to be partitioned based
on date values.

Below is the uncompleted table definition

Which of the following would come in Area 1?
● HASH
● (Correct)
● ROUND_ROBIN
● REPLICATE
Explanation
Since this is a Fact table with a large size , the preferred way for the distribution should
be a hash-based distribution.

For more information on table distribution, one can visit the following URL

https://docs.microsoft.com/en-us/azure/synapse-analytics/sql-data-warehouse/sql-dat
a-warehouse-tables-distribute
Question 10: Correct
You have to design a Fact table . The table will be used to store Orders-based data. The
size of the table will be around 10 GB on disk. The table needs to be partitioned based
on date values.

Below is the uncompleted table definition

Which of the following would come in Area 2?
● PARTITION
● (Correct)
● DateKey
● SPLIT
Explanation
Here we will use the PARTITION BY clause since we need to partition the data by dates.

For more information on table partitioning , one can visit the following URL

https://docs.microsoft.com/en-us/azure/synapse-analytics/sql-data-warehouse/sql-dat
a-warehouse-tables-partition
Question 11: Correct
You have to design a Fact table . The table will be used to store Orders-based data. The
size of the table will be around 10 GB on disk. The table needs to be partitioned based
on date values.

Below is the uncompleted table definition

Which of the following would come in Area 3?
● PARTITION
● DateKey
● (Correct)
● SPLIT
Explanation
Here we specify the column we want to partition the table by. Since its by the dates , we
mention the column name as DateKey.

For more information on table partitioning , one can visit the following URL

https://docs.microsoft.com/en-us/azure/synapse-analytics/sql-data-warehouse/sql-dat
a-warehouse-tables-partition
Question 12: Correct
Your team is planning on using External tables in Azure Synapse Analytics. Which of the
following can be queried via the use of External tables?
● Files in Azure File shares
● Documents in Azure Cosmos DB
● Objects in Azure Data Lake Gen2
● (Correct)
● Tables in Azure SQL Databases
Explanation
With external tables you can query for data in Azure Blob Storage or Azure Data Lake
Gen2

For more information on External tables , one can visit the following URL

https://docs.microsoft.com/en-us/azure/synapse-analytics/sql/develop-tables-external-
tables?tabs=hadoop
Question 13: Incorrect
Your team is planning on using External tables in Azure Synapse Analytics. The team
will be using a set of Parquet-based files hosted in an Azure Data Lake Gen2 Storage
account.

Which of the following statement is used to reference the Azure Data Lake Gen2
Storage account and the associated credentials to access the account?
● CREATE EXTERNAL FILE FORMAT
● CREATE EXTERNAL TABLE
● (Incorrect)
● CREATE EXTERNAL DATA SOURCE
● (Correct)
Explanation
With the CREATE EXTERNAL DATA SOURCE statement, you will mention the source of
data which is in Azure Data Lake Gen2 Storage account. You will also specify the
credentials which will be used to access the storage account.

For more information on External tables , one can visit the following URL

https://docs.microsoft.com/en-us/azure/synapse-analytics/sql/develop-tables-external-
tables?tabs=hadoop
Question 14: Correct
Your team is planning on using External tables in Azure Synapse Analytics. The team
will be using a set of Parquet-based files hosted in an Azure Data Lake Gen2 Storage
account.

Which of the following statement is used to describe the format of the files?
● CREATE EXTERNAL FILE FORMAT
● (Correct)
● CREATE EXTERNAL TABLE
● CREATE EXTERNAL DATA SOURCE
Explanation
The CREATE EXTERNAL FILE FORMAT statement is used to specify the format of the
files.

For more information on External tables , one can visit the following URL

https://docs.microsoft.com/en-us/azure/synapse-analytics/sql/develop-tables-external-
tables?tabs=hadoop
Question 15: Correct
Your data engineering team currently has the following resources defined in Azure

1) An Azure Event Hub – This is used to stream events from external data sources onto
Azure.

2) An Azure Data Lake Gen2 Storage account – This is used to store the events
streamed via Azure Event Hubs

3) An Azure Data Factory instance – This is used to build various ETL pipelines

4) An Azure Synapse Analytics workspace – This is used to host a dedicated SQL pool.

You have to build a pipeline in Azure Data Factory to copy data at regular time intervals
from the Azure Data Lake Gen2 Storage account onto tables in the dedicated SQL pool.
You have to ensure that only data within a specified time window is copied onto tables
in the dedicated SQL pool.

Which of the following would you choose the Integration runtime type for the pipeline?
● Azure Integration runtime
● (Correct)
● Azure-SSIS Integration runtime
● Self-hosted Integration runtime
Explanation
Here since the source and destination of the data are Azure-based resources ,we can
make use of the Azure Integration runtime itself.

For more information on the Azure Integration runtime , one can visit the following URL

https://docs.microsoft.com/en-us/azure/data-factory/create-azure-integration-runtime?
tabs=data-factory
Question 16: Incorrect
Your data engineering team currently has the following resources defined in Azure

1) An Azure Event Hub – This is used to stream events from external data sources onto
Azure.

2) An Azure Data Lake Gen2 Storage account – This is used to store the events
streamed via Azure Event Hubs

3) An Azure Data Factory instance – This is used to build various ETL pipelines

4) An Azure Synapse Analytics workspace – This is used to host a dedicated SQL pool.

Which of the following should be used as the trigger type?

● Event-based trigger
● (Incorrect)
● Schedule trigger
● Tumbling window trigger
● (Correct)
Explanation
Here since we need to ensure that jobs are executed within a particular time frame, and
each window is independent of the other, we should look towards using the Tumbling
window trigger.

For more information on the tumbling window trigger , one can visit the following URL

https://docs.microsoft.com/en-us/azure/data-factory/how-to-create-tumbling-window-t
rigger?tabs=data-factory
Question 17: Correct
Your data engineering team has a table that has the following structure in a dedicated
SQL pool in an Azure Synapse Analytics workspace

Which of the following statement can be used to implement row level security in the
table?
● CREATE DYNAMIC MASK
● CREATE SECURITY POLICY
● (Correct)
● GRANT
● UPDATE
Explanation
You can implement row-level security with the use of the statement CREATE SECURITY
POLICY

For more information on row-level security , one can visit the following URL

https://docs.microsoft.com/en-us/sql/relational-databases/security/row-level-security?
view=sql-server-ver15
Question 18: Correct
Your data engineering team has a table that has the following structure in a dedicated
SQL pool in an Azure Synapse Analytics workspace
Which of the following statement can be used to implement column level security in the
table?
● CREATE DYNAMIC MASK
● CREATE SECURITY POLICY
● GRANT
● (Correct)
● UPDATE
Explanation
You can implement column-level security with the use of the GRANT statement

For more information on column-level security , one can visit the following URL

https://docs.microsoft.com/en-us/azure/synapse-analytics/sql-data-warehouse/colum
n-level-security
Question 19: Correct
Your team has an Azure Databricks workspace. They need to create two clusters. Below
are the requirements for the clusters

Which of the following would you choose as the cluster mode for Cluster 1?
● Single Node
● Standard
● High Concurrency
● (Correct)
Explanation
All of these requirements are met with the use of the High Concurrency cluster

For more information on configuring clusters in Azure Databricks , one can visit the
following URL

https://docs.microsoft.com/en-us/azure/databricks/clusters/configure
Question 20: Correct
Your team has an Azure Databricks workspace. They need to create two clusters. Below
are the requirements for the clusters

Which of the following would you choose as the cluster mode for Cluster 2?
● Single Node
● Standard
● (Correct)
● High Concurrency
Explanation
We have to choose the Standard cluster because the High Concurrency cluster does not
support Scala. Also, the Single Node cluster will not be effective for a set of users.

For more information on configuring clusters in Azure Databricks , one can visit the
following URL

https://docs.microsoft.com/en-us/azure/databricks/clusters/configure
Question 21: Correct
Your team is going to make use of Azure Data Lake Gen 2 storage accounts for storage
of data. Data will be uploaded to Azure Data Lake Gen 2 storage account via a pipeline
in Azure Data Factory. The pipeline will run once every day.

You have to design the storage access for the storage account based on the following
requirements

1) During the first 2 weeks, the data in the storage account will be accessed frequently

2) After 2 weeks, the data will be accessed less frequently. But the data needs to be
accessed immediately whenever required.

3) After 3 months the data will be rarely accessed. Whenever an object is required , an
SLA of one day is in place to have the object in place.

You have to ensure data storage costs are minimized

Which of the following access tier would you use for the objects during the first 2
weeks?
● Archive
● Cool
● Hot
● (Correct)
Explanation
Here since the objects need to be accessed immediately, then we need to opt for the
Hot access tier.

For more information on configuring access tiers for Azure Blob storage , one can visit
the following URL

https://docs.microsoft.com/en-us/azure/storage/blobs/access-tiers-overview
Question 22: Correct
Your team is going to make use of Azure Data Lake Gen 2 storage accounts for storage
of data. Data will be uploaded to Azure Data Lake Gen 2 storage account via a pipeline
in Azure Data Factory. The pipeline will run once every day.

You have to design the storage access for the storage account based on the following
requirements

1) During the first 2 weeks, the data in the storage account will be accessed frequently

2) After 2 weeks, the data will be accessed less frequently. But the data needs to be
accessed immediately whenever required.

3) After 3 months the data will be rarely accessed. Whenever an object is required , an
SLA of one day is in place to have the object in place.

You have to ensure data storage costs are minimized

Which of the following access tier would you use for the objects after the first 2 weeks
and before 3 months?
● Archive
● Cool
● (Correct)
● Hot
Explanation
Here since the objects are not accessed that frequently, we can choose the Cool Access
tier. Here we will not choose the Archive Access tier because the objects need to be
accessed immediately whenever required.

For more information on configuring access tiers for Azure Blob storage , one can visit
the following URL

https://docs.microsoft.com/en-us/azure/storage/blobs/access-tiers-overview
Question 23: Correct
Your team is going to make use of Azure Data Lake Gen 2 storage accounts for storage
of data. Data will be uploaded to Azure Data Lake Gen 2 storage account via a pipeline
in Azure Data Factory. The pipeline will run once every day.

You have to design the storage access for the storage account based on the following
requirements

1) During the first 2 weeks, the data in the storage account will be accessed frequently

2) After 2 weeks, the data will be accessed less frequently. But the data needs to be
accessed immediately whenever required.

3) After 3 months the data will be rarely accessed. Whenever an object is required , an
SLA of one day is in place to have the object in place.

You have to ensure data storage costs are minimized

Which of the following access tier would you use for the objects after 3 months?
● Archive
● (Correct)
● Cool
● Hot
Explanation
Here we can opt for the Archive Access tier since the objects are rarely accessed. Also,
with the SLA of one day, that can be taken to rehydrate the object whenever required.

For more information on configuring access tiers for Azure Blob storage , one can visit
the following URL

https://docs.microsoft.com/en-us/azure/storage/blobs/access-tiers-overview
Question 24: Correct
Your data engineering team is developing a data analytics solution. Part of the solution
is to develop a data warehousing environment. Initially the below table design has been
proposed.

Which of the following design is this tending towards?

● Star Schema
● (Correct)
● Snowflake Schema
Explanation
Here this is going towards a star schema where you will be having Fact and single
Dimension tables

For more information on understanding the star schema , one can visit the following
URL

https://docs.microsoft.com/en-us/power-bi/guidance/star-schema
Question 25: Correct
Your data engineering team is developing a data analytics solution. Part of the solution
is to develop a data warehousing environment. Initially the below table design has been
proposed.
What type of table is the Orders table going to be?
● Dimension
● Fact
● (Correct)
Explanation
As per the Star schema design, the Orders table is going to be a Fact table

For more information on understanding the star schema , one can visit the following
URL

https://docs.microsoft.com/en-us/power-bi/guidance/star-schema
Question 26: Correct
Your data engineering team is developing a data analytics solution. Part of the solution
is to develop a data warehousing environment. Initially the below table design has been
proposed.
What type of table is the Customers table going to be?
● Dimension
● (Correct)
● Fact
Explanation
As per the Star schema design, the Customers table is going to be a Dimension table

For more information on understanding the star schema , one can visit the following
URL

https://docs.microsoft.com/en-us/power-bi/guidance/star-schema
Question 27: Incorrect
Your data engineering team is developing a data analytics solution. Part of the solution
is to develop a data warehousing environment. Initially the below table design has been
proposed.
What type of dimension is the Product Table designed to be?
● Type 0
● Type 1
● (Incorrect)
● Type 2
● (Correct)
Explanation
The Product table is designed to be a Type 2 Slowly changing dimension. Here the table
has the additional columns of StartDate, EndDate and IsCurrent. This design suggests
that it is going to be a Type 2 Slowly Changing Dimension.

For more information on understanding the star schema , one can visit the following
URL

https://docs.microsoft.com/en-us/power-bi/guidance/star-schema
Question 28: Correct
Your team needs to deploy an Azure Data Lake Gen2 storage account. You have to
ensure that the Storage account remains available even if there is a region-level failure.
Costs need to be minimized wherever possible.

Which of the following do you need to enable when deploying an Azure General Purpose
V2 Storage account to ensure that is behaves as a Data Lake Gen2 Storage account?
● Enable storage account key access
● Enable hierarchical namespace
● (Correct)
● Access tier set to the Hot Access tier
● Enable large file shares
Explanation
When creating an Azure Data Lake Gen 2 storage account, for a normal General Purpose
V2 storage account, you need to enable the hierarchical namespace.

Question 29: Incorrect

Your team needs to deploy an Azure Data Lake Gen2 storage account. You have to
ensure that the Storage account remains available even if there is a region-level failure.
Costs need to be minimized wherever possible.

Which of the following would you choose as a redundancy option for the storage
account?
● Locally redundant storage
● Zone-redundant storage
● (Incorrect)
● Geo-redundant storage
● (Correct)
● Read Access Geo-redundant storage
Explanation
When you set the data redundancy option to Geo-redundant storage, you can be ensured
that the data in the storage account will become available in a secondary location if the
primary location fails.
Using the redundancy option of Read Access Geo-redundant storage would increase the
costs. And we need to minimize the costs wherever possible.

For more information on Azure Storage account redundancy , one can visit the following
URL

https://docs.microsoft.com/en-us/azure/storage/common/storage-redundancy
Question 30: Incorrect
Your team currently has the following resources defined on Azure

1) An Azure Data Lake Gen2 Storage account

2) An Azure Databricks cluster

A Notebook is being developed in Scala in Azure Databricks. The Notebook will take
data from the Azure Data Lake Gen2 storage account as batch updates and save the
data onto a delta table.

Below is a snippet of the code that needs to be completed

Which of the following would go into Area 1?

● save
● (Incorrect)
● saveAsTable
● mode
● write
● (Correct)
● stream
Explanation
Here since we are performing batch updates, we can make use of the write method.

For more information on working with batch workloads in delta tables , one can visit the
following URL

https://docs.microsoft.com/en-us/azure/databricks/delta/delta-batch
Question 31: Correct
Your team currently has the following resources defined on Azure
1) An Azure Data Lake Gen2 Storage account

2) An Azure Databricks cluster

A Notebook is being developed in Scala in Azure Databricks. The Notebook will take
data from the Azure Data Lake Gen2 storage account as batch updates and save the
data onto a delta table.

Below is a snippet of the code that needs to be completed

Which of the following would go into Area 2?

● save
● saveAsTable
● mode
● (Correct)
● write
● stream
Explanation
Here the append keyword is being used , that means the mode is append. We are
appending data onto the table.

For more information on working with batch workloads in delta tables , one can visit the
following URL

https://docs.microsoft.com/en-us/azure/databricks/delta/delta-batch
Question 32: Incorrect
Your team currently has the following resources defined on Azure

1) An Azure Data Lake Gen2 Storage account

2) An Azure Databricks cluster

A Notebook is being developed in Scala in Azure Databricks. The Notebook will take
data from the Azure Data Lake Gen2 storage account as batch updates and save the
data onto a delta table.

Below is a snippet of the code that needs to be completed

Which of the following would go into Area 3?
● save
● (Correct)
● saveAsTable
● mode
● write
● (Incorrect)
● stream
Explanation
Here we are saving it onto a location, so we need to use the save option.

For more information on working with batch workloads in delta tables , one can visit the
following URL

https://docs.microsoft.com/en-us/azure/databricks/delta/delta-batch
Question 33: Correct
You have to design a Data Analytics solution for your company. You need to decide on
the services that are going to be used based on the below requirements

Which of the following would you consider for Service 1?

● Azure Synapse Analytics
● Azure Databricks
● Azure SQL Database
● Azure Data Lake Gen2
● (Correct)
Explanation
We can use Azure Data Lake Gen2 storage accounts as a repository for the data. Here
the storage scales on demand. And you can store files in various formats.

For more information on Azure Data Lake Gen2 Storage accounts , one can visit the
following URL

https://docs.microsoft.com/en-us/azure/storage/blobs/data-lake-storage-introduction
Question 34: Correct
You have to design a Data Analytics solution for your company. You need to decide on
the services that are going to be used based on the below requirements

Which of the following would you consider for Service 2?

● Azure Synapse Analytics
● Azure Databricks
● (Correct)
● Azure SQL Database
● Azure Data Lake Gen2
Explanation
For the processing needs we can use Azure Databricks. Here you can provision Spark
clusters in which you can develop Notebooks in a variety of languages. It also has
support for auto-termination of clusters.

For more information on Azure Databricks , one can visit the following URL

https://docs.microsoft.com/en-us/azure/databricks/scenarios/what-is-azure-databrick
s
Question 35: Correct
You have to design a Data Analytics solution for your company. You need to decide on
the services that are going to be used based on the below requirements

Which of the following would you consider for Service 3?

● Azure Synapse Analytics
● (Correct)
● Azure Databricks
● Azure SQL Database
● Azure Data Lake Gen2
Explanation
As the Analytical data store, we can provision a Dedicated SQL Pool to serve as a SQL
data warehouse in Azure Synapse.

For more information on Azure Synapse , one can visit the following URL

https://docs.microsoft.com/en-us/azure/synapse-analytics/overview-what-is
Question 36: Correct
You have a table named ProductDetails hosted in a Dedicated SQL Pool in an Azure
Synapse Analytics workspace. You have to segregate the status of each product in the
table. Below is the SQL statement that needs to be completed for this requirement
Which of the following would come in Area 1?
● UPDATE
● SELECT
● CASE
● (Correct)
● ELSE
Explanation
Here we can compare the different product status with the use of the CASE statement

For more information on the CASE statement , one can visit the following URL

https://docs.microsoft.com/en-us/sql/t-sql/language-elements/case-transact-sql
Question 37: Correct
You have a table named ProductDetails hosted in a Dedicated SQL Pool in an Azure
Synapse Analytics workspace. You have to segregate the status of each product in the
table. Below is the SQL statement that needs to be completed for this requirement

Which of the following would come in Area 2?

● UPDATE
● SELECT
● CASE
● ELSE
● (Correct)
Explanation
Here we put the final condition with the use of the ELSE clause

For more information on the CASE statement , one can visit the following URL

https://docs.microsoft.com/en-us/sql/t-sql/language-elements/case-transact-sql
Question 38: Incorrect
You have to develop the SQL statement for an Azure Stream Analytics Job. The Job will
take inputs from two separate Azure Event Hubs. And then write the data to a table in a
Dedicated SQL pool in an Azure Synapse Analytics workspace.

Below is the script that needs to be completed

Which of the following would go into Area 1?

● TIMESTAMP
● (Incorrect)
● CreatedAt
● LogID
● (Correct)
Explanation
Here the data is being repartitioned based on the LogID that could be part of the input
data stream. You have to ensure that the stream scheme key and count of each stream
in the same. The output scheme is then matching the input stream scheme key

For more information on repartitioning data , one can visit the following URL

https://docs.microsoft.com/en-us/azure/stream-analytics/repartition
Question 39: Incorrect
You have to develop the SQL statement for an Azure Stream Analytics Job. The Job will
take inputs from two separate Azure Event Hubs. And then write the data to a table in a
Dedicated SQL pool in an Azure Synapse Analytics workspace.

Below is the script that needs to be completed

Which of the following would go into Area 2?
● TIMESTAMP
● CreatedAt
● (Incorrect)
● LogID
● (Correct)
Explanation
Here the data is being repartitioned based on the LogID that could be part of the input
data stream. You have to ensure that the stream scheme key and count of each stream
in the same. The output scheme is then matching the input stream scheme key

For more information on repartitioning data , one can visit the following URL

https://docs.microsoft.com/en-us/azure/stream-analytics/repartition
Question 40: Correct
You have an Azure Databricks cluster. You want to keep the configuration of the cluster
even after it is terminated. Which of the following can you do for this requirement?
● Create a notebook in the cluster with the cluster configuration
● Pin the cluster
● (Correct)
● Configure the cluster init scripts
Explanation
If you want to maintain the configuration of the cluster, you just need to Pin the cluster.

For more information on managing clusters , one can visit the following URL

https://docs.microsoft.com/en-us/azure/databricks/clusters/clusters-manage
Question 41: Correct
Your team has an Azure Data Lake Gen2 storage account. Continuous Time
series-based data is going to be streamed into the Data Lake Gen2 storage account.
Which of the following is the right design pattern to follow when it comes to the folder
structure and file naming convention for the streaming data?
● \YYYY\MM\DD\DataSet\datafile_YYYY_MM_DD.csv
● \DataSet\YYYY\MM\DD\datafile_YYYY_MM_DD.csv
● (Correct)
● \DataSet\datafile_YYYY_MM_DD.csv
Explanation
The recommendation is to ensure to have a parent folder that could specify the data
source or the data set for the data. Then the format of the child folders needs to be in
the form of the year , then the month and then the day. And then finally you have the file.

For more information on the best practices for Azure Data Lake Storage , one can visit
the following URL

https://docs.microsoft.com/en-us/azure/storage/blobs/data-lake-storage-best-practice
s
Question 42: Correct
Your team currently has an Azure Stream Analytics job in place. The job is used to take
in data being streamed via Azure Event Hubs. Here log-based metrics from an
application is being streamed from Azure Event Hubs onto the Stream Analytics job.

You have to find the difference in time between the First and the Final Event in the
stream over a 2-hour duration.

You have to complete the below script for this requirement

Which of the following should come in Area 1?

● LIMIT
● LAST
● COLLATE
● DATEDIFF
● (Correct)
Explanation
Here we need to use the DATEDIFF function to find the time difference

This question is based on the example in the below documentation link

https://docs.microsoft.com/en-us/azure/stream-analytics/stream-analytics-stream-ana
lytics-query-patterns
Question 43: Correct
Your team currently has an Azure Stream Analytics job in place. The job is used to take
in data being streamed via Azure Event Hubs. Here log-based metrics from an
application is being streamed from Azure Event Hubs onto the Stream Analytics job.

You have to find the difference in time between the First and the Final Event in the
stream over a 2-hour duration.

You have to complete the below script for this requirement

Which of the following should come in Area 2?

● LIMIT
● LAST
● (Correct)
● COLLATE
● DATEDIFF
Explanation
Here we use the LAST function to retrieve the last event

This question is based on the example in the below documentation link

https://docs.microsoft.com/en-us/azure/stream-analytics/stream-analytics-stream-ana
lytics-query-patterns
Question 44: Incorrect
Your team currently has an Azure Stream Analytics job in place. The job is used to take
in data being streamed via Azure Event Hubs. Here log-based metrics from an
application is being streamed from Azure Event Hubs onto the Stream Analytics job.

You have to find the difference in time between the First and the Final Event in the
stream over a 2-hour duration.

You have to complete the below script for this requirement

Which of the following should come in Area 3?

● LIMIT
● (Correct)
● LAST
● COLLATE
● (Incorrect)
● DATEDIFF
Explanation
Since we need to search within the last 2 hours , we use the LIMIT function here.

This question is based on the example in the below documentation link

https://docs.microsoft.com/en-us/azure/stream-analytics/stream-analytics-stream-ana
lytics-query-patterns
Question 45: Incorrect
Your team needs to create an external table in an Azure Synapse Serverless SQL pool.
The table will be used to query parquet-based files in an Azure Data Lake Gen2 storage
account. Currently the storage account container is configured as shown below

You have to ensure the Serverless SQL Pool has the right authorization to query the data
in the storage account. Which of the following would you create for this requirement?
● An encryption key
● (Incorrect)
● An Azure Databricks scoped secret
● A database scoped credential
● (Correct)
Explanation
Here we need to create a database scoped credential that would have the right
authorization such as Shared Access Signatures. This would allow the external table to
query the data in the Azure Data Lake Gen2 storage account.

For more information on working with external tables , one can visit the below URL

https://docs.microsoft.com/en-us/azure/synapse-analytics/sql/develop-tables-external-
tables

DP-203 Exam Answers
100% (1)
DP-203 Exam Answers
43 pages
DP 203 ExamTopics
No ratings yet
DP 203 ExamTopics
47 pages
Microsoft: Exam Questions DP-203
100% (2)
Microsoft: Exam Questions DP-203
17 pages
Microsoft Azure Data Fundamentals (DP-900) Master Cheat Sheet
No ratings yet
Microsoft Azure Data Fundamentals (DP-900) Master Cheat Sheet
32 pages
Nur Info 1 30
No ratings yet
Nur Info 1 30
52 pages
DP 203
No ratings yet
DP 203
514 pages
Microsoftfabricanalyticsengineerdp 600examdumps2024 240518151026 9b189f89
No ratings yet
Microsoftfabricanalyticsengineerdp 600examdumps2024 240518151026 9b189f89
17 pages
DP 203
No ratings yet
DP 203
16 pages
DP900 ExamTopic Questions - 70 To 200
No ratings yet
DP900 ExamTopic Questions - 70 To 200
49 pages
FANUC Series 0 - MF Plus Die Mould Functions (Procedures)
No ratings yet
FANUC Series 0 - MF Plus Die Mould Functions (Procedures)
20 pages
DP-900 Dumps
100% (6)
DP-900 Dumps
84 pages
DP-203 Exam - Actual Q&as, Page 1 - ExamTopics-1
No ratings yet
DP-203 Exam - Actual Q&as, Page 1 - ExamTopics-1
1 page
DP-203T00 Microsoft Azure Data Engineering-03
No ratings yet
DP-203T00 Microsoft Azure Data Engineering-03
21 pages
Microsoft: Exam Questions DP-900
No ratings yet
Microsoft: Exam Questions DP-900
20 pages
DP 203 Microsoft Azure Data Engineer Associate Exam Study Guide PDF
No ratings yet
DP 203 Microsoft Azure Data Engineer Associate Exam Study Guide PDF
23 pages
DP 600 Questions
No ratings yet
DP 600 Questions
10 pages
dp-900 - 2c26aa3133b9 - 260 Questions
100% (1)
dp-900 - 2c26aa3133b9 - 260 Questions
187 pages
DP-203 StudyGuide ENU FY23Q2a Vnext
No ratings yet
DP-203 StudyGuide ENU FY23Q2a Vnext
13 pages
Azure Databricks Documentation
No ratings yet
Azure Databricks Documentation
32 pages
S12 - OS - Lab 2
No ratings yet
S12 - OS - Lab 2
6 pages
Exam DP 203 Data Engineering On Microsoft Azure Skills Measured
No ratings yet
Exam DP 203 Data Engineering On Microsoft Azure Skills Measured
8 pages
DP 600t00a Enu Powerpoint 07
No ratings yet
DP 600t00a Enu Powerpoint 07
40 pages
Exam DP-203: Data Engineering On Microsoft Azure - Skills Measured
0% (1)
Exam DP-203: Data Engineering On Microsoft Azure - Skills Measured
5 pages
DP-203T00 Microsoft Azure Data Engineering-02
No ratings yet
DP-203T00 Microsoft Azure Data Engineering-02
23 pages
DP 203
100% (1)
DP 203
87 pages
DP 900
50% (2)
DP 900
229 pages
Azure Data Engineer Interview Questions and Answers
No ratings yet
Azure Data Engineer Interview Questions and Answers
7 pages
Databricks Data Engg Pro Certification Dumps
100% (2)
Databricks Data Engg Pro Certification Dumps
41 pages
Azure Databricks Interview
100% (2)
Azure Databricks Interview
35 pages
Azure Data Factory
100% (4)
Azure Data Factory
16 pages
Azure Data Factory
100% (2)
Azure Data Factory
10 pages
DP-203 Exam Demo
No ratings yet
DP-203 Exam Demo
7 pages
Microsoft - Actualtests.dp 203.v2021!04!13.by - Liam.25q
No ratings yet
Microsoft - Actualtests.dp 203.v2021!04!13.by - Liam.25q
31 pages
Cse-IV-unix and Shell Programming (10cs44) - Notes
No ratings yet
Cse-IV-unix and Shell Programming (10cs44) - Notes
161 pages
Microsoft: Exam Questions DP-900
100% (1)
Microsoft: Exam Questions DP-900
15 pages
Learning Perl Student Workbook 2nd Edition
No ratings yet
Learning Perl Student Workbook 2nd Edition
163 pages
Unit 1, 2 Part B
No ratings yet
Unit 1, 2 Part B
58 pages
DP-900 Exam - Free Actual Q&As, Page 1 - ExamTopics
67% (3)
DP-900 Exam - Free Actual Q&As, Page 1 - ExamTopics
158 pages
Off 142q Vce
No ratings yet
Off 142q Vce
16 pages
Manual Ayuda para Manejar Programa NCSS
No ratings yet
Manual Ayuda para Manejar Programa NCSS
96 pages
Visit Now To Get Below and Much More ($10 Free) : Thank You For Downloading This Ebook Courtesy of
86% (7)
Visit Now To Get Below and Much More ($10 Free) : Thank You For Downloading This Ebook Courtesy of
14 pages
DP 900 TestPrep - Cloudthat
60% (5)
DP 900 TestPrep - Cloudthat
13 pages
WSOS01-DOC-102 Protocol Configuration Manual
No ratings yet
WSOS01-DOC-102 Protocol Configuration Manual
38 pages
DP-203 - Data Engineering On Microsoft Azure 2021-1
100% (2)
DP-203 - Data Engineering On Microsoft Azure 2021-1
42 pages
A Detailed Analysis of The Lockbit Ransomware: Prepared By: Vlad Pasca, Lifars, LLC Date
No ratings yet
A Detailed Analysis of The Lockbit Ransomware: Prepared By: Vlad Pasca, Lifars, LLC Date
62 pages
Operating System
No ratings yet
Operating System
24 pages
Ug1327 DNNDK User Guide
No ratings yet
Ug1327 DNNDK User Guide
172 pages
dp-203 Dedb75bd432f
No ratings yet
dp-203 Dedb75bd432f
98 pages
Tutor
100% (1)
Tutor
309 pages
Azure Analytics: Synapse
100% (4)
Azure Analytics: Synapse
251 pages
Error Codes For Merak Suite 2007.1 Installation: December 2007
No ratings yet
Error Codes For Merak Suite 2007.1 Installation: December 2007
24 pages
MIS - Block 1 MS 54 Unit 2
No ratings yet
MIS - Block 1 MS 54 Unit 2
13 pages
Course Presentation DP 900 AzureDataFundamentals
100% (2)
Course Presentation DP 900 AzureDataFundamentals
142 pages
Github Project Setup and Version Control
No ratings yet
Github Project Setup and Version Control
4 pages
DP - 900 - 2
100% (1)
DP - 900 - 2
79 pages
DP - 900 - 4
0% (1)
DP - 900 - 4
17 pages
File Allocation Methods
No ratings yet
File Allocation Methods
9 pages
DP 900 Part 2 PDF
100% (2)
DP 900 Part 2 PDF
13 pages
Unix Interview Questions On Awk Command
No ratings yet
Unix Interview Questions On Awk Command
19 pages
Data Factory
100% (2)
Data Factory
26 pages
DP-203T00 Microsoft Azure Data Engineering-05
No ratings yet
DP-203T00 Microsoft Azure Data Engineering-05
20 pages
DP-900 Practice Set
100% (2)
DP-900 Practice Set
23 pages
Warner DP 203 Slides
No ratings yet
Warner DP 203 Slides
98 pages
Azure DP 203
100% (1)
Azure DP 203
57 pages
DP 900
No ratings yet
DP 900
35 pages
DP 900
100% (1)
DP 900
33 pages
Microsoft Certified: Azure Data Engineer Associate - Skills Measured
No ratings yet
Microsoft Certified: Azure Data Engineer Associate - Skills Measured
4 pages
(June-2022) New PassLeader DP-900 Exam Dumps
No ratings yet
(June-2022) New PassLeader DP-900 Exam Dumps
4 pages
Brookheights Guide 1.0
No ratings yet
Brookheights Guide 1.0
12 pages
BIGTRRETECH Eddy V1.0 User Manual
No ratings yet
BIGTRRETECH Eddy V1.0 User Manual
28 pages
UNIT 5 Files
No ratings yet
UNIT 5 Files
20 pages
DP-203 Exam-PG-111-120 - ExamTopics - Passei Direto
No ratings yet
DP-203 Exam-PG-111-120 - ExamTopics - Passei Direto
10 pages
Trendmicro Workload Security Agent Process
No ratings yet
Trendmicro Workload Security Agent Process
2 pages
Azure DP 900 - 80 Questions Tfhfuffhy
100% (3)
Azure DP 900 - 80 Questions Tfhfuffhy
25 pages
DP-900
No ratings yet
DP-900
191 pages
Configuring UZ7HO SoundModem As A Simple Digipeater
No ratings yet
Configuring UZ7HO SoundModem As A Simple Digipeater
15 pages
DP 203
No ratings yet
DP 203
37 pages
نماذج استرشادية لمادة ICT للصف الخامس لغات
No ratings yet
نماذج استرشادية لمادة ICT للصف الخامس لغات
4 pages
UNIX For BI-LabBook
No ratings yet
UNIX For BI-LabBook
17 pages
2 MagPick Manual
No ratings yet
2 MagPick Manual
174 pages
Ec-Council Certleader 312-49v10 Exam Question 2023-Jan-02 by Leif 249q Vce
No ratings yet
Ec-Council Certleader 312-49v10 Exam Question 2023-Jan-02 by Leif 249q Vce
43 pages
CCS Lab Manual
No ratings yet
CCS Lab Manual
26 pages
Guid Guideline On Submission of Applications and BOMRA Timelines ER MD P04 G01 Iss 5
No ratings yet
Guid Guideline On Submission of Applications and BOMRA Timelines ER MD P04 G01 Iss 5
9 pages
dp-900 New
100% (1)
dp-900 New
69 pages
Unit 3 Ict 111 Intro
No ratings yet
Unit 3 Ict 111 Intro
22 pages
DP-600 Exam Valid Dumps Questions
No ratings yet
DP-600 Exam Valid Dumps Questions
31 pages
Databricks Essentials: A Guide to Unified Data Analytics
From Everand
Databricks Essentials: A Guide to Unified Data Analytics
Robert Johnson
No ratings yet
Ultimate Data Engineering with Databricks: Develop Scalable Data Pipelines Using Data Engineering's Core Tenets Such as Delta Tables, Ingestion, Transformation, Security, and Scalability
From Everand
Ultimate Data Engineering with Databricks: Develop Scalable Data Pipelines Using Data Engineering's Core Tenets Such as Delta Tables, Ingestion, Transformation, Security, and Scalability
Mayank Malhotra
No ratings yet
Mastering Azure Synapse Analytics: Learn how to develop end-to-end analytics solutions with Azure Synapse Analytics (English Edition)
From Everand
Mastering Azure Synapse Analytics: Learn how to develop end-to-end analytics solutions with Azure Synapse Analytics (English Edition)
Debananda Ghosh
No ratings yet
Ultimate Azure Data Engineering
From Everand
Ultimate Azure Data Engineering
Ashish Agarwal
No ratings yet

DP 203 Questions 2

Uploaded by

DP 203 Questions 2

Uploaded by

Question 1: Correct

“Ensure to group events that arrive at similar times”

Below is the uncompleted table definition

Below is the uncompleted table definition

Below is the uncompleted table definition

Which of the following should be used as the trigger type?

You have to ensure data storage costs are minimized

You have to ensure data storage costs are minimized

You have to ensure data storage costs are minimized

Which of the following design is this tending towards?

Question 29: Incorrect

1) An Azure Data Lake Gen2 Storage account

2) An Azure Databricks cluster

Below is a snippet of the code that needs to be completed

Which of the following would go into Area 1?

2) An Azure Databricks cluster

Below is a snippet of the code that needs to be completed

Which of the following would go into Area 2?

1) An Azure Data Lake Gen2 Storage account

2) An Azure Databricks cluster

Below is a snippet of the code that needs to be completed

Which of the following would you consider for Service 1?

Which of the following would you consider for Service 2?

Which of the following would you consider for Service 3?

Which of the following would come in Area 2?

Below is the script that needs to be completed

Which of the following would go into Area 1?

Below is the script that needs to be completed

You have to complete the below script for this requirement

This question is based on the example in the below documentation link

You have to complete the below script for this requirement

Which of the following should come in Area 2?

This question is based on the example in the below documentation link

You have to complete the below script for this requirement

Which of the following should come in Area 3?

This question is based on the example in the below documentation link

You might also like