Big Data and Visualization Hands-Steps-1
Big Data and Visualization Hands-Steps-1
Duration: 30 minutes
In this exercise, you will set up your environment for use in the rest of the hands-on
lab. You should follow all the steps provided in the Before the Hands-on Lab section to
prepare your environment before attending the hands-on lab.
Note: To view the Azure portal menu, select the menu icon in the upper left-hand
corner.
3. Set the following configuration on the Azure Databricks Service creation form:
o Subscription: Select the subscription you are using for this hands-on lab.
o Resource Group: Select Create new and enter a unique name, such
as hands-on-lab-bigdata
o Location: Select a region close to you. (If you are using an Azure Pass,
select South Central US.)
2. Select Create.
3. Set the following configuration on the Azure Storage account creation form:
o Subscription: Select the subscription you are using for this hands-on lab.
o Resource group: Select the same resource group you created at the
beginning of this lab.
o Location: Select the same region you used for Azure Databricks.
o Performance: Standard
1. From the side menu in the Azure portal, choose Resource groups, then enter
your resource group name into the filter box, and select it from the list.
2. Next, select your lab Azure Storage account from the list.
3. Select Containers (1) from the menu. Select + Container (2) on the Containers
blade, enter sparkcontainer for the name (3), leaving the public access level set
to Private. Select Create (4) to create the container.
o Subscription: Select the subscription you are using for this hands-on lab.
o Resource Group: Select the same resource group you created at the
beginning of this lab.
o Version: Select V2
Understanding Data Factory Location: The Data Factory location is where the
metadata of the data factory is stored and where the triggering of the pipeline is
initiated from. Meanwhile, a data factory can access data stores and compute
services in other Azure regions to move data between data stores or process
data using compute services. This behavior is realized through the globally
available IR to ensure data compliance, efficiency, and reduced network egress
costs.
The IR Location defines the location of its back-end compute, and essentially the
location where the data movement, activity dispatching, and SSIS package
execution are performed. The IR location can be different from the location of
the data factory it belongs to.
4. Select Create to finish and submit.
You should follow all these steps provided before attending the Hands-on lab.