0% found this document useful (0 votes)

30 views7 pages

Big Data and Visualization Hands-Steps-1

Uploaded by

SunkaraVenkataramana

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views7 pages

Big Data and Visualization Hands-Steps-1

Uploaded by

SunkaraVenkataramana

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Before the hands-on lab

Duration: 30 minutes

In this exercise, you will set up your environment for use in the rest of the hands-on
lab. You should follow all the steps provided in the Before the Hands-on Lab section to
prepare your environment before attending the hands-on lab.

Task 1: Provision Azure Databricks

Azure Databricks is an Apache Spark-based analytics platform optimized for Azure. It
will be used in this lab to build and train a machine learning model used to predict flight
delays.

Note: To view the Azure portal menu, select the menu icon in the upper left-hand
corner.

1. In the Azure Portal (https://portal.azure.com), select + Create a resource within

the portal menu, then type "Azure Databricks" into the search bar. Select Azure
Databricks from the results.
2. Select Create.

3. Set the following configuration on the Azure Databricks Service creation form:

o Subscription: Select the subscription you are using for this hands-on lab.

o Resource Group: Select Create new and enter a unique name, such
as hands-on-lab-bigdata

o Workspace name: Enter a unique name, this is indicated by a green

checkmark.

o Location: Select a region close to you. (If you are using an Azure Pass,
select South Central US.)

o Pricing: Select Premium (+ Role-based access controls)

4. Select Review + Create.

5. Wait for validation to pass, then select Create.

Task 2: Create Azure Storage account
Create a new Azure Storage account that will be used to store historic and scored flight
and weather data sets for the lab.

1. In the Azure Portal (https://portal.azure.com), select + Create a resource, then

type "storage" into the search bar. Select Storage account from the results.

2. Select Create.

3. Set the following configuration on the Azure Storage account creation form:

o Subscription: Select the subscription you are using for this hands-on lab.

o Resource group: Select the same resource group you created at the
beginning of this lab.

o Storage account name: Enter a unique name, this is indicated by a green

checkmark.

o Location: Select the same region you used for Azure Databricks.

o Performance: Standard

o Account kind: BlobStorage

o Replication: Read-access geo-redundant storage (RA-GRS)

o Access tier: Hot

4. Select Review + create.

5. Wait for validation to pass, then select Create.

Task 3: Create storage container

In this task, you will create a storage container in which you will store your flight and
weather data files.

1. From the side menu in the Azure portal, choose Resource groups, then enter
your resource group name into the filter box, and select it from the list.

2. Next, select your lab Azure Storage account from the list.
3. Select Containers (1) from the menu. Select + Container (2) on the Containers
blade, enter sparkcontainer for the name (3), leaving the public access level set
to Private. Select Create (4) to create the container.

Task 4: Provision Azure Data Factory

Create a new Azure Data Factory instance that will be used to orchestrate data
transfers for analysis.

1. In the Azure Portal (https://portal.azure.com), select + Create a resource, then

type "Data Factory" into the search bar. Select Data Factory from the results.
2. Select Create.

3. Set the following configuration on the Data Factory creation form:

o Name: Enter a unique name, this is indicated by a green checkmark.

o Subscription: Select the subscription you are using for this hands-on lab.

o Resource Group: Select the same resource group you created at the
beginning of this lab.

o Version: Select V2

o Location: Select any region close to you.

o Enable GIT: Unchecked

Understanding Data Factory Location: The Data Factory location is where the
metadata of the data factory is stored and where the triggering of the pipeline is
initiated from. Meanwhile, a data factory can access data stores and compute
services in other Azure regions to move data between data stores or process
data using compute services. This behavior is realized through the globally
available IR to ensure data compliance, efficiency, and reduced network egress
costs.

The IR Location defines the location of its back-end compute, and essentially the
location where the data movement, activity dispatching, and SSIS package
execution are performed. The IR location can be different from the location of
the data factory it belongs to.
4. Select Create to finish and submit.

Task 5: Download and install Power BI Desktop

Power BI desktop is required to make a connection to your Azure Databricks
environment when creating the Power BI dashboard.

1. Download and install Power BI Desktop.

You should follow all these steps provided before attending the Hands-on lab.

Azure DataBricks
No ratings yet
Azure DataBricks
37 pages
Azure Data Engineering Project Part 1
No ratings yet
Azure Data Engineering Project Part 1
41 pages
200T01-A: Implementing An Azure Data Solution: Course Outline Module 1: Azure For The Data Engineer
No ratings yet
200T01-A: Implementing An Azure Data Solution: Course Outline Module 1: Azure For The Data Engineer
4 pages
DP-100 Microsoft Exam Practice Questions
No ratings yet
DP-100 Microsoft Exam Practice Questions
56 pages
Azure Project
No ratings yet
Azure Project
13 pages
Azure DATA Fatcory
No ratings yet
Azure DATA Fatcory
2,982 pages
Az, Mic, Gog, Presentations
No ratings yet
Az, Mic, Gog, Presentations
145 pages
DP 200
No ratings yet
DP 200
370 pages
Standard NF Workbook 23G31
No ratings yet
Standard NF Workbook 23G31
128 pages
Session 6 - Azure Case Study - Covid 19
No ratings yet
Session 6 - Azure Case Study - Covid 19
42 pages
M01 - Fundamentals
No ratings yet
M01 - Fundamentals
32 pages
DP 900t00a Enu Powerpoint 04
No ratings yet
DP 900t00a Enu Powerpoint 04
23 pages
Study Guide For Exam DP-203 - Data Engineering On Microsoft Azure - Microsoft Learn
No ratings yet
Study Guide For Exam DP-203 - Data Engineering On Microsoft Azure - Microsoft Learn
4 pages
Big Data and Visualization
No ratings yet
Big Data and Visualization
141 pages
Azure de Project
No ratings yet
Azure de Project
73 pages
Azure Databricks Course Slide Deck
75% (4)
Azure Databricks Course Slide Deck
169 pages
Azure Data Engineering Training Raxicube Technologies
No ratings yet
Azure Data Engineering Training Raxicube Technologies
8 pages
ADE Project Amit
No ratings yet
ADE Project Amit
17 pages
ADE Azure Data Engineer Interview
No ratings yet
ADE Azure Data Engineer Interview
12 pages
Azure Databricks Documentation
No ratings yet
Azure Databricks Documentation
7,197 pages
Lab 2 - Working With Data Storage
No ratings yet
Lab 2 - Working With Data Storage
15 pages
Solution (Updated)
No ratings yet
Solution (Updated)
61 pages
Data Analyst Azure PowerBI Syllabus
No ratings yet
Data Analyst Azure PowerBI Syllabus
35 pages
DP 600t00a Enu Powerpoint 02
No ratings yet
DP 600t00a Enu Powerpoint 02
30 pages
f4b7901ed5e5f9106a3a82eea2e2f003
No ratings yet
f4b7901ed5e5f9106a3a82eea2e2f003
3,614 pages
Automotive Telematics
100% (1)
Automotive Telematics
118 pages
Azure Databricks
No ratings yet
Azure Databricks
21 pages
Data Factory, Data Integration
No ratings yet
Data Factory, Data Integration
2,034 pages
Azure Data Engr POC - S For Interns
No ratings yet
Azure Data Engr POC - S For Interns
9 pages
Databricks Guide
No ratings yet
Databricks Guide
27 pages
Azure Databricks Documentation
No ratings yet
Azure Databricks Documentation
32 pages
DP 3011 ENU PowerPoint - 01 Content
No ratings yet
DP 3011 ENU PowerPoint - 01 Content
42 pages
Azure Data Factory - Pratap - Qbex Technologies - 8886230001
No ratings yet
Azure Data Factory - Pratap - Qbex Technologies - 8886230001
4 pages
Azure Data Factory
No ratings yet
Azure Data Factory
18 pages
2023-IDA Custom Bootcamp Curriculum Day Wise Curriculum v0.1
No ratings yet
2023-IDA Custom Bootcamp Curriculum Day Wise Curriculum v0.1
122 pages
ETL Azure
No ratings yet
ETL Azure
12 pages
Databricks Lab 1
100% (3)
Databricks Lab 1
7 pages
Lab 2 - Setting Up Azure Databricks Workspace & Cluster
No ratings yet
Lab 2 - Setting Up Azure Databricks Workspace & Cluster
3 pages
ADF Hands-On
No ratings yet
ADF Hands-On
98 pages
Data Factory
No ratings yet
Data Factory
1,158 pages
Lab 6 - Performing Real-Time Analytics With Stream Analytics
No ratings yet
Lab 6 - Performing Real-Time Analytics With Stream Analytics
17 pages
Reference Guide - DP-203 Collection - v2
No ratings yet
Reference Guide - DP-203 Collection - v2
3 pages
Lab 3 - Enabling Team Based Data Science With Azure Databricks
No ratings yet
Lab 3 - Enabling Team Based Data Science With Azure Databricks
18 pages
dp-203 Notes1
No ratings yet
dp-203 Notes1
12 pages
Lab 2 - Working With Data Storage
No ratings yet
Lab 2 - Working With Data Storage
15 pages
Orchestrating Big Data Solutions With Azure Data Factory: Setup Guide
No ratings yet
Orchestrating Big Data Solutions With Azure Data Factory: Setup Guide
2 pages
Final Report Restaurant Management System
No ratings yet
Final Report Restaurant Management System
36 pages
Exam DP 100 Data Science Solution On Azure Skills Measured
No ratings yet
Exam DP 100 Data Science Solution On Azure Skills Measured
9 pages
Azure Data Engineer + Databricks Content
No ratings yet
Azure Data Engineer + Databricks Content
7 pages
DP-100 - Designing and Implementing A Data Science
No ratings yet
DP-100 - Designing and Implementing A Data Science
9 pages
Azure DataEngineer Course Outline
No ratings yet
Azure DataEngineer Course Outline
4 pages
Azure Datalake
No ratings yet
Azure Datalake
8 pages
AI 100 Labs
No ratings yet
AI 100 Labs
99 pages
Start To Finish With Azure Data Factory
100% (2)
Start To Finish With Azure Data Factory
30 pages
DP 203T00A ENU AssessmentGuide
No ratings yet
DP 203T00A ENU AssessmentGuide
13 pages
Microsoft Certified: Azure Data Scientist Associate - Skills Measured
No ratings yet
Microsoft Certified: Azure Data Scientist Associate - Skills Measured
4 pages
Lab 7 - Orchestrating Data Movement With Azure Data Factory
No ratings yet
Lab 7 - Orchestrating Data Movement With Azure Data Factory
26 pages
Integrated Bridge Systems (IBS) : T.C. Dokuz Eylül University Maritime Faculty Marine Transportation Engineering
100% (7)
Integrated Bridge Systems (IBS) : T.C. Dokuz Eylül University Maritime Faculty Marine Transportation Engineering
23 pages
FB 1200
No ratings yet
FB 1200
110 pages
T WebServicesAPIv1 6 PDF
0% (1)
T WebServicesAPIv1 6 PDF
373 pages
Setup Guide
No ratings yet
Setup Guide
2 pages
Lab 3 - Enabling Team Based Data Science With Azure Databricks
No ratings yet
Lab 3 - Enabling Team Based Data Science With Azure Databricks
18 pages
Course Review and Exam Tips: AZ-900 Outline Objectives (From Microsoft) Covered in ACG Course Section Lesson(s) /lab(s)
No ratings yet
Course Review and Exam Tips: AZ-900 Outline Objectives (From Microsoft) Covered in ACG Course Section Lesson(s) /lab(s)
11 pages
Exam DP 100 Data Science Solution On Azure Skills Measured
No ratings yet
Exam DP 100 Data Science Solution On Azure Skills Measured
6 pages
Kcs School Fee 3112020
0% (1)
Kcs School Fee 3112020
1 page
Huawei Network Solution Overview v2
No ratings yet
Huawei Network Solution Overview v2
77 pages
CC MCQ Unit-3
No ratings yet
CC MCQ Unit-3
3 pages
CRM Practices of Amazon
No ratings yet
CRM Practices of Amazon
45 pages
RHEL 6 - 6.2 Technical Notes
No ratings yet
RHEL 6 - 6.2 Technical Notes
496 pages
Intel® Desktop Board D975XBX2: Technical Product Specification
No ratings yet
Intel® Desktop Board D975XBX2: Technical Product Specification
106 pages
Voip H.323: Session No.7
No ratings yet
Voip H.323: Session No.7
60 pages
Intro Prompt Design - Ipynb
No ratings yet
Intro Prompt Design - Ipynb
18 pages
MSI MS-1795 User Manual
No ratings yet
MSI MS-1795 User Manual
58 pages
UAN: 111-IBM-IBM, Website: WWW - ibm.Com/Pk, E-Mail: Ibm - Pakistan@
No ratings yet
UAN: 111-IBM-IBM, Website: WWW - ibm.Com/Pk, E-Mail: Ibm - Pakistan@
8 pages
'Computer Project' On Topic Google Apps
No ratings yet
'Computer Project' On Topic Google Apps
9 pages
The Complete Servicenow System Administrator Course
0% (1)
The Complete Servicenow System Administrator Course
14 pages
CSC159 Ch2 Numbering System
No ratings yet
CSC159 Ch2 Numbering System
23 pages
RF-BM-ND04 Hardware Datasheet V1.2
No ratings yet
RF-BM-ND04 Hardware Datasheet V1.2
19 pages
Rexelite Tutorial
No ratings yet
Rexelite Tutorial
5 pages
Data Model Changes Regarding SD Index Tables: Document Version Status Date 1.0 Final October 20, 2015
No ratings yet
Data Model Changes Regarding SD Index Tables: Document Version Status Date 1.0 Final October 20, 2015
19 pages
Chapter 9: Strings and Arrays
No ratings yet
Chapter 9: Strings and Arrays
58 pages
Bank Additional Names
No ratings yet
Bank Additional Names
1 page
Probability DAY01
No ratings yet
Probability DAY01
8 pages
Business Models in Two-Sided Markets - An Assessment of Strategies
No ratings yet
Business Models in Two-Sided Markets - An Assessment of Strategies
13 pages
Update Gateway
No ratings yet
Update Gateway
2 pages
DB Tata Tchitchikoshvili HW
No ratings yet
DB Tata Tchitchikoshvili HW
8 pages
13.feature Usage Card - FLP
No ratings yet
13.feature Usage Card - FLP
3 pages
NCCR Tracker 3G
No ratings yet
NCCR Tracker 3G
12 pages
Confusion Matrix: Logistic Regression
No ratings yet
Confusion Matrix: Logistic Regression
2 pages
Assignment Computer Application in Business: Mr. Shahid Waseem
No ratings yet
Assignment Computer Application in Business: Mr. Shahid Waseem
10 pages
6 Waterfall Charts
No ratings yet
6 Waterfall Charts
1 page
DGCA Module 08 MARCH 2017 HANDWRITTEN SET 1 & 2 PDF
No ratings yet
DGCA Module 08 MARCH 2017 HANDWRITTEN SET 1 & 2 PDF
5 pages
Rapidstream: P2P Streaming On Android: Philipp M. Eittenberger, Matthias Herbst, Udo R. Krieger
No ratings yet
Rapidstream: P2P Streaming On Android: Philipp M. Eittenberger, Matthias Herbst, Udo R. Krieger
6 pages
Bassmix
No ratings yet
Bassmix
6 pages
Likhit Hegu
No ratings yet
Likhit Hegu
3 pages
Big Data Workshop Contents
No ratings yet
Big Data Workshop Contents
2 pages

Big Data and Visualization Hands-Steps-1

Uploaded by

Big Data and Visualization Hands-Steps-1

Uploaded by

Before the hands-on lab

Task 1: Provision Azure Databricks

1. In the Azure Portal (https://portal.azure.com), select + Create a resource within

o Workspace name: Enter a unique name, this is indicated by a green

o Pricing: Select Premium (+ Role-based access controls)

4. Select Review + Create.

5. Wait for validation to pass, then select Create.

1. In the Azure Portal (https://portal.azure.com), select + Create a resource, then

o Storage account name: Enter a unique name, this is indicated by a green

o Account kind: BlobStorage

o Replication: Read-access geo-redundant storage (RA-GRS)

o Access tier: Hot

5. Wait for validation to pass, then select Create.

Task 3: Create storage container

Task 4: Provision Azure Data Factory

1. In the Azure Portal (https://portal.azure.com), select + Create a resource, then

3. Set the following configuration on the Data Factory creation form:

o Name: Enter a unique name, this is indicated by a green checkmark.

o Location: Select any region close to you.

o Enable GIT: Unchecked

Task 5: Download and install Power BI Desktop

1. Download and install Power BI Desktop.

You might also like