ADF - Data Tranformation Activities

The document outlines various data transformation activities within Azure Data Factory (ADF), including Stored Procedure Activity, Databricks Notebook Activity, and Azure Function Activity. Each activity allows users to execute complex data processing tasks, integrate with different databases, and manage custom logic. Additionally, it provides configuration steps and use cases for effectively utilizing these activities in data pipelines.

Uploaded by

sanket333deshmukh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views17 pages

ADF - Data Tranformation Activities

Uploaded by

sanket333deshmukh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 17

Data Transformation

Activities
- Activities that you can use to transform and process your raw data
into predictions and insights at scale
Stored Procedure Activity
• Allows you to execute a stored procedure in a SQL database as part of a data pipeline.

• Supported Databases: SQL Server, Azure SQL Database, Azure Synapse Analytics, and other databases with
linked services

• Key Features
• Parameter Passing: Pass parameters to the stored procedure from ADF pipeline.
• Output Handling: Capture and utilize the output of the stored procedure if it returns any result sets.
• Integration: Works seamlessly with other ADF activities, such as Copy and Data Flow activities.
• Use Cases
• Data Transformation: Execute complex transformations that are easier to manage in SQL.
• Data Cleaning: Run procedures to clean or aggregate data before further processing.
• Custom Logic: Implement and manage custom business logic directly in the database.
Configure a SP activity
7. Create and configure SP to update
customer table
Create table
a. Create table and schema in SQL database Create SP to insert value

create schema spdemo

create table spdemo.Customers

(
CustomerID integer,
FirstName VARCHAR(50),
LastName VARCHAR(50) )

CREATE PROCEDURE InsertCustomer

@CustomerID INT,
@FirstName VARCHAR(50),
@LastName VARCHAR(50)
AS
BEGIN
INSERT INTO spdemo.Customers
(CustomerID, FirstName, LastName)
VALUES (@CustomerID, @FirstName,
@LastName)
END Inserting value through SP in sql server
b. Add stored procedure activity and connect to SQL database Linked
service
c. Select the newly created SP name from the drop down
d. Import parameters

e. Pass values and debug,

check the table in database

Stored proc activity conti.

Databricks Notebook Activity
• Execute a Databricks notebook as part of an Azure Data Factory (ADF) pipeline. It integrate the ADF with data
bricks enabling data engineering and data transformation tasks.

• Key Features
• Seamless Execution: Run Databricks notebooks directly from ADF pipelines.
• Parameterization: Pass parameters from ADF to Databricks notebooks.
• Integration: Utilize notebook results in downstream ADF activities.

• Use Cases
• Data Transformation: Execute complex transformations that are better suited for Databricks notebooks.
• Machine Learning: Run machine learning models and training scripts written in notebooks.
• Data Integration: Integrate Databricks processing with other ADF activities like data copy and data flow.
8. Configure Databricks Activity
1. Search for Notebook in the pipeline Activities pane, and drag a Notebook activity to the
pipeline canvas.
2. Select the new Notebook activity on the canvas if it is not already selected.
3. Select the Azure Databricks tab to select or create a new Azure Databricks linked service that
will execute the Notebook activity.
• Select the Settings tab and specify the notebook path to be executed on Azure Databricks, optional
base parameters to be passed to the notebook, and any additional libraries to be installed on the
cluster to execute the job.
Databricks activity properties
Azure Function Activity
• Integrate and execute Azure Functions from within an Azure Data Factory (ADF) pipeline.

• Functionality: Allows for serverless compute tasks, enabling custom processing logic as part of your data
workflows.

• Key Features
• Serverless Execution: Leverage Azure Functions to run code without managing infrastructure.
• Custom Logic: Execute custom logic or APIs directly from your ADF pipeline.
• Parameter Passing: Send parameters to Azure Functions and retrieve results

• Use Cases
• Custom Processing: Implement custom data transformations or processing logic.
• API Calls: Trigger APIs or microservices that are encapsulated in Azure Functions.
• Data Enrichment: Enhance or transform data by executing complex logic in serverless functions.
9. Configure Azure Function Activity
1. Expand the Azure Function section of the pipeline Activities pane, and drag an Azure Function activity to
the pipeline canvas.
2. Select the new Azure Function activity on the canvas if it is not already selected, and its Settings tab, to
edit its details.
3. If you do not already have an Azure Function
linked service defined, select New to create a new
one. In the new Azure Function linked service
pane, choose your existing Azure Function App
URL and provide a Function Key.
4. After selecting the Azure Function linked service,
provide the function name and other details to
complete the configuration.
10. Copy data from SQL database to blob and log the pipeline
status in log table
create table audit.copylog
(
a. Create copy activity to copy data from loadid INT IDENTITY(1,1) PRIMARY KEY,
SQL to csv as shown before tablename varchar(50),
loadstatus varchar(50),
b. Create a audit table in SQL database dataread varchar(50),
to log the execution details Errorid varchar(50),
errormessage varchar(50)
c. Create stored procedure for logging
)
the execution details
CREATE PROCEDURE audit.loadstatus
@TableName VARCHAR(50),
@loadstatus varchar(50),
@dataread varchar(50),
@errorid varchar(50),
@errormessage varchar(50)
AS
BEGIN
INSERT INTO audit.copylog
(tablename,loadstatus,dataread,errorid,errormessage)
VALUES(@tablename,
@loadstatus,@dataread,@errorid,@errormessage)
END
d. Attach stored procedure activity to success side
of copy activity to log success details
e. Configure the activity to use audit.loadstatus SP
created
f. Pass values for parameters

Dataread = @activity('copy_data_sql_csv').output.dataread
Laodstatus =
@activity('copy_data_sql_csv').output.executionDetails[0].status
Tablename =
@concat(pipeline().parameters.schemaname,'_',pipeline().parameters
.tablename)

g. Attach stored procedure activity to failed side of

copy activity to log in case of failure
h. Configure the activity to use audit.loadstatus SP
created
i. Pass values for parameters

@activity('copy_data_sql_csv').output.errors[0].c
ode
@activity('copy_data_sql_csv').output.errors[0].m
essage
• Mapping Data Flows:
• Visual design of data transformations without coding.
• Executes as activities within pipelines on scalable Spark clusters.
• Integrates with scheduling, control, and monitoring features.

• HDInsight Activities:
• Hive: Execute Hive queries on HDInsight clusters.
• Pig: Execute Pig queries on HDInsight clusters.
• MapReduce: Run MapReduce programs on HDInsight clusters.
• Streaming: Execute Hadoop Streaming programs on HDInsight clusters.
• Spark: Run Spark programs on HDInsight clusters.
• ML Studio (Classic):
• Support ends on August 31, 2024.
• Use Batch Execution to run predictions; update models with the Update Resource activity
• Data Wrangling:
• Code-free data preparation using Power Query.
• Supports iterative, cloud-scale data wrangling via Spark execution.
• Note: Power Query is supported only in Azure Data Factory.
• Stored Procedure Activity:
• Invoke stored procedures in various SQL-based data stores.
• Data Lake Analytics U-SQL Activity:
• Run U-SQL scripts on Data Lake Analytics clusters.
• Azure Synapse & Databricks Activities:
• Synapse Notebook: Run Synapse notebooks.
• Databricks Notebook, Jar, Python: Run notebooks, Jars, and Python scripts on Databricks
clusters.
• Custom Activity:
• Create custom transformations with your own logic using Azure Batch or HDInsight.
References
• Pipelines and activities - Azure Data Factory &
• https://learn.microsoft.com/en-us/azure/data-factory/transform-data Azure Synapse | Microsoft Learn

Sales Data Analytics AW-2017LT Az - Project - 2
No ratings yet
Sales Data Analytics AW-2017LT Az - Project - 2
118 pages
Azure Data Factory Presentation
No ratings yet
Azure Data Factory Presentation
30 pages
Azure Data Factory For Beginners
No ratings yet
Azure Data Factory For Beginners
250 pages
Azure Data Factory Tutorial
No ratings yet
Azure Data Factory Tutorial
36 pages
Azure DATA Fatcory
No ratings yet
Azure DATA Fatcory
2,982 pages
ADF Activities
No ratings yet
ADF Activities
35 pages
Azure Data Factory
100% (1)
Azure Data Factory
6 pages
Azure Data Factory
No ratings yet
Azure Data Factory
3,167 pages
Azure Data Factory Full Notes
No ratings yet
Azure Data Factory Full Notes
4 pages
ADE Project Amit
No ratings yet
ADE Project Amit
17 pages
Adf Profilesummary
No ratings yet
Adf Profilesummary
1 page
ADF Pipeline Documentation Template
No ratings yet
ADF Pipeline Documentation Template
5 pages
ADF Course Deck
No ratings yet
ADF Course Deck
88 pages
f4b7901ed5e5f9106a3a82eea2e2f003
No ratings yet
f4b7901ed5e5f9106a3a82eea2e2f003
3,614 pages
ADF - Control Flow Activites 1
No ratings yet
ADF - Control Flow Activites 1
17 pages
ADF Cheat Sheet 21 To 50
No ratings yet
ADF Cheat Sheet 21 To 50
3 pages
Adf Activities
No ratings yet
Adf Activities
6 pages
Adf 161206173358
No ratings yet
Adf 161206173358
29 pages
Azure Data Engr POC - S For Interns
No ratings yet
Azure Data Engr POC - S For Interns
9 pages
Azure Data Factory Mapping Data Flows
No ratings yet
Azure Data Factory Mapping Data Flows
22 pages
ADE Azure Data Engineer Interview
No ratings yet
ADE Azure Data Engineer Interview
12 pages
ADF - Data Movt and IR
No ratings yet
ADF - Data Movt and IR
26 pages
Azure Databricks
No ratings yet
Azure Databricks
21 pages
Data Factory, Data Integration
No ratings yet
Data Factory, Data Integration
2,034 pages
Introduction To ADF - LWTN
No ratings yet
Introduction To ADF - LWTN
54 pages
Azure Data Factory Whitepaper PassingParameters
No ratings yet
Azure Data Factory Whitepaper PassingParameters
19 pages
Azure Data Factory - Pratap - Qbex Technologies - 8886230001
No ratings yet
Azure Data Factory - Pratap - Qbex Technologies - 8886230001
4 pages
Pipeline: Azure Data Factory Cheat Sheet by
100% (1)
Pipeline: Azure Data Factory Cheat Sheet by
14 pages
Free IT Courses (CBT Nuggets, InE, Udemy, Packt, Pluralsight, Linux Academy, Linkedin, ITProTV, Microsoft, Skyline Academy)
43% (7)
Free IT Courses (CBT Nuggets, InE, Udemy, Packt, Pluralsight, Linux Academy, Linkedin, ITProTV, Microsoft, Skyline Academy)
8 pages
Naukri ChandrashekarV (10y 1m)
No ratings yet
Naukri ChandrashekarV (10y 1m)
6 pages
Detailed Azure Data Factory Presentation
No ratings yet
Detailed Azure Data Factory Presentation
30 pages
Azure DataEngineer Training
No ratings yet
Azure DataEngineer Training
13 pages
Azure Data Factory
No ratings yet
Azure Data Factory
18 pages
BY K Madhavi Data Architect
No ratings yet
BY K Madhavi Data Architect
24 pages
Adf 1
No ratings yet
Adf 1
29 pages
Auto Jack Loader Research Paper
No ratings yet
Auto Jack Loader Research Paper
6 pages
ADF - Intro and Components
No ratings yet
ADF - Intro and Components
17 pages
Data Factory
No ratings yet
Data Factory
1,158 pages
Capgemini Questionnaire
No ratings yet
Capgemini Questionnaire
11 pages
ADF Workshop by Amit Navgire
No ratings yet
ADF Workshop by Amit Navgire
26 pages
MS Azure Data Factory Lab Overview
No ratings yet
MS Azure Data Factory Lab Overview
58 pages
Azure Data Bricks & Factory
No ratings yet
Azure Data Bricks & Factory
2 pages
Az Questions
No ratings yet
Az Questions
11 pages
VENKAIAHPATRA Hyderabad Secunderabad, Telangana 3.08 Yrs
No ratings yet
VENKAIAHPATRA Hyderabad Secunderabad, Telangana 3.08 Yrs
4 pages
025.0 ADF Overview
No ratings yet
025.0 ADF Overview
12 pages
Azure Data Factory Deck 1
No ratings yet
Azure Data Factory Deck 1
59 pages
Azure Data Factory Interview Questions Answers 1740678784
No ratings yet
Azure Data Factory Interview Questions Answers 1740678784
9 pages
Snowflake
No ratings yet
Snowflake
43 pages
ADF Copy Data
100% (1)
ADF Copy Data
81 pages
Start To Finish With Azure Data Factory
100% (2)
Start To Finish With Azure Data Factory
30 pages
Himanshu - Assignment Solved ETL 1
No ratings yet
Himanshu - Assignment Solved ETL 1
6 pages
Full Load
No ratings yet
Full Load
16 pages
Azure Data Factory Data Flows: Luke Newport Technical Specialist - Data & AI
100% (1)
Azure Data Factory Data Flows: Luke Newport Technical Specialist - Data & AI
30 pages
Azure Data Factory
No ratings yet
Azure Data Factory
5 pages
Databricks
No ratings yet
Databricks
43 pages
Az 400
No ratings yet
Az 400
75 pages
ADF Course Content
No ratings yet
ADF Course Content
11 pages
Most Frequently Asked Azure Data Factory Interview Questions
0% (1)
Most Frequently Asked Azure Data Factory Interview Questions
5 pages
06.introduction To Data Factory
No ratings yet
06.introduction To Data Factory
26 pages
Azure Data Factory
100% (2)
Azure Data Factory
14 pages
20764C Setupguide
No ratings yet
20764C Setupguide
23 pages
Azure Interview Questions
No ratings yet
Azure Interview Questions
19 pages
Microsoft Intune Deployment Guide
No ratings yet
Microsoft Intune Deployment Guide
20 pages
AZ 104 - Exam Topics Testlet 07182023
No ratings yet
AZ 104 - Exam Topics Testlet 07182023
28 pages
MS 500 PDF
No ratings yet
MS 500 PDF
188 pages
Powerbi Microsoft
No ratings yet
Powerbi Microsoft
990 pages
SQL Server 2019 Editions Datasheet
No ratings yet
SQL Server 2019 Editions Datasheet
3 pages
Az-103 Questionsanswersdumps PDF
No ratings yet
Az-103 Questionsanswersdumps PDF
30 pages
The Project Management Road Map - Microsoft Support
No ratings yet
The Project Management Road Map - Microsoft Support
8 pages
Exposing BizTalk Service To Azure PDF
No ratings yet
Exposing BizTalk Service To Azure PDF
12 pages
Industrial Training Report Format
No ratings yet
Industrial Training Report Format
18 pages
Azure DevOps
No ratings yet
Azure DevOps
22 pages
sc-300 3
No ratings yet
sc-300 3
35 pages
Rsa Authentication Manager 8.5 Setup Config Guide
No ratings yet
Rsa Authentication Manager 8.5 Setup Config Guide
120 pages
CC Practical SHIVAM
No ratings yet
CC Practical SHIVAM
46 pages
Introduction To Microsoft Azure and Services
No ratings yet
Introduction To Microsoft Azure and Services
22 pages
GTM Marketing Services Catalogue
No ratings yet
GTM Marketing Services Catalogue
68 pages
Gaining Expertise With Azure AD B2C
No ratings yet
Gaining Expertise With Azure AD B2C
55 pages
Saas
No ratings yet
Saas
13 pages
Abhijeet AB Bhusari-General
No ratings yet
Abhijeet AB Bhusari-General
8 pages
Withsecure Microsoft Azure Security Framework Whitepaper en
No ratings yet
Withsecure Microsoft Azure Security Framework Whitepaper en
28 pages
CC Ensem Exam 2022 Model Answersheet 2022
No ratings yet
CC Ensem Exam 2022 Model Answersheet 2022
24 pages
Sem 520
No ratings yet
Sem 520
22 pages
openSAP dsp1 Week 3 Unit 3 Integrate Presentation
No ratings yet
openSAP dsp1 Week 3 Unit 3 Integrate Presentation
16 pages
Microsoft Azure Ai Fundamentals Ai 900 Coures Outline
No ratings yet
Microsoft Azure Ai Fundamentals Ai 900 Coures Outline
2 pages
Microsoft Testking AI-100 v2020-03-04 by Iwei 82q
No ratings yet
Microsoft Testking AI-100 v2020-03-04 by Iwei 82q
67 pages
White Paper Microsoft Knowledge Mining
No ratings yet
White Paper Microsoft Knowledge Mining
31 pages
Proactive Monitoring For Smes Using Appinsight: International Journal of Advanced Research in Computer Science
No ratings yet
Proactive Monitoring For Smes Using Appinsight: International Journal of Advanced Research in Computer Science
9 pages
AZ-600 Configuring and Operating a Hybrid Cloud with Microsoft Azure Stack Hub Study Guide
From Everand
AZ-600 Configuring and Operating a Hybrid Cloud with Microsoft Azure Stack Hub Study Guide
Anand Vemula
No ratings yet
Administering Microsoft Azure SQL Solutions DP 300
From Everand
Administering Microsoft Azure SQL Solutions DP 300
Manish Soni
No ratings yet

ADF - Data Tranformation Activities

Uploaded by

ADF - Data Tranformation Activities

Uploaded by

Data Transformation

create schema spdemo

create table spdemo.Customers

CREATE PROCEDURE InsertCustomer

e. Pass values and debug,

Stored proc activity conti.

g. Attach stored procedure activity to failed side of

You might also like