0% found this document useful (0 votes)

4 views

Data Factory

For more details, Contact and subscribe at https://youtube.com/@mindsgreenstudios?si=FZQTQQ6ekVfAYIeD

Uploaded by

mindsgreengroup

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

Data Factory

For more details, Contact and subscribe at https://youtube.com/@mindsgreenstudios?si=FZQTQQ6ekVfAYIeD

Uploaded by

mindsgreengroup

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 57

Introduction to Azure Data Factory

© ANKIT & VIJAY – AZURE CLOUD 1

Azure Data Factory
 In the world of big data, raw, unorganized data is often stored in relational, non-relational, and
other storage systems

 The raw data doesn't have the proper context or meaning to provide meaningful insights to
analysts, data scientists, or business decision makers

 Big data requires a service that can orchestrate and operationalize processes to refine these
enormous stores of raw data into actionable business insights

 Azure Data Factory is a managed cloud service that's built for these complex hybrid extract-
transform-load (ETL), extract-load-transform (ELT), and data integration projects

© ANKIT & VIJAY – AZURE CLOUD 2

Usage scenarios
 Gaming company  To extract insights
 Petabytes of game logs  Joined data by using a Spark cluster in the cloud
 Analyze Logs to gain insights into customer (Azure HDInsight)
preferences,  Publish the transformed data into a cloud data
 Demographics warehouse such as Azure Synapse
 Usage behavior
 Want to automate this workflow
 Wants to identify up-sell and cross-sell
opportunities,  Monitor and manage it on a daily schedule.
 develop compelling new features,
 drive business growth  Want to execute it when files land in a blob store
 provide a better experience to its customers container
 Need Ref from on-premises data center  Azure Data Factory is the platform that solves such
 customer information
 Game information
 Marketing campaign
© ANKIT & VIJAY – AZURE CLOUD 3
© ANKIT & VIJAY – AZURE CLOUD 4
Pipelines and activities

© ANKIT & VIJAY – AZURE CLOUD 5

Pipelines and activities
 A pipeline is a logical grouping of activities that together perform a task.
 A Data Factory can have one or more pipelines

 For example, a pipeline could contain a set of activities that

 Ingest and clean log data,
 then kick off a mapping data flow to analyze the log data

 The pipeline allows you to manage the activities as a set instead of each one individually

 You deploy and schedule the pipeline instead of the activities independently

© ANKIT & VIJAY – AZURE CLOUD 6

Pipelines and activities
 The activities in a pipeline define actions to perform on your data

 For example, you may use a

 Copy activity to copy data from SQL Server to an Azure Blob Storage.

 Then, use a data flow activity or a Databricks Notebook activity to process and transform
data from the blob storage to an Azure Synapse Analytics pool

 Then top of which business intelligence reporting solutions are built

© ANKIT & VIJAY – AZURE CLOUD 7

Pipelines and activities
 Azure Data Factory have three groupings of activities:

1. Data movement activities

2. Data transformation activities

3. Control flow activities

 An activity can take zero or more input datasets and produce one or more output datasets

© ANKIT & VIJAY – AZURE CLOUD 8

Pipelines and activities

Credit: Azure Cloud

Relationship between pipeline, activity, and dataset

© ANKIT & VIJAY – AZURE CLOUD 9

Pipelines and activities
 Azure documentation link: https://learn.microsoft.com/en-us/azure/data-factory/concepts-
pipelines-activities?tabs=data-factory

© ANKIT & VIJAY – AZURE CLOUD 10

© ANKIT & VIJAY – AZURE CLOUD 12

Linked services
 Linked services are much like connection strings,
 which define the connection information needed for the service to connect to external resources.

 These resources can be on-premises or in the cloud, and they can include data stores, compute
resources, and other Azure services

 For example, an Azure Storage linked service links a storage account to the service

© ANKIT & VIJAY – AZURE CLOUD 13

Linked Services
Connection string 1
Azure Blob Storage

Connection string 2
ADF Azure SQL Database

Connection string 3
Amazon S3

© ANKIT & VIJAY – AZURE CLOUD 14

Datasets
 A dataset is a named view of data that simply points or references the data you want to use in your
activities as inputs and outputs

 Datasets identify data within different data stores, such as tables, files, folders, and documents

 For example, an Azure Blob dataset specifies the blob container and folder in Blob Storage from which the
activity should read the data

 Before you create a dataset, you must create a linked service to link your data store to the service

© ANKIT & VIJAY – AZURE CLOUD 15

Datasets
Linked Service 1 Azure Blob Storage
(Images, Videos, Docs)

Linked Service 2 Azure SQL Database

Dataset
(Tables)

Linked Service 3 Amazon S3

(Images, Videos, Docs)

© ANKIT & VIJAY – AZURE CLOUD 16

Linked services & Datasets

Credit: Azure Cloud

© ANKIT & VIJAY – AZURE CLOUD 17

© ANKIT & VIJAY – AZURE CLOUD 19

Triggers
 A trigger in Azure Data Factory is a mechanism that determines
 when to start or
 Invoke an end-to-end pipeline execution
 Triggers can be scheduled to run at
 specific times
 intervals, or
 They can be event-based.

© ANKIT & VIJAY – AZURE CLOUD 20

Types of Triggers
 Schedule trigger:
 Runs a pipeline at a specified time or interval
 Tumbling window trigger:
 Runs a pipeline on a regular schedule,
 But only processes data that has arrived within a specific time window

 Storage event trigger:

 Runs a pipeline when a specific event occurs in Azure Storage,
 Such as when a new file is uploaded or when a file is deleted

 Custom event trigger:

 Runs a pipeline when a specific event occurs in an external system.
 Custom events can be raised by other Azure services, such as Azure Event Grid, or by third-party app

© ANKIT & VIJAY – AZURE CLOUD 21

© ANKIT & VIJAY – AZURE CLOUD 23

Schedule vs Tumbling Window
Trigger

Characteristic Schedule trigger Tumbling window trigger

Type Time-based Time-based
Execution Fire-and-forget Tracks run status
State No state Retains state
Dependency Cannot depend on other Can depend on other
triggers tumbling window triggers

© ANKIT & VIJAY – AZURE CLOUD 24

Use Cases
 Schedule triggers are typically used to run pipelines on a regular basis, such as once a day or once a
week

 Tumbling window triggers are typically used to run pipelines on a periodic interval, while also
retaining state

 This makes them ideal for scenarios such as:

1. Processing streaming data in real time

2. Processing batch data in batches of a fixed size

3. Processing data from multiple sources in a coordinated manner

© ANKIT & VIJAY – AZURE CLOUD 25

Examples
 Schedule trigger: Running a pipeline to copy data from a database to a data lake once a day

 Tumbling window trigger: Running a pipeline to process streaming data from a Kafka topic in
1-hour batches

 Tumbling window trigger with dependency: Running a pipeline to process data from a
database in 1-hour batches, with the pipeline run depending on a successful run of a previous
pipeline that processes data from a different database.

© ANKIT & VIJAY – AZURE CLOUD 26

© ANKIT & VIJAY – AZURE CLOUD 28

Custom Event Trigger
 A custom event trigger in Azure Data Factory (ADF) is a type of trigger that allows you to start a
pipeline when a custom event is published to an Event Grid topic

 This can be useful for a variety of scenarios, such as:

1. Starting a pipeline when a new file is uploaded to a storage account

2. Starting a pipeline when a new row is inserted into a database table

3. Starting a pipeline when a message is received in a queue or service bus

4. Starting a pipeline when a custom event is published from another Azure service, such as Azure
Logic Apps or Azure Functions

© ANKIT & VIJAY – AZURE CLOUD 29

Custom Event Trigger
 To create a custom event trigger in ADF, you will need to:

1. Create an Event Grid topic

2. Create a pipeline in ADF

3. Add a custom event trigger to the pipeline

4. Configure the trigger to listen for the custom events that you want to start the pipeline

5. Publish the pipeline

 Once the pipeline is published, it will start whenever a custom event is published to the Event Grid topic that the
trigger is listening for

© ANKIT & VIJAY – AZURE CLOUD 30

Event Grid topic
 An Event Grid topic is a central place where you can publish and consume events. It acts as a
router and distributor of events to event handlers

 You can publish events to a topic from any source, and you can subscribe to events from any
topic by creating an event subscription

© ANKIT & VIJAY – AZURE CLOUD 31

Event Grid topic
 Event Grid topics are used to decouple applications and services, and to enable event-driven
architectures

 They can be used to implement a variety of scenarios, such as:

1. Starting a pipeline in Azure Data Factory when a new file is uploaded to a storage account

2. Sending a notification to a user when a new email arrives in their inbox

3. Triggering a workflow in Azure Logic Apps when a new row is inserted into a database table

 Event Grid topics are highly scalable and reliable, and they can be used to distribute events to any
number of event handler

© ANKIT & VIJAY – AZURE CLOUD 32

© ANKIT & VIJAY – AZURE CLOUD 34

Integration Runtime
 The Integration Runtime (IR)
 Compute infrastructure used by Azure Data Factory pipelines

 To do ELT, ETL & data integration

 Provide below capabilities

 Data Flow

 Data movement

 Activity dispatch

 SSIS package execution

© ANKIT & VIJAY – AZURE CLOUD 35

Integration Runtime
 In Data Factory pipelines,
 an activity defines the action to be performed.
 A linked service defines a target data store

 An integration runtime provides the bridge between activities and linked services

 Integration runtime is referenced by the linked service or activity

 Provides the compute environment where the activity is run directly

 This allows the activity to be performed in the closest possible region to the target data store

© ANKIT & VIJAY – AZURE CLOUD 36

Integration Runtime Types
 Data Factory offers three types of Integration Runtime (IR)

 You should choose the type that best serves your data integration capabilities and network
environment requirements

1. Azure Integration Runtime

2. Self-hosted Integration Runtime

3. Azure-SSIS Integration Runtime

Why Azure IR
 Why Need to create Azure IR type in ADF if it is already created by default?
 To use a different compute type
 To use a different region
 To use a different concurrency level
 To improve performance

Pipeline Parameters
 Pipeline Parameters:

 Defined at the pipeline level

 Cannot be modified during a pipeline run

 Can be used to control the behavior of a pipeline and its activities,

 Such as by passing in the connection details for a dataset

 Path of a file to be processed

Pipeline Variables
 Pipeline variables

 are values that can be set and modified during a pipeline run

 Unlike pipeline parameters, which are defined at the pipeline level & cannot be changed during a
pipeline run

 pipeline variables can be set and modified within a pipeline using a Set Variable activity

 Pipeline variables can be used to store and manipulate data during a pipeline run,
 Such as by storing the results of a computation

 Current state of a process

System Variables
 Built-in variables in ADF

 Can be used within every Data Factory pipeline

 Can be used to capture commonly used pipeline-related information & pass it dynamically
anywhere within the pipeline

Usage of System Variables
 Specifying dynamic file paths and folder names:

 This allows you to generate unique file names and paths for each pipeline run

 Setting conditional expressions :

 To check the status of a previous activity before running the next activity

 Passing data between activities:

 To pass data between activities in a pipeline.

 This allows you to reuse data from one activity in another activity

System Variables
 Azure documentation: https://learn.microsoft.com/en-us/azure/data-factory/control-flow-
system-variables

Connectors
 Connectors

 Components that allow you to connect to & interact with external data sources

 ADF provides a wide range of built-in connectors,

 Connectors for on-premises and cloud data sources, SaaS applications, and other Azure services

Usage of Connectors
 You can use connectors to perform a variety of tasks, such as:

 Ingesting data:
 From a variety of sources, such as on-premises databases, cloud storage, and SaaS
applications

 Loading data:
 To load data into a variety of destinations, such as Azure Data Lake Storage, Azure Synapse
Analytics, and Azure SQL Database

Connectors
 Azure documentation: https://learn.microsoft.com/en-us/azure/data-factory/connector-
overview

Control Flow Activities
 Set variable Activity  ForEach Activity

 Append Variable Activity  If Condition Activity

 Get Metadata Activity  Switch Activity

 Execute Pipeline Activity  Web & Webhook Activity

 Fail Activity  Validation Activity

 Wait Activity  Lookup Activity

 Filter Activity
 Until Activity
 Pipeline return variable

© ANKIT & VIJAY – AZURE CLOUD 56
DataFlow Transformation
 Filter Transformation  Select
 Aggregate Transformation  Pivot & unpivot
 Join Transformation  Surrogate key transformation
 Fuzzy join  Window transformation
 Conditional split  Flatten transformation
 Exists transformation  Assert transformation
 Union transformation  Cast transformation
 Lookup transformation  Parse transformation
 Sort transformation  Rank transformation
 Creating new Branch  Stringify transformation
© ANKIT & VIJAY – AZURE CLOUD 57

Azure Data Factory tutorial
No ratings yet
Azure Data Factory tutorial
36 pages
Azure Data Factory
77% (13)
Azure Data Factory
52 pages
Azure Data Factory Interview Questions
0% (1)
Azure Data Factory Interview Questions
14 pages
Types of Activities in ADF
100% (1)
Types of Activities in ADF
37 pages
Azure Data Factory
100% (4)
Azure Data Factory
16 pages
Microsoft Azure Fundamentals Exam Cram: Second Edition
From Everand
Microsoft Azure Fundamentals Exam Cram: Second Edition
IP Specialist
5/5 (1)
Azure Data Factory
No ratings yet
Azure Data Factory
4 pages
Session 5 - PNP ACG - Understanding Digital Forensics
No ratings yet
Session 5 - PNP ACG - Understanding Digital Forensics
79 pages
ADF INTERVIEW Q&A
No ratings yet
ADF INTERVIEW Q&A
27 pages
ADF - Data Flow, Triggers & CICD
No ratings yet
ADF - Data Flow, Triggers & CICD
20 pages
Databricks
No ratings yet
Databricks
43 pages
Azure Data Factory
No ratings yet
Azure Data Factory
6 pages
Copy activity in ADF
No ratings yet
Copy activity in ADF
52 pages
06.introduction To Data Factory
No ratings yet
06.introduction To Data Factory
26 pages
BY K Madhavi Data Architect
No ratings yet
BY K Madhavi Data Architect
24 pages
ADF-3
No ratings yet
ADF-3
10 pages
capgemini questionnaire
No ratings yet
capgemini questionnaire
11 pages
ADE -7_30AM_Frame 4
No ratings yet
ADE -7_30AM_Frame 4
1 page
az questions
No ratings yet
az questions
11 pages
2 Data Literacy Essentials of Azure Data Factory
No ratings yet
2 Data Literacy Essentials of Azure Data Factory
4 pages
Data Factory
100% (2)
Data Factory
26 pages
ADF Workshop by Amit Navgire
No ratings yet
ADF Workshop by Amit Navgire
26 pages
Adf Part-1
No ratings yet
Adf Part-1
5 pages
Azure Data Factory For Beginners
No ratings yet
Azure Data Factory For Beginners
250 pages
Azure Interview Questions
No ratings yet
Azure Interview Questions
7 pages
ADF Notes
No ratings yet
ADF Notes
1 page
Data Platform and Analytics Foundational Training: (Speaker Notes)
No ratings yet
Data Platform and Analytics Foundational Training: (Speaker Notes)
19 pages
Data Factory
No ratings yet
Data Factory
1,158 pages
Adf 1
No ratings yet
Adf 1
29 pages
Untitled
No ratings yet
Untitled
3 pages
Azure Notes - 3 Data Integration
No ratings yet
Azure Notes - 3 Data Integration
9 pages
Azure Data Factory - A Complete Introduction
No ratings yet
Azure Data Factory - A Complete Introduction
72 pages
Introduction to ADF - LwTN
No ratings yet
Introduction to ADF - LwTN
54 pages
Azure Data Factory Notes 1682135573
No ratings yet
Azure Data Factory Notes 1682135573
78 pages
025.0 ADF Overview
No ratings yet
025.0 ADF Overview
12 pages
Adf 161206173358
No ratings yet
Adf 161206173358
29 pages
Microsoft_ADF
No ratings yet
Microsoft_ADF
11 pages
Pass Sqlsaturday Melbourne Azure Data Pipelines v0 1 PDF
No ratings yet
Pass Sqlsaturday Melbourne Azure Data Pipelines v0 1 PDF
41 pages
Detailed Azure Data Factory Presentation
No ratings yet
Detailed Azure Data Factory Presentation
30 pages
Exam AZ 900: Azure Fundamental Study Guide-1: Explore Azure Fundamental guide and Get certified AZ 900 exam
From Everand
Exam AZ 900: Azure Fundamental Study Guide-1: Explore Azure Fundamental guide and Get certified AZ 900 exam
Mamta Devi
No ratings yet
Azure Data Demystified: From SQL to Synapse
From Everand
Azure Data Demystified: From SQL to Synapse
Kameron Hussain
No ratings yet
Azure Data Factory Deck 1
No ratings yet
Azure Data Factory Deck 1
59 pages
Azure Data Factory Compressed
No ratings yet
Azure Data Factory Compressed
24 pages
Adf Loop PDF
100% (1)
Adf Loop PDF
4 pages
Azure Data Factory - Pratap - Qbex Technologies - 8886230001
No ratings yet
Azure Data Factory - Pratap - Qbex Technologies - 8886230001
4 pages
ADF Hands-On
No ratings yet
ADF Hands-On
98 pages
004 Components-of-ADF
No ratings yet
004 Components-of-ADF
10 pages
A Comprehensive Guide to Cloud Infrastructure and Management: IT Books, #1
From Everand
A Comprehensive Guide to Cloud Infrastructure and Management: IT Books, #1
Mario Marinov
No ratings yet
Taking Interviw
No ratings yet
Taking Interviw
15 pages
azure ADF
No ratings yet
azure ADF
22 pages
Pipeline: Azure Data Factory Cheat Sheet by
100% (1)
Pipeline: Azure Data Factory Cheat Sheet by
14 pages
Interview Series ADF Part-1
No ratings yet
Interview Series ADF Part-1
17 pages
Adf 25 Questions
No ratings yet
Adf 25 Questions
16 pages
Azure Data Factory v2 (PDFDrive)
No ratings yet
Azure Data Factory v2 (PDFDrive)
78 pages
Azure Cloud: Fundamentals to Architecture
From Everand
Azure Cloud: Fundamentals to Architecture
Alex Carvalho
No ratings yet
AWS Cloud Practitioner Study Guide & Practice Tests
From Everand
AWS Cloud Practitioner Study Guide & Practice Tests
SUJAN
No ratings yet
For More Information, Check: Starting Your Journey With Microsoft Azure Data Factory
No ratings yet
For More Information, Check: Starting Your Journey With Microsoft Azure Data Factory
4 pages
Engineering Data Mesh in Azure Cloud: Implement data mesh using Microsoft Azure's Cloud Adoption Framework
From Everand
Engineering Data Mesh in Azure Cloud: Implement data mesh using Microsoft Azure's Cloud Adoption Framework
Aniruddha Deswandikar
No ratings yet
Azure Fundamentals Exam Insights
From Everand
Azure Fundamentals Exam Insights
PRIYANKA
No ratings yet
1694639964-Module 3 Azure Data Factory
No ratings yet
1694639964-Module 3 Azure Data Factory
48 pages
AZ-900: Microsoft Azure Fundamentals - Study Notes
From Everand
AZ-900: Microsoft Azure Fundamentals - Study Notes
Steve Brown
No ratings yet
Basic Computer Knowledge Test 2
No ratings yet
Basic Computer Knowledge Test 2
20 pages
Setting Up SPD Server Parameter Files
No ratings yet
Setting Up SPD Server Parameter Files
6 pages
oracle academy certification
No ratings yet
oracle academy certification
15 pages
Methods ZPO - BO905
No ratings yet
Methods ZPO - BO905
13 pages
Iiith Pgcss Partb Brochure
No ratings yet
Iiith Pgcss Partb Brochure
20 pages
Teradata Access Rights
100% (1)
Teradata Access Rights
13 pages
DSA Quiz 3 Solution 26122020 035014pm
No ratings yet
DSA Quiz 3 Solution 26122020 035014pm
4 pages
MC m2 SEO Kickoff
No ratings yet
MC m2 SEO Kickoff
11 pages
Computer Networks JNTUH Unit1 Notes
No ratings yet
Computer Networks JNTUH Unit1 Notes
6 pages
Advanced Java J2EE Vtu Notes 17CS553
No ratings yet
Advanced Java J2EE Vtu Notes 17CS553
30 pages
Knowledge Is Power: A Wise Man Has Great Power, and A Man of Knowledge Increases Strength (Proverbs 24:5)
No ratings yet
Knowledge Is Power: A Wise Man Has Great Power, and A Man of Knowledge Increases Strength (Proverbs 24:5)
20 pages
Nosql Products: It Giants Perspectives: Shagufta Praveen
No ratings yet
Nosql Products: It Giants Perspectives: Shagufta Praveen
10 pages
HK7at5AEP1e0PUCTUbfbrvIcswX93cW9AG4T79Lu
No ratings yet
HK7at5AEP1e0PUCTUbfbrvIcswX93cW9AG4T79Lu
16 pages
How Can I Troubleshoot High Cpu Utilization For Amazon Rds or Amazon Aurora Postgresql?
No ratings yet
How Can I Troubleshoot High Cpu Utilization For Amazon Rds or Amazon Aurora Postgresql?
3 pages
Computer Assignment for AS
No ratings yet
Computer Assignment for AS
7 pages
Sending Mails Using Lazarus: Michaël Van Canneyt August 31, 2012
No ratings yet
Sending Mails Using Lazarus: Michaël Van Canneyt August 31, 2012
9 pages
153766-Lvm Limits White Paper v4-18861
No ratings yet
153766-Lvm Limits White Paper v4-18861
10 pages
Answer Key - DBMS - June 2023
No ratings yet
Answer Key - DBMS - June 2023
25 pages
PGDCA Database Mgt. System (Sem-II) XX-1000
No ratings yet
PGDCA Database Mgt. System (Sem-II) XX-1000
3 pages
0641 Oracle 11g Administration
100% (1)
0641 Oracle 11g Administration
68 pages
Lab6 - Configure EBS Volumes
No ratings yet
Lab6 - Configure EBS Volumes
55 pages
Log File
No ratings yet
Log File
14 pages
Delay Tolerant Network
No ratings yet
Delay Tolerant Network
22 pages
MCS-014 Systems Analysis and Design
No ratings yet
MCS-014 Systems Analysis and Design
30 pages
Mod1 Co&a Bec306c
No ratings yet
Mod1 Co&a Bec306c
31 pages
CS276 S10 Problem Set 1
No ratings yet
CS276 S10 Problem Set 1
4 pages
Capacity Plan Template
No ratings yet
Capacity Plan Template
12 pages
Virtual I/O Server: Power Systems
No ratings yet
Virtual I/O Server: Power Systems
208 pages
2014 p01 q02 Solutions
No ratings yet
2014 p01 q02 Solutions
2 pages

Data Factory

Uploaded by

Data Factory

Uploaded by

Introduction to Azure Data Factory

© ANKIT & VIJAY – AZURE CLOUD 1

© ANKIT & VIJAY – AZURE CLOUD 2

© ANKIT & VIJAY – AZURE CLOUD 5

 For example, a pipeline could contain a set of activities that

© ANKIT & VIJAY – AZURE CLOUD 6

 For example, you may use a

 Then top of which business intelligence reporting solutions are built

© ANKIT & VIJAY – AZURE CLOUD 7

1. Data movement activities

2. Data transformation activities

3. Control flow activities

© ANKIT & VIJAY – AZURE CLOUD 8

Credit: Azure Cloud

Relationship between pipeline, activity, and dataset

© ANKIT & VIJAY – AZURE CLOUD 9

© ANKIT & VIJAY – AZURE CLOUD 10

© ANKIT & VIJAY – AZURE CLOUD 12

© ANKIT & VIJAY – AZURE CLOUD 13

© ANKIT & VIJAY – AZURE CLOUD 14

© ANKIT & VIJAY – AZURE CLOUD 15

Linked Service 2 Azure SQL Database

Linked Service 3 Amazon S3

© ANKIT & VIJAY – AZURE CLOUD 16

Credit: Azure Cloud

© ANKIT & VIJAY – AZURE CLOUD 17

© ANKIT & VIJAY – AZURE CLOUD 19

© ANKIT & VIJAY – AZURE CLOUD 20

 Storage event trigger:

 Custom event trigger:

© ANKIT & VIJAY – AZURE CLOUD 21

© ANKIT & VIJAY – AZURE CLOUD 23

Characteristic Schedule trigger Tumbling window trigger

© ANKIT & VIJAY – AZURE CLOUD 24

 This makes them ideal for scenarios such as:

1. Processing streaming data in real time

2. Processing batch data in batches of a fixed size

3. Processing data from multiple sources in a coordinated manner

© ANKIT & VIJAY – AZURE CLOUD 25

© ANKIT & VIJAY – AZURE CLOUD 26

© ANKIT & VIJAY – AZURE CLOUD 28

 This can be useful for a variety of scenarios, such as:

1. Starting a pipeline when a new file is uploaded to a storage account

2. Starting a pipeline when a new row is inserted into a database table

3. Starting a pipeline when a message is received in a queue or service bus

© ANKIT & VIJAY – AZURE CLOUD 29

1. Create an Event Grid topic

2. Create a pipeline in ADF

3. Add a custom event trigger to the pipeline

5. Publish the pipeline

© ANKIT & VIJAY – AZURE CLOUD 30

© ANKIT & VIJAY – AZURE CLOUD 31

 They can be used to implement a variety of scenarios, such as:

2. Sending a notification to a user when a new email arrives in their inbox

© ANKIT & VIJAY – AZURE CLOUD 32

© ANKIT & VIJAY – AZURE CLOUD 34

 To do ELT, ETL & data integration

 Provide below capabilities

 SSIS package execution

© ANKIT & VIJAY – AZURE CLOUD 35

 Integration runtime is referenced by the linked service or activity

 Provides the compute environment where the activity is run directly

© ANKIT & VIJAY – AZURE CLOUD 36

1. Azure Integration Runtime

2. Self-hosted Integration Runtime

3. Azure-SSIS Integration Runtime

© ANKIT & VIJAY – AZURE CLOUD 37

© ANKIT & VIJAY – AZURE CLOUD 39

© ANKIT & VIJAY – AZURE CLOUD 40

© ANKIT & VIJAY – AZURE CLOUD 42

 Defined at the pipeline level

 Cannot be modified during a pipeline run

 Can be used to control the behavior of a pipeline and its activities,

 Such as by passing in the connection details for a dataset

 Path of a file to be processed

© ANKIT & VIJAY – AZURE CLOUD 43