Running Azure Databricks Notebook On Synapse Analytics
Running Azure Databricks Notebook On Synapse Analytics
Requirements:
https://docs.microsoft.com/en-us/azure/databricks/scenarios/databricks-extract-load-sql-data-
warehouse
Next step is setup the Azure Databricks component fron Azure Synapse Analytics Integration Layer but
before we need a link service for Azure Databricks.
On your left at the manage layer, click Linked services and them New.
1- Give a name for you linked service
2- Enabled AutoResolveIntegrationRuntime
3- Your Azure subscription
4- Databricks Workspace URL (You can find It at the Databricks cluster Advance Option tab)
5- Authentication type (You can use an access token form your cluster or from Azure Key Vault)
6- Select New Job cluster (If you have instance pool can use it or an existent Interactive cluster)
7- Cluster version (the Cluster you create from your Azure Databricks Workspace)
8- Cluster node type
9- Python version
10- Select a fixed Worker or autoscaling option
If you have some additional information’s like parameters to passe to your notebook, you can setup you
parameters option.
Finally, in the Linked service List, we can find the new linked service.
We can drop it, configure some roles like service principal account
Now we can go ahead and create a new pipeline from the Integration Layer
From your left at the integration layer, you can access to integration component.
AS you can see, I already have two pipelines (Like Microsoft SSIS package)
From the Activities Tab, select at Databricks layer, notebook component and drag and drop it on the
canvas.
I rename it Databricks
Next step , after Renaming your component et to go at Azure Databricks tab . From there you need to
access to you Linked service , previously setup at the Azure Synapse Manager Layer.
Enabled the Intercative autoring
Next, Settings Tab, click on Browse button to find and select your notebook and then OK
At this point Your are good to go.
Details to monitor your execution from your Databricks workspace by using the link below.
Databricks task finished successfully.