De Mod 1 Get Started With Databricks Data Science and Engineering Workspace
De Mod 1 Get Started With Databricks Data Science and Engineering Workspace
Databricks Data
Science & Engineering
Workspace
Module 01
Web App
Unity Catalog
Workflow Manager
Access Control
Notebooks, Repos,
DBSQL
Worker
Distributes workloads
Notebook
across workers VM instance
Python, SQL,
Single user Always Yes
Scala, R
Python (DBR
Shared Always (Premium plan required) Yes
11.1+), SQL
No isolation Can be hidden by enforcing user isolation in the admin Python, SQL,
No
shared console or configuring account-level settings Scala, R
Attach notebook ✓ ✓ ✓
View Spark UI, cluster
metrics, driver logs ✓ ✓ ✓
Start, restart,
terminate ✓ ✓
Edit ✓
Attach library ✓
Resize ✓
Change permissions ✓
Multi-language Reproducible
Use Python, SQL, Scala, and R, all in one Automatically track version history, and
Notebook use git version control with Repos
Collaborative
Real-time co-presence, co-editing, and Get to production faster
commenting Quickly schedule notebooks as jobs or
create dashboards from their results, all
in the Notebook
Ideal for exploration
Explore, visualize, and summarize data
with built-in charts and data profiles
Enterprise-ready
Enterprise-grade access controls,
Adaptable identity management, and auditability
Install standard libraries and use local
modules
CI CD
Repos Service
Set up Git
Run Databricks job
automation to
based on Repo in
update Repos on Create and edit code Production folder
merge Git automation calls
Databricks Repos API
Steps in Databricks
Commit and push to
feature branch Steps in your Git provider