0% found this document useful (0 votes)
246 views110 pages

Databricks Certified Data Analyst Associate

The document provides an overview of Databricks SQL, a tool for data analysis that simplifies querying and integrates with BI tools. It explains the different types of SQL Warehouses (Classic, Pro, and Serverless) and their functionalities, as well as features of the SQL Editor, including data preview and query filters. Additionally, it discusses the use of query parameters to limit data returned in queries.

Uploaded by

rafabertuol
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
246 views110 pages

Databricks Certified Data Analyst Associate

The document provides an overview of Databricks SQL, a tool for data analysis that simplifies querying and integrates with BI tools. It explains the different types of SQL Warehouses (Classic, Pro, and Serverless) and their functionalities, as well as features of the SQL Editor, including data preview and query filters. Additionally, it discusses the use of query parameters to limit data returned in queries.

Uploaded by

rafabertuol
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 110

Practice Test 1 | Databricks

Certified Data Analyst


Associate
Databricks SQL
● Databricks SQL is a powerful tool for querying and analysing data in Databricks Lakehouse.
● It empowers data analysts and engineers that are familiar with SQL, to extract insights without the
need for complex code, thereby streamlining and speeding up the data analysis process.
● It is useful for collaborating between teams:
○ can share queries,
○ develop basic dashboards, and
○ seamlessly integrates with BI tools such as Power BI, dbt, and Tableau.
SQL Warehouses
● SQL Warehouses (formerly
SQL endpoints) is the
compute resources utilised
by Databricks SQL.

● Essentially this is the


connection that is used to
run your SQL queries, where
you configure the cluster
size/family, scaling and
termination options.
Warehouse Types
● Classic,
● Pro, and
● Serverless.

● The Classic warehouse type offers


limited Databricks SQL functionality
and basic performance features.

● The Pro warehouse type supports


all Databricks SQL functionality and
delivers higher performance
features in comparison to Classic,
including query federation, workflow
integration, and data science and
ML functions.
The Serverless warehouse type delivers
advanced performance features and
supports all of the features available in the
Warehouse Types
● The Serverless warehouse type
delivers advanced performance
features and supports all of the
features available in the Pro type,
along with instant and fully managed
compute.

● Once a warehouse is established,


you can easily access and connect
to the data using BI tools, such as
Power BI, dbt, and Tableau by
downloading the relevant connection
files.

● This means you don’t have to


manually configure your BI tools to
connect to the SQL Warehouse,
though you still can if you prefer.
SQL Editor
● SQL Editor allows you to execute SQL queries and
interact with the databases in your Databricks workspace.

● When a database is selected, you can easily view all the


objects it contains, including tables and fields, along with
their data types.

● One of the most convenient features of the SQL Editor is


the ability to preview the data in your tables.

● By hovering over a table, you can retrieve up to 50


records using the arrow icon, which provides a quick and
efficient way to preview the data.

● In addition to the preview feature, you can quickly insert


table or field names into your query script using the two
arrows icon when hovering over the tables and fields.
Query Filters
● Filters allow you to reduce the Query Filter
Filters are added once the query has been executed
dataset after the query has been using the dropdown option (Plus icon). The popup
executed options allow you to specify the column, type, and
sort order of the filter.

● Once the filter is defined, a widget will appear above


the results panel, where the filter value can be
selected.
Query Parameters
● Parameters enable you to limit the data returned by supplying an input value
into a query before it is executed.

The parameter settings can be altered


using the “Add Parameter” option (gear
icon), this allows for the parameter
Keyword, Title, and Type to be defined.

● Keyword: this corresponds to the


parameter in the query.
● Title: will appear over the widget.
● Type: supports Text (default),
Number, Dropdown List, Query
Based Dropdown List, Date, Date
and Time, Date and Time (with
seconds), and Date Range.

You might also like