0% found this document useful (0 votes)
15 views6 pages

Cabanasj486 Snowflake Snowpro Core

The Snowflake SnowPro Core Cheat Sheet provides essential information for the SnowPro Core certification exam, including exam format, passing score, and recommended resources. It covers Snowflake's architecture, use cases, data loading methods, access management, and various features such as time travel and zero-copy cloning. Additionally, it outlines pricing, cloud provider options, and the types of objects and tables within Snowflake.

Uploaded by

saidinarimen98
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
15 views6 pages

Cabanasj486 Snowflake Snowpro Core

The Snowflake SnowPro Core Cheat Sheet provides essential information for the SnowPro Core certification exam, including exam format, passing score, and recommended resources. It covers Snowflake's architecture, use cases, data loading methods, access management, and various features such as time travel and zero-copy cloning. Additionally, it outlines pricing, cloud provider options, and the types of objects and tables within Snowflake.

Uploaded by

saidinarimen98
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 6

Snowflake SnowPro Core Cheat Sheet

by cabanasj486 via cheatography.com/192752/cs/40080/

Exam Inform​ation Archit​ecture (cont)

Average salary $90.501 Shared​-no​thing Each node in the virtual warehouse cluster stores
Duration 120 minutes archit​ectures a portion of the entire data set locally.

Exam Guide https:​//l​ear​n.s​now​fla​ke.c​om​/en​/ce​rti​fic​ati​ons​/sn​‐ Hybrid of tradit​ional shared​-disk and shared​-no​thing database archit​‐
owp​ro-​core/ ect​ures.

Format Multiple Select, Multiple Choice, True/False


Archit​ecture Layers
questions
Language English & Japanese Centra​lized Snowflake reorga​nizes that data into its internal
Storage optimized, compre​ssed, columnar format.
Number of 100
Layer
Questions
Compute Query execution is performed using virtual
Passing Score 80% or more
Layer warehouses in the comput​e/p​roc​essing layer
Price 175 USD
Cloud Collection of services that coordinate activities across
Recomm​ended https:​//p​laz​ago​nza​lo.m​ed​ium.com
Services Snowflake, including Authen​tic​ation, Access Control,
course
Layer etc.
Validity 2 years
Cloud It is only used the first time when we choose a
Version COF-C02 Agnostic provider.
Layer
What is Snowflake?

Snowflake is a Data Solution provided as Softwa​re-​as-​a-S​ervice Costs


(SaaS). It’s not available on-premise. It combines a new SQL query Storage Average amount of storage used per month, after
engine with an innovative archit​ecture natively designed for the Costs compre​ssion
cloud.
Compute Billed by seconds with a one-minute minimum
Costs
Use cases
Cloud Up to 10% of daily compute credits is included for
Snowflake is optimal as Data Warehouse, Data Lake, Data Service free
Exchange, Data Apps, Data Science, and Data Engine​ering.
Data Move or copy their data between regions or cloud
Transfer providers
Snowflake Use Cases
The Snowflake edition, warehouse size, number of clusters, and time
that each server in each cluster runs determine the number of credits
that a data warehouse consumes.

Archit​ecture

Shared​-disk Central data repository for persisted data accessible


archit​ectures from all compute nodes in the platform.

By cabanasj486 Not published yet. Sponsored by CrosswordCheats.com


Last updated 6th September, 2023. Learn to solve cryptic crosswords!
Page 1 of 6. http://crosswordcheats.com

cheatography.com/cabanasj486/
Snowflake SnowPro Core Cheat Sheet
by cabanasj486 via cheatography.com/192752/cs/40080/

Capacity Options Continuos Load (cont)

On-Demand Fixed rate for the consumed services Snowflake Reads data from Apache Kafka topics and
Pre-paid Cheaper, but commitment to Snowflake Connector for loads the data into a Snowflake table
Kafka
Micro-​par​titions Third-​Party Data You can see the list at the following link.
Integr​ation Tools
All data in Snowflake tables are automa​tically divided into micro-​par​‐
tit​ions, contiguous units of storage between 50 and 500MB of The most important part of this section is Snowpipe. You should use
uncomp​ressed data, organized in a columnar way. it for small volume of frequent data, and you load it contin​uously
They are immutable, meaning they cannot be changed once created. (micro​-ba​tches). It's serverless, which means that it doesn’t use
Virtual Wareho​uses.
Pruning process
It can detect new files by automating Snowpipe using cloud
Technique to analyze the smallest number of micro-​par​titions to
messaging, or by calling the Snowpipe REST endpoints.
solve a query. It retrieves all the necessary data to give a solution
without looking at all the micro-​par​tit​ions, saving a lot of time to
Bulk Load
return the result. You can find a real example here
COPY Load data from any stage to an existing table. 64 days
Load Data INTO of metadata.

Bulk Load Loading batches of data from files already available at Some important consid​era​tions:
any stage into Snowflake tables 1) You cannot Load/U​nload files from your Local Drive
Continuous Load small volumes of data (micro​-ba​tches) and 2) Using the Snowflake UI, you can only Load 50MB files
Load increm​entally make them available for analysis. 3) Organizing input data by granular path can improve load perfor​‐
mance
Default Roles 4) FORCE=True to copy the files again and ommit the 64 days of
metadata.
ACCOUN​‐ Top-level role
5) PURGE = True removes the data files from the stage.
TADMIN
5) If there is any error, you can specify different options: ABORT_​‐
SECURI​‐ Manage users and roles STA​TEMENT, CONTINUE, SKIP_FILE, SKIP_F​ILE​_num, SKIP_F​‐
TYADMIN ILE​_num%.
SYSADMIN Create warehouses and databases (and other
objects) Cache Strategies
USERADMIN User and role management Metadata Objects Inform​ation & Statis​tics.
PUBLIC Automa​tically granted to every user an role Cache

CUSTOM Create your own roles and assign the privileges Warehouse Attached SSD storage to a Warehouse. Inform​ation
that you want Cache lost when the Warehouse is suspended.
Query It stores the results of our queries for 24 hours. If we
Continuos Load Result perform the same query and the data hasn’t changed,
Snowpipe Loading data when the files are available in any (inter​‐ Cache it will return the same result without using the
nal​/ex​ternal) stage. 14 days of metadata. Warehouse.

You can find a complete example of how to use the different cache
strategies in the following link.

By cabanasj486 Not published yet. Sponsored by CrosswordCheats.com


Last updated 6th September, 2023. Learn to solve cryptic crosswords!
Page 2 of 6. http://crosswordcheats.com

cheatography.com/cabanasj486/
Snowflake SnowPro Core Cheat Sheet
by cabanasj486 via cheatography.com/192752/cs/40080/

Access Management Approaches Data Sharing (cont)

Discre​tionary Access Each object has an owner who can, in turn, Types of Full account (existing Snowflake account), and
Control (DAC) grant access to that object Consumers Reader Account (share data with someone without
Role-Based Access Access privileges are assigned to roles, Snowflake account).
Control (RBAC) which are, in turn, given to users Shared data is instan​taneous for consumers as no actual data is
copied or transf​erred between accounts. For this reason, shared
Access Management in Snowflake data is always up-to-​date, and consumers don't pay for storage.
User Person or program
Streams
Role Entity to which we grant privileges
Definition Snowflake objects that record data manipu​lation
Securable Object Entity to which we can grant access
language (DML) changes made to tables and views,
Privilege Defined level of access to an object
including INSERTS, UPDATES, and DELETES, as well
as metadata about each change
Other Concepts
Storage They don't contain table data; they only store offsets
Parnet Technology & Solution partners
Types Standard, Append Only, and Insert Only
Connect
Columns METADA​TA$​ACTION, METADA​TA$​ISU​PDATE,
Compliance HITRUST / HIPAA, ISO/IEC 27001, FedRAMP
METADA​TA$​ROW_ID
Moderate, PCI-DSS, etc
Function that indicates whether a stream contains
Data Market​‐ For providers to buy or sell their datasets. Free,
change data capture (CDC) records
place Person​alized, and Paid Listings
Another important function is SYSTEM​$ST​REA​M_H​AS_DATA,
Column Level Dynamic Data Masking & External Tokeni​zation
which that indicates whether a stream contains change data capture
Security
(CDC) records. You can see a example of how streams work in the
following link.
Data Sharing

Share Snowflake objects that encaps​ulate all inform​ation


File Formats
required to share a database
Structured Data CSV. Fastest way to load data.
Types of Outbound & Inbound
Semi-s​tru​ctured JSON, Parquet, XML, Avro, ORC
Shares
Data
Producer Snowflake account that creates shares and makes
FLATTEN Convert semi-s​tru​ctured data to a relational
them available to other Snowflake accounts to consume
repres​ent​ation
Consumer Accounts that receive the share/​data.

Sequences

Use Generate unique numbers across sessions and


case statements
nextval Function to generate a set of distinct values

By cabanasj486 Not published yet. Sponsored by CrosswordCheats.com


Last updated 6th September, 2023. Learn to solve cryptic crosswords!
Page 3 of 6. http://crosswordcheats.com

cheatography.com/cabanasj486/
Snowflake SnowPro Core Cheat Sheet
by cabanasj486 via cheatography.com/192752/cs/40080/

Editions Main Objects contained within a Schema (cont)

Standard Stages Location of data files in the Cloud Storage


Enterprise File Formats Describes a set of staged data to access or
Business Critical load into Snowflake tables.

Virtual Private Snowflake Sequences Like counters to create unique numbers.


Pipes Object that enables automatic loading of
Snowflake Editions data from files as soon as they are
available in a Stage.
Stored Procedures & Extend the system to perform operations in
User-D​efined different progra​mming languages.
Functions (UDF)

Snowflake Objects

Cloud Providers

Amazon Web Services


Azure
Google Cloud Platform

Connect to Snowflake
Types of tables
Web Interface
Permanent
SnowSQL (CLI Client)
Transient
ODBC
Temporary
JDBC
External
SDK for Node, Python, Kafka, Go, and more!

Types of views
Snowflake Objects
Regular
Account Must be unique.
Materi​alized
Warehouse Virutal Machine to execute queries. Compute Part.
Secure
Database Logical Collection of Schemas.
Schema Logical Collection of Objects. The Public schema and
the Inform​ati​on_​Schema are created when creating a
Database.

Main Objects contained within a Schema

Tables Contains all the data in the DB.


Views Virtual table defined by a query.

By cabanasj486 Not published yet. Sponsored by CrosswordCheats.com


Last updated 6th September, 2023. Learn to solve cryptic crosswords!
Page 4 of 6. http://crosswordcheats.com

cheatography.com/cabanasj486/
Snowflake SnowPro Core Cheat Sheet
by cabanasj486 via cheatography.com/192752/cs/40080/

Stages Time Travel

Named External Stage Use Cases Access historical data at any point within a defined
Named Internal Stage period. Useful to restore tables.

User Internal Stage (@~) Objects that we Databases, Schemas, and tables.
can restore
Table Internal Stage (@%)
Retention 1 day by default, with a maximum of 90 days
Storage Integr​ations will enable users to avoid supplying creden​tials
Period (Enter​prise edition).
when creating stages or when loading or unloading data. It's an
Ways to restore By offset, query statement ID, or timestamp.
object that stores a generated identity and access management
(IAM) entity for your external cloud storage. Example UNDROP TABLE mytable;

Note: Time Travel requires additional storage, which will be reflected


Stage Metadata in your monthly storage charges
METADA​TA$​FIL​‐ Name of the staged data file the current
ENAME row belongs to. Extend Snowflake Functi​onality

METADA​TA$​FIL​E_R​‐ Row number for each record in the Store Extend Snowflake SQL by combining it with
OW_​NUMBER container staged data file. Procedures JavaScript
User-D​‐ Perform operations that are not available through
Fail-Safe efined Snowfl​ake’s built-in, system​-de​fined functions. SQL,
Use It ensures historical data is protected in the event of a Functions JavaSc​ript, Java, and Python. It returns a single row
Cases system failure or other catast​rophic event (UDFs)

Retention NON-CO​NFI​GURABLE 7-day period User-D​‐ They can multiple rows for each input row (only
Period efined difference with UDFs)
Table
Example No, you cannot recover this data alone; you MUST ask
Functions
Snowflake support
(UDTFs)
Note: Fail-Safe requires additional storage, which will be reflected in
External They call code that is executed outside Snowflake
your monthly storage charges
Functions

Zero-Copy Cloning
Tasks
Use Create a snapshot of any table, schema, or Database
Definition Schedu​lable scripts that are run inside your Snowflake
Cases
enviro​nment
Cost FREE, it doesn’t consume storage. It does NOT
When Task run on a schedule
duplicate data; it duplicates the metadata of the micro-​‐
they run
par​tit​ions.
Execution They execute a single SQL statement, including a call
Other Privileges are not cloned. Data History is not cloned.
to a Stored Procedure
consid​‐
era​tions Duration Maximum duration of 60 minutes by default

Note: When you modify some cloned data, it will consume storage
because Snowflake has to recreate the micro-​par​tit​ions, which will
cost money.

By cabanasj486 Not published yet. Sponsored by CrosswordCheats.com


Last updated 6th September, 2023. Learn to solve cryptic crosswords!
Page 5 of 6. http://crosswordcheats.com

cheatography.com/cabanasj486/
Snowflake SnowPro Core Cheat Sheet
by cabanasj486 via cheatography.com/192752/cs/40080/

Tasks (cont) Data Warehouses Properties

Tree of Each task can have a maximum of 100 children tasks. Size Impact the amount of time required to execute
tasks A tree of tasks can have a maximum of 1000 tasks, queries
including the root one. Multi-​Cluster Scale compute resources to manage query
Task Query the history of task usage within a specified date Warehouses concur​rency
History range Multi-​Cluster Maximized & Auto-scale
Serverless Compute resources automa​tically scale up or down by Warehouses
Tasks Snowflake as required for each workload Modes

Note: Snowflake ensures only one instance of a task with a schedule Scaling Scale up/down to increase perfor​mance. Scale
is executed at a given time. If a task is still running when the next out/in to improve concur​rency for users/​que​ries.
scheduled execution time occurs, that scheduled time is skipped. Scaling policies Standard & Economy
Auto Suspend & Enabled by default.
Transa​ctions
Auto Resume
ACID. Sequence of SQL statements that are committed or rolled
A Data Warehouse is a cluster of computing resources in Snowflake
back as a unit. Things we need to know for the exam:
that provides CPU, memory, and temporary storage to perform
1) Snowflake takes 4 hours to abort it if we do not abort it with the
queries and DML operat​ions. While a warehouse is running, it
SYSTEM​$AB​ORT​_TR​ANS​ACTION
consumes Snowflake credits. It utilizes utilizes per-second billing
2) Each transa​ction has indepe​ndent scope.
(with a 60-second minimum each time the warehouse starts).
3) Snowflake does not support Nested Transa​ctions

Other commands
Resource Monitors
PUT UPLOAD files from a local direct​ory​/folder on a client
Credit Snowflake credits allocated to the monitor for the
machine into internal stages.
Quota specified frequency interval
GET DOWNLOAD files from a Snowflake internal stage into a
Monitor Monitor the credit usage for your entire Account or
direct​ory​/folder on a client machine
Level individual Warehouses
Example GET @my_in​t_stage file:/​//t​mp/​data/;
Schedule When the monitor is going to start monitoring
You cannot use both of these commands with the Snowflake Web
Actions What to do when the threshold is reached. Notify (send
UI.
notifi​cat​ion), Notify & Suspend (suspend wareho​use), or
Notify & Suspend Immedi​ately (kill query).

Resource monitors help you control costs and avoid unexpected


credit usage caused by running data wareho​uses. You can impose
limits on the number of credits that warehouses consume. They can
only be created by Accoun​tAd​mins.

By cabanasj486 Not published yet. Sponsored by CrosswordCheats.com


Last updated 6th September, 2023. Learn to solve cryptic crosswords!
Page 6 of 6. http://crosswordcheats.com

cheatography.com/cabanasj486/

You might also like