0% found this document useful (0 votes)

402 views16 pages

Snowpro™ Advanced: Data Engineer: Exam Study Guide

Uploaded by

Vivek Reddyvari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

402 views16 pages

Snowpro™ Advanced: Data Engineer: Exam Study Guide

Uploaded by

Vivek Reddyvari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

SNOWPRO™ ADVANCED: DATA ENGINEER

EXAM STUDY GUIDE

DEA-C01
Last Updated: September 27, 2022
SNOWPRO™ STUDY GUIDE OVERVIEW

This study guide highlights concepts that may be covered on Snowflake’s SnowPro™
Advanced: Data Engineer Certification exam.

This document introduces relevant information that may appear on the SnowPro Advanced:
Data Engineer Exam. This document should serve as an introduction to the knowledge and
skills required to guide your preparation. The material contained within this study guide is not
intended to guarantee a passing score on any Snowflake certification exam.

For an overview and more information on the SnowPro™ Core Certification exam or
SnowPro™ Advanced Certification series, please navigate here.

TABLE OF CONTENTS

SNOWPRO™ ADVANCED: DATA ENGINEER CERTIFICATION OVERVIEW 2

SNOWPRO™ ADVANCED: DATA ENGINEER SUBJECT AREA BREAKDOWN 2

SNOWPRO ADVANCED: DATA ENGINEER PREREQUISITE 3

RECOMMENDATIONS AND USING THE GUIDE 4

STEPS TO SUCCESS 4

SNOWPRO ADVANCED: DATA ENGINEER DOMAINS & OBJECTIVES 5

1.0 Domain: Data Movement 5

2.0 Domain: Performance Optimization 6
3.0 Domain: Storage & Data Protection 7
4.0 Domain: Security 7
5.0 Domain: Data Transformation 8

DATA ENGINEER SAMPLE QUESTIONS 10

Page 1
SNOWPRO™ ADVANCED: DATA ENGINEER CERTIFICATION
OVERVIEW

The SnowPro™ Advanced: Data Engineer tests advanced knowledge and skills used to apply
comprehensive data engineering principles using Snowflake.

This certification will test the ability to:

● Source data from Data Lakes, APIs, and on-premises

● Transform, replicate, and share data across cloud platforms
● Design end-to-end near real-time streams
● Design scalable compute solutions for DE workloads
● Evaluate performance metrics

Target Audience:

2 + years of data engineering experience, including practical experience using Snowflake for DE
tasks; Candidates should have a working knowledge of Restful APIs, SQL, semi-structured
datasets, and cloud native concepts. Programming experience is a plus.

SNOWPRO™ ADVANCED: DATA ENGINEER SUBJECT AREA

BREAKDOWN

This exam guide includes test domains, weightings, and objectives. It is not a comprehensive
listing of all the content that will be presented on this examination. The table below lists the main
content domains and their weighting ranges.

Domain Estimated Percentage Range of Exam Questions

1.0 Data Movement 25-30%

2.0 Performance Optimization 20-25%
3.0 Storage and Data Protection 10-15%
4.0 Security 10-15%
5.0 Data Transformation 25-30%

Page 2
SNOWPRO™ ADVANCED: DATA ENGINEER PREREQUISITE

Eligible individuals must hold an active SnowPro Core Certified credential. If you feel you need
more guidance on the fundamentals, please see the SnowPro Core Study guide.

Snowflake recommends that examinees have at least 2 years of hands-on practical Snowflake
implementation experience prior to attempting any of the SnowPro Advanced exams.

For the SnowPro Advanced: Data Engineer Certification exam, we recommend individuals
have at least 2 + years of hands-on Snowflake Practitioner experience in a Data
Engineering role prior to attempting this exam. The exam will assess skills through
scenario-based questions and real-world examples.

Page 3
RECOMMENDATIONS AND USING THE GUIDE

This guide will show the Snowflake topics and subtopics covered on the exam. Following the
topics will be additional resources consisting of videos, documentation, blogs, and/or exercises
to help you understand data engineering with Snowflake.

Estimated length of study guide: 10 – 13 hours

STEPS TO SUCCESS

1. Review Data Engineer Exam Guide

2. For training, attend Snowflake’s Instructor Led Data Engineering Course
3. Review and study applicable white papers and documentation
4. Get hands-on practical experience with relevant business requirements using Snowflake
5. Attend Snowflake Webinars
6. Attend Snowflake Virtual Hands-on Labs for more hands-on practical experience
7. Schedule your exam
8. Take your exam!

Additional Snowflake Asset to check out for Data Engineering:

Cloud Data Engineering for Dummies

Page 4
SNOWPRO ADVANCED: DATA ENGINEER DOMAINS & OBJECTIVES
1.0 Domain: Data Movement

1.1 Given a data set, load data into Snowflake.

● Outline considerations for data loading
● Define data loading features and potential impact

1.2 Ingest data of various formats through the mechanics of Snowflake.

● Required data formats
● Outline Stages

1.3 Troubleshoot data ingestion.

1.4 Design, build and troubleshoot continuous data pipelines.

● Design a data pipeline that forces uniqueness but is not unique.
● Stages
● Tasks
● Streams
● Snowpipe
● Auto ingest as compared to Rest API

1.5 Analyze and differentiate types of data pipelines.

● Understand Snowpark architecture (client vs server)
● Create and deploy UDFs and Stored Procedures using Snowpark
● Design and use the Snowflake SLQ API

1.6 Install, configure, and use connectors to connect to Snowflake.

1.7 Design and build data sharing solutions.

● Implement a data share
● Create a secure view
● Implement row level filtering

1.8 Outline when to use External Tables and define how they work.
● Partitioning external tables
● Materialized views
● Partitioned data unloading

Data Movement Study Resources:

Lab Guides
Accelerating Data Engineering with Snowflake & dbt (lab guide)
Auto-Ingest Twitter Data into Snowflake (lab guide)

Page 5
Automating Data Pipelines to Drive Marketing Analytics with Snowflake & Fivetran (lab
guide)

Reading Assets
Support for Calling External functions via Google Cloud API Gateway Now in Public
Preview (blog)
Snowflake and Spark, Part 2: Pushing Spark Query (Blog)
Fetching Query Results From Snowflake (Blog)
Moving from On-Premises ETL to Cloud-Driven ELT (White paper)

Snowflake Documentation
COPY INTO (Documentation)
Loading Data into Snowflake (Documentation)
DESCRIBE STAGE (Documentation)
Data Loading Tutorials(Documentation)
CREATE FILE FORMAT (Documentation)
Continuous Data Pipelines (Documentation)
VALIDATE_PIPE_LOAD (Documentation)
COPY_HISTORY (Documentation)
Databases, Tables & Views (Documentation)
CREATE STREAM (Documentation)
CREATE TASK (Documentation)
Connectors & Drivers (Documentation)
Sharing Data Securely in Snowflake (Documentation)
CREATE EXTERNAL TABLE (Documentation)

2.0 Domain: Performance Optimization

2.1 Troubleshoot underperforming queries.

● Identify underperforming queries
● Outline telemetry around the operation
● Increase efficiency
● Identify the root cause

2.2 Given a scenario, configure a solution for the best performance.

● Scale out as compared to scale in
● Clustering as compared to increasing warehouse size
● Query complexity
● Micro partitions and the impact of clustering
● Materialized views
● Search optimization

2.3 Outline and use caching features.

2.4 Monitor continuous data pipelines.

Page 6
● Snowpipe
● Stages
● Tasks
● Streams

Performance Optimization Study Resources:

Lab Guides
Resource Optimization: Performance (lab guide)
Resource Optimization: Usage Monitoring (lab guide)
Building a Data Application (lab guide)

Reading Assets
Performance Impact from Local and Remote Disk Spilling (Blog)
Snowflake: Visualizing Warehouse Performance (Blog)
Caching in Snowflake Data Warehouse (Blog)

Snowflake Documentation
Queries (Documentation)
System Functions (Documentation)
Account Usage (Documentation)
QUERY_HISTORY, QUERY_HISTORY_BY_* (Documentation)
Analyzing Queries Using Query Profile (Documentation)
Databases, Tables & Views (Documentation)
Virtual Warehouses (Documentation)
COPY_HISTORY (Documentation)
LOAD_HISTORY View (Documentation)
TASK_HISTORY (Documentation)
COPY_HISTORY View (Documentation)
SHOW STREAMS (Documentation)
PIPE_USAGE_HISTORY View (Documentation)

3.0 Domain: Storage & Data Protection

3.1 Implement data recovery features in Snowflake.

● Time Travel
● Fail-safe

3.2 Outline the impact of Streams on Time Travel.

3.3 Use System Functions to analyze Micro-partitions.

● Clustering depth
● Cluster keys

3.4 Use Time Travel and Cloning to create new development environments.

Page 7
● Backup databases
● Test changes before deployment
● Rollback

Storage & Data Protection Study Resources:

Lab Guides
Getting Started with Time Travel (lab guide)

Snowflake Documentation
Snowflake Time Travel & Fail-safe (Documentation)
Databases, Tables & Views (Documentation)
Parameter Hierarchy and Types (Documentation)
Database Replication and Failover/Failback (Documentation)
Continuous Data Pipelines (Documentation)
SYSTEM$CLUSTERING_INFORMATION (Documentation)
SYSTEM$CLUSTERING_DEPTH (Documentation)

4.0 Domain: Security

4.1 Outline Snowflake security principles.

● Authentication methods (Single Sign On (SSO), Key Authentication,
Username/Password, Multi-factor Authentication (MFA))
● Role Based Access Control (RBAC)
● Column Level Security and how data masking works with RBAC to secure
sensitive data

4.2 Outline the system defined roles and when they should be applied.
● The purpose of each of the System Defined Roles including best practices
usage in each case
● The primary differences between SECURITYADMIN and USERADMIN
roles
● The difference between the purpose and usage of the USERADMIN/
SECURITYADMIN roles and SYSADMIN

4.3 Manage Data Governance.

● Explain the options available to support column level security including
Dynamic Data Masking and External Tokenization
● Explain the options available to support row level security using Snowflake
Row Access Policies
● Use DDL required to manage Dynamic Data Masking and Row Access
Policies
● Use methods and best practices for creating and applying masking policies on
data
● Use methods and best practices for Object Tagging

Page 8
Security Study Resources:

Reading Assets
Snowflake RBAC Security Prefers Role Inheritance to Role Composition (Blog)

Snowflake Documentation
Managing Security in Snowflake (Documentation)
Managing Your User Preferences (Documentation)
Managing Governance in Snowflake (Documentation)
Stored Procedures (Documentation)
GRANT <privileges>…TO ROLE (Documentation)
CREATE MATERIALIZED VIEW (Documentation)

5.0 Domain: Data Transformation

5.1 Define User-Defined Functions (UDFs) and outline how to use them.
● Secure UDFs
● SQL UDFs
● JavaScript UDFs
● Returning table value as compared to scalar value

5.2 Define and create External Functions.

● Secure External Functions

5.3 Design, Build, and Leverage Stored Procedures.

● Transaction management

5.4 Handle and transform semi-structured data.

● Traverse and transform semi-structured data to structured data
● Transform structured to semi-structured data

5.5 Use Snowpark for data transformation.

● Query and filter data using the Snowpark library
● Perform data transformations using Snowpark (ie., aggregations)
● Join Snowpark dataframes

Data Transformation Study Resources:

Reading Assets
Snowflake For Data Engineering – Easily Ingest, Transform and Deliver Data for
Up-To-The Moment Insight (white paper)

Page 9
Bringing Extensibility to Data Pipelines: What’s New with Snowflake External Functions
(blog)
Generating a JSON Dataset Using Relational Data in Snowflake (blog)
Best Practices for Managing Unstructured Data (White paper)

Snowflake Documentation
UDFs (User-Defined Functions) (Documentation)
External Functions (Documentation)
CREATE EXTERNAL FUNCTION (Documentation)
CREATE API INTEGRATION (Documentation)
CREATE EXTERNAL FUNCTION (Documentation)
Transactions (Documentation)
Stored Procedures (Documentation)
TRY_PARSE_JSON (Documentation)
Queries (Documentation)
Semi-Structured Data (Documentation)
Databases, Tables & Views (Documentation)
Snowpark (Documentation)

Ready to register for an exam? Navigate here to get started.

Page 10
SNOWPRO™ ADVANCED: DATA ENGINEER SAMPLE QUESTIONS

1. Running the below clustering information analysis function:

SELECT SYSTEM$CLUSTERING_INFORMATION(‘table1 , ‘(col1, col2)’)

on TABLE1, that is not clustered, will return which of the following?

a. An error: this function works only on clustered tables.

b. Clustering information on all tables: this function clusters all tables by default.

c. Clustering information: the information will be presented as if the table was

clustered by col1,col2.

d. An error: this function does not accept lists of columns as a second parameter.

Page 11
2. A Data Engineer has inherited a database and is monitoring a table with the below query
every 30 days:

SELECT SYSTEM$CLUSTERING_INFORMATION( ‘orders’, ‘(o_orderdate)’);

The Engineer gets the first two results (e.g., Day 0 and Day 30).

-- DAY 0 -------
{
"cluster_by_keys" : "LINEAR(o_orderdate)",
"total_partition_count" : 3218,
"total_constant_partition_count" : 0,
"average_overlaps" : 20.4133,
"average_depth" : 11.4326,
"partition_depth_histogram" : {
"00000" : 0,
"00001" : 0,
"00002" : 0,
"00003" : 0,
"00004" : 0,
"00005" : 0,
"00006" : 0,
"00007" : 0,
"00008" : 0,
"00009" : 0,
"00010" : 993,
"00011" : 841,
"00012" : 748,
"00013" : 413,
"00014" : 121,
"00015" : 74,
"00016" : 16,
"00032" : 12
}
}

-- DAY 30 -------
{
"cluster_by_keys" : "LINEAR(o_orderdate)",
"total_partition_count" : 3240,
"total_constant_partition_count" : 0,
"average_overlaps" : 64.1185,
"average_depth" : 33.4704,
"partition_depth_histogram" : {
"00000" : 0,
"00001" : 0,
"00002" : 0,
"00003" : 0,

Page 12
"00004" : 0,
"00005" : 0,
"00006" : 0,
"00007" : 0,
"00008" : 0,
"00009" : 0,
"00010" : 0,
"00011" : 0,
"00012" : 0,
"00013" : 0,
"00014" : 0,
"00015" : 0,
"00016" : 0,
"00032" : 993,
"00064" : 2247
}
}

How should the Engineer interpret these results?

a. The table is well organized for queries that range over column o_orderdate.
Over time, this organization is degrading.
b. The table was initially well organized for queries that range over column
o_orderdate. Over time this organization has improved further.
c. The table was initially not organized for queries that range over column
o_orderdate. Over time, this organization has changed.
d. The table was initially poorly organized for queries that range over column
o_orderdate. Over time, this organization has improved.

3. A Data Engineer is preparing to load staged data from an external stage using a
task object.

Which of the following practices will provide the MOST efficient load performance?

a. Store the files on the external stage to ensure caching is maintained

b. PUT all files in a single directory
c. Limit file names to under 30 characters
d. Organize files into logical paths that reflect a scheduling pattern

Page 13
4. A Data Engineer is working on a project that requires data to be moved directly from an
internal stage to an external stage.

Which of the following is the QUICKEST way to accomplish this task?

a. COPY INTO @myExtStage from (SELECT $1, $2, ...

@myInternalStage);
b. Copy the data from the internal stage to a table and then unload the data to an
external stage
c. COPY INTO @myExtStage from @myInternalStage;
d. Write a custom script to move the data

5. The S1 schema contains two permanent tables that were created as shown below:

CREATE TABLE table_a (c1 INT)

DATA_RETENTION_TIME_IN_DAYS = 10;

CREATE TABLE table_b (c1 INT);

What will be the impact of running the following command?

ALTER SCHEMA S1 SET DATA_RETENTION_TIME_IN_DAYS = 20;

a. The retention time on table_a does not change; table_b is set to 20 days.
a. An error will be generated; a data retention time on a schema cannot be set.
b. The retention time on both tables will be set to 20 days.
c. The retention time will not change on either table.

Page 14
Keys: 1) B
2) A
3) D
4) A
5) A

The information provided in this study guide is provided for your purposes only and may not be
provided to third parties.

IN ADDITION, THIS STUDY GUIDE IS PROVIDED “AS IS”. NEITHER SNOWFLAKE NOR ITS
SUPPLIERS MAKES ANY OTHER WARRANTIES, EXPRESS OR IMPLIED, STATUTORY OR
OTHERWISE, INCLUDING BUT NOT LIMITED TO WARRANTIES OF MERCHANTABILITY, TITLE,
FITNESS FOR A PARTICULAR PURPOSE OR NONINFRINGEMENT.

Page 15

Installation Diagram Senyang Board
67% (3)
Installation Diagram Senyang Board
14 pages
Programming+in+Snowflake+ +All+Slides
100% (1)
Programming+in+Snowflake+ +All+Slides
342 pages
Snowflake Training
No ratings yet
Snowflake Training
685 pages
Snowpro™ Core: Exam Study Guide
100% (1)
Snowpro™ Core: Exam Study Guide
15 pages
SnowProCore Exam Study Guide 050423
No ratings yet
SnowProCore Exam Study Guide 050423
16 pages
Snowflake Training Slide SANMs
67% (6)
Snowflake Training Slide SANMs
218 pages
Snowflake For: Data Engineering
No ratings yet
Snowflake For: Data Engineering
15 pages
Ultimate SnowPro Core Certification Course Slides by Tom Bailey
No ratings yet
Ultimate SnowPro Core Certification Course Slides by Tom Bailey
333 pages
Snowflake Certification Practice Paper3 V1-Done
No ratings yet
Snowflake Certification Practice Paper3 V1-Done
22 pages
Snowflake Interview 2024 03
100% (1)
Snowflake Interview 2024 03
167 pages
Solutions Partner Technical Onboarding Guide
100% (1)
Solutions Partner Technical Onboarding Guide
27 pages
Snowflake+Interview+Questions+ +Part+II
100% (1)
Snowflake+Interview+Questions+ +Part+II
31 pages
Best Practices For Optimizing Your DBT and Snowflake Deployment
No ratings yet
Best Practices For Optimizing Your DBT and Snowflake Deployment
30 pages
Snowpro™ Core: Exam Study Guide
No ratings yet
Snowpro™ Core: Exam Study Guide
17 pages
Master Snowflake Interview Q A 1729835390
No ratings yet
Master Snowflake Interview Q A 1729835390
7 pages
Snowflake and Its Benefits
No ratings yet
Snowflake and Its Benefits
93 pages
Snowflake Certification Syllabus
No ratings yet
Snowflake Certification Syllabus
4 pages
Snowflake Training Presentation v1
No ratings yet
Snowflake Training Presentation v1
111 pages
Snowflake Scenario Based Interview Questions
100% (2)
Snowflake Scenario Based Interview Questions
20 pages
Snowflake - Certforall.snowpro Core - free.PDF.2023 Oct 02.by - Levi.182q.vce
No ratings yet
Snowflake - Certforall.snowpro Core - free.PDF.2023 Oct 02.by - Levi.182q.vce
26 pages
Snowflake Free Lab Guide
50% (4)
Snowflake Free Lab Guide
58 pages
Snowflake Faq
No ratings yet
Snowflake Faq
185 pages
Snowpro Core Certification Guide
No ratings yet
Snowpro Core Certification Guide
5 pages
Snowpro™ Core: Study Guide
100% (1)
Snowpro™ Core: Study Guide
17 pages
Snowflake 101 - For Data Architects - LinkedIn
No ratings yet
Snowflake 101 - For Data Architects - LinkedIn
17 pages
Silvus Overview 2024 03
No ratings yet
Silvus Overview 2024 03
86 pages
Snowflake SnowPro Advanced - Architect - Practice Exam - Medium
No ratings yet
Snowflake SnowPro Advanced - Architect - Practice Exam - Medium
9 pages
Interview Questions
No ratings yet
Interview Questions
16 pages
ILT-Fundamentals 4-Day - Datasheet
No ratings yet
ILT-Fundamentals 4-Day - Datasheet
4 pages
Commonly Asked Snowflake
No ratings yet
Commonly Asked Snowflake
26 pages
All Course Slides
100% (1)
All Course Slides
192 pages
3 Snowflake+Architecture
No ratings yet
3 Snowflake+Architecture
20 pages
Snowflake SnowPro Advanced - Architect - Practice Exam - Medium
No ratings yet
Snowflake SnowPro Advanced - Architect - Practice Exam - Medium
7 pages
Snowflake
No ratings yet
Snowflake
16 pages
Top 88 Data Modeling Interview Questions and Answers
No ratings yet
Top 88 Data Modeling Interview Questions and Answers
19 pages
Snowflake Data Sharing
100% (1)
Snowflake Data Sharing
35 pages
Snowflake Interview Questions: Click Here
No ratings yet
Snowflake Interview Questions: Click Here
29 pages
Cof C02
0% (1)
Cof C02
7 pages
Snowflake 20 s1 A PDF
No ratings yet
Snowflake 20 s1 A PDF
254 pages
Teradata To Snowflake Migration Guide
100% (2)
Teradata To Snowflake Migration Guide
15 pages
Credit Card Fraud Analysis Project Documentation
No ratings yet
Credit Card Fraud Analysis Project Documentation
101 pages
Snowflake Certification Practice Paper5 v1
No ratings yet
Snowflake Certification Practice Paper5 v1
19 pages
Snowflake Vs Data Bricks
No ratings yet
Snowflake Vs Data Bricks
10 pages
Redshift Vs Snowflake - An In-Depth Comparison PDF
100% (2)
Redshift Vs Snowflake - An In-Depth Comparison PDF
19 pages
7 Snowflake Reference Architectures For Application Builders
No ratings yet
7 Snowflake Reference Architectures For Application Builders
13 pages
Snowflake Questions 2
No ratings yet
Snowflake Questions 2
6 pages
What Is Snowflake
No ratings yet
What Is Snowflake
34 pages
Snowflake UNIT II
No ratings yet
Snowflake UNIT II
44 pages
Documentation: o o o o o
50% (2)
Documentation: o o o o o
5 pages
Snowflake Questions1
No ratings yet
Snowflake Questions1
4 pages
Introducing Snowflake Role Based Access Control
No ratings yet
Introducing Snowflake Role Based Access Control
11 pages
Snowflake's SnowPro Certification Preparation Guide
No ratings yet
Snowflake's SnowPro Certification Preparation Guide
6 pages
Documentation: Community Resources Blog English
No ratings yet
Documentation: Community Resources Blog English
11 pages
Snowflake
No ratings yet
Snowflake
3 pages
SnowFlake Schema
No ratings yet
SnowFlake Schema
8 pages
What Is The Snowflake Data Warehouse
No ratings yet
What Is The Snowflake Data Warehouse
7 pages
Snowflake:: Data Warehouse For Cloud
No ratings yet
Snowflake:: Data Warehouse For Cloud
2 pages
Key Concepts & Architecture: Data Platform As A Cloud Service
No ratings yet
Key Concepts & Architecture: Data Platform As A Cloud Service
4 pages
Edgelink RESTful API Specification - v2.0
No ratings yet
Edgelink RESTful API Specification - v2.0
99 pages
Handout - Sensors - Embedded Programming - ESP32 v1.0
No ratings yet
Handout - Sensors - Embedded Programming - ESP32 v1.0
11 pages
Python While Loop
No ratings yet
Python While Loop
5 pages
Lecture On Principles of Programming Languages
No ratings yet
Lecture On Principles of Programming Languages
34 pages
PCF Users Guide
No ratings yet
PCF Users Guide
104 pages
Melhores Praticas Ifix
No ratings yet
Melhores Praticas Ifix
113 pages
Arrays and Strings C++
No ratings yet
Arrays and Strings C++
27 pages
Unit 4 Deadlock
No ratings yet
Unit 4 Deadlock
46 pages
COAL Chapter No 7 (Complete)
No ratings yet
COAL Chapter No 7 (Complete)
56 pages
Citra Log - Txt.old
No ratings yet
Citra Log - Txt.old
33 pages
Exp3 For Varying Message Sizes Test Integrity of Message Using MD-5 SHA-1
No ratings yet
Exp3 For Varying Message Sizes Test Integrity of Message Using MD-5 SHA-1
4 pages
Week 2
No ratings yet
Week 2
23 pages
Integration Between Unity Connection and CUCM
No ratings yet
Integration Between Unity Connection and CUCM
10 pages
1 IJAEST Volume No 2 Issue No 1 Malware Analysis Using Assembly Level Program 000 012
No ratings yet
1 IJAEST Volume No 2 Issue No 1 Malware Analysis Using Assembly Level Program 000 012
12 pages
Barracuda Web Application Firewall DS US 1-2
No ratings yet
Barracuda Web Application Firewall DS US 1-2
6 pages
Hotel Management Report
No ratings yet
Hotel Management Report
5 pages
Prasada Reddy - Server Admin
No ratings yet
Prasada Reddy - Server Admin
5 pages
SOLIDWORKS 2020 WhatsNew
No ratings yet
SOLIDWORKS 2020 WhatsNew
33 pages
Command Line Juniper
No ratings yet
Command Line Juniper
11 pages
Real Time Vehicle Tracking System Using 9c0ccb27
No ratings yet
Real Time Vehicle Tracking System Using 9c0ccb27
4 pages
C++ Lecture 2
No ratings yet
C++ Lecture 2
26 pages
DNS Incident Response 1686637907
No ratings yet
DNS Incident Response 1686637907
9 pages
FortiMail Cloud User Portal Guide
No ratings yet
FortiMail Cloud User Portal Guide
6 pages
Corelation 22.1
No ratings yet
Corelation 22.1
9 pages
SQL Questions For Journal
No ratings yet
SQL Questions For Journal
9 pages
MangoRed - DV6 - Final WP7702
No ratings yet
MangoRed - DV6 - Final WP7702
10 pages
Final Term Seqs Dccn-w2021
No ratings yet
Final Term Seqs Dccn-w2021
2 pages
Learning Informatica PowerCenter 9.x
From Everand
Learning Informatica PowerCenter 9.x
Rahul Malewar
3/5 (4)
Sybex's Study Guide for Snowflake SnowPro Core Certification: COF-C02 Exam
From Everand
Sybex's Study Guide for Snowflake SnowPro Core Certification: COF-C02 Exam
Hamid Mahmood Qureshi
No ratings yet
Databricks Essentials: A Guide to Unified Data Analytics
From Everand
Databricks Essentials: A Guide to Unified Data Analytics
Robert Johnson
No ratings yet
Practice Questions for Snowflake Snowpro Core Certification Concept Based - Latest Edition 2023
From Everand
Practice Questions for Snowflake Snowpro Core Certification Concept Based - Latest Edition 2023
Exam OG
5/5 (1)

Snowpro™ Advanced: Data Engineer: Exam Study Guide

Uploaded by

Snowpro™ Advanced: Data Engineer: Exam Study Guide

Uploaded by

SNOWPRO™ ADVANCED: DATA ENGINEER

EXAM STUDY GUIDE

SNOWPRO™ ADVANCED: DATA ENGINEER CERTIFICATION OVERVIEW 2

SNOWPRO™ ADVANCED: DATA ENGINEER SUBJECT AREA BREAKDOWN 2

SNOWPRO ADVANCED: DATA ENGINEER PREREQUISITE 3

RECOMMENDATIONS AND USING THE GUIDE 4

SNOWPRO ADVANCED: DATA ENGINEER DOMAINS & OBJECTIVES 5

1.0 Domain: Data Movement 5

DATA ENGINEER SAMPLE QUESTIONS 10

This certification will test the ability to:

● Source data from Data Lakes, APIs, and on-premises

SNOWPRO™ ADVANCED: DATA ENGINEER SUBJECT AREA

Domain Estimated Percentage Range of Exam Questions

1.0 Data Movement 25-30%

Estimated length of study guide: 10 – 13 hours

1. Review Data Engineer Exam Guide

Additional Snowflake Asset to check out for Data Engineering:

Cloud Data Engineering for Dummies

1.1 Given a data set, load data into Snowflake.

1.2 Ingest data of various formats through the mechanics of Snowflake.

1.3 Troubleshoot data ingestion.

1.4 Design, build and troubleshoot continuous data pipelines.

1.5 Analyze and differentiate types of data pipelines.

1.6 Install, configure, and use connectors to connect to Snowflake.

1.7 Design and build data sharing solutions.

Data Movement Study Resources:

2.0 Domain: Performance Optimization

2.1 Troubleshoot underperforming queries.

2.2 Given a scenario, configure a solution for the best performance.

2.3 Outline and use caching features.

2.4 Monitor continuous data pipelines.

Performance Optimization Study Resources:

3.0 Domain: Storage & Data Protection

3.1 Implement data recovery features in Snowflake.

3.2 Outline the impact of Streams on Time Travel.

3.3 Use System Functions to analyze Micro-partitions.

Storage & Data Protection Study Resources:

4.0 Domain: Security

4.1 Outline Snowflake security principles.

4.3 Manage Data Governance.

5.0 Domain: Data Transformation

5.2 Define and create External Functions.

5.3 Design, Build, and Leverage Stored Procedures.

5.4 Handle and transform semi-structured data.

5.5 Use Snowpark for data transformation.

Data Transformation Study Resources:

Ready to register for an exam? Navigate here to get started.

1. Running the below clustering information analysis function:

SELECT SYSTEM$CLUSTERING_INFORMATION(‘table1 , ‘(col1, col2)’)

on TABLE1, that is not clustered, will return which of the following?

a. An error: this function works only on clustered tables.

c. Clustering information: the information will be presented as if the table was

SELECT SYSTEM$CLUSTERING_INFORMATION( ‘orders’, ‘(o_orderdate)’);

How should the Engineer interpret these results?

a. Store the files on the external stage to ensure caching is maintained

Which of the following is the QUICKEST way to accomplish this task?

a. COPY INTO @myExtStage from (SELECT $1, $2, ...

CREATE TABLE table_a (c1 INT)

CREATE TABLE table_b (c1 INT);

What will be the impact of running the following command?

ALTER SCHEMA S1 SET DATA_RETENTION_TIME_IN_DAYS = 20;

You might also like