7 Snowflake Reference Architectures For Application Builders
7 Snowflake Reference Architectures For Application Builders
ARCHITECTURES FOR
APPLICATION BUILDERS
For every data app use case, there is a modern data architecture. Discover yours. EBOOK
3 Why your data platform matters
5 Serverless data stack reference architecture
6 Streaming data stack reference architecture
7 Machine learning and data science reference architecture
8 Application health and security analytics
9 IoT reference architecture
10 Customer 360 reference architecture
11 Embedded analytics reference architecture
12 Future-proof your applications
13 About Snowflake
WHY YOUR DATA
PLATFORM MATTERS
It’s safe to say data application builders will never worry about a
lack of data. Approximately 40 zettabytes (ZB) of new data was
generated in 2019, and IDC predicts that with a steady growth
trajectory, 175 ZB will be generated in 2025. Although these ever-
increasing amounts of data present immeasurable opportunities for
delivering data-driven insights to customers, there are three crucial
questions every startup and established ISV provider should ask:
CAN OUR UNDERLYING ARCHITECTURE SCALE TO MEET THE
NEEDS OF OUR FAST-GROWTH BUSINESS?
CAN OUR PRODUCT INGEST AND ANALYZE LARGE AMOUNTS OF
STRUCTURED AND SEMI-STRUCTURED DATA TOGETHER?
MOST IMPORTANTLY, CAN WE ACCOMPLISH THESE GOALS WHILE
REMAINING OPERATIONALLY EFFICIENT AND COST-EFFECTIVE?
CHAMPION GUIDES
Today, too many organizations are burdened by • SQL for all data This ebook provides detailed reference architectures
infrastructure costs that arise from traditional Snowflake ingests JSON, Avro, Parquet, and other for seven use cases and design patterns, and it
architectures. When companies can achieve data without transformations or requiring pipeline demonstrates the importance of a cloud-built data
scalability only by throwing more resources at the fixes every time the schema changes. With ANSI platform that matches scalability and connectivity
problem, companies face an expensive and never- SQL, Snowflake enables your teams to query semi- expectations, both today and in the future.
ending problem. Traditional architectures are also structured data just as easily as structured data.
riddled with operational overhead in the form of
• No Site Reliability Engineering/DevOps burden
maintenance and tuning, which wastes valuable
As a near-zero management platform, Snowflake
engineering time and slows growth.
automatically handles provisioning, availability,
The questions above highlight the intrinsic need tuning, data protection, and other operations,
for a data stack architecture that has scalability, which enables you to focus on your own
connectivity, and support for all data types built application rather than maintenance.
into its design. That means selecting cloud-built
infrastructure components, the most important of Snowflake also ensures seamless connections to
which is your data platform. third-party platforms and APIs, easily fitting in with
your existing environment.
As the central hub for all-things data, only a cloud
data platform can deliver the performance and
nearly infinite autoscaling needed to launch and scale
applications quickly and cost-effectively. Here’s what
the Snowflake Cloud Data Platform provides:
4
CHAMPION GUIDES
SERVERLESS DATA OBJECTIVE
Build data intensive applications that
ARCHITECTURE
SERVERLESS DATA STACK DESCRIPTION
1 The client-side app, running on mobile or web
API Gateway Service Serverless Compute NoSQL/OLTP DB Serverless ETL devices, invokes the application logic on the
serverless compute via an API gateway service.
The gateway authenticates the API calls and
throttles them, based on SLAs.
Amazon API Gateway AWS Lambda Amazon Aurora AWS Step Functions 2 Serverless compute runs the application logic
Serverless and scales on demand, without the need to
provision or manage servers. The application
Asure Data Factory queries Snowflake data (5) for runtime decisions,
such as delivering product recommendations or
powering a dashboard for analysis.
Azure API Management Azure Functions
Azure Google Cloud 3
An OLTP or NoSQL database provides the
Cosmos DB Composer application with high-capacity transaction
processing. This NoSQL/OLTP database can also
be a serverless service.
5
CHAMPION GUIDES
STREAMING DATA OBJECTIVE
Build data intensive applications
Google Cloud
Storage Amazon S3
6
CHAMPION GUIDES
MACHINE LEARNING AND OBJECTIVE
Train machine learning (ML) models to
7
CHAMPION GUIDES
APPLICATION HEALTH OBJECTIVE
Analyze large volumes of log data to
Azure
Blob
Storage
3 7
8
CHAMPION GUIDES
IOT REFERENCE OBJECTIVE
Build applications that analyze large
IOT DESCRIPTION
1 Smart devices, sensors, and other IoT devices
generate continuous data.
IoT Rules
Engine
Due to frequently unreliable internet
3 Streaming Services 6 2
connectivity, IoT devices communicate using the
MQTT protocol and an IoT message broker. The
5 message broker uses a publish and subscribe
mechanism to interact with other services, which
subscribe to specific topics within the broker to
access device data.
1 2 Amazon
Kinesis 3 A streaming service is used to ingest and buffer
IoT IoT Message real-time device data, thus ensuring reliable
Native JSON ingestion and delivery to a staging table in
Devices Broker Support Snowflake (5).
Cloud
AWS IoT Pub/Sub
Aggregation Using 4 In cases where the application requires it, cloud
Core
Streams & Tasks object storage is used to stage batch data prior
to ingestion. For example, minute-by-minute
Azure data may be stored in cloud object storage,
Event
whereas aggregated data over a longer period
Hub
MQTT Azure IoT IoT Analytics may be stored in Snowflake (5).
Hub
5 Snowflake offers native support for JSON and
Time-series Optimized
Cloud Object other semi-structured data formats for easy
Data Ingestion ingestion of device data. Snowpipe automatically
Storage with Snowpipe
Cloud IoT optimizes time-series queries by ingesting data
Core chronologically. Snowflake’s Streams and Tasks
Google features automate the workflows required to
Cloud Query Data in
Object Storage via ingest and aggregate incoming data.
Storage
External Tables
6 An IoT rules engine hosts the business logic
HiveMQ required by the application and operates on
Amazon data available in Snowflake and in the message
S3 broker. The rules engine sends messages back to
controls devices.
Azure
Blob
4 Storage
9
CHAMPION GUIDES
CUSTOMER 360 OBJECTIVE
Build sales and marketing applications
Amazon Cloud
3 Native
5
Snowflake supports all the analytics workloads
Kinesis within the application. External Tables support
Pub/Sub JSON queries of data in cloud object storage without
ETL Support ingestion. The Streams and Tasks features
automate the ingestion and data enrichment
AWS Step process. Native support for JSON and other
Cloud Object Storage Functions Data Enrichment using semi-structured formats simplifies the ingestion
Streams & Tasks of event data. Secure Data Sharing enables
Product Data monetization of fresh data without copying or
moving the data.
Google Cloud ML models are trained to optimize offers based
Audience Data Dataflow 6
on historical data stored in Snowflake. The
1 application makes real-time predictions via an
Purchase Attribution Data Azure Data Factory Query Data in API and uses Snowflake tables to store input
Object Storage via data and batch prediction results.
External Tables
User Activity Data Data Monetization
via Secure
Data Sharing
10
CHAMPION GUIDES
EMBEDDED ANALYTICS OBJECTIVE
Build analytics-heavy applications that
4 In-App Embedded 5
Embedded business intelligence tools or open-
source charting libraries support analytics from
Business Intelligence within the application.
Workload Isolation
11
CHAMPION GUIDES
FUTURE-PROOF YOUR
APPLICATIONS
Regardless of the type of applications you Rather than spend valuable development time
build or what architectural design pattern you rearchitecting your data stack over and over again
select, you must meet the core data platform to chase ever-evolving scalability needs, a cloud data
platform lets you focus on what you do best:
requirements for scalability and connectivity
building and improving your application to entice
if you want to attract and keep customers to new customers.
grow your business. With Snowflake, you can
And that’s something you can hang your app on.
meet customer expectations with a modern
foundation for your data stack that delivers
a highly performant service, both now and in
the future.
12
ABOUT SNOWFLAKE
Snowflake’s cloud data platform shatters the barriers that have prevented organizations of all sizes from unleashing the true value from their data.
More than 2,000 customers deploy Snowflake to advance their businesses beyond what was once possible by deriving all the insights from all
their data by all their business users. Snowflake equips organizations with a single, integrated platform that offers the only data warehouse built for
the cloud; instant, secure, and governed access to their entire network of data; and a core architecture to enable many types of data workloads,
including a single platform for developing modern data applications. Snowflake: Data without limits. Find out more at snowflake.com
CITATIONS
“The Digitization of the World From Edge to Core.” IDC. bit.ly/2QuFiKk
1