Cloud Data Warehousing For Dummies 3rd Edition
Cloud Data Warehousing For Dummies 3rd Edition
Cloud Data Warehousing For Dummies 3rd Edition
by David Baum
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
Cloud Data Warehousing For Dummies®, 3rd Snowflake
Special Edition
Published by
John Wiley & Sons, Inc.
111 River St.
Hoboken, NJ 07030-5774
www.wiley.com
Copyright © 2024 by John Wiley & Sons, Inc., Hoboken, New Jersey
No part of this publication may be reproduced, stored in a retrieval system or transmitted in any
form or by any means, electronic, mechanical, photocopying, recording, scanning or otherwise,
except as permitted under Sections 107 or 108 of the 1976 United States Copyright Act, without
the prior written permission of the Publisher. Requests to the Publisher for permission should be
addressed to the Permissions Department, John Wiley & Sons, Inc., 111 River Street, Hoboken, NJ
07030, (201) 748-6011, fax (201) 748-6008, or online at http://www.wiley.com/go/permissions.
Trademarks: Wiley, For Dummies, the Dummies Man logo, The Dummies Way, Dummies.com,
Making Everything Easier, and related trade dress are trademarks or registered trademarks of John
Wiley & Sons, Inc. and/or its affiliates in the United States and other countries, and may not be
used without written permission. Snowflake and the Snowflake logo are trademarks or registered
trademarks of Snowflake, Inc. All other trademarks are the property of their respective owners.
John Wiley & Sons, Inc., is not associated with any product or vendor mentioned in this book.
For general information on our other products and services, or how to create a custom For Dummies
book for your business or organization, please contact our Business Development Department
in the U.S. at 877-409-4177, contact [email protected], or visit www.wiley.com/go/custompub.
For information about licensing the For Dummies brand for products or services, contact
BrandedRights&[email protected].
ISBN 978-1-394-21162-3 (pbk); ISBN 978-1-394-21163-0 (ebk)
Publisher’s Acknowledgments
Some of the people who helped bring this book to market include the following:
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
Table of Contents
INTRODUCTION................................................................................................ 1
About This Book.................................................................................... 1
Icons Used in This Book........................................................................ 2
Beyond the Book................................................................................... 2
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
CHAPTER 5: Bolstering Data Security and Governance............. 27
Exploring the Fundamentals of Database Security......................... 28
Eliminating security silos............................................................... 28
Encrypting data by default............................................................ 28
Verifying vendor participation...................................................... 29
Patching, updates, and network monitoring.............................. 29
Ensuring data protection, retention, and redundancy.............. 30
Securing marketplace data........................................................... 30
Controlling user logins.................................................................. 30
Applying access controls............................................................... 31
Governing How People View, Access,
and Interact with Your Data............................................................... 31
Protecting your data...................................................................... 32
Classifying and identifying data................................................... 32
Demanding attestations and compliance certifications........... 33
Monitoring data quality................................................................. 33
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
Introduction
D
ata is infiltrating all types of business processes and
reshaping the way companies operate. Regardless of your
industry or market, the ability to manage data easily,
securely, and efficiently has become vital for success.
Introduction 1
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
In this book, you learn how to create an innovative, cost-
effective, and versatile cloud data platform that powers not
only your data warehouse but also many other data workloads.
Additionally, you learn how to extend an existing data warehouse
to take advantage of the latest cloud technologies.
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
IN THIS CHAPTER
»» Understanding data warehouses, data
lakes, and cloud data platforms
Chapter 1
Introducing Cloud Data
Warehousing
A
traditional data warehouse required purchasing, install-
ing, and configuring the necessary hardware, software,
and infrastructure to store and analyze data. Cloud data
warehousing emerged as an efficient, cost-effective way for
organizations to scale analytics without those upfront costs. And,
when a cloud data warehouse lives on a well-architected, modern
cloud data platform, it not only enables organizations to acceler-
ate analytics but also broadens data management capabilities to
include other architectures, like a data lake, and can securely and
efficiently run other workloads. To help you understand data
warehouses, data lakes, and the modern cloud data platform, this
chapter defines each, and briefly shows how the modern cloud
data platform came into being. The chapter wraps up with a quick
look at trends in data processing and how those trends require the
ability to shift and meet new data demands.
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
Defining the Data Warehouse
Initially, data warehouses were simply relational databases that
stored and queried large volumes of structured data. Today,
cloud-built and hybrid cloud data warehouses can also incor-
porate semi-structured data, such as JavaScript Object Nota-
tion (JSON) weblogs, and unstructured data, such as images and
audio conversations. This has allowed modern data warehouses
to expand beyond mere analytic repositories for internal business
operations and include a burgeoning volume of data from mobile
apps, online games, Internet of Things (IoT) devices, social media
networks, generative AI systems, and many other sources.
Data warehouses and data lakes are both widely used to store big
data but aren’t interchangeable. A data lake is a vast pool of raw
data that is stored in a highly flexible format for future use. A data
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
warehouse is a repository of filtered data that has been prepro-
cessed for a specific purpose. We explore these differences further
in Chapter 2.
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
MARRIOT SIMPLIFIES ITS DATA
PLATFORM AND ACHIEVES LOWER
TOTAL COST OF OWNERSHIP
Marriot, a Snowflake customer, comprises 32 global brands across
139 countries, with 8,300 hotels offering 15 million hotel rooms, and
100,000 home and villa properties.
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
As analytic applications, data science applications, data engineer-
ing pipelines, and many other types of data applications have
grown in popularity and importance, many of these legacy data
warehouse platforms have bowed under the strain. Restricted by
a linear architecture, they can’t run multiple workloads in paral-
lel, leading to long wait times for computing resources and the
data-driven insights they impart. Many users complain of slow,
inefficient queries, scalability issues, and rising licensing costs as
analytic workloads grow.
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
Traditional data platforms don’t scale well, and having a fixed
set of compute and storage resources limits concurrency (the
degree to which users can simultaneously access the same data
and computing resources). Today, thanks to the nearly infinite
resources available in the cloud, businesses can easily scale com-
pute resources to handle an escalating volume of activity.
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
is separate, they can be expanded and contracted independently,
enabling data warehouses to be more responsive and adaptable.
FIGURE 1-1: The modern data warehouse must support many types of data,
analytic use cases, and applications.
(continued)
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
(continued)
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
IN THIS CHAPTER
»» Supporting many languages
Chapter 2
Standardizing on a
Versatile Data Platform
R
egardless of your industry or market, the capability to har-
ness your data easily and securely in a multitude of ways
has become paramount for success. A modern cloud data
platform empowers you to consolidate your data, providing
unlimited bandwidth for data analysis, data sharing, data engi-
neering, application development, and data science initiatives. As
a result, your business users become more efficient and your IT
team can break free from mundane data administration tasks,
allowing everyone to focus on delivering valuable experiences.
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
Supporting Many Languages
SQL, Python, Scala, Java, JavaScript — developers interact with
many languages to access data and build data applications,
including non-coding languages, natural languages, and conver-
sational interfaces, such as generative AI tools that use program-
ming languages behind the scenes.
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
to produce serialized data in a compact binary format. The
serialized data can be sent to any destination (that is,
application or program) where it can be easily deserialized
because the schema is included in the data.
»» Apache ORC (Optimized Row Columnar), a columnar format
used to speed up Apache Hive queries. ORC was designed for
efficient compression in Hadoop and improved performance
of Hive for reading, writing, and processing data.
»» Apache Parquet, a compressed, efficient columnar data
representation designed for projects in the Hadoop ecosystem.
This file format supports complex nested data structures and
uses Dremel record shredding and assembly algorithms.
»» XML, a markup language that defines a set of rules for
encoding documents. XML was originally based on standard
generalized markup language (SGML), another markup
language developed for standardizing the structure and
elements that comprise a document.
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
A complete cloud data platform can store diverse types of data in
their native formats without creating data silos or imposing unique
schemas to access data. You don’t have to develop or maintain sep-
arate storage environments for structured, semi-structured, and
unstructured data. It is easy to load, combine, and analyze all data
through a single interface while maintaining transactional integrity.
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
them to more open and versatile formats. Such migrations can
become a massive undertaking, sort of like trying to copy a life-
time’s worth of family movies from an analog VHS format to a
digital format like MP4.
With new types of data, you may encounter new architectural pat-
terns that you didn’t predict. For instance, you may want a data
warehouse to be transformed into a hybrid pattern that merges
the strengths of data warehouses and data lakes. Additionally,
domain-specific data marts could evolve into a more streamlined
and regulated data mesh.
FIGURE 2-1: A versatile data platform powers a full spectrum of use cases,
whether data is stored inside a data warehouse or in external tables.
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
data privacy mandates, and ensuring data quality. Rather than
creating silos, a data mesh breaks them down — it distributes
data responsibilities across different teams or domains while
maintaining data discoverability and accessibility.
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
IN THIS CHAPTER
»» Defining essential architectural
attributes
Chapter 3
Architecting a Cloud
Data Platform That
Just Works
C
reating an effective cloud data warehouse isn’t just a matter
of repurposing yesterday’s on-premises technologies or
moving existing analytic applications and databases from
your data center to a cloud vendor’s infrastructure. Properly
leveraging the power and scale of the cloud requires a new
mindset, a new set of management principles, and new cloud-
built capabilities.
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
A modern cloud data warehouse includes a central persisted
data repository that is accessible from all compute nodes. Like
a shared-nothing architecture, it processes queries using MPP
(massively parallel processing) compute clusters.
»» The storage layer holds your data, tables, and query results.
This scalable repository should handle structured, semi-
structured, and unstructured data and span multiple regions
within a single cloud and across major public clouds.
»» The compute layer processes enormous quantities of data
with maximum speed and efficiency. You can easily specify
the number of dedicated clusters you want to use for each
workload (thus eliminating contention for resources) and
have the option to let the service scale automatically.
»» The services layer coordinates transactions across all
workloads and enables concurrent data loading and
querying activities, enforcing security, propagating metadata,
optimizing queries, and performing other important data
management tasks. When each workload has its own
dedicated compute resources, operations can run simultane-
ously and perform as needed.
»» The cross cloud and global layer globally connects data and
applications across regions and clouds, securely, through a
single, consistent experience, and is described further below.
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
examined data usage patterns at 7,800 organizations — all Snow-
flake customers. According to the survey, the number of organi-
zations operating across the three leading public cloud providers
(Amazon Web Services, Microsoft Azure, and Google Cloud) grew
207% during the 12 months ending January 2023.
These companies need data warehouses that can store and man-
age data consistently across many different geographic regions
and clouds. However, when working with multiple cloud provid-
ers, how do you ensure that the same security configurations,
administrative techniques, analytics practices, and data pipelines
apply to all your cloud providers? For example, will you have to
resolve differences in audit trails and event logs or apply unique
tuning and scaling techniques on each cloud? Will your security
experts have to deal with varying sets of rules or work with multi-
ple key management systems to encrypt data? Will data engineers
have to create unique pipelines?
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
FIGURE 3-1: A modern cloud data platform should seamlessly operate across
multiple clouds and apply a consistent set of data management services to
many types of data workloads.
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
production application creates its own data silo. For example, mar-
keting data resides in a marketing automation system, sales data
in a customer relationship management (CRM) system, finance
data in an enterprise resource planning (ERP) system, and inven-
tory data in a warehouse management system, among others.
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
»» Does the service automatically encrypt all our data at rest
and in motion with industry-standard encryption, or do we
have to set up and apply encryption to the data manually?
Does the encryption system hinder query performance?
»» Does the service scale up and out instantaneously and
elastically and then release extra compute or storage
resources when they’re no longer in use? Or do we have to
handle these tasks manually?
»» Does the cloud provider automatically replicate your data to
ensure business continuity across regions? After cross-
regional replication is established, do we have to set up
change data capture (CDC) procedures to keep multiple
databases in sync, or does the vendor handle that for us?
»» Do we need to partition data, tune SQL queries, and
optimize performance, or does the platform handle this
automatically?
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
IN THIS CHAPTER
»» Ensuring value through consumption-
based pricing
Chapter 4
Achieving Exceptional
Price and Performance
F
ast analytical performance is crucial for data-informed
decision-making. However, the more data you ingest and
process in your data warehouse, the more cloud resources
you consume, which can have a direct impact on costs.
This chapter dives into these aspects and describes how to achieve
cutting-edge performance while simultaneously monitoring data
warehouse costs and optimizing resource use.
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
Utilizing Consumption-Based Pricing
Make sure that the pricing model for your cloud data warehouse
matches the value you obtain from it. Paying for a set amount of
storage and computing power, commonly known as subscription-
based pricing, can incur significant yearly costs and typically requires
regular management. To ensure that you don’t pay for more capac-
ity than you need, your cloud data platform should offer usage-
based pricing.
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
Calculating and Controlling Costs
As enterprises migrate IT workloads to the cloud, they’re transi-
tioning from a world of scarcity to a world of abundance marked
by nearly limitless data storage resources and nonstop data pro-
cessing capacity. It’s important to control costs and rein in exces-
sive consumption.
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
Look for a cloud data warehouse solution that automatically
optimizes performance and eliminates administrative effort to
incorporate new resources. Whether it’s search optimization (SO)
capabilities, more efficient storage compression techniques, or
reduced compilation time for SQL queries, you shouldn’t have to
do anything to gain access to new features or the latest capabilities.
AUTOMATION DRIVES
INNOVATION
Veradigm, a Snowflake customer, is a technology company that deliv-
ers care and financial solutions to healthcare providers. To provide
stakeholders with actionable data and insights, the company ingests
and analyzes large amounts of data on electronic health records,
disease registry data, and claims data.
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
IN THIS CHAPTER
»» Securing data through encryption, user
login controls, access controls, and more
Chapter 5
Bolstering Data Security
and Governance
I
n recent years, there has been a spike in the proliferation of
data generated and collected by organizations. With data from
third-party sources becoming more common — such as data
from SaaS apps, popular application clouds, data marketplaces,
data exchanges, and more — data security, data privacy, data
governance, and regulatory compliance have become much more
complicated. Organizations need to understand the source of
common threats and take a hard look at who might be trying to
misuse, breach, or attack their database management systems.
For example, trade secrets may be valuable to industry competi-
tors, while energy grid information is a target for political
saboteurs. Understanding these realities is the starting point for
setting up comprehensive security, governance, and compliance
policies that can be consistently enforced across your entire
data estate.
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
Exploring the Fundamentals
of Database Security
Securing your data and complying with pertinent regulations is
fundamental to the architecture, implementation, and opera-
tion of a cloud data warehouse service. All aspects of the service
must be centered on protecting your data as part of a multilayered
strategy that considers both current and evolving security threats.
Your security strategy should address external interfaces, access
control, data storage, and physical infrastructure in conjunction
with comprehensive network monitoring, alerts, and verifiable
cybersecurity practices.
The vendor must protect the decryption keys that decode your
data. The best service providers employ AES 256-bit encryption
with a hierarchical key model. This method encrypts the encryp-
tion keys and instigates key rotation that limits the time during
which any single key can be used.
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
Data encryption and key management must be always on and
entirely transparent. Having the option to supply your own
encryption keys is important so that you can disconnect the cloud
provider from your data if necessary.
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
(also known as penetration testing) by an independent security
firm to proactively check for vulnerabilities.
In some cases, data providers create data clean rooms that enforce
designated governance policies. These sanitized data sets can be
confidently shared with partners and other external constituents
without exposing sensitive information.
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
management (IAM) procedures. The data warehouse should also
permit you to apply multifactor authentication (MFA) at the
account level. This permits you to require some or all users to
pass through a secondary level of verification such as entering a
one-time security code sent to the user’s mobile phone.
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
»» Secure joins establish linkages without revealing personally
identifiable information (PII). It allows discreet connections to
people, devices, cookies, or other identifiers.
»» Secure UDFs let users analyze fine-grained data while
protecting raw data from being viewed or exported by other
parties.
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
assists compliance officers in tracing the usage of sensitive data,
including its sources, destinations, and any transformations
along the way.
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
relevant. The best data platforms include out-of-the-box sys-
tem metrics for the most common types of data quality issues,
and make it easy to define, measure, and monitor data quality via
integrated, cloud-native facilities (see Figure 5-1).
»» Know your data: Classify data, tag sensitive data, and audit
data usage
»» Protect your data: Secure sensitive and regulated data with
granular access policies
»» Connect your ecosystem: Seamlessly extend your data
governance policies as you share data, internally and
externally, across regions and clouds.
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
IN THIS CHAPTER
»» Recognizing and overcoming technology
limitations
Chapter 6
Enabling Data Sharing
D
ata sharing is the act of providing access to data — both
within an enterprise and between enterprises. The
organization that makes its data available, or shares its
data, is a data provider. The organization that wants to use the
shared data is a data consumer. Any organization can be a data
provider, a data consumer, or both.
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
of data version control. These complexities, coupled with data-
base inconsistencies, authenticity headaches, and the difficulty of
sharing large volumes of data add up to frustrating, expensive,
and time-consuming data exchange processes.
Look for a cloud data platform that allows you to accomplish the
following:
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
FIGURE 6-1: Identifying the attributes of modern data sharing.
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
to help retailers target consumers with ads. Consumer packaged
goods companies can share purchasing data with online advertis-
ers or directly with customers.
FIGURE 6-2: A cloud data platform enables you to securely leverage your data
warehouse to share and collaborate with your data, for every scenario.
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
IN THIS CHAPTER
»» Accommodating geospatial analytics
»» Developing AI applications
Chapter 7
Advancing Analytics
B
usiness intelligence (BI) is no longer merely the domain of
executives, professional analysts, and data scientists. An
effective cloud data platform that supports data warehouse
workloads establishes not only a common repository for all types
of data and analytics but also empowers diverse teams to collab-
orate and easily manage data. Popular analytic methods include
the following:
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
The data warehouse workload in your cloud data platform should
support a broad ecosystem of third-party BI solutions, as well as
offer native tools for specific types of analysis. Some of the pri-
mary capabilities are summarized below.
Select a data platform that can store and process any type of
spatial vector object and perform complex geospatial transfor-
mations, such as converting geographic coordinates to street
addresses. The processing engine must be able to handle location
data at scale and seamlessly integrate with leading GIS tools.
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
»» Substring and regular expression searches
»» Queries on fields in columns that use certain types
of predicates
Developing AI Applications
You may start out using a cloud data platform for a traditional
warehousing workload. As your volume of data grows, as your
data analysts advance, and as you hire data scientists to join your
team, you can start using the cloud data platform to store and
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
process artificial intelligence (AI)/ML workflows, train predictive
models, and then put those models into production.
ML algorithms learn from data; the more data you provide, the
more capable they become. A cloud data platform gives you one
place to instantly access all relevant data for AI and ML workflows
without complex data pipelines. It enables data science teams to
store and process nearly limitless volumes of data at a progres-
sively lower cost via powerful arrays of computers that can be
scaled up and down at will. It unifies data security and data gov-
ernance activities, fosters collaboration, and provides elastic scal-
ability for data science and related analytic endeavors.
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
Chapter 8
Four Steps for Getting
Started with Cloud Data
Warehousing
CHAPTER 8 Four Steps for Getting Started with Cloud Data Warehousing 43
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
Step 2: Migrate or Start Fresh
Assess how much of your existing environment should migrate
to the new data platform and what should be built from scratch.
Defining strategy and goals, taking account of budget and
resources to migrate, and understanding your data volume can
help you make this decision. To better understand which approach
is best for your organization, talk to the professional services
team of the data platform you’re considering. Your BI solutions,
data visualization tools, data science libraries, and other software
development tools must easily adapt to the new architecture.
These materials are © 2024 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
WILEY END USER LICENSE AGREEMENT
Go to www.wiley.com/go/eula to access Wiley’s ebook EULA.