YugabyteDB TVA Datastax
YugabyteDB TVA Datastax
of YugabyteDB
Anywhere vs.
DataStax Enterprise
Leverage the PRESS framework to evaluate how
your next database can optimize productivity,
resiliency, efficiency, security, and savings.
Introduction
Organizations have made significant progress and investment in
transforming their applications and infrastructure over the past decade.
However, one part of the technology stack has remained largely
unchanged: the transactional database.
The objective of this paper is to help you compare YugabyteDB Anywhere and DataStax
Enterprise across a number of key areas. We have created a reference guide that considers the
short and long-term impact of both databases. This enables you to make an informed decision
The paper is structured around the five critical parameters that make up the YugabyteDB PRESS
framework:
Security Savings
Productivity: Increase your rate of innovation and differentiation by bringing new ideas to
production faster.
Resiliency: Ensure apps work seamlessly without any major impact on performance, even
Efficiency: Enable database operators to offer an internal DBaaS, make high-impact changes
Security: Secure data anywhere, patch security issues in a timely manner, and limit threat
Savings: Reduce both upfront and ongoing costs by optimally allocating or reclaiming
resources as needed.
database-as-a-service) and DataStax Enterprise for each of the criteria above. This takes into
account the ease of each solution meeting the requirements, and the features currently
available. Although we have done thorough research into the third-party offerings, you should
use this paper as a guide and also conduct your own analysis.
Below is a brief summary. A more detailed analysis of our findings follows later in the paper.
YugabyteDB
PRESS YugabyteDB DataStax
DataStax Enterprise
Anywhere Ranking
Framework Anywhere Enterprise Ranking Reason
Reason
Architectural
sharding, HA/DR
Operationally challenging
Faster deployments,
expansion and upgrades,
operational
challenges with
efficiency while
background compaction,
scaling, automatic
Efficiency issues with garbage
4 out of 5 3 out of 5 large partition
collection, bottlenecks
splitting, enhanced
while scaling, large
compaction
partition issues, time
management
consuming read repairs
eventual consistency,
labor efficient
high storage costs,
operations ,
profitability
End-to-end built-in
of key certifications ,
Mature set of
expanded KMS
certifications, adherence
options, periodic
Security to GDPR compliance,
maintenance, and
upgrades
YugabyteDB Anywhere is the ideal fit for organizations that require strong
data consistency and the familiarity of popular APIs (both PostgreSQL and
Cassandra) along with key cloud native features like high availability,
Organizations that are a better fit for YugabyteDB than DataStax Enterprise
A need for the power and flexibility of SQL Queries: These organizations
respond to new and changing business needs without requiring new data
multi-API upper half with the industry’s best PostgreSQL compatibility (the
sacrificed data consistency for scale and resiliency, but now realize they
can simplify the lives of their operations and app development teams as well
needs. While it can scale, DataStax is very hard and slow to scale once in
have to size their growth into the initial footprint, so they must bear the cost of
much higher density per node, 5 - 10+ TBs per node as well as a
They are looking for a solution (like YugabyteDB) that offers the core
existing staff on hand that understands (or is already operating) a Cassandra environment,
modernizing to a distributed SQL database like YugabyteDB may not be a high priority at this
time. However, you should keep these considerations in mind, so you can recognize them early
if your applications continue to scale and data accuracy becomes more important.
We recognize that some applications may not yet need the capabilities that come with
and globally distributed database solution built to handle a high volume of data. It has wide
adoption, a strong community, and a range of available resources and tools, making it easy to
Organizations often have a portfolio of databases used by different business units and
applications, based on specific needs and the skill sets available. For non-transactional
applications or smaller applications where data consistency, as well as hardware and storage
Cassandra-based databases, like DataStax Enterprise, deliver value and are well-suited for
Consistency is a low priority (i.e., not a priority system of record workload): DataStax
Enterprise offers eventual consistency with scale and resilience that is sufficient for many
Asynchronous replication for more than three regions: Organizations can consider DataStax
Enterprise when they need specific replication configurations such as hub/spoke models and
N: N bi-directional replication
delivers ACID level consistency for distributed transactions. Cassandra provides a range of
weaker consistency options, providing tradeoffs on performance and how they achieve
Overall, DataStax Enterprise may be a good choice for businesses that prioritize availability and
only need to handle moderate amounts of data where there are minimal concerns about large
infrastructure sprawl and associated costs. DataStax Enterprise may not be the best fit for
businesses that require strong consistency, multi-API support, or low operational complexity.
Here, we will examine how YugabyteDB Anywhere and DataStax Enterprise (DSE) compare using
the five elements of the PRESS framework (Productivity, Resilience, Efficiency, Security, and
Savings). By measuring against these five key considerations, you can ensure you choose the
PRODUCTIVITY
An organization's ability to deliver rapid innovation and stay ahead of the competition is often a
direct result of how efficient its developers are. Do developers have the time and tools required
to deliver new services or revenue-impacting enhancements? Or, are they required to spend
valuable time building workarounds into the application to handle various gaps and tradeoffs in
Along with these common issues, database migration challenges can be a major barrier. It can
slow down, or in many cases prevent, cloud-native and application modernization projects from
moving forward. Organizations can significantly boost their rate of innovation and
competitiveness by focusing on new solutions that minimize these challenges and decrease
time to value.
Both YugabyteDB Anywhere and DataStax Enterprise offer an alternative to Apache Cassandra.
DataStax Enterprise simply extends Apache Cassandra while YugabyteDB Anywhere offers a
These are very different approaches, which impact overall productivity differently. Based on our
productivity issues that exist with Cassandra and Cassandra-based solutions like DataStax
Enterprise.
YugabyteDB: DSE:
core PostgreSQL code for the front end, developers can use existing skills and tools.
Interns at
Finserv
Support diverse skills and tools: YugabyteDB provides a flexible query layer with
Company Learn
two key APIs: Cassandra (YCQL) and PostgreSQL (YSQL). Developers familiar with
YugabyteDB either PostgreSQL or Cassandra can build on their existing skills and tools to write
learning curve.
YugabyteDB's strong
allowed interns new to YugabyteDB allows for simple relational SQL transactions and queries. As new
YugabyteDB to deliver business needs arise that require a new query, they can be implemented quickly
powerful projects to and easily without needing changes to the data model or new copies of the data.
Fiserv. An executive at
Fiserv shared a story Enjoy strongly consistent data: YugabyteDB delivers strongly consistent, ACID
about working with a set transactions across a distributed, scalable database. As a result, developers do
of college interns. Fiserv not need to handle common consistency issues like read after write, concurrency
solve--something they
planned to build into a Manage large-scale deployment: YugabyteDB Anywhere helps you move away
product--and told them from manual, complex Day 2 operations, especially for large clusters. YugabyteDB
to use YugabyteDB. The Anywhere intelligently orchestrates your database scaling, upgrades, backups,
interns knew
monitoring, and security operations across your public, private or hybrid
PostgreSQL but had infrastructure. This helps optimize your systems for performance
never touched
YugabyteDB before. After Leverage the power of global secondary indexes and JSONB: Using advanced
a couple of days of indexes can enhance database performance by enabling the database server to
Google searches and find rows faster. YugabyteDB supports strongly consistent (ACID) secondary
some testing, they were indexes to help you quickly retrieve data using columns that are not part of the
off and running, primary key. YugabyteDB supports global secondary indexes, partial indexes, and
capitalizing on their covering indexes in addition to JSON and JSONB for added data type flexibility
knowledge and
experience, and applying Migrate quickly: YugabyteDB Voyager helps you migrate from the most common
this to YugabyteDB legacy and single-cloud relational databases to YugabyteDB quickly and easily.
without any problems. This newly launched migration engine can manage the entire lifecycle of database
migration, including cluster preparation for data import, schema migration, and
data migration.
However, there are several challenges that users Hard to manage at scale: The DataStax Enterprise
report can negatively impact their organizational OpsCenter has challenges when monitoring larger
Expensive data modeling: When migrating to a dashboard for monitoring and cluster lifecycle
DataStax Enterprise, customers need to invest management, and was bolted on later hence the
more in redesigning their data model as this is cluster management challenges. This greatly brings
model it properly and you may require additional matches most of our use cases. We
third-party tools like Solr or Spark integration to have some tables as an index to help on
Enterprise also has indexing and querying using a dedicated DC for Spark, but that
limitations, such as the non-availability of mixed comes with an extra cost for new
We live in a world where customers require instant access to information and data. Businesses
are expected to deliver 24x7x365 services that are always available to customers, wherever
and whenever they are in the world. The data behind those apps has become the lifeblood of
However, cloud failures and outages are becoming normal. Over the past 18 months, we’ve seen
a major cloud outage occur, on average, every 50 days. As we continue to see regular major
cloud failures, just “moving to the cloud” is not a good enough strategy for high availability.
Organizations need to evaluate the backbone of their data, their database, to ensure they can
meet customers’ demands with minimal performance impact if something goes down.
NoSQL saw major growth 10+ years ago because it delivered a distributed architecture that
provided native resiliency and security—something not easily accomplished with traditional,
monolithic relational databases. With distributed SQL databases, like YugabyteDB, that is no
longer true. Organizations can now get relational capabilities with the resilience and availability
Both YugabyteDB and DataStax Enterprise are known for providing a distributed, highly-
available database that can survive a wide variety of failures. If availability is your primary
requirement, then both systems can address that need. However, the newer architecture of
YugabyteDB provides additional advantages that might be important to your organization when
YugabyteDB: DSE:
tolerance, continuous availability, and disaster recovery. These resiliency benefits position
As you examine which database solution works best for your use cases, YugabyteDB’s
architecture provides some key differences compared to DataStax Enterprise. These features
and allows organizations to access data locally with confidence. Users do not need to worry
about losing transactions or reading stale data, which can happen in the eventually consistent
model of NoSQL. YugabyteDB prioritizes data consistency, so even during failures, any data
read will be the most current. This is a key requirement for true System of Record applications.
recovery to the millisecond. Data backups can be taken numerous times daily
without impacting performance, allowing recovery from even the most serious
Rolling upgrades with zero downtime: Yugabyte sets high standards for data
Major Retailer retention and performance when the cloud provider whose service YugabyteDB
Recovery Point Objective (RPO) of 0 (meaning no data loss will occur during
4-Day Azure
failures) and a Recovery Time Objective (RTO) of ~3 seconds. i.e. to recover and
Outage with resume operations from the new zone. YugabyteDB also performs rolling
A global retail leader that Simplified App Development: Resiliency and consistency are powered by the
runs its product catalog distributed data layer, meaning developers do not need to worry about resolving
on YugabyteDB powered availability and consistency issues in the app. As a result, app development is
easier but also ensures that data and applications are protected completely
through a major winter
multi-region replication, DataStax Enterprise provides a masterless architecture for zero downtime to help
the retailer only faced a meet strict enterprise availability requirements. Organizations can maintain
impact on their However, there are some challenges to be aware of that might negatively impact
rebalanced to the other Stale reads or data loss: The high availability in DataStax Enterprise relies on
region in just three replication to ensure data availability. However, the replication can introduce
seconds. Meanwhile,
I
data consistency issues if not properly managed. f there is a delay in
replication, a read operation on a replica node may return stale data or lose
their legacy databases
data if a failure happens before new data can be replicated
were offline for days,
From born-in-the-cloud FinTech, Retail, and Telco companies, to large, entrenched leaders,
businesses across all industries face increased competition. To be successful, organizations are
searching for ways to increase profitability by reducing legacy costs and shifting valuable
Flexibility and operational efficiency are critical to this process, with many organizations
also investing in strategic initiatives to finally abandon legacy, “status quo” IT solutions (and
their high costs) in search of modern and often open-source solutions. At the same time, there
YugabyteDB and DataStax Enterprise, both have several advantages over monolithic databases
like Oracle and DB2. However, DataStax Enterprise is still limited by many of the core challenges
and limitations of Apache Cassandra. Built by some of the original Facebook architects that built
Cassandra, YugabyteDB introduces a new architecture, one that builds on the lessons of
Cassandra and addresses many of its gaps, especially around overall efficiency.
When comparing the two databases, YugabyteDB stands out as delivering significant
advantages across a few key areas of efficiency, including scaling complexity, hardware costs,
YugabyteDB: DSE:
can process more transactions in less time, handle large volumes of data, reduce costs, and
Fast scalability and node additions: YugabyteDB can scale applications within hours, even in
large clusters. The same operation can sometimes take days for large clusters and
quickly by copying already compressed data files from the leader of the corresponding shard,
versus the complex and time-consuming process of Cassandra. Clusters can also be easily
eliminates many manual tasks common with Cassandra and other legacy databases.
YugabyteDB automatically splits large partitions, saving days of manual effort for a DBA.
YugabyteDB, these operations only involve one round trip between the quorum
members
Improvement in with different priorities. This guarantees a certain quality of service to the
process 3 million
DataStax Enterprise Challenges
messages a second and
improved app processing For smaller environments or focused use cases, many of the mature and proven
time by 10x. The Vehicle
capabilities of DataStax Enterprise are a good fit, especially when the efficiency
Data Factory (VDF)
impact on resource utilization, hardware costs, and operational efficiencies are
application ingests data
less important. However, many organizations have seen their applications explode
from over 20M
in size and usage over the years, and are now focusing more on overall efficiency
connected vehicles,
and exploring other options.
Cassandra, which
provided great Repeated duplication of data: One of the common approaches within a
availability but failed to
Cassandra-based environment is to make a new copy of the data to avoid
deliver the scalability and
changes or impact on the first copy. For multi-region environments, DataStax
performance needed for
Enterprise replicates the data completely in each region with multiple copies per
a fast-growing
region. If you have a replication factor of three, then three copies of the data will
application. The team
performed extensive exist in each region —consuming lots of resources and impacting overall
testing on YugabyteDB, efficiency. If you want to introduce a new query, then usually the table is copied,
production within their with 50% being the standard overhead recommended. In addition, as node
desired timeframe and density increases on DataStax Enterprise, performance is impacted more during
are positioned to support garbage collection and compaction tasks. A major source of slowdown in
their future wave of
DataStax Enterprise is background compactions. As organizations try to increase
vehicles, which continue
space efficiency, they face more and more performance issues.
to generate more data
DataStax Enterprise and can take several hours additional operations regularly hurt performance.
There can be inconsistency with read/write performance, especially when maintenance operations
“ occur. We get far more timeouts than expected from the platform, and though the software
handles the case, our performance metrics and SLA gets impacted accordingly.
We underestimated the operational overhead of setting up and managing a DSE cluster. There are
“ a number of 3rd parties that offer cloud and on-premise managed services for DSE, and in
Companies need to build a trusted data environment with robust data security and privacy while
also governing data policies for ongoing compliance. To achieve this, it is essential that
customers choose databases that have uncompromising security, with core security features
built in from the start making it easy and seamless to enable. Customers look to harden security,
achieve compliance, and mitigate risks before moving their applications and data to the cloud.
YugabyteDB and DataStax Enterprise both recognize the importance of security. Both have a
strong security focus that spans product development to certifications and ongoing security
Cassandra, while YugabyteDB has taken advantage of building a secure, distributed storage
Despite DataStax Enterprise being in the market for longer, we feel both solutions are focused
on security. There are some differences in what they deliver, so you’ll need to decide if certain
security aspects are more important to you than others. Because of the close ratings for both of
these, below we highlight some of the key features for each and recommend you do further
YugabyteDB: DSE:
Security was a key design principle for YugabyteDB, and the database offers an end-to-end
encryption, RBAC, authentication, authorization, audit logging, SSL/TLS, network security, and
more. These features help ensure that data is protected from unauthorized access, reducing the
risk of data breaches and other security threats. Applications that require high levels of security,
such as healthcare and financial applications, can benefit greatly from YugabyteDB's security
features.
product design approach is consistent with the SD3+C (Secure by Design, Secure by Default,
YugabyteDB Managed.
Native geo-partitioning: Modern policy controls limit user privileges and pin data to specific
KMS choice: YugabyteDB supports an expanded choice of Key Management Services (KMS)
for developers that include Google and Azure in addition to HashiCorp Vault and AWS KMS.
available for YSQL but not currently available for YCQL (Cassandra-compatible
API).
Compliance fix common vulnerabilities and exposure (CVE) take no time to deploy. Operating
Geo-Distribution
of personalized services
the ever-changing
requirements from
DataStax Enterprise Security Features
various countries on data
leveraged the native that data is protected from unauthorized access, reducing the risk of data
capabilities to easily
f
Certi ications: DataStax supports a set of compliance, regulations, and
meet various
certifications including PCI, SOC 2 Type 2, HIPAA, and GDPR
requirements. The native
region or country. Granular security: Row and column-level security. DSE adds row-level access
Graph) database.
While the technical capabilities of a database and the business outcomes they can drive are
often analyzed first when researching a new database, in today’s economic environment, it’s
important to quickly analyze overall costs and prioritize solutions that can lead to meaningful
savings.
To successfully increase profitability, organizations need to find ways to lower costs, such as
reducing expensive legacy database licenses or removing less efficient solutions. Ultimately,
you should prioritize solutions that help you shift finite budgets for hardware, software, and
people to higher-impact, value-added initiatives. This means moving away from legacy
databases that have been in use for 10+ years, as newer solutions, like distributed SQL
databases, provide a number of cost savings—in terms of the license and required hardware
Here, we completed an in-depth analysis of the costs of both YugabyteDB and DataStax
Enterprise using a sample scenario that closely aligns with a real-world customer workload we
recently helped support. As a result of significant hardware savings and lower overall license
costs, YugabyteDB resulted in over 2x savings in licensing and infrastructure costs alone. Below
is a summary of the highlights of the savings that companies can achieve with YugabyteDB. You
YugabyteDB: DSE:
Organizations can achieve cost savings with YugabyteDB by leveraging its lower license costs
(free for the OSS version), high data density, built-in automation, and easy management
features. These features help companies avoid the high costs of proprietary software, expensive
In the detailed cost comparison outlined in Appendix A, the following areas were the primary
License cost savings: Software licensing costs for YugabyteDB Anywhere are far less than
legacy databases, with a list price over 80% less than DataStax Enterprise. As well as lower
pricing, YugabyteDB includes all core database features without additional fees. In fact, all core
database features are available for free in the open-source (OSS) version of YugabyteDB. The
capabilities, along with 24x7 support, to assist with enterprise deployments at scale.
delivering the necessary performance with high throughput and low latency.
DataStax Enterprise can handle around one to two TB per node (and sometimes
less), while YugabyteDB supports five to 10 (or more) TB per node for. most use
cases. This means that YugabyteDB's data density greatly reduces your
hardware footprint, lowering upfront hardware costs and lower operating costs
over time.
Partner outlined in Appendix A. YugabyteDB helps drastically reduce these costs thanks
to much higher data density per server. Cost savings are also realized during
Reduces TCO
hardware refresh cycles. Customers can use inexpensive commodity hardware
by 4x with without the need for specialized vendor hardware, an in-customer data-center,
YugabyteDB or the public cloud using any form factor (VMs, Containers/K8s, Bare-Metal)
organizations that saw have the opportunity to reduce database sprawl, consolidate their apps on fewer
their cloud costs rise to databases, and greatly reduce operational complexity.
DynamoDB database As a result, organizations can preserve their top-line revenue, so the bottom line
business growth,
database modernization
YugabyteDB and as a
DataStax Enterprise delivers a number of key enhancements and features to Apache Cassandra, however,
those additional features come at a cost. In addition, some of the issues that result in higher costs for
Cassandra deployments, like low data density, mean DSE users can face higher costs due to the need for larger
Companies can achieve cost savings with DataStax Enterprise by leveraging its scalabidata center-native
architecture, lack of vendor lock-in, high availability, operational efficiency, and advanced analytics features.
However, some challenges with DataStax Enterprise increase costs for companies, including:
High infrastructure costs: The low data density Cost of eventual consistency: The eventual
per node, results in a cluster size that can be 5x is a possibility of inconsistent data in any replica.
or larger than that required for supporting the To address this, DataStax Enterprise uses Read
same size workload with YugabyteDB. Repair and Anti-entropy maintenance processes
High hardware refresh costs: Given the larger that slow down apps.
costs to refresh and update the cluster are High Storage costs: Usable storage per node in
significant. Scaling times are also higher, resulting Cassandra is about 50% because of compaction
High Operator-to-Developer ratio: The operator- High License Costs: The DataStax Enterprise list
to-developer ratio for DataStax Enterprise can be price is over 5x higher than YugabyteDB
YugabyteDB Anywhere and DataStax Enterprise both deliver additional features and automation
two offerings using the PRESS framework, we’ve focused on how the modern, distributed SQL
productivity, resiliency, efficiency, security, and savings. Some of the well-known challenges
Our Total Cost of Ownership analysis showed that a DataStax Enterprise environment is 2.2
times more expensive than YugabyteDB Anywhere for the same workload under consideration.
Cost savings are a direct result of lower license costs and huge savings in hardware costs, both
upfront and ongoing maintenance, thanks to the much higher data density per node possible
with YugabyteDB.
For organizations considering DataStax Enterprise, we have found that in most cases
YugabyteDB Anywhere can actually offer a better Cassandra experience with strong
consistency and better TCO than achievable with DataStax Enterprise due to the challenges it
Our PRESS analysis provides key areas for you to consider when evaluating your next database
modernization initiative. In addition to the points we have addressed, it’s also important that you
research the additional capabilities and benefits a distributed SQL database, like YugabyteDB, can
provide, in addition to the specific advantages over DataStax Enterprise we’ve covered here.
We invite you to learn more about YugabyteDB and also consider whether a self-managed
DBaas offering like YugabyteDB Anywhere, aligns more closely with your needs. You can get
started by signing up for a free trial (or) request a demo to learn more.
Infrastructure costs
(YBA) is configured to be spread across 3 clusters. Based on the model scenario we are
using, the nodes for YBA need a total of 144 cores and 576 GB of memory. YBA has a low
compaction overhead of 20%, and we’ll assume an optimal data density of 8TB per node. We
have allocated 1 DB Admin FTE for managing the DB infrastructure and DB operations.
Clusters)*Clusters
YBA Total disk needed = (Raw Data (1+ Compaction overhead) RF)*No of Datacenters
For the architecture under consideration with a 20 TB database, the DataStax Enterprise (DSE)
deployment spans across two clusters in two data centers. The nodes need 640 cores and
2,560 GB of memory. DSE has a high compaction overhead of 100% and a low data density. For
this calculation, we use 3TB per node. We have allocated 2 DB Admin FTE for managing the DB
infrastructure and DB operations, as there are significantly more nodes, disk storage,
Clusters)*Clusters
DSE Total disk needed = (Raw Data (1+ Compaction overhead) RF)* Number of Datacenters
The license cost data for YugabyteDB Anywhere, and DataStax Enterprise includes software
license costs per core/node for 1-year or 3-years. Non-production license costs are included in
all options, providing a basis for comparing the direct license costs of both databases.
License Costs (DIRECT) YBA - 1 Year YBA - 3 Years DSE - 1 Year DSE - 3 Year
The infrastructure cost data includes a breakdown of direct costs for both YugabyteDB
Anywhere and DataStax Enterprise over 1 year and 3 years. Costs include compute costs,
DBA costs, storage, snapshot space, backup storage costs, AWS data transfer costs, and
KMS costs.
The total infrastructure costs provide a basis for comparing the overall infrastructure costs of
both databases based on the workload and infrastructure requirements under consideration.
Infrastructure Costs
YBA - 1 Year YBA - 3 Years DSE - 1 Year DSE - 3 Year
(DIRECT)
Compute - Normalized
Average duration
(DIRECT)
YBA - 1 Year YBA - 3 Years DSE - 1 Year DSE - 3 Year
16,000.00
16,000.00
16,000.00
16,000.00
volume
volume
volume
volume
(TB)
$ 26,595.3 6
$ 26,595.3 6
$ 79,786.0 8
$ 79,786.0 8
$ 319,144.32
$ 957,432.96
$ 957,432.96
$ 2,872,298.88
monthly
annual
A
20 T 20 T 20 T 20 T
data transfer (TB/month)
KMS - no of symmetric
requests
2000000 2000000 2000000 2000000
Get In Touch