The Guide To Choosing Backup Storage
The Guide To Choosing Backup Storage
In primary storage, you simply buy the usable terabytes or petabytes that you need, including some growth,
however unlike primary storage, backup storage is unique with:
Backup storage is much more complex. While backup applications are usually top of mind, backup storage
can often be overlooked and leads to poor performance, and overpaying for storage in the backup
environment. In addition, the importance of utilizing backup storage as a key part of the recovery process
especially after a ransomware attack. Being prepared for a recovery from a site disaster is also a big part of the
equation.
4 Secure against attacks and ready to recover from an attack such as ransomware
4 Sized correctly so that if you plan on 3 years of growth, you are not re-buying in 9 months
Exceptional support that is knowledgeable, available and knows both backup applications
4 and backup storage
Considerations:
y Not maximized for backup performance (files system, advanced backup protocols, job
concurrency, etc.). This results in longer backup windows, that grow over time.
y Restores will be as fast as disk as long as the data is not deduplicated by the backup
application. If deduplicated by the backup application, then the deduplicated blocks must be
put back together or rehydrated which slows restores down.
y The storage is typically not scale-out, so the backup window grows as data grows, so you are
chasing an ever-expanding backup window.
y Becomes very expensive with longer term retention of over 6 copies. The storage is often
under-sized as the vendors don’t know how to size for backup including backup rotation,
data growth, retention policy, cross replication, etc. Always compare apples to apples for
sizing as storage is often under-sized to look less expensive. (You could be buying more
storage in 6 to 9 months).
y The storage is network facing and highly vulnerable to security attacks.
y The storage is often discontinued, forcing you to buy something else. (Product obsolescence)
y Cost over time is much larger than anticipated, as upfront sizing is typically inadequate.
y Managing silos of primary storage takes IT staff time.
y The support organizations do not understand backup storage, resulting in finger pointing
between the backup vendor and the storage provider.
Considerations:
y Not maximized for backup performance (files system, advanced backup protocols, job
concurrency, etc.). This results in longer backup windows.
y Low spindle count with bottleneck single controllers.
y Restores will be slower than primary storage disk if the data is not deduplicated by the
backup application due to spindle and controller bottlenecks. If deduplicated by the backup
application, then the deduplicated blocks must be put back together or rehydrated which
slows restores down.
y The storage is not scale-out, so backup window grows as data grows so you are chasing an
ever-expanding backup window.
y Because it is low cost, it does help the economics but still becomes more expensive at a
certain retention point. Storage is often under-sized as the vendors don’t know how to size
for backup including backup rotation, data growth, retention policy, cross replication, etc.
Always compare apples to apples for sizing as often under-sized systems look less expensive.
(You could be buying more storage in 6 to 9 months).
y The storage is network facing and highly vulnerable to security attacks.
y The storage is often discontinued, forcing you to buy something else. (Product obsolescence)
y Cost over time is much larger than anticipated as upfront sizing is typically inadequate.
y Managing silos of primary storage takes IT staff time.
y The less you pay, the lower the level of support that is provided. The support organizations do
not understand backup storage, resulting in finger pointing between the backup vendor and
the storage provider.
Considerations:
y Not maximized for backup performance (files system, advanced backup protocols, job
concurrency, etc.) This results in longer backup window. This sounds counter intuitive for
SSD so test before you buy, and you will see that file systems need to be optimized for large
backup files.
y Restores will be faster than HDD disk as long as the data is not deduplicated by the backup
application. If deduplicated by the backup application, then the deduplicated blocks must
be put back together or rehydrated which slows restores down but will still be faster than
HDD disk.
y These systems are typically scale-out, however compute resources must grow at the same
rate the data grows, or the backup window grows as data grows so you are chasing an
ever-expanding backup window.
y Extremely expensive with longer term retention as SSD is typically 5X the price of HDD. SSD is
extremely expensive for backup storage. Often under-sized as the vendors don’t know how to
size for backup including backup rotation, data growth, retention policy, cross replication, etc.
Always compare apples to apples for sizing as often under-sized systems look less expensive.
(You could be buying more storage in 6 to 9 months). SSD may look closer to the price of HDD
but you might not be buying the required capacity.
y The storage is network facing and highly vulnerable to security attacks. Need to buy
additional storage and tools which further drives up cost.
y Cost over time is much larger than anticipated as upfront sizing is typically inadequate.
y The support organizations do not understand backup resulting in finger pointing between
the backup vendor and the storage provider.
Considerations:
y Has all the challenges of low-cost disk servers or SSD described early in this document.
y In addition, ReFS and XFS have a further performance impact, so the performance is less than
the native disk file system.
y Does not scale well as ReFS was designed for smaller IT shops.
y Take a lot of management time, especially for Linux XFS.
y Good for small IT shops with 30TB or less, may not be good for IT shops with 30TB to 50TB,
and is not a viable solution for the upper mid-market, small enterprise, enterprise,
and large enterprise.
Considerations:
y Slow performance as inline deduplication is compute intensive. Even with deduplication
software that runs on the media server the performance is still inadequate. In addition, they
don’t all do job concurrency for parallel backup jobs. Inline deduplication appliances save
storage costs but are the slowest approach for ingest performance.
y Restores are painfully slow, as the data has to be put back together or rehydrated for each
request. VM boots can take an hour versus a few minutes.
y These solutions are not scale-out so backup window grows as data grows so you are chasing
an ever-expanding backup window. This results in expensive and disruptive forklift upgrades.
y The backup storage is network facing and highly vulnerable to security attacks. You have to
add additional storage and software to increase security which virtually doubles the cost.
y The storage is often discontinued, forcing you to buy something else. (product obsolescence)
y Cost over time is much larger than anticipated as upfront sizing is typically inadequate.
Often vendors undersize to lower their entry price.
y The support organizations do not understand backup storage, resulting in finger pointing
between the backup vendor and the storage provider.
Considerations:
y Backup performance is slow because they do inline data deduplication, compression,
and encryption all in the software which slows backups down.
y Restores are slow because they have to rehydrate the deduplicated data blocks for
each request.
y They are scale-out, however; it is very expensive because they use erasure coding to achieve
redundancy which adds a lot more cost.
y Becomes very expensive with longer term retention because the deduplication ratios are
between 3:1 to 5:1 so they use a lot of disk storage. They are often under-sized as the vendors
don’t know how to size for backup including backup rotation, data growth, retention policy,
cross replication, etc. In addition, there is no easy way to determine the monthly cloud
costs for retention storage. Always compare apples to apples for sizing as often under-sized
systems look less expensive. However, you will most likely be buying more storage in
6 to 9 months.
y They tout immutable data objects for ransomware recovery however that assumes there
is no way to get to the immutable data objects. They do not have a non-network-facing tier
(tiered air gap).
y Often, they have you buy the hardware from a separate vendor.
y The support organizations do not understand backup storage, resulting in finger pointing
between the backup vendor and the storage provider.
Considerations:
y Unique front-end Landing Zone. Backups write direct to disk with no inline deduplication to
slow backups down. Performance is fast due to:
- No inline deduplication
- A file system optimized for large backup jobs
- Use of advanced protocols for performance such as the Veeam Data Mover or Veritas NetBackup
OST, etc.
- Use of job concurrency for parallel backup jobs
- Use of integrated backup application functionality for front-end job load balancing
(Veeam SOBR, NetBackup Single Storage Disk Pool, Commvault Spill & Fill, Oracle RMAN Channels,
HYCU Scale-out, etc.)
- Data at rest encryption at the drive level (takes nanoseconds)
- Typically, 2 times faster than SSD and 3 to 5 times faster than inline deduplication appliances.
y Restores will be as fast as disk, as the data is in the Landing Zone in the native backup
application format ready to restore.
y Scale-out where compute is added, with capacity, the system scales linearly keeping the
backup window fixed-length as data grows.
y Data is deduplicated from the Landing Zone into a non-network-facing repository tier where
a copy of the most recent data in the Landing Zone and all retention data is stored in a highly
deduplicated format to greatly lower the storage requirement and resulting cost. ExaGrid
allows Veeam and Commvault to keep their native data deduplication turned “on” and
ExaGrid will further deduplicate that data.
y All the data is stored in a deduplicated format in the non-network-facing repository tier that
threat actors cannot see or access. The system employs a delayed delete policy to ensure data
deleted in the Landing Zone is not immediately deleted in the repository. In addition, the
system can survive encrypted data ingest because all the previous repository data objects
are immutable, which means they are not changed, deleted, or modified, protecting the most
recent backup and all retention points.
y No planned product obsolescence as ExaGrid allows any age or size appliance to be mixed or
matched in the same system. So even if a model is no longer available it can still be used with
current models.
y Sizing for backup is extremely complicated because unlike primary storage there are a dozen
impact items such as rotation, retention, data types, etc. ExaGrid has extensive sizing tools
and prides itself on and is committed to not undersizing.
y ExaGrid’s system is designed for ease of use. Customer’s constantly say they spend the least
time on ExaGrid.
y ExaGrid assigns a level 2 support engineer to account so that the IT person works with the
same senior support engineer all the time. You don’t get the run around and you don’t have
to repeat yourself over and over. ExaGrid’s level 2 support engineers are experts in backup
applications, networking, and ExaGrid’s Tiered Backup Storage.
1. Take the time to truly listen to all the ins and outs of all the solutions.
2. That you talk to at least 3 to 5 reference accounts who are using the solution, ensure that
it is just you and the customer, and the vendor is not on the call. Remember, everyone has
customer success stories, talk live to customers.
3. Test, test, test. There is a lot of marketing hype out there and a lot of salespeople telling
interesting stories. Nothing beats testing side by side and seeing ingest performance, restore
performance, and scalability for yourself. Also, do a mock security attack and see which
system is still standing.
4. Always, double, and triple check the sizing. There are thousands of stories where customers
bought 3 years of data growth up front and in 9 months, they hit the wall and have to buy
additional capacity and now the long-term cost was much higher than expected or imagined.
• = adequate
ExaGrid reserves the right to change specifications or other product information without notice. ExaGrid and the
ExaGrid logo are trademarks of ExaGrid Systems, Inc. All other trademarks are the property of their respective holders.
©2022 ExaGrid Systems, Inc. All rights reserved.