Configuring EMC Data Domain Boost With Veeam Availability Suite v8 2014
Configuring EMC Data Domain Boost With Veeam Availability Suite v8 2014
Data Domain
Boost with Veeam
Availability Suite v8
Rick Vanover
VMware vExpert, MCP, MCITP, MCSA
and Veeam Product Strategy Specialist
Introduction
Support for EMC Data Domain Boost is included for backups leveraging
Veeam Availability Suite v8. This tech note will walk through the
configuration points associated with adding an EMC Data Domain target for
backup jobs. This document is written from an early internal preview and is
subject to change upon the final release of Veeam Availability Suite v8.
Figure 1: EMC Data Domain provides backup and archive storage for many use cases.
Figure 2: A DD Boost Storage Unit is the first step in creating a Veeam repository.
In the wizard, a user is selected for the DD Boost Storage Unit, which will be
used in the Veeam repository creation wizard. In this example, the DD Boost
Storage Unit is called v8-ddboost-sample and the user is sysadmin.
Figure 3: The backup repository wizard will start to leverage EMC Data Domain Boost.
Figure 4: The backup repository wizard will start to leverage EMC Data Domain Boost.
When the actual Data Domain system is selected in this step, a few important
options are presented:
System name
Fibre Channel connectivity
Gateway server
The system name is the DNS name or TCP/IP address of the Data Domain
server that will house the DD Boost Storage Unit connected in the previous
steps. The communication that leverages EMC Data Domain Boost can go over
the LAN or SAN as indicated earlier. If Fibre Channel connectivity is to be used,
select the option to leverage the SAN. This requires that the SAN be correctly
zoned for the Veeam gateway server to access the storage targets.
A critical decision in this wizard is the selection of a gateway server. The
gateway server in all situations should be placed from a network perspective as
close as possible (from a latency perspective) to the actual Data Domain server.
This is because the gateway server will send the traffic to the Data Domain
server and communicate with the DD Boost Storage Unit. Backup (or Backup
Copy job) data flow may originate in other sites or networks, but the gateway
server role for the Data Domain server should be very close to the appliance.
Additionally the credentials set in the configuration phase of the DD Boost
Storage Unit are specified here in the backup repository wizard (Figure 5).
Figure 5: The Data Domain system name and credentials are applied in this step.
If you need the Data Domain server name on a Fibre Channel network, it is in
the Data Management | DD Boost | Fibre Channel section of the administrative
interface. This section is shown below (Figure 6).
Figure 6: The Fibre Channel name is defined in the Data Domain server administrative interface.
Once the Data Domain server, credentials and gateway server are selected, the
DD Boost Storage Unit is selected while creating the backup repository. Note
how the path to the repository will call the EMC Data Domain booth through
the libraries that are present via the integration between Veeam Availability
Suite and the Data Domain server in the figure below (Figure 7).
After completing the remaining steps of the backup repository (which are not
specific to DD Boost), the new repository will be ready for use in a backup job.
Figure 8: Synthetic full backups are created via the Advanced Settings button of a backup job.
With EMC Data Domain Boost on synthetic full backups, there are benefits to
be seen in the time it takes to create these full backups. In the preview edition,
the following performance benefits were seen on a test VM. The backup job
on the left (with DD Boost) took 11 minutes and 30 seconds for a synthetic
full backup. The backup job on the right (without DD Boost) took 1 hour
27 minutes and 4 seconds to synthesize the same VM to a full backup. The
highlighted sections of the user interface are shown below (Figure 9).
Figure 9: The synthetic full backups are faster when Data Domain Boost is enabled for a repository.
Figure 10: The GFS transformations are quicker when EMC Data Domain Boost is leveraged.
The first of which applies to VMs that may have multiple backup jobs and will
land on the same DD Boost Storage Unit. The first time that VM is backed up
with the first of the two (or more) jobs, it will function like a full backup as
expected. The next time that VM is backed up with the second job, it will have
to perform a full backup as it is the first time the Veeam job is called. Because
the first job has the bulk of the blocks of the vSphere or Hyper-V VM on the
DD Boost Storage Unit, it will only need to transfer metadata and any possible
changed blocks. This can be a significant improvement on the active full
backup process when there is a fast source storage resource in place.
Additionally when using EMC Data Domain Boost, there is Advanced Load
Balancing and Link Failover for connectivity to the Data Domain server. When
multiple network interfaces are in use, the Data Domain server will distribute
connections from Veeam components over these links. This provides improved
data transfer performance with Veeam jobs operating in parallel and additional
resiliency by transparently switching failed links to active links if there is an
interruption in connectivity.
Configuration Summary
Configuring EMC Data Domain Boost with Veeam Availability Suite v8 is easy.
This document provides a tour of the configuration steps as well as where
to find performance benefits. Additionally, Veeam and EMC customers can
leverage these additional resources for more material:
Veeam Forums http://forums.veeam.com
Veeam social media @Veeam
EMC PowerLink https://powerlink.emc.com/
10
11
12