DPA Data Domain WP PDF
DPA Data Domain WP PDF
Abstract
EMC® Data Protection Advisor (DPA) provides a comprehensive set of features to analyze data protection
operations to ensure that your data is protected and recoverable. Analyzing the backup applications, supporting
infrastructure, and target storage, DPA can capture issues so that they can be addressed before a failure. This
white paper outlines how Data Protection Advisor operates in conjunction with EMC Data Domain ®
deduplication storage systems.
June 2010
Copyright © 2010 EMC Corporation. All rights reserved.
EMC believes the information in this publication is accurate as of its publication date. The information is
subject to change without notice.
THE INFORMATION IN THIS PUBLICATION IS PROVIDED ―AS IS.‖ EMC CORPORATION
MAKES NO REPRESENTATIONS OR WARRANTIES OF ANY KIND WITH RESPECT TO THE
INFORMATION IN THIS PUBLICATION, AND SPECIFICALLY DISCLAIMS IMPLIED
WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE.
Use, copying, and distribution of any EMC software described in this publication requires an applicable
software license.
For the most up-to-date listing of EMC product names, see EMC Corporation Trademarks on EMC.com
All other trademarks used herein are the property of their respective owners.
Part Number h7115.1
Using EMC Data Protection Advisor with EMC Data Domain Deduplication Storage Systems
Applied Technology 2
Table of Contents
Using EMC Data Protection Advisor with EMC Data Domain Deduplication Storage Systems
Applied Technology 3
Executive summary
EMC® Data Protection Advisor (DPA) collects, monitors, analyzes, and reports on information from our
customer’s entire data protection infrastructure, providing a unified data protection management window
across their entire backup and EMC replication investment, accelerating access to information, saving time
and money, allowing faster decisions, and improving data protection.
Through support for heterogeneous backup infrastructures including EMC Backup Solutions and support
for EMC Replication Solutions, DPA reduces the cost and risk to manage a data protection environment,
enabling our customers to get more from their existing investments and increase efficiency.
The DPA graphical user interface (GUI) can present this information in a familiar manner by arranging
assets in common sense groups or views such as business unit or geography, greatly enhancing readability
and giving the user the ability to perform advanced reporting, troubleshooting, performance management,
and capacity planning operations.
EMC Data Domain® support in DPA gathers information about the configuration, status, and performance
of Data Domain components. Data Protection Advisor uses SNMP to gather this data from the Data
Domain management information base (MIB).
Introduction
DPA monitors a very wide range of components/assets and provides a comprehensive range of reports for
each. This white paper seeks to describe DPA and its interaction with Data Domain deduplication storage
systems. Because DPA supports a wide range of products and technology, it often uses general
terminology. Therefore in this document and within the DPA console, Data Domain Global Compression
technology is referred to as ―deduplication.‖
Audience
This white paper is intended for use by backup administrators and operations managers to understand the
benefits of using EMC Data Protection Advisor in conjunction with EMC Data Domain deduplication
storage systems. As a target storage device used by backup systems to store backup copies of critical data,
it is helpful to understand aspects such as system performance, capacity, and availability. DPA maintains a
historical record of the systems operation, providing consistent monitoring, alerting, and reporting. Data
Domain support in conjunction with data from the backup environment enables a comprehensive view of
the operations and health of the backup systems.
Using EMC Data Protection Advisor with EMC Data Domain Deduplication Storage Systems
Applied Technology 4
high-level overview of health status and then let you drill down into areas of interest. The operational
simplicity of the dashboards helps to reduce administrative costs.
Using EMC Data Protection Advisor with EMC Data Domain Deduplication Storage Systems
Applied Technology 5
The multiple layers of a Data Domain user’s environment and the resource utilization within it can be
difficult to visualize.
The backup application, operating system, virtual devices, physical devices, and underlying SAN / SCSI
can have different architectures, and the utilization of these various layers and their inter-relationships, can
be difficult to reconcile.
DPA allows the user to create overviews/control panels that are essentially reports with multiple constituent
report windows embedded within them. These have a very useful application in Data Domain environments
as one could, for instance, graphically show on one screen the utilization of:
The backup application (for instance, IBM Tivoli Storage Manager)
TSM management processes and their relationships with storage pools
TSM storage pools from the TSM perspective
The underlying disk volumes from the OS perspective
Data Domain system utilization
The ability to view Data Domain system activity in the same window as TSM server-side processes is key.
The configuration tree in Figure 1 illustrates the wealth of assets that DPA supports.
Using EMC Data Protection Advisor with EMC Data Domain Deduplication Storage Systems
Applied Technology 6
Figure 1. DPA supports many assets
Benefit
DPA provides the user with a common interface for the wide range of Data Domain system configurations
in numerous client environments.
Using EMC Data Protection Advisor with EMC Data Domain Deduplication Storage Systems
Applied Technology 7
Data Domain, VMware, and DPA
The benefits of DPA in complex environments are even more evident in VMware environments. Due to the
relative ease with which VMware environments can be expanded, ―virtual machine (VM) sprawl‖ creates
increased pressure on the backup environment. By increasing the number of systems sharing a set of
physical resources, backups can be constrained by the backup of other VMs on the same host. Due to the
redundant nature of VMware backups, these environments are ideal for deduplication technology, usually
achieving deduplication ratios above those of conventionally hosted systems.
DPA provides excellent insight into VMware and Data Domain environments, allowing the user to identify
bottlenecks, load balance, and view the environment from differing perspectives in one screen.
By viewing Backup Application Schedules along with Resource Utilization reports of multiple VMs and
the ESX host, a user can establish if the backup workload is having an undue effect on the physical host. If
the server is overloaded from the backup workload, backup schedules can be adjusted accordingly to
balance the environment.
If the user is sending disk image backup to a Data Domain system, DPA can be used to ease the process of
scheduling the required VM shutdowns. DPA can also monitor the synchronization effects of the VM
shutdown scripts and backup activity, keeping downtime or performance impact to a minimum.
Sample reports
The following sections outline some typical reports with a description of the data presented and possible
interpretations.
Resource utilization
DPA provides reports that show the utilization of CPU, memory, and network activity. The Resource
Utilization control panel presents this information in a single report.
From the screenshot below, it can be seen that there could be some cause for concern as the memory is so
high (>80%) and processor utilization is so low this might be worth investigating due to the CPU-centric
architecture of Data Domain systems.
Using EMC Data Protection Advisor with EMC Data Domain Deduplication Storage Systems
Applied Technology 8
Figure 2. Resource Utilization Overview report
Note: Filesystem utilization is not seen in this report but can be viewed from the perspective of the attached host.
Benefit
DPA allows the user to view the vital signs of the Data Domain system in one window, allowing insight
into any imbalance, such as the imbalance between the CPU, network, and memory utilization shown in
Figure 2.
Deduplication ratio
The Deduplication Ratio report plots the dedupe ratio that the Data Domain system is achieving over time.
This shows the storage savings gained from using Data Domain deduplication storage systems versus
traditional backup storage like tape or standard disk.
In a normal environment, one would expect the dedupe ratio to improve over time and eventually plateau.
If the deduplication ratio has declined, there may be reason to investigate any changes in the backup data
types over the time period and revise provisioning and capacity planning estimates.
Using EMC Data Protection Advisor with EMC Data Domain Deduplication Storage Systems
Applied Technology 9
In addition, the deduplication ratio can be viewed and compared across the enterprise to identify lower-
performing configurations, systems, or locations.
Benefit
DPA allows users to track deduplication ratios of a Data Domain system over time.
Using EMC Data Protection Advisor with EMC Data Domain Deduplication Storage Systems
Applied Technology 10
Network interface utilization
The Network Interface Utilization Summary control panel provides insight into the performance of network
interfaces on the Data Domain system. This control panel shows the maximum and average utilization over
a reporting period, along with the utilization on a per-interface basis.
A well-balanced system/environment will likely have a similar utilization across all components. Figure 4
shows that two of the six available network ports on this Data Domain system are bearing the majority of
the load, which might be worth investigating. This imbalance could mean a variety of things. For example
backups may be occurring from different systems on different networks at different times. Other potential
problems include hardware failures, unbalanced workload, a backup host is offline, other network activity
impacting backups, backups configured for only two networks, and other issues
Although we could spend lots of time analyzing an unbalanced system, it may not be a serious problem.
The intent is that DPA provides a starting point to identify possible problems. The ―Processor utilization
and status‖ section next provides more information.
Using EMC Data Protection Advisor with EMC Data Domain Deduplication Storage Systems
Applied Technology 11
Benefit
DPA allows the user to identify potential load-balancing and functional issues across network interfaces.
Using EMC Data Protection Advisor with EMC Data Domain Deduplication Storage Systems
Applied Technology 12
RTO and RPO
Data Domain systems allow great gains in recovery times by eliminating the wait times required for backup
media availability, particularly the time required to mount physical tape. DPA can accurately calculate and
report on the Recovery Time Objective (RTO) and Recovery Point Objective (RPO) based on historical
data, estimated mount times, and data transfer rates. By setting the Off-Line Data Overhead field (also
known as tape recall) to zero we can account just for the time to restore. These accurate predictions are
important in ensuring SLAs can be met and that the benefits of deploying a Data Domain system in the
environment are evident to the end customer.
Cleaning
Data Domain systems require cleaning or ―garbage collection‖ operations to be run regularly, allowing
space reclamation to help maintain optimum performance. DPA can be used to ensure that the cleaning
processes run within periods of low backup activity.
Using EMC Data Protection Advisor with EMC Data Domain Deduplication Storage Systems
Applied Technology 13
Status reports
Status reports show the state of the Data Domain system components at a glance, allowing adverse
conditions to be readily identified and rapidly addressed. Analyses can be configured and assigned to alert
the user to degradations in the environment that may have adverse effects.
Disk status
This Disk Status report provides detailed information about the underlying disk, allowing the user to
readily identify those disks with high operating temperatures, error counts, and other issues, and helping to
anticipate failures and inform when they result in deterioration of availability.
The Disk Status report and other reports are shown in the following pages.
Using EMC Data Protection Advisor with EMC Data Domain Deduplication Storage Systems
Applied Technology 14
Figure 9. Disk Status report
Fan status
Using EMC Data Protection Advisor with EMC Data Domain Deduplication Storage Systems
Applied Technology 15
Thermometer status
PSU status
Battery status
Using EMC Data Protection Advisor with EMC Data Domain Deduplication Storage Systems
Applied Technology 16
Network interface status
Filesystem status
VTL status
This VTL Status report provides detailed information listing of the VTL system status, similar to the
Filesystem status report shown above. The VTL Status report is supported in DPA version 5.5 SP1 if you
are running Data Domain 4.7.
Benefit
With all of the various status reports available, DPA provides useful insight into the status of the
environment and can present this information in an automatically refreshed format that is ideal for
bridges/operations rooms.
Using EMC Data Protection Advisor with EMC Data Domain Deduplication Storage Systems
Applied Technology 17
Configuration reports
The configuration reports display configuration detail on a wide range of components of the Data Domain
system. These reports might be useful to identify differences in configurations when investigating
differences in function across the enterprise. Any configuration changes can be automatically detected and
alerted to the user (see the ―Change management reports‖ section). Likewise, the configuration report can
be used to easily identify upgrade candidates based on details like firmware revision.
Using EMC Data Protection Advisor with EMC Data Domain Deduplication Storage Systems
Applied Technology 18
Battery configuration
Using EMC Data Protection Advisor with EMC Data Domain Deduplication Storage Systems
Applied Technology 19
VTL configuration
The VTL Configuration report is supported with DPA version 5.5 SP1 and Data Domain Operating System
version 4.7.
Benefit
DPA provides one-stop insight into the configuration settings of the entire Data Domain environment.
Performance reports
DPA provides a wealth of performance reports for Data Domain systems and the relationships with the
boarder environment.
Using EMC Data Protection Advisor with EMC Data Domain Deduplication Storage Systems
Applied Technology 20
Benefit
DPA can use its reporting of FC switches and backup host (storage node) FC HBAs to establish virtual
drive performance.
Disk performance
Fileserver performance
Using EMC Data Protection Advisor with EMC Data Domain Deduplication Storage Systems
Applied Technology 21
Network interface status
This report shows the status of ports within a Data Domain system. Note the fact that a number of the ports
are not active, corresponding with the Health Status report earlier. This could mean that these network
cards were turned off or that there was a failure within the system.
Using EMC Data Protection Advisor with EMC Data Domain Deduplication Storage Systems
Applied Technology 22
Figure 24. Change Overview
Using EMC Data Protection Advisor with EMC Data Domain Deduplication Storage Systems
Applied Technology 23
Figure 25. Change Details report
Benefit
In today’s dynamic environments it is essential to keep track of changes. DPA allows the user to readily
establish the change history of the Data Domain system assets both in real time and historically.
Using EMC Data Protection Advisor with EMC Data Domain Deduplication Storage Systems
Applied Technology 24
If virtual or physical tape resources are unavailable to the backup application the user could be
automatically alerted to this fact, enabling them to investigate the Data Domain system in the event of a
failure.
Conclusion
EMC Data Protection Advisor (DPA) collects, monitors, analyzes, and reports on information from your
customer’s entire data protection infrastructure, providing a unified data protection management interface.
Data Domain system support in DPA gathers information about the configuration, status, and performance
of Data Domain system components, via SNMP from the Data Domain MIB. DPA can present this
information in multiple formats, arranging assets in logical groups (such as business unit or geography),
greatly enhancing readability, and giving the user the ability to perform advanced reporting,
troubleshooting, performance management, and capacity planning operations. Through heterogeneous
support, DPA reduces the cost and risk to manage a data protection environment, enabling our customers to
get more from their existing investments.
References
The following can provide additional information and can be found on Powerlink ®, EMC’s password-
protected customer- and partner-only extranet.
EMC Data Protection Advisor Version 5.6 Architecture Overview
EMC Data Protection Advisor Version 5.6 Compatibility Matrix
EMC Data Protection Advisor Version 5.6 Installation Guide
EMC Data Protection Advisor Version 5.6 Administration Guide
EMC Data Protection Advisor Version 5.6 Reference Guide
EMC Data Protection Advisor Version 5.6 Release Notes
EMC Data Protection Advisor Version 5.6 User Guide
Using EMC Data Protection Advisor with EMC Data Domain Deduplication Storage Systems
Applied Technology 25