h13512 Vnxe3200 Best Practices For Performance WP
h13512 Vnxe3200 Best Practices For Performance WP
Performance
Applied Best Practices Guide
Abstract
This applied best practices guide provides recommended best practices for
installing and configuring VNXe™ systems for best performance.
September, 2014
Copyright © 2014 EMC Corporation. All rights reserved. Published in the USA.
EMC believes the information in this publication is accurate of its publication date.
The information is subject to change without notice.
EMC2, EMC, and the EMC logo are registered trademarks or trademarks of EMC
Corporation in the United States and other countries. All other trademarks used
herein are the property of their respective owners.
For the most up-to-date regulatory document for your product line, go to the technical
documentation and advisories section on EMC Online Support.
2 EMC VNXe3200 Best Practices for Performance Applied Best Practices Guide
Contents
EMC VNXe3200 Best Practices for Performance Applied Best Practices Guide
3
Contents
4 EMC VNXe3200 Best Practices for Performance Applied Best Practices Guide
Preface
As part of an effort to improve and enhance the performance and capabilities of its
product line, EMC from time to time releases revisions of its hardware and software.
Therefore, some functions described in this guide may not be supported by all
revisions of the hardware or software currently in use. For the most up-to-date
information on product features, refer to your product release notes.
If a product does not function properly or does not function as described in this
document, please contact your EMC representative.
Purpose
The Applied Best Practices Guide delivers straightforward guidance to the majority of
customers using the storage system in a mixed business environment. The focus is
on system performance and maximizing the ease of use of the automated storage
features, while avoiding mismatches of technology. Some exception cases are
addressed in this guide; however, less commonly encountered edge cases are not
covered by general guidelines and are addressed in use-case-specific white papers.
Guidelines can and will be broken, appropriately, owing to differing circumstances or
requirements. Guidelines must adapt to:
• Different sensitivities toward data integrity
• Different economic sensitivities
EMC VNXe3200 Best Practices for Performance Applied Best Practices Guide 5
System Configuration
Audience
This document is intended for EMC customers, partners, and employees who are
installing and/or configuring VNXe unified systems. Some familiarity with EMC
unified storage systems is assumed.
Related documents
6 EMC VNXe3200 Best Practices for Performance Applied Best Practices Guide
System Configuration
Essential guidelines................................................................................... 8
Storage Processor cache ............................................................................ 8
Physical placement of drives ...................................................................... 8
Hot Sparing ..................................................................................... 9
Availability and connectivity....................................................................... 9
EMC VNXe3200 Best Practices for Performance Applied Best Practices Guide
7
System Configuration
Essential guidelines
This paper introduces specific configuration recommendations that enable good
performance from a VNXe3200 storage system. At the highest level, good
performance design follows a few simple rules. The main principles of designing a
storage system for performance are:
• Flash First – Utilize flash storage for the active dataset to achieve maximum
performance
• Distribute the load over available hardware resources
• Design for 70 percent utilization (activity level) for hardware resources
• When utilizing Hard Disk Drives (HDD), AVOID mixing response-time-sensitive
I/O with large-block I/O or high-load sequential I/O
• Maintain latest released VNXe Operating Environment version
8 EMC VNXe3200 Best Practices for Performance Applied Best Practices Guide
System Configuration
Hot Sparing
Hot sparing is the process of rebuilding a failed drive’s data onto a system-selected
compatible drive. Any unbound non-system drive can be considered for sparing.
When planning Hot Spares consider the following recommendations:
• Plan to reserve at least one of every 30 installed drives of a given type
o Verify count in the GUI or CLI
Storage > Storage Configuration > Spare Disks
uemcli /env/disk –unused show
o Note: Unbound system drives (DPE Disk 0 through DPE Disk 3) cannot be
used as hot spares
• Ensure that unbound drives for each drive type are available
o SAS Flash / FAST Cache SSD (SLC) must spare for SAS Flash / FAST Cache
SSD (SLC)
o SAS Flash VP / FAST VP SSD (eMLC) must spare for SAS Flash VP / FAST VP
SSD (eMLC)
o SAS must spare for SAS (regardless of rotational speed)
o NL-SAS must spare for NL-SAS
• The capacity of an unbound drive should be equal to or larger than the
provisioned drives for which it will spare
• Use multiple FC I/O ports on each SP, and balance host port connections
across FC I/O ports, as host port connections affect the preferred CPU core
assignment
EMC VNXe3200 Best Practices for Performance Applied Best Practices Guide
9
System Configuration
• If not connecting all the available FC I/O ports, use the even numbered ports
on each FC I/O module before using any odd numbered ports
iSCSI Connectivity
iSCSI connectivity is facilitated via the onboard copper NICs. The NICs will connect at
100Mbps, 1Gbps, or 10Gbps.
o Note: The entire network infrastructure must also support Jumbo Frames
NAS Connectivity
NAS protocols (NFS and CIFS) are facilitated via the onboard copper NICs. The NICs
will connect at 100Mbps, 1Gbps, or 10Gbps.
10 EMC VNXe3200 Best Practices for Performance Applied Best Practices Guide
Chapter 2 Storage Configuration
EMC VNXe3200 Best Practices for Performance Applied Best Practices Guide 11
Storage Configuration
General considerations
Drive type
Match the appropriate drive type to the expected workload:
Drive type Workload type
SAS Flash / FAST Cache SSD (SLC) For extreme performance; these provide the best
performance for transactional random workloads, and
the lowest write service times.
Required for Multicore FAST Cache
SAS Flash VP / FAST VP SSD (eMLC) For extreme performance FAST VP tier; these are a higher
capacity flash option.
Not for use with Multicore FAST Cache.
Rules of thumb
Disk drives are a critical element of unified performance. Use the rule of thumb
information to determine the number of drives to use to support the expected
workload.
These guidelines are a conservative starting point for sizing, not the absolute
maximums.
Rules of thumb (RoT) for drive bandwidth (MB/s):
• Bandwidth assumes multiple large-block sequential streams
• Parity does not count towards host write bandwidth sizing
o For example; a 4+1 RAID group using SAS 15K to support a sequential
write workload is sized at 100 MB/s for the entire RAID group (4*25 MB/s)
Bandwidth NL-SAS SAS 10K SAS 15K Flash (All)
RoT per drive, 15 MB/s 25 MB/s 30 MB/s 90 MB/s
Sequential Read
12 EMC VNXe3200 Best Practices for Performance Applied Best Practices Guide
Storage Configuration
Throughput NL-SAS SAS 10K SAS 15K SAS Flash VP (eMLC) SAS Flash (SLC)
Per drive RoT 90 IOPS 150 IOPS 180 IOPS 3500 IOPS 5000 IOPS
RAID level
For best performance from the least number of drives, match the appropriate RAID
level with the expected workload:
RAID 6 for NL-SAS Works best with read-biased workloads such as archiving and backup
to disk
RAID 6 provides additional RAID protection to endure longer rebuild
times of large drives
EMC VNXe3200 Best Practices for Performance Applied Best Practices Guide
13
Storage Configuration
14 EMC VNXe3200 Best Practices for Performance Applied Best Practices Guide
Storage Configuration
• Within a given pool, use all of the same flash technology for the extreme
performance tier
• When expanding pools, use a multiple of the preferred drive count already in
use for the tier being expanded
Pool capacity considerations
EMC recommends leaving free space in the storage pool, to accommodate data
services.
Note: The pool can still be oversubscribed above 100% of actual capacity. The values
here refer to actual physical space that is not used in the pool.
• When using Snapshots, at least 5% free space is needed; EMC recommends
maintaining about 10% to buffer snapped writes
• When using FAST VP, EMC recommends maintaining at least 10% free to
accommodate the quickest rebalancing
• When using FAST VP and Snapshots together in a pool, a total of 10% free will
meet the requirements of both
Note: By default, the VNXe3200 will begin issuing alerts when more than 70% of
available capacity has been subscribed.
File systems share space in the storage pool with their Snapshots, and also with
Block LUNs if the pool is shared.
• DON’T oversubscribe space in a storage pool that contains file systems
• Ensure that the storage pool has sufficient capacity to cover the maximum
size of all file systems, plus the capacity needed for any Snapshots, plus the
maximum size of all Block LUNs
EMC VNXe3200 Best Practices for Performance Applied Best Practices Guide
15
Storage Configuration
Storage tiers
Skew is when a small percentage of the total storage capacity in a storage system is
the target for the majority of the IOPS served by the system. It is the locality of active
data within the total storage capacity. For instance, in a payroll system, the current
month’s data will be highly active, year-to-date data will be moderately active, and
the data for previous years will be mostly inactive.
Storage tiers improve system performance by providing faster drives to handle the
more active data, while keeping the less active data on inexpensive capacity drives.
The number of tiers required in a storage pool is influenced by performance
requirements, capacity requirements, and the knowledge of the skew between active
and inactive capacity. Best performance is achieved when the entire active dataset
can be contained within the capacity of the Extreme Performance (flash) and
Performance (SAS) tiers.
If the active capacity (skew) is known, the capacity per tier should be sized
accordingly. Using the payroll system example again, if 7 years of data are kept in the
storage system, less than 20% of the data is expected to be highly active (1 year out
of 7).
If the active capacity is not known, consider capacity per tier of 5 percent flash, 20
percent SAS, and 75 percent NL-SAS. This works on the assumption that less than 25
percent of the used capacity will be active, and infrequent relocations from the lowest
tier will occur.
Follow these general guidelines:
• When Multicore FAST Cache is available, use a 2-tier pool comprised of SAS
and NL-SAS. Enable Multicore FAST Cache as a cost-effective way of realizing
flash performance without dedicating flash to this pool
o Flash tier can be added later if Multicore FAST Cache is not fully capturing
the active data
• For a 3-tier pool, start with 5 percent flash, 20 percent SAS, and 75 percent
NL-SAS for capacity per tier if skew is not known
o Tiers can be expanded after initial deployment to effect a change in the
capacity distribution if needed
• Use a 2-tier pool comprised of flash and SAS as an effective way of providing
consistently good performance.
o NL-SAS can be added later if capacity growth and aged data require it
• AVOID using a 2-tier pool of flash and NL-SAS if there is uncertainty about the
active data fitting in the flash tier
o The SAS tier provides a buffer for active data not captured in the flash tier;
the SAS tier still provides modest performance, as well as quicker
promotion to flash when relocations occur
• Add a flash tier to a pool with thin LUNs so that metadata is promoted to flash
and overall performance is improved
16 EMC VNXe3200 Best Practices for Performance Applied Best Practices Guide
Storage Configuration
EMC VNXe3200 Best Practices for Performance Applied Best Practices Guide
17
Data Services
FAST VP ..................................................................................... 19
Multicore FAST Cache................................................................................. 20
Snapshots ..................................................................................... 21
Deduplication ..................................................................................... 21
18 EMC VNXe3200 Best Practices for Performance Applied Best Practices Guide
Data Services
FAST VP
General
FAST VP moves data between tiers in a pool based on the performance needs of the
data. Construct the pool such that each tier will provide consistent performance.
• Use consistent drive technology for each tier within a single pool
o Same flash drive technology and drive size for the extreme performance
tier
o Same SAS RPM and drive size for the performance tier
o Same NL-SAS drive size for the capacity tier
Tiering Policy
For most tiered pools, using auto-tier will provide the best performance.
• DON’T use auto-tier for LUNs with low-skew random workloads where the
active dataset will not fit in the highest tier
o This might cause excessive tier relocations that may not benefit the active
data
• AVOID using highest-available for a LUN whose capacity exceeds 90% the
highest tier capacity
o This can affect the overall efficiency of the highest tier to service active
data for LUNs running in auto-tier mode
• AVOID using lowest-available with thin LUNs, as this will force the metadata
into the lowest tier as well
Data Relocation
Relocation is the process of moving pool data slices across tiers, or within the same
tier, to move hot data to higher performing drives, or to balance underlying drive
utilization. Relocation can occur as part of a FAST VP scheduled relocation, as an
automated relocation after a storage pool expansion, or as a result of manually
requested relocation.
• Enable FAST VP on a pool, even if the pool only contains a single tier, to
provide ongoing load balancing across available drives based on slice
temperature and capacity utilization
• Schedule relocations for off-hours, so that relocation activity does not
contend with the primary workload
• Schedule relocations to run before or during backup windows, so that the
relocations are based on the primary workload activity
Pool capacity utilization
FAST VP requires unallocated space within the pool to accommodate data
relocations.
• EMC recommends leaving 10% free space in storage pools when FAST VP is
enabled
o Relocation will attempt to reclaim 10 percent free per tier
EMC VNXe3200 Best Practices for Performance Applied Best Practices Guide
19
Data Services
General considerations
EMC recommends first utilizing available flash drives for Multicore FAST Cache, which
can globally benefit all LUNs in the storage system. Supplement performance as
needed with additional flash drives in storage pool tiers.
Preferred application workloads for Multicore FAST Cache:
• Small-block random I/O applications with high locality
• High frequency of access to the same data
• Systems where current performance is limited by HDD capability, not SP
capability
AVOID enabling Multicore FAST Cache for pools that are not expected to benefit, such
as when:
• The primary workload is sequential
• The primary workload is large-block I/O
Multicore FAST Cache can improve overall system performance if the current
bottleneck is drive-related, but boosting the IOPS will result in greater CPU utilization
on the SPs. Generally, EMC recommends sizing systems so that the maximum
sustained utilization is 70 percent.
On an existing system, check the SP CPU utilization of the system, and then proceed
as follows:
20 EMC VNXe3200 Best Practices for Performance Applied Best Practices Guide
Data Services
Snapshots
Snapshots are used to take point-in-time checkpoints of LUNs and file systems.
LUNs
When using Snapshots with LUNs:
• Start with thin LUNs, to provide the most optimal pool capacity utilization
• Plan for the deletion of snapshots
o Whenever possible, schedule the deletion of Snapshots during non-peak
hours of operation
If snapshots must be deleted during peak periods of array activity,
lessen the impact by reducing the number of concurrent Snapshot
deletes (for example, stagger the delete operations over several
hours, instead of all at once)
o DON'T delete the last snapshot of a Thick LUN, if you intend to create
another snapshot immediately after deleting the last snapshot
Create the new snapshot before deleting the older snapshot
Deleting the last snapshot of a Thick LUN will undo the thin
conversion, which would then be reconverted for the new snapshot
File Systems
When using Snapshots with file systems:
• Start with thin file systems, to provide the most optimal pool capacity
utilization
• Migrate data into a new file system before enabling Snapshots on the file
system
• DON'T delete the last snapshot of a file system, if you intend to create another
snapshot immediately after deleting the last snapshot
o Create the new snapshot before deleting the older snapshot
o Deleting the last snapshot of a file system will undo the thin conversion,
which would then be reconverted for the new snapshot
Deduplication
EMC VNXe3200 Best Practices for Performance Applied Best Practices Guide
21
Data Services
File Systems
If using file system deduplication and compression:
• Enable deduplication and compression on a file system and allow the initial
scan to complete before enabling Snapshots
• Use file extension and path filtering to exclude large files that will not
compress or deduplicate well, such as JPG or MP3 files
22 EMC VNXe3200 Best Practices for Performance Applied Best Practices Guide
Application Specific Considerations
EMC VNXe3200 Best Practices for Performance Applied Best Practices Guide
23
Application Specific Considerations
• Use Jumbo Frames (9000 MTU) with iSCSI, including all host NICs, network
equipment, and array ports
24 EMC VNXe3200 Best Practices for Performance Applied Best Practices Guide
Chapter 5 Conclusion
This best practices guide provides configuration and usage recommendations for
VNXe3200 systems in general usage cases.
EMC VNXe3200 Best Practices for Performance Applied Best Practices Guide 25