Ibm San
Ibm San
Understand SVC-based
migrations
Lisa Martinez
Rosemary McCutchen
Hank Sautter
Bertrand Dufrasne Stephen Solewin
Aubrey Applewhaite Anthony Vandewerdt
David Denny Ron Verbeek
Jawed Iqbal Pete Wendler
Christina Lara Roland Wolf
ibm.com/redbooks
International Technical Support Organization
April 2010
SG24-7759-00
Note: Before using this information and the product it supports, read the information in “Notices” on
page ix.
This edition applies to Version 10, Release 2.1, of the XIV Storage System software.
Notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ix
Trademarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .x
Preface . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xi
The team that wrote this book . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xi
Now you can become a published author, too! . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xiii
Comments welcome. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xiv
Stay connected to IBM Redbooks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xiv
Chapter 1. Snapshots . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.1 Snapshots architecture . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.2 Snapshots . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
1.2.1 Creating a snapshot . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
1.2.2 Viewing snapshot details . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
1.2.3 Deletion priority . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
1.2.4 Restore a snapshot . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2.5 Overwriting snapshots . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
1.2.6 Unlocking a snapshot . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
1.2.7 Locking a snapshot . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.2.8 Deleting a snapshot . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
1.2.9 Automatic deletion of a snapshot . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
1.3 Snapshots consistency group . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
1.3.1 Creating a consistency group . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
1.3.2 Creating a snapshot using consistency groups. . . . . . . . . . . . . . . . . . . . . . . . . . . 27
1.3.3 Managing a consistency group . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28
1.3.4 Deleting a consistency group . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
1.4 Snapshot with Remote Mirror . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
1.5 MySQL database backup example . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34
Chapter 2. Tivoli Storage FlashCopy Manager and Volume Shadow Copy Services . 41
2.1 Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42
2.2 Tivoli Storage FlashCopy Manager . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42
2.3 Windows Server 2008 Volume Shadow Copy Service . . . . . . . . . . . . . . . . . . . . . . . . . 44
2.3.1 VSS architecture and components . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45
2.3.2 Microsoft Volume Shadow Copy Service function . . . . . . . . . . . . . . . . . . . . . . . . 47
2.4 XIV VSS provider . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47
2.4.1 XIV VSS Provider installation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48
2.4.2 XIV VSS Provider configuration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49
2.5 Installing and configuring Tivoli Storage FlashCopy Manager for Microsoft Exchange 52
2.6 Backup scenario for Microsoft Exchange Server . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57
Contents v
7.9.4 Host server cannot access the XIV migration volume. . . . . . . . . . . . . . . . . . . . . 218
7.9.5 Remote volume cannot be read . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 218
7.9.6 LUN is out of range . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 219
7.10 Backing out of a data migration. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 219
7.10.1 Back-out prior to migration being defined on the XIV . . . . . . . . . . . . . . . . . . . . 219
7.10.2 Back-out after a data migration has been defined but not activated . . . . . . . . . 219
7.10.3 Back-out after a data migration has been activated but is not complete. . . . . . 219
7.10.4 Back-out after a data migration has reached the synchronised state . . . . . . . . 220
7.11 Migration checklist. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 220
7.12 Device-specific considerations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 223
7.12.1 EMC CLARiiON. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 224
7.12.2 EMC Symmetrix and DMX . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 224
7.12.3 HDS TagmaStore USP . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 226
7.12.4 HP EVA . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 226
7.12.5 IBM DS3000/DS4000/DS5000 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 227
7.12.6 IBM ESS E20/F20/800 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 229
7.12.7 IBM DS6000 and DS8000. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 230
7.13 Sample migration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 231
Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 279
Contents vii
viii IBM XIV Storage System: Copy Services and Migration
Notices
This information was developed for products and services offered in the U.S.A.
IBM may not offer the products, services, or features discussed in this document in other countries. Consult
your local IBM representative for information on the products and services currently available in your area. Any
reference to an IBM product, program, or service is not intended to state or imply that only that IBM product,
program, or service may be used. Any functionally equivalent product, program, or service that does not
infringe any IBM intellectual property right may be used instead. However, it is the user's responsibility to
evaluate and verify the operation of any non-IBM product, program, or service.
IBM may have patents or pending patent applications covering subject matter described in this document. The
furnishing of this document does not give you any license to these patents. You can send license inquiries, in
writing, to:
IBM Director of Licensing, IBM Corporation, North Castle Drive, Armonk, NY 10504-1785 U.S.A.
The following paragraph does not apply to the United Kingdom or any other country where such
provisions are inconsistent with local law: INTERNATIONAL BUSINESS MACHINES CORPORATION
PROVIDES THIS PUBLICATION "AS IS" WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESS OR
IMPLIED, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF NON-INFRINGEMENT,
MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Some states do not allow disclaimer of
express or implied warranties in certain transactions, therefore, this statement may not apply to you.
This information could include technical inaccuracies or typographical errors. Changes are periodically made
to the information herein; these changes will be incorporated in new editions of the publication. IBM may make
improvements and/or changes in the product(s) and/or the program(s) described in this publication at any time
without notice.
Any references in this information to non-IBM Web sites are provided for convenience only and do not in any
manner serve as an endorsement of those Web sites. The materials at those Web sites are not part of the
materials for this IBM product and use of those Web sites is at your own risk.
IBM may use or distribute any of the information you supply in any way it believes appropriate without incurring
any obligation to you.
Information concerning non-IBM products was obtained from the suppliers of those products, their published
announcements or other publicly available sources. IBM has not tested those products and cannot confirm the
accuracy of performance, compatibility or any other claims related to non-IBM products. Questions on the
capabilities of non-IBM products should be addressed to the suppliers of those products.
This information contains examples of data and reports used in daily business operations. To illustrate them
as completely as possible, the examples include the names of individuals, companies, brands, and products.
All of these names are fictitious and any similarity to the names and addresses used by an actual business
enterprise is entirely coincidental.
COPYRIGHT LICENSE:
This information contains sample application programs in source language, which illustrate programming
techniques on various operating platforms. You may copy, modify, and distribute these sample programs in
any form without payment to IBM, for the purposes of developing, using, marketing or distributing application
programs conforming to the application programming interface for the operating platform for which the sample
programs are written. These examples have not been thoroughly tested under all conditions. IBM, therefore,
cannot guarantee or imply reliability, serviceability, or function of these programs.
The following terms are trademarks of the International Business Machines Corporation in the United States,
other countries, or both:
AIX® FlashCopy® System Storage™
BladeCenter® IBM® System x®
DB2® Lotus® System z®
Domino® Redbooks® Tivoli®
DS4000® Redpaper™ WebSphere®
DS6000™ Redbooks (logo) ® XIV®
DS8000® S/390®
ITIL is a registered trademark, and a registered community trademark of the Office of Government
Commerce, and is registered in the U.S. Patent and Trademark Office.
Snapshot, and the NetApp logo are trademarks or registered trademarks of NetApp, Inc. in the U.S. and other
countries.
VMware, the VMware "boxes" logo and design are registered trademarks or trademarks of VMware, Inc. in the
United States and/or other jurisdictions.
Microsoft, Windows, and the Windows logo are trademarks of Microsoft Corporation in the United States,
other countries, or both.
UNIX is a registered trademark of The Open Group in the United States and other countries.
Linux is a trademark of Linus Torvalds in the United States, other countries, or both.
Other company, product, or service names may be trademarks or service marks of others.
This IBM® Redbooks® publication provides a practical understanding of the XIV® Storage
System copy and migration functions. The XIV Storage System has a rich set of copy
functions suited for various data protection scenarios, which enables clients to enhance their
business continuance, data migration, and online backup solutions. These functions allow
point-in-time copies, known as snapshots and full volume copies, and also include remote
copy capabilities in either synchronous or asynchronous mode. These functions are included
in the XIV software and all their features are available at no additional charge.
The various copy functions are reviewed under separate chapters that include detailed
information about usage, as well as practical illustrations.
This book also discusses how to integrate the snapshot function with the IBM Tivoli®
FlashCopy® manager, explains the XIV built-in migration capability, and presents migration
alternatives based on the San Volume Controller (SVC).
Note: GUI and XCLI illustrations included in this book were created with an early version of
the 10.2.1 code, as available at the time of writing. There could be minor differences with
the XIV 10.2.1 code that is publicly released.
This book is intended for anyone who needs a detailed and practical understanding of the XIV
copy functions.
Bertrand Dufrasne is an IBM Certified Consulting IT Specialist and Project Leader for
System Storage™ disk products at the International Technical Support Organization, San
Jose Center. He has worked at IBM in various IT areas. He has authored many IBM
Redbooks publications and has also developed and taught technical workshops. Before
joining the ITSO, he worked for IBM Global Services as an Application Architect. He holds a
Master’s degree in Electrical Engineering from the Polytechnic Faculty of Mons.
Aubrey Applewhaite is an IBM Certified Consulting IT Specialist working for the Storage
Services team in the UK. He has worked for IBM since 1996 and has over 20 years of
experience in the IT industry. He has worked in a number of areas, including System x®
servers, operating system administration, and technical support. He currently works in a
customer-facing role providing advice and practical expertise to help IBM customers
implement new storage technology. He specializes in XIV, SVC, DS8000®, and DS5000
hardware. He holds a Bachelor of Science degree in Sociology and Politics from Aston
University and is also a VMware Certified Professional.
David Denny is a Solutions Architect with XIV in the IBM Systems and Technology Group.
David has over 20 years of experience in the IT field, ranging from systems administration to
enterprise storage architect. David is the lead corporate resource for data migrations with XIV.
Prior to joining IBM, David was a Lead Architect of the Enterprise SAN for the DoD Disaster
Jawed Iqbal is an Advisory Software Engineer and a Team Lead for Tivoli Storage Manager
Client, Data Protection, and FlashCopy Manager products at the IBM Almaden Research
Center in San Jose, CA. Jawed joined IBM in 2000 and worked as Test Lead on several Data
Protection products, including Oracle RDBMS Server, WebSphere®, MS SQL, MS Exchange,
and Lotus® Domino® Server. He holds a master’s degree in Computer Science, a BBA in
Computer Information Systems, and a bachelor’s degree in Math, Stats, and Economics.
Jawed also holds an ITIL® certification.
Christina Lara is a Senior Test Engineer currently working on the XIV storage test team in
Tucson, AZ. She just completed a 1-year assignment as a Assistant Technical Staff Member
(ATSM) to the Systems Group Chief Test Engineer. Christina has just begun her ninth year
with IBM, having held different test and leadership positions within the Storage Division over
that last several years. Her responsibilities included system level testing and field support test
on both DS8000 and ESS800 storage products and test project management. Christina
graduated from the University of Arizona in 1991 with a BSBA in MIS and Operations
Management. In 2002, she received her MBA in Technology Management from the University
of Phoenix.
Lisa Martinez is a Senior Software Engineer working in the DS8000 and XIV System Test
Architecture in Tucson, Arizona. She has extensive experience in Enterprise Disk Test. She
holds a Bachelor of Science degree in Electrical Engineering from the University of New
Mexico and a Computer Science degree from New Mexico Highlands University. Her areas of
expertise include the XIV Storage System and IBM System Storage DS8000, including Copy
Services, with Open Systems and System z®.
Hank Sautter is a Consulting IT Specialist with Advanced Technical Support in the U.S. He
has 17 years of experience with S/390® and IBM disk storage hardware and Advanced Copy
Services functions working in Tucson, Arizona. His previous 13 years of experience include
IBM Processor microcode development and S/390 system testing while working in
Poughkeepsie, NY. He has worked at IBM for 30 years. Hank's areas of expertise include
enterprise storage performance and disaster recovery implementation for large systems and
open systems. He writes and presents on these topics. He holds a BS degree in Physics.
Stephen Solewin is an XIV Corporate Solutions Architect based in Tucson, Arizona. He has
13 years of experience working on IBM storage, including Enterprise and Midrange Disk, LTO
drives and libraries, SAN, Storage Virtualization, and software. Steve has been working on
the XIV product line since March of 2008, working with both clients and various IBM teams
worldwide. Steve holds a Bachelor of Science degree in Electrical Engineering from the
University of Arizona, where he graduated with honors.
Anthony Vandewerdt is a Senior IT Specialist who currently works for IBM STG Storage
Systems Sales in Australia. He has 21 years of experience providing pre-sales and post-sales
technical support at IBM. Anthony has extensive hands-on experience with nearly all IBM
storage products, especially DS8000, SVC, XIV, ESS800, and Brocade and Cisco SAN
switches. He has worked in a wide variety of post-sales technical support roles including
country and Asia Pacific storage support. Anthony has also worked as an instructor for STG
Education.
Pete Wendler is a Software Engineer for IBM Systems and Technology Group, Storage
Platform, located in Tucson, Arizona. In his 10 years working for IBM, Peter has worked in
client support for enterprise storage products, solutions testing, and development of the IBM
DR550 archive appliance. He currently holds a position in technical marketing at IBM. Peter
received a Bachelor of Science degree from Arizona State University in 1999.
Roland Wolf is a Certified IT Specialist in Germany. He has worked for IBM for 23 years and
has 15 years of experience with high-end disk storage hardware in S/390 and Open Systems
environments. He is working in Field Technical Sales Support for storage systems. His areas
of expertise include performance analysis and disaster recovery solutions in enterprises
utilizing the unique capabilities and features of the IBM disk storage servers, DS8000 and
XIV. He has contributed to various IBM Redbooks publications including ESS, DS80000
Architecture, and DS8000 Copy Services. He holds a Ph.D. in Theoretical Physics.
Special thanks to Rami Elron for his help with and advice on many of the topics covered in
this book.
John Bynum, Aviad Offer, Jim Segdwick, Brian Sherman, Juan Yanes
Find out more about the residency program, browse the residency index, and apply online at:
ibm.com/redbooks/residencies.html
Preface xiii
Comments welcome
Your comments are important to us!
We want our books to be as helpful as possible. Send us your comments about this book or
other IBM Redbooks publications in one of the following ways:
Use the online Contact us review Redbooks form found at:
ibm.com/redbooks
Send your comments in an e-mail to:
[email protected]
Mail your comments to:
IBM Corporation, International Technical Support Organization
Dept. HYTD Mail Station P099
2455 South Road
Poughkeepsie, NY 12601-5400
Chapter 1. Snapshots
The XIV Storage System has a rich set of copy functions suited for various data protection
scenarios, which enables clients to enhance their business continuance, data migration, and
online backup solutions. This chapter provides an overview of the snapshot function for the
XIV product.
A snapshot is a point-in-time copy of a volume’s data. The XIV snapshot is based on several
innovative technologies to ensure minimal degradation of or impact on system performance.
A volume copy is an exact copy of a system volume and differs in approach to a snapshot in
that a full data copy is performed in the background. Snapshots make use of pointers and do
not necessarily copy all the data to the second instance of a volume.
With these definitions in mind, we explore the architecture and functions of snapshots within
the XIV Storage System.
The XIV system consists of several servers with 12 disk drives each and memory that acts as
cache. All the servers are connected to each other and certain servers act as interface
servers to the SAN and the host servers (Figure 1-1).
Server
Network (FC/Ethernet)
Ethernet
Switch 1 Switch 2
Module 1 Module 15
XIV Architecture
• Split volume data in 1MB • Store both copies in
partitions different modules
• Maintain a copy of each • Spread data of a volume
partition across all disk drives
pseudo randomly
Volume
Da ta M odule 1 Da ta M odule 2
Da ta M odule 3
Chapter 1. Snapshots 3
A logical volume is represented by pointers to partitions that make up the volume. If a
snapshot is taken of a volume, the pointers are just copied to form the snapshot volume, as
shown in Figure 1-3. No space is consumed for the snapshot volume up to now.
Vol
• A snapshot of a
volume is taken.
Pointers point to the
same partitions as the
original volume
When an update is performed on the original data, the update is stored in a new position and
a pointer of the original volume now points to the new partition, whereas the snapshot volume
still points to the old partition. Now we use up more space for the original volume and its
snapshot and it has the size of a partition (1 MB). This method is called redirect-on-write.
Empty
Empty
Snapshot Pointer Volume A Volume Pointer
to Partition to Partition
Empty
Volume A Volume Pointer
to Partition
Snapshot Pointer Snapshot of A
to Partition
The actual metadata overhead for a snapshot is small. When the snapshot is created, the
system does not require new pointers because the volume and snapshot are exactly the
same, which means that the time to create the snapshot is independent of the size or number
of snapshots present in the system. As data is modified, new metadata is created to track the
changes to the data.
Note: The XIV system minimizes the impact to the host for write operations by performing
a redirect-on-write operation. As the host writes data to a volume with a snapshot
relationship, the incoming information is placed into a newly allocated partition. Then the
pointer to the data for the master volume is modified to point at the new partition. The
snapshot volume continues to point at the original data partition.
Because the XIV Storage System tracks the snapshot changes on a partition basis, data is
only copied when a transfer is less than the size of a partition. For example, a host writes
4 KB of data to a volume with a snapshot relationship. The 4 KB is written to a new partition,
but in order for the partition to be complete, the remaining data must be copied from the
original partition to the newly allocated partition.
The alternative to redirect-on-write is the copy on write function. Most other systems do not
move the location of the volume data. Instead, when the disk subsystem receives a change, it
copies the volume’s data to a new location for the point-in-time copy. When the copy is
complete, the disk system commits the newly modified data. Therefore, each individual
modification takes longer to complete, as the entire block must be copied before the change
can be made.
As the storage assigned to the snapshot is completely utilized, the XIV Storage System
implements a deletion mechanism to protect itself from overutilizing the set pool space.
Deletion of snapshots is further explained in 1.2.8, “Deleting a snapshot” on page 19.
Chapter 1. Snapshots 5
If you know in advance that an automatic deletion is possible, a pool can be expanded to
accommodate additional snapshots. This function requires that there is available space on
the system for the storage pool. See Figure 1-5.
Each snapshot has a deletion priority property that is set by the user. There are four priorities,
with 1 being the highest priority and 4 being the lowest priority. The system uses this priority
to determine which snapshot to delete first. The lowest priority becomes the first candidate for
deletion. If there are multiple snapshots with the same deletion priority, the XIV system
deletes the snapshot that was created first. Refer to 1.2.3, “Deletion priority” on page 12 for
an example of working with deletion priorities.
When unlocked, the snapshot takes on the properties of a volume and can be resized or
modified. As soon as the snapshot has been unlocked, the modified property is set. The
modified property cannot be reset after a snapshot is unlocked, even if the snapshot is
relocked without modification.
If the first snapshot is unlocked and the duplicate snapshot already exists, the creation time
for the duplicate snapshot does not change. The duplicate snapshot points to the original
snapshot. If a duplicate snapshot is created from the unlocked snapshot, the creation date is
the time of duplication and the duplicate snapshot points at the original snapshot.
Terminology
• Storage Pool Storage Pool
– Administrative construct
for controlling usage of
data capacity Consistency Group
• Volume
– Data capacity spreads
across all disks in IBM
XIV system Volume Volume
• Snapshot
– Point in time image
– Same storage pool as
source
Snapshot Snapshot
• Consistency group
– Multiple volumes that Snapshot Group
require consistent
snapshot creation
– All in same storage pool
• Snapshot group
– Group of consistent
snapshots
A storage pool is just a logical entity that represents storage capacity. Volumes are created in
a storage pool and snapshots of a volume are within the same storage pool. Because
snapshots require capacity as the source and the snapshot volume differ over time, space for
snapshots must be set aside when defining a storage pool (Figure 1-7). A storage pool can
be resized as needed as long as there is enough free capacity in the XIV Storage System.
Chapter 1. Snapshots 7
An application can utilize many volumes on the XIV Storage System. For example, a
database application can span several volumes for journaling and user data. In this case, the
snapshot for the volumes must occur at the same moment in time so that the journal and data
are synchronized. The consistency group allows the user to perform the snapshot on all the
volumes assigned to the group at the same moment in time, therefore enforcing data
consistency.
The XIV Storage System creates a special snapshot related to the Remote Mirroring
functionality. During the recovery process of lost links, the system creates a snapshot of all
the volumes in the system. This snapshot is used if the synchronization process fails. The
data can be restored to a point of known consistency. A special value of the deletion priority is
used to prevent the snapshot from being automatically deleted. Refer to 1.4, “Snapshot with
Remote Mirror” on page 33, for an example of this snapshot.
1.2 Snapshots
The creation and management of snapshots with the XIV Storage System are simple and
easy to perform. This section guides you through the life cycle of a snapshot, providing
examples of how to interact with the snapshots using the GUI. This section also discusses
duplicate snapshots and the automatic deletion of snapshots.
From this view shown in Figure 1-9, there are three other details:
First is the locked property of the snapshot. By default, a snapshot is locked, which means
the it is write inhibited at the time of creation.
Secondly, the modified property is displayed to the right of the locked property. In this
example, the snapshot has not been modified.
Third, the creation date is displayed. For this example, the snapshot was created on
12 June 2009 at 21:39.
You might want to create a duplicate snapshot, for example, if you want to keep this snapshot
as it is and you want another one that you want to modify,
The duplicate has the same creation date as the first snapshot, and it also has a similar
creation process. From the Volumes and snapshots view, right-click the snapshot to duplicate.
Select Duplicate from the menu to create a new duplicate snapshot. Figure 1-10 provides an
example of duplicating the snapshot, ITSO_Volume.snapshot_00001.
After selecting Duplicate from the menu, the duplicate snapshot is displayed directly under
the original snapshot.
Chapter 1. Snapshots 9
Note: The creation date of the duplicate snapshot in Figure 1-11 is the same creation date
as the original snapshot. Even though it is not shown, the duplicate snapshot points to the
master volume, not the original snapshot.
Example 1-1 provides an example of creating a snapshot and a duplicate snapshot with the
Extended Command Line Interface (XCLI).
In the following examples we use the XIV Session XCLI. You could also use the XCLI
command. In this case, however, specify the configuration file or the IP address of the XIV
that you are talking to as well as the user ID and password. Use the XCLI command to
automate tasks with batch jobs. For simplicity, we used the XIV Session XCLI in our
examples.
Example 1-1 Creating a snapshot and a duplicate with the XCLI Session
snapshot_create vol=ITSO_Volume
snapshot_duplicate snapshot=ITSO_Volume.snapshot_00001
After the snapshot is created, it must be mapped to a host in order to access the data. This
action is performed in the same way as mapping a normal volume.
Important: A snapshot is an exact replica of the original volume. Certain hosts do not
properly handle having two volumes with the same exact metadata describing them. In
these cases, you must map the snapshot to a different host to prevent failures.
Creation of a snapshot is only done in the volume’s storage pool. A snapshot cannot be
created in a storage pool other than the one that owns the volume. If a volume is moved to
another storage pool, the snapshots are moved with the volume to the new storage pool
(provided that there is enough space).
Scroll down to the snapshot of interest and select the snapshot by clicking its name. Details of
the snapshot are displayed in the upper right panel. Looking at the volume ITSO_Volume, it
contains a snapshot 00001 and a duplicate snapshot 00002. The snapshot and the duplicate
snapshot have the same creation date of 2009-06-12 21:39:08, as shown in Figure 1-13. In
addition, the snapshot is locked, has not been modified, and has a deletion priority of 1 (which
is the highest priority, so it will be deleted last).
Chapter 1. Snapshots 11
Along with these properties, the tree view shows a hierarchal structure of the snapshots. This
structure provides details about restoration and overwriting snapshots. Any snapshot can be
overwritten by any parent snapshot, and any child snapshot can restore a parent snapshot or
a volume in the tree structure.
In Figure 1-13 on page 11, the duplicate snapshot is a child of the original snapshot, or in
other words, the original snapshot is the parent of the duplicate snapshot. This structure has
nothing to do with how the XIV Storage System manages the pointers with the snapshots, but
is intended to provide an organizational flow for snapshots.
Example 1-2 is an example of viewing the snapshot data in the XCLI Session. Due to space
limitations, only a small portion of the data is displayed from the output.
If the snapshot space is full, the duplicate snapshot is deleted first even though the original
snapshot is older.
To modify the deletion priority, right-click the snapshot in the Volumes and snapshots view
and select Change Deletion Priority, as shown in Figure 1-14.
Figure 1-16 confirms that the duplicate snapshot has had its deletion priority lowered to 4. As
shown in the upper right panel, the delete priority is reporting a 4 for snapshot
ITSO_Volume.snapshot_00002.
To change the deletion priority for the XCLI Session, specify the snapshot and new deletion
priority, as illustrated in Example 1-3.
Chapter 1. Snapshots 13
The GUI also lets you specify the deletion priority when you create the snapshot. Instead of
selecting Create Snapshot, you select Create Snapshot (Advanced), as shown in
Figure 1-17).
A panel is presented that allows you to specify the deletion priority, but it also allows you to
use your own volume name for the snapshot.
After you perform the restore action, you return to the Volumes and snapshots panel. The
process is instantaneous, and none of the properties (creation date, deletion priority, modified
properties, or locked properties) of the snapshot or the volume have changed.
Specifically, the process modifies the pointers to the master volume so that they are
equivalent to the snapshot pointer. This change only occurs for partitions that have been
modified. On modification, the XIV Storage System stores the data in a new partition and
modifies the master volume’s pointer to the new partition. The snapshot pointer does not
change and remains pointing at the original data. The restoration process restores the pointer
back to the original data and frees the modified partition space.
If a snapshot is taken and later on the original volume increases in size, you can still do a
restore operation. The snapshot still has the original volume size and when you restore it to
the original volume this volume also will have the original size again.
The XCLI Session (or XCLI command) provides more options for restoration than the GUI.
With the XCLI, you can restore a snapshot to a parent snapshot (Example 1-4).
Chapter 1. Snapshots 15
pointers to the original data are lost, and the snapshot appears as new. Storage that was
allocated for the data changes between the volume and its snapshot is released.
From either the Volumes and Snapshots view or the Snapshots Tree view, right-click the
snapshot to overwrite. Select Overwrite from the menu and a dialog box opens. Click OK to
validate the overwriting of the snapshot. Figure 1-20 illustrates overwriting the snapshot
named ITSO_Volume.snapshot_00001.
It is important to note that the overwrite process modifies the snapshot properties and
pointers when involving duplicates. Figure 1-21 shows two changes to the properties. The
snapshot named ITSO_Volume.snapshot_00001 has a new creation date. The duplicate
snapshot still has the original creation date. However, it no longer points to the original
snapshot. Instead, it points to the master volume according to the snapshot tree, which
prevents a restoration of the duplicate to the original snapshot. If the overwrite occurs on the
duplicate snapshot, the duplicate creation date is changed, and the duplicate is now pointing
to the master volume.
Figure 1-21 Snapshot tree after the overwrite process has occurred
The XCLI performs the overwrite operation through the snapshot_create command. There is
an optional parameter in the command to specify which snapshot to overwrite. If the optional
parameter is not used, a new snapshot volume is created.
The results in the Snapshots Tree window show that the locked property is off and the
modified property is on for ITSO_Volume.snapshot_00002. Even if the volume is relocked or
overwritten with the original master volume, the modified property remains on. Also note that
in Figure 1-23 the structure is unchanged. If an error occurs in the modified duplicate
snapshot, the duplicate snapshot can be deleted, and the original snapshot duplicated a
second time to restore the information.
Chapter 1. Snapshots 17
For the second scenario, the original snapshot is unlocked and not the duplicate. Figure 1-24
shows the new property settings for ITSO_Volume.snapshot.00001. At this point, the duplicate
snapshot mirrors the unlocked snapshot, because both snapshots still point to the original
data. While the unlocked snapshot is modified, the duplicate snapshot references the original
data. If the unlocked snapshot is deleted, the duplicate snapshot remains, and its parent
becomes the master volume.
Because the hierarchal snapshot structure was unmodified, the duplicate snapshot can be
overwritten by the original snapshot. The duplicate snapshot can be restored to the master
volume. Based on the results, this process is no different from the first scenario. There is still
a backup and a working copy of the data.
Even though there has not been a change to the snapshot, the system does not remove the
modified property.
The XCLI lock command (vol_lock), which is shown in Example 1-7, is almost a mirror
operation of the unlock command. Only the actual command changes, but the same
operating parameters are used when issuing the command.
Chapter 1. Snapshots 19
Figure 1-28 no longer displays the snapshot ITSO_volume.snapshot.00001. Note that the
volume and the duplicate snapshot are unaffected by the removal of this snapshot. In fact, the
duplicate becomes the child of the master volume. The XIV Storage System provides the
ability to restore the duplicate snapshot to the master volume or to overwrite the duplicate
snapshot from the master volume even after deleting the original snapshot.
The delete snapshot command (snapshot_delete) operates the same as the creation
snapshot. Refer to Example 1-8.
Important: If you delete a volume all snapshots associated with the volume are also
deleted.
With this scenario, a duplicate does not cause the automatic deletion to occur. Because a
duplicate is a mirror copy of the original snapshot, the duplicate does not create the additional
allocations in the storage pool.
Chapter 1. Snapshots 21
To examine the details of the scenario at the point where the second snapshot is taken, a
partition is in the process of being modified. The first snapshot caused a redirect on write, and
a partition was allocated from the snapshot area in the storage pool. Because the second
snapshot occurs at a different time, this action generates a second partition allocation in the
storage pool space. This second allocation does not have available space, and the oldest
snapshot is deleted. Figure 1-30 shows that the master volume XIV_ORIG_VOL and the newest
snapshot XIV_ORIG_VOL.snapshot.00007 are present. The oldest snapshot
XIV_ORIG_VOL.snapshot.00006 was removed.
Chapter 1. Snapshots 23
Restriction: Volumes in a consistency group must be in the same storage pool. A
consistency group cannot have volumes from different pools.
Starting at the Volumes menu, select the volume that is to be added to the consistency group.
To select multiple volumes, hold down the Ctrl key and click each volume. After the volumes
are selected, right-click a selected volume to bring up the Operations menu. From there, click
Create consistency group with these Volumes. Refer to Figure 1-32 for an example of this
operation.
After selecting the Create option from the menu, a dialog window appears. Enter the name of
the consistency group. Because the volumes are added during creation, it is not possible to
change the pool name. Figure 1-33 shows the process of creating a consistency group. After
the name is entered, click Create.
Viewing the volumes displays the owning consistency group. As in Figure 1-34, the two
volumes contained in the xiv_volume_copy pool are now owned by the xiv_db_cg consistency
group. The volumes are displayed in alphabetical order and do not reflect a preference or
internal ordering.
This selection sorts the information by consistency group. The panel allows you to expand the
consistency group and see all the volumes owned by that consistency group. In Figure 1-36,
there are two volumes owned or contained by the xiv_db_cg consistency group. In this
example, a snapshot of the volumes has not been created.
From the consistency group view, you can create a consistency group without adding
volumes. On the menu bar at the top of the window, there is an icon to add a new consistency
group. By clicking the Add consistency group icon shown in Figure 1-37, a creation dialog box
appears, as shown in Figure 1-33 on page 24. Then provide a name and the storage pool for
the consistency group.
Chapter 1. Snapshots 25
When created, the consistency group appears in the Consistency Groups view of the GUI
(Figure 1-38). The new group does not have any volumes associated with it. A new
consistency group named xiv_db_cg is created. The consistency group cannot be expanded
yet, because there are no volumes contained in the consistency group xiv_db_cg.
Using the Volumes view in the GUI, select the volumes to add to the consistency group. You
can select multiple volumes by holding Ctrl down and clicking the desired volumes. After
selecting the desired volumes, right-click the volumes and select Add to Consistency
Group. Figure 1-39 shows two volumes being added to a consistency group:
xiv_vmware_1
xiv_vmware_2
Using the XCLI Session (or XCLI command), the process must be done in two steps. First,
create the consistency group, then the volumes are added. Example 1-9 provides an example
of setting up a consistency group and adding volumes using the XCLI.
Example 1-9 Creating consistency groups and adding volumes with the XCLI
cg_create cg=xiv_new_cg pool=ITSO_Volume_CG
cg_add_vol cg=xiv_new_cg vol=ITSO_Volume_01
cg_add_vol cg=xiv_new_cg vol=ITSO_Volume_02
Chapter 1. Snapshots 27
The new snapshots are created and displayed beneath the volumes in the consistency group
view (Figure 1-42). These snapshots have the same creation date and time. Each snapshot is
locked on creation and has the same defaults as a regular snapshot. The snapshots are
contained in a group structure (called a snapshot group) that allows all the snapshots to be
managed by a single operation.
Adding volumes to a consistency group does not prevent you from creating a single volume
snapshot. If a single volume snapshot is created, it is not displayed in the consistency group
view. The single volume snapshot is also not consistent across multiple volumes. However,
the single volume snapshot does work according to all the rules defined previously in 1.2,
“Snapshots” on page 8.
With the XCLI, when the consistency group is set up, it is simple to create the snapshot. One
command creates all the snapshots within the group at the same moment in time.
Chapter 1. Snapshots 29
To obtain details about a consistency group, you can select Snapshots Group Tree from the
Volumes menu. Figure 1-44 shows where to find the group view.
To display all the consistency groups in the system, issue the XCLI cg_list command.
More details are available by viewing all the consistency groups within the system that have
snapshots. The groups can be unlocked or locked, restored, or overwritten. All the operations
discussed in the snapshot section are available with the snap_group operations.
Chapter 1. Snapshots 31
Example 1-12 illustrates the snap_group_list command.
In order to delete a consistency group with the XCLI, you must first remove all the volumes
one at a time. As in Example 1-13, each volume in the consistency group is removed first.
Then the consistency group is available for deletion. Deletion of the consistency group does
not delete the individual snapshots. They are tied to the volumes and were removed from the
consistency group when you removed the volumes.
Important: This snapshot has a special deletion priority and is not deleted automatically if
the snapshot space becomes fully utilized.
When the synchronization is complete, the snapshot is removed by the system because it is
no longer needed. The following list describes the sequence of events to trigger the creation
of the special snapshot. Note that if a write does not occur while the links are broken, the
system does not create the special snapshot. The events are:
1. Remote Mirror is synchronized.
2. Loss of connectivity to remote system occurs.
3. Writes continue to the primary XIV Storage System.
4. Mirror paths are reestablished (here the snapshot is created) and synchronization starts.
For more details about Remote Mirror refer to Chapter 5, “Synchronous Remote Mirroring” on
page 125.
Important: The special snapshot is created regardless of the amount of pool space on the
target pool. If the snapshot causes the pool to be overutilized, the mirror remains inactive.
The pool must be expanded to accommodate the snapshot, then the mirror can be
reestablished.
Chapter 1. Snapshots 33
1.5 MySQL database backup example
MySQL is an open source database application that is used by many Web programs. For
more information go to:
http://www.mysql.com
The MySQL database stores the data in a set directory and cannot be separated. The backup
data, when captured, can be moved to a separate system. The following scenario shows an
incremental backup of a database and then uses snapshots to restore the database to verify
that the database is valid.
The first step is to back up the database. For simplicity, a script is created to perform the
backup and take the snapshot. Two volumes are assigned to a Linux® host (Figure 1-48). The
first volume contains the database and the second volume holds the incremental backups in
case of a failure.
On the Linux host, the two volumes are mapped onto separate file systems. The first file
system xiv_pfe_1 maps to volume redbook_markus_09, and the second file system xiv_pfe_2
maps to volume redbook_markus_10. These volumes belong to the consistency group MySQL
Group so that when the snapshot is taken, snapshots of both volumes are taken at the same
moment.
# First flush the tables this can be done while running and
# creates an incremental backup of the DB at a set point in time.
/usr/local/mysql/bin/mysql -h localhost -u root -p password < ~/SQL_BACKUP
# Since the mysql daemon was run specifying the binary log name
# of backup the files can be copied to the backup directory on another disk
cp /usr/local/mysql/data/backup* /xiv_pfe_2
When issuing commands to the MySQL database, the password for the root user is stored in
an environment variable (not in the script, as was done in Example 1-16 for simplicity).
Storing the password in an environment variable allows the script to perform the action
without requiring user intervention. For the script to invoke the MySQL database, the SQL
statements are stored in separate files and piped into the MySQL application. Example 1-17
provides the three SQL statements that are issued to perform the backup operation.
SQL_LOCK
FLUSH TABLES WITH READ LOCK
SQL_UNLOCK
UNLOCK TABLES
Chapter 1. Snapshots 35
Before running the backup script, a test database, which is called redbook, is created. The
database has one table, which is called chapter, which contains the chapter name, author,
and pages. The table has two rows of data that define information about the chapters in the
redbook. Figure 1-49 shows the information in the table before the backup is performed.
Now that the database is ready, the backup script is run. Example 1-18 is the output from the
script. Then the snapshots are displayed to show that the system now contains a backup of
the data.
The restore script, shown in Example 1-19, stops the MySQL daemon and unmounts the
Linux file systems. Then the script restores the snapshot and finally remounts and starts
MySQL.
Chapter 1. Snapshots 37
# XCLI command to perform the backup
# ****** NOTE User ID and Password are set in the user profile *****
/root/XIVGUI/xcli -c xiv_pfe snap_group_restore snap_group="$snap_group"
# Mount the FS
mount /dev/dm-2 /xiv_pfe_1
mount /dev/dm-3 /xiv_pfe_2
To help you a bit, I am now going to create the needed MySQL databases
and start the MySQL server for you. If you run into any trouble, please
consult the MySQL manual, that you can find in the Docs directory.
When complete, the data is restored and the redbook database is available, as shown in
Figure 1-51.
Chapter 1. Snapshots 39
40 IBM XIV Storage System: Copy Services and Migration
2
After a brief overview of the Microsoft VSS architecture and an introduction to IBM Tivoli
Storage FlashCopy Manager, we cover the requirements, configuration, and implementation
of the XIV VSS Provider with the Tivoli Storage FlashCopy Manager for backing up Microsoft
Exchange Server data.
Tivoli Storage FlashCopy Manager uses Microsoft Volume Shadow Copy Services in a
Windows® environment. VSS relies on a VSS hardware provider.
We explain in subsequent sections the installation of the XIV VSS Provider and provide
detailed installation and configuration information for the IBM Tivoli Storage FlashCopy
Manager. We have also included usage scenarios.
IBM Tivoli Storage FlashCopy Manager V2.1 is a standalone product designed to perform
near-instant application-aware snapshot backups, with minimal performance impact, for IBM
DB2®, Oracle, SAP, Microsoft SQL Server, and Microsoft Exchange. It improves application
availability and service levels through high-performance, near-instant restore capabilities that
help reduce downtime.
In addition, it can satisfy advanced data protection and data reduction needs through optional
integration with IBM Tivoli Storage Manager V6.
Exchange Integrated
Integratedwith
withIBM
IBM
With
Server Storage
Storage Hardware
Hardware
SVC Optional
XIV TSM
DS8000 Simplified
Simplifieddeployment
deployment
DS 3/4/5* Backup
Integration
*VSS Integration
1
Figure 2-1 Tivoli Storage FlashCopy Manager overview
Tivoli Storage FlashCopy Manager V2.1 has the following key features:
Step-by-step product provision using Microsoft Management Console (MMC) for Microsoft
Exchange Server and Microsoft SQL server
Performs application snapshots without using the TSM Server
Easy to integrate with TSM environment
Diagnostic snapshot tool
Provider complete reporting tool
Manages and schedules application snapshots
Maintains application snapshot history
Configuration verification tool
Incremental and differential support for Exchange VSS Backups
Oracle ASM support
SAP Control File Backup
For more detailed information refer to the IBM Tivoli Storage Web site:
http://www.ibm.com/software/tivoli
Chapter 2. Tivoli Storage FlashCopy Manager and Volume Shadow Copy Services 43
Figure 2-2 shows the Tivoli Storage FlashCopy Manager Management Console.
Without VSS, if you do not have an online backup solution implemented, you either must stop
or quiesce applications during the backup process, or live with the side effects of an online
backup with inconsistent data and open files that could not be backed up.
With VSS, you can produce consistent shadow copies by coordinating tasks with business
applications, file system services, backup applications, fast recovery solutions, and storage
hardware such as the XIV Storage System.
Chapter 2. Tivoli Storage FlashCopy Manager and Volume Shadow Copy Services 45
For exchange data, the Microsoft Exchange Server contains the writer components and
requires no configuration.
For SQL data, Microsoft SQL Server contains the writer components (SqlServerWriter). It
is installed with the SQL Server software and requires the following minor configuration
tasks:
– Set the SqlServerWriter service to automatic. This enables the service to start
automatically when the machine is rebooted.
– Start the SqlServerWriter service.
Provider
This is the application that produces the shadow copy and also manages its availability. It
can be a system provider (such as the one included with the Microsoft Windows operating
system), a software provider, or a hardware provider (such as the one available with the
XIV storage system).
For XIV, you must install and configure the IBM XIV VSS Provider.
VSS uses the following terminology to characterize the nature of volumes participating in a
shadow copy operation:
Persistent
This is a shadow copy that remains after the backup application completes its operations.
This type of shadow copy also survives system reboots.
Non-persistent
This is a temporary shadow copy that remains only as long as the backup application
needs it in order to copy the data to its backup repository.
Transportable
This is a shadow copy volume that is accessible from a secondary host so that the backup
can be off-loaded. Transportable is a feature of hardware snapshot providers. On an XIV
you can mount a snapshot volume to another host.
Source volume
This is the volume that contains the data to be shadow copied. These volumes contain the
application data.
Target or snapshot volume
This is the volume that retains the shadow-copied storage files. It is an exact copy of the
source volume at the time of backup.
Chapter 2. Tivoli Storage FlashCopy Manager and Volume Shadow Copy Services 47
2.4.1 XIV VSS Provider installation
This section illustrates the installation of the XIV VSS Provider. First, make sure that your
Windows system meets the minimum requirements listed below:
Supported operating systems: Windows 2003 and later
Disk space: 3 MB
Host attachment: 1.0.4 or later
.NET Framework: 2.0 with SP1 or later
At the time of writing, the XIV VSS Provider 2.0.9 version was available. We used a Windows
2008 64bit host system for our tests.
The XIV VSS Hardware Provider 2.0.9 version can be downloaded at:
http://www.ibm.com/systems/storage/disk/xiv/index.html
The installation of the XIV VSS Provider is a straightforward normal Windows application
program installation.
To start, locate the XIV VSS Provider installation file, also known as the xProv installation file.
If the XIV VSS Provider 2.0.9 is downloaded from the Internet, the file name is
xProvSetup-x64-2.0.9.exe. Execute the file to start the installation.
Tip: Uninstall any previous versions of the XIV VSS xProv driver if installed. An upgrade is
not allowed with the 2.0.9 release of XIV VSS provider.
The License Agreement window is displayed and to continue the installation you must accept
the license agreement.
The next dialog window is for post-installation operations, as shown in Figure 2-5. Perform a
post-installation configuration during the installation process. The configuration can, however,
be performed at later time. When done, click Next.
A Confirm Installation window is displayed. If required you can go back to make changes or
confirm the installation by clicking Next.
If the post installation check box was selected during the installation (Figure 2-5), the XIV
VSS Provider configuration window shown in Figure 2-7 on page 50 is now displayed. If the
post-installation check box had not been selected during the installation, it must be manually
invoked by selecting Start All Programs XIV and starting the MachinePool Editor, as
shown in Figure 2-6.
Chapter 2. Tivoli Storage FlashCopy Manager and Volume Shadow Copy Services 49
Provide specific information regarding the XIV Storage System IP addresses and user ID and
password with admin privileges. Have that information available.
1. In the dialog shown in Figure 2-7, click Add Machine.
2. The New Machine dialog shown in Figure 2-8 is displayed. Enter the user name and
password of an XIV user with administrator privileges (storageadmin role) and the primary
IP address of the XIV Storage System. Then click Validate.
4. You are now returned to the VSS MachinePool Editor window. The VSS Provider collected
additional information about the XIV storage system, as illustrated in Figure 2-10.
5. Click SaveAll.
At this point XIV VSS Provider configuration is complete and you can close the Machine Pool
Editor window. If you must add other XIV Storage Systems, repeat steps 1 to 5.
Once the XIV VSS provider has been configured as just explained, ensure that the operating
system can recognize it. For that purpose, launch the vssadmin command from the operating
system command line:
C:\>vssadmin list providers
Make sure that IBM XIV VSS HW Provider appears among the list of installed VSS providers
returned by the vssadmin command.
Chapter 2. Tivoli Storage FlashCopy Manager and Volume Shadow Copy Services 51
The Windows server is now ready to perform snapshot operations on the XIV Storage
System. Refer to you application documentation for completing the VSS setup.
The next section demonstrates how the Tivoli Storage FlashCopy Manager application uses
the XIV VSS Provider to perform a consistent point-in-time snapshot of the Exchange 2007
and SQL 2008 data on Windows 2008 64bit.
The Tivoli Storage FlashCopy Manager installation and configuration wizards will guide you
through the installation and configuration steps. After you run the setup and configuration
wizards, your computer is ready to take snapshots.
Tivoli Storage FlashCopy Manager provides the following wizards for installation and
configuration tasks:
Setup wizard
Use this wizard to install Tivoli Storage FlashCopy Manager on your computer.
Local configuration wizard
Use this wizard to configure Tivoli Storage FlashCopy Manager on your computer to
provide locally managed snapshot support. To manually start the configuration wizard,
double-click Local Configuration in the results pane.
Tivoli Storage Manager configuration wizard
Use this wizard to configure Tivoli Storage FlashCopy Manager to manage snapshot
backups using a Tivoli Storage Manager server. This wizard is only available when a Tivoli
Storage Manager license is installed.
Once installed, Tivoli Storage FlashCopy Manager must be configured for VSS snapshot
backups. Use the local configuration wizard for that purpose. These tasks include selecting
the applications to protect, verifying requirements, provisioning, and configuring the
components required to support the selected applications.
Figure 2-11 Tivoli FlashCopy Manager: local configuration wizard for Exchange Server
2. A dialog window is displayed, as shown in Figure 2-12. Select the Exchange Server to
configure and click Next.
Note: The Show System Information button shows the basic information about your
host system.
Tip: Select the check box at the bottom if you do not want the local configuration wizard
to start automatically the next time that the Tivoli Storage FlashCopy Manager
Management Console windows starts.
Chapter 2. Tivoli Storage FlashCopy Manager and Volume Shadow Copy Services 53
3. The Requirements Check dialog window opens, as shown in Figure 2-13. At this stage,
the systems checks that all prerequisites are met.
If any requirement is not met, the configuration wizard does not proceed to the next step.
You may have to upgrade components to fulfill the requirements. The requirements check
can be run again by clicking Re-run once fulfilled. When the check completes
successfully, click Next.
Note: By default, details are hidden. Details can be seen or hidden by clicking Show
Details or Hide Details.
Chapter 2. Tivoli Storage FlashCopy Manager and Volume Shadow Copy Services 55
5. The completion window shown in Figure 2-15 is displayed. To run a VSS diagnostic check,
ensure that the corresponding check box is selected and click Finish.
6. The VSS Diagnostic dialog window is displayed. The goal of this step is to verify that any
volume that you select is indeed capable of performing an XIV snapshot using VSS. Select
the XIV mapped volumes to test, as shown in Figure 2-16, and click Next.
7. The VSS Snapshot Tests window is displayed, showing a status for each of the snapshots.
This dialog also displays the event messages when clicking Show Details, as shown in
Figure 2-17. When done, click Next.
8. A completion window is displayed with the results, as shown in Figure 3-25. When done,
click Finish.
Note: Microsoft SQL Server can be configured the same way as Microsoft Exchange
Server to perform XIV VSS snapshots for Microsoft SQL Server using Tivoli Storage
FlashCopy Manager.
Chapter 2. Tivoli Storage FlashCopy Manager and Volume Shadow Copy Services 57
To perform a VSS snapshot backup of Exchange data, we used the following setup:
Windows 2008 64bit
Exchange 2007 Server
XIV Host Attachment Kit 1.0.4
XIV VSS Provider 2.0.9
Tivoli Storage FlashCopy Manager 2.0
Tivoli Storage FlashCopy Manager was already configured and tested for XIV VSS snapshot,
as shown in 2.5, “Installing and configuring Tivoli Storage FlashCopy Manager for Microsoft
Exchange” on page 52. To review the Tivoli Storage FlashCopy Manager configuration
settings, use the command shown in Example 2-1.
Example 2-1 Tivoli Storage FlashCopy Manager for Mail: query DP configuration
C:\Program Files\Tivoli\TSM\TDPExchange>tdpexcc query tdp
BACKUPDESTination................... LOCAL
BACKUPMETHod........................ VSS
BUFFers ............................ 3
BUFFERSIze ......................... 1024
As explained earlier, Tivoli Storage FlashCopy Manger does not use (or need) a TSM server
to perform a snapshot backup. You can see this when you execute the query tsm command,
as shown in Example 2-2. The output does not show a TSM service but
FLASHCOPYMANAGER instead for the NetWork Host Name of Server field. Tivoli Storage
FlashCopy Manager creates a virtual server instead of using a TSM Server to perform a VSS
snapshot backup.
Example 2-3 shows what options have been configured and used for TSM Client Agent to
perform VSS snapshot backups.
Chapter 2. Tivoli Storage FlashCopy Manager and Volume Shadow Copy Services 59
CLUSTERnode NO
PASSWORDAccess Generate
*======================================================================*
* TCP/IP Communication Options *
*======================================================================*
COMMMethod TCPip
TCPSERVERADDRESS FlashCopymanager
TCPPort 1500
TCPWindowsize 63
TCPBuffSize 32
Before we can perform any backup, we must ensure that VSS is properly configured for
Microsoft Exchange Server and that the DSMagent service is running (Example 2-4).
STG3G_XIVG2_BAS
Circular Logging - Disabled
Replica - None
Recovery - False
2nd MailBox Online
Mail Box1 Online
Our test Microsoft Exchange Storage Group is on drive G:\ and it is called STG3G_XIVG2_BAS. It
contains two mailboxes:
Mail Box1
2nd MailBox
Now we can take a full backup of the storage group by executing the backup command, as
shown in Example 2-5.
Example 2-5 Tivoli Storage FlashCopy Manger: full XIV VSS snapshot backup
C:\Program Files\Tivoli\TSM\TDPExchange>tdpexcc backup STG3G_XIVG2_BAS full
Note that we did not specify a disk drive here. Tivoli Storage FlashCopy Manager finds out
which disk drives to copy with snapshot when doing a backup of a Microsoft Exchange
Storage Group. This is the advantage of an application-aware snapshot backup process.
Chapter 2. Tivoli Storage FlashCopy Manager and Volume Shadow Copy Services 61
To see a list of the available VSS snapshot backups issue a query command, as shown in
Example 2-6.
Example 2-6 Tivoli Storage FlashCopy Manger: query full VSS snapshot backup
C:\Program Files\Tivoli\TSM\TDPExchange>tdpexcc query TSM STG3G_XIVG2_BAS full
Querying FlashCopy Manager server for a list of database backups, please wait...
Backup List
-----------
To show that a restore operation is working, we deleted the 2nd Mailbox mail box, as shown
in Example 2-7.
To perform a restore, all the mailboxes must be unmounted first. A restore will be done at the
volume level, called instant restore (IR), then the recovery operation will run, applying all the
logs, and then mount the mail boxes, as shown in Example 2-8.
Example 2-8 Tivoli Storage FlashCopy Manager: VSS Full Instant Restore and recovery.
C:\Program Files\Tivoli\TSM\TDPExchange>tdpexcc Restore STG3G_XIVG2_BAS Full
/RECOVer=APPL
YALLlogs /MOUNTDAtabases=Yes
Starting snapshot restore process. This process may take several minutes.
C:\Program Files\Tivoli\TSM\TDPExchange>
Note: Instant restore is at the volume level. It does not show the total number of files
examined and completed like a normal backup process does.
To verify that the restore operation worked, open the Exchange Management Console and
check that the storage group and all the mailboxes have been mounted. Furthermore, verify
that the 2nd Mailbox.edb file exists.
See the Tivoli Storage FlashCopy Manager: Installation and User’s Guide for Windows,
SC27-2504, or Tivoli Storage FlashCopy Manager for AIX: Installation and User’s Guide,
SC27-2503, for more and detailed information about Tivoli Storage FlashCopy Manager and
its functions.
The latest information about the Tivoli Storage FlashCopy Manager is available on the Web
at:
http://www.ibm.com/software/tivoli
Chapter 2. Tivoli Storage FlashCopy Manager and Volume Shadow Copy Services 63
64 IBM XIV Storage System: Copy Services and Migration
3
After the XIV Storage System completes the setup of the pointers to the source data, a
background copy of the data is performed. The data is copied from the source volume to a
new area on the disk, and the pointers of the target volume are then updated to use this new
space. The copy operation is done in such a way as to minimize the impact to the system. If
the host performs an update before the background copy is complete, a redirect on write
occurs, which allows the volume to be readable and writable before the volume copy
completes.
If the sizes of the volumes differ, the size of the target volume is modified to match the source
volume when the copy is initiated. The resize operation does not require user intervention.
From the dialog box, select redbook_chris_01 and click OK. The system then asks you to
validate the copy action.
The XIV Storage System instantly performs the update process and displays a completion
message. When the copy process is complete, the volume is available for use.
To create a volume copy with the XCLI, the source and target volumes must be specified in
the command. In addition, the -y parameter must be specified to provide an affirmative
response to the validation questions. See Example 3-1.
VMware allows the resources of a server to be separated into logical virtual systems, each
containing its own OS and resources. When creating the configuration, it is extremely
important to have the hard disk assigned to the virtual machine to be a mapped raw LUN. If
the hard disk is a VMware File System (VMFS), the volume copy fails because there are
duplicate file systems in VMware. In Figure 3-3, the mapped raw LUN is the XIV volume that
was mapped to the VMware server.
A demonstration of the process is simple using VMware. Starting with the VMware resource
window, power off the virtual machines for both the source and the target. The summary
described in Figure 3-4 shows that both XIV Source VM (1), the source, and XIV Source VM
(2), the target, are powered off.
Looking at the XIV Storage System before the copy (Figure 3-5), xiv_vmware_1 is mapped to
the XIV Source VM (1) in VMware and has utilized 1 GB of space. This information shows that
the OS is installed and operational. The second volume, xiv_vmware_2, is the target volume
for the copy and is mapped to XIV Source VM (2) and is 0 in size. At this point, the OS has not
been installed on the virtual machine and thus the OS is not usable.
Because the virtual machines are powered off, simply initiate the copy process as just
described.
Selecting xiv_vmware_1 as the source, copy the volume to the target xiv_vmware_2. The
copy completes immediately and is available for usage.
To verify that the copy is complete, the used area of the volumes must match, as shown in
Figure 3-6.
Figure 3-8 shows the second virtual machine console with the Windows operating system
powered on.
Remote Mirroring can be a synchronous copy solution where write operations are completed
on both copies (local and remote sites) before they are considered to be complete (see
Chapter 5, “Synchronous Remote Mirroring” on page 125). This type of remote mirroring is
normally used for short distances to minimize the effect of I/O delays inherent to the distance
to the remote site.
Remote Mirroring can also be an asynchronous solution were consistent sets of data are
copied to the remote location at specified intervals and host I/O operations are complete after
writing to the primary (see Chapter 6, “Asynchronous Remote Mirroring” on page 149). This
is typically used for long distances between sites.
Note: For asynchronous mirroring over iSCSI links, a reliable, dedicated network must be
available. It requires consistent network bandwidth and a non-shared link.
Unless otherwise noted, this chapter describes the basic concepts, functions, and terms that
are common to both XIV synchronous and asynchronous mirroring.
XIV remote mirroring is application and operating system independent, and does not require
server processor cycle usage.
Host Server
2
4
1. Host Write to Master XIV 3
(data placed in cache of 2
Modules)
Local XIV Remote XIV
2. Master replicates to Slave (Master)
XIV (data placed in cache of (Slave)
2 Modules)
3. Slave acknowledges write
complete to Master
4. Master acknowledges write
complete to application
Application Server
3
2
4
1. Host Write to Master XIV
(data placed in cache of 2
Modules)) Local XIV
(Master) Remote XIV
2. Master acknowledges write (Slave)
complete to application
3. Master replicates to Slave
4. Slave acknowledges write
complete
XIV System B
XIV System A
Storage
Pool
Mirrored CG
Slave
XIV System D
XIV System C
Mirrored Vol
Slave
Storage S tora ge
Pool Po o l
Up to 16 targets can be referenced by a single system. A system can host replication sources
and separate replication targets simultaneously.
Figure 4-3 illustrates possible schemes for how mirroring can be configured.
Important: A single XIV can contain both master volumes and CGs (mirroring to another
XIV) and slave volumes and CGs (mirroring from another XIV). Peers in a master role and
peers in a slave role on the same XIV system must belong to different mirror couplings.
Consistency group
With mirroring (synchronous or asynchronous), the major reason for consistency groups is to
handle a large number of mirror pairs as a group (mirrored volumes are consistent). Instead
of dealing with many volume remote mirror pairs individually, consistency groups simplify the
handling of many pairs considerably.
Important: If your mirrored volumes are in a mirrored consistency group you cannot do
mirroring operations like deactivate or change_role on a single volume basis. If you want to
do this, you must remove the volume from the consistency group (refer to “Removing a
volume from a mirrored consistency group” on page 132 or “Removing a volume from a
mirrored consistency group” on page 159).
Consistency groups also play an important role in the recovery process. If mirroring was
suspended (for example, due to complete link failure), data on different slave volumes at the
remote XIV are consistent. However, when the links are up again and resynchronization is
started, data spread across several slave volumes is not consistent until the master state is
synchronized. To preserve the consistent state of the slave volumes, the XIV system
automatically creates a snapshot of each slave volume and keeps it until the remote mirror
volume pair is synchronized (the snapshot is kept until all pairs are synchronized in order to
enable restoration to the same consistent point in time). If the remote mirror pairs are in a
consistency group, then the snapshot is taken for the whole group of slave volumes and the
snapshots are preserved until all pairs are synchronized. Then the snapshot is deleted
automatically.
Link status
The link status reflects the connection from the master to the slave volume or CG. A link has
a direction (from local site to remote or vice versa). A failed link or a failed secondary system
both result in a link error status. The link state is one of the factors determining the mirror
operational status. Link states are as follows:
OK: link is up and functioning
Error: link is down
Figure 4-5 and Figure 4-6 depict how the link status is reflected in the XIV GUI, respectively.
If there are several links (at least two) in one direction and one link fails, this usually does not
affect mirroring as long as the bandwidth of the remaining link is high enough to keep up with
the data traffic.
The synchronization status reflects the consistency of the data between the master and slave
volumes. Because the purpose of the remote mirroring feature is to ensure that the slave
volumes are an identical copy of the master volumes, this status indicates whether this
objective is currently being achieved.
The XIV keeps track of the partitions that have been modified on the master volumes and
when the link is operational again or the remote mirroring is reactivated. These changed
partitions can be sent to the remote XIV and applied to the slave volumes there.
Remote Mirroring
Using snapshots
Snapshots can be used with Remote Mirroring to provide copies of production data for
business or IT purposes. Moreover, when used with Remote Mirroring, snapshots provide
protection against data corruption.
Remote Mirroring
Point in Time
Copy
IBMS t St TM 8
Figure 4-13 Combining snapshots with Remote Mirroring
Note that recovery using a snapshot warrants deletion and recreation of the mirror.
XIV snapshot (within a single XIV system)
Protection for the event of software data corruption can be provided by a point-in-time
backup solution using the XIV snapshot function within the XIV system that contains the
production volumes. Figure 4-14 shows a single-system point-in-time online backup
configuration.
Target
M
Figure 4-17 One-to-one target configuration
During normal remote mirroring operation, one XIV system (at the DR site) will be active
as a mirroring target. The other XIV system (at the local production site) will be active as a
mirroring target only when it becomes available again after an outage and switch of
production to the DR site. Changes made while production was running at the DR site are
copied back to the original production site, as shown in Figure 4-18.
Target
M
Figure 4-18 Copying changes back to production
Target
Target
Figure 4-20 Fan-out target configuration
Target
Target
Target
S Target
Target M
9 FC SAN 9
8 8
7 7
6 6
Data,
5 , FC SAN Data,
5 ,
Mgt
4
Data, , Mgt
4
Data, ,
Mgt Mgt
In Figure 4-24, the solid lines represent mirroring connections used during normal
operation (the mirroring target system is on the right), and the dotted lines represent
mirroring connections used when production is running at the disaster recovery site and
changes are being copied back to the original production site (mirroring target is on the
left.)
XIV Fibre Channel ports may be easily and dynamically configured as initiator or target
ports.
iSCSI ports
For iSCSI ports, connections are bi-directional.
Use a minimum of two connections (with each of these ports in a different module) using a
total of four ports to provide availability protection. In Figure 4-25 on page 92, the solid
lines represent data flow during normal operation and the dotted lines represent data flow
when production is running at the disaster recovery site and changes are being copied
back to the original production site.
9
8
IP Network 9
8
7 7
Data, , Data, ,
IP Network
DMatgat , , DMatgat , ,
Mgt Mgt
Note: For asynchronous mirroring over iSCSI links, a reliable, dedicated network must
be available. It requires consistent network bandwidth and a non-shared link.
Before discussing actions involved in creating mirroring pairs, we must introduce the basic
XIV concepts used in the discussion.
An XIV volume is a logical volume that is presented to an external server as a logical unit
number (LUN). An XIV volume is allocated from logical and physical capacity within a single
XIV storage pool. The physical capacity on which data for an XIV volume is stored is always
spread across all available disk drives in the XIV system
The XIV system is data aware. It monitors and reports the amount of physical data written to
a logical volume and does not copy any part of the volume that has not been used yet to store
any actual data.
40TB
Storage
Pool
With Remote Mirroring, the concept of consistency group represents a logical container for a
group of volumes, allowing them to be managed as a single unit. Instead of dealing with many
volume remote mirror pairs individually, consistency groups simplify the handling of many
pairs considerably.
An XIV consistency group exists within the boundary of an XIV storage pool in a single XIV
system (in other words, you can have different CGs in different storage pools within an XIV
storage system, but a CG cannot span multiple storage pools). All volumes in a particular
consistency group are in the same XIV storage pool.
In Figure 4-27, an XIV storage pool with 40 TB capacity contains seven logical volumes. One
consistency group has been defined for the XIV storage pool, but no volumes have been
added to or created in the consistency group.
40TB CG
Storage
Pool
Volumes may be easily and dynamically (that is, without stopping mirroring or application
I/Os) added to a consistency group.
40TB CG
Storage
Pool
Volumes may also be easily and dynamically removed from an XIV consistency group. In
Figure 4-29, one of the five volumes has been removed from the consistency group, leaving
four volumes remaining in the consistency group. It is also possible to remove all volumes
from a consistency group.
40TB CG
Storage
Pool
2) Update Record
DB
1) Intend to update DB
3) DB updated Log
2) Update Record
x DB
1) Intend to update DB
3) DB updated Log
Just as the application or database manages dependent write consistency for the production
volumes, the XIV system must manage dependent write consistency for the mirror target
volumes.
XIV also supports creation of application-consistent data in the remote mirroring target
volumes, as discussed 4.5.4, “Creating application-consistent data at both local and the
remote sites” on page 109.
The two peers in the mirror coupling may be either two volumes (volume peers) or two
consistency groups (CG peers), as shown in Figure 4-32.
SITE 1 SITE 2
Production DR Test/Recovery Servers
Volume
Coupling/Mirror
M Defined S
Volume
Volume Peer Coupling/Mirror Volume Peer
Designated M Defined S Designated
Primary Volume Secondary
Coupling/Mirror
M Defined S
CG
Consistency Group Peer Coupling/Mirror Consistency Group Peer
Primary Designation (P) P/M Defined S/S Secondary Designation (S)
Master Role (M) Slave Role (S)
Each of the two peers in the mirroring relationship is given a designation and a role. The
designation indicates the original or normal function of each of the two peers—either primary
or secondary. The peer designation does not change with operational actions or commands.
(If necessary, the peer designation may be changed by explicit user command or action.)
The role of a peer indicates its current (perhaps temporary) operational function (either
master or slave). The operational role of a peer may change as the result of user commands
or actions. Peer roles typically change during DR testing or a true disaster recovery and
production site switch.
When a mirror coupling is created, the first peer specified (for example, the volumes or CG at
site 1, as shown in Figure 4-32) is the source for data to be replicated to the target system, so
it is given the primary designation and the master role.
The second peer specified (or automatically created by the XIV system) when the mirroring
coupling is created is the target of data replication, so it is given the secondary designation
and the slave role.
Initialization may take a significant amount of time if a large amount of data exists on the
master when a mirror coupling is activated. As discussed earlier, the rate for this initial copy of
data can be specified by the user. The speed of this initial copy of data will also be affected by
the connectivity and bandwidth (number of links and link speed) between the XIV primary and
secondary systems.
As an option to remove the impact of distance on initialization, XIV mirroring may be initialized
with the target system installed locally, and the target system may be disconnected after
initialization, shipped to the remote site and reconnected, and mirroring reactivated.
If a remote mirroring configuration is set up when a volume is first created (that is, before any
application data has been written to the volume), initialization will be very quick.
When an XIV consistency group mirror coupling is created, the CG must be empty so there is
no data movement and the initialization process is extremely fast.
The mirror coupling status at the end of initialization differs for XIV synchronous mirroring and
XIV asynchronous mirroring (see “Synchronous mirroring states” on page 80 and “Storage
pools, volumes, and consistency groups” on page 92), but in either case, when initialization is
complete, a consistent set of data exists at the remote site. See Figure 4-33.
SITE 1 SITE 2
Production DR Test/Recovery Servers
Volume
Coupling/Mirror
M Active S
Volume
Volume Peer Coupling/Mirror Volume Peer
Designated M Active S Designated
Primary Volume Secondary
Coupling/Mirror
M Active S
CG
Consistency Group Peer Coupling/Mirror Consistency Group Peer
Primary Designation (P) P/M Active S/S Secondary Designation (S)
Master Role (M) Slave Role (S)
In Figure 4-34, three active volume couplings that have completed initialization have been
moved into the active mirrored consistency group.
SITE 1 SITE 2
Production DR Test/Recovery Servers
P/M S/S
CG
Consistency Group Peer Coupling/Mirror Consistency Group Peer
Primary Designation (P) Active Secondary Designation (S)
Master Role (M) Slave Role (S)
One or more additional mirrored volumes may be added to a mirrored consistency group at a
later time in the same way.
It is also important to realize that in a CG all volumes have the same role. Also, consistency
groups are handled as a single entity and, for example, in asynchronous mirroring, a delay in
replicating a single volume affects the status of the entire CG.
Normal operation, statuses, and reporting differ for XIV synchronous mirroring and XIV
asynchronous mirroring. Refer to Chapter 5, “Synchronous Remote Mirroring” on page 125,
and Chapter 6, “Asynchronous Remote Mirroring” on page 149, for details.
SITE 1 SITE 2
Production Servers DR Test/Recovery Servers
Remote Target
Volume
Volume Peer Coupling/Mirror Volume Peer
Designated Primary M Active S Designated Secondary
Master Role Slave Role
CG
Coupling/Mirror
CG Peer Active CG Peer
Designated Primary Designated Secondary
Master Role M S Slave Role
Figure 4-35 Normal operations: volume mirror coupling and CG mirror coupling
SITE 1 SITE 2
Production Servers DR Test/Recovery Servers
Volume
Volume Peer Coupling/Mirror Volume Peer
Designated Primary M Standby S Designated Secondary
Master Role Master Role
CG
Coupling/Mirror
CG Peer Standby CG Peer
Designated Primary Designated Secondary
Master Role M S Master Role
During standby mode, a consistent set of data is available at the remote site (site 2, in our
example). The currency of the consistent data ages in comparison to the master volumes,
and the gap increases while mirroring is in standby mode.
Note that in asynchronous mirroring, metadata is not used and the comparison between the
most_recent and last_replicated snapshots indicates the data that must be replicated.
Planned deactivation of XIV remote mirroring may be done to suspend remote mirroring
during a planned network outage or DR test, or to reduce bandwidth during a period of peak
load.
SITE 1 SITE 2
Production Servers DR Test/Recovery Servers
Volume
Volume Peer Coupling/Mirror Volume Peer
Designated Primary M Standby M Designated Secondary
Master Role Master Role
CG
Coupling/Mirror
CG Peer Standby CG Peer
Designated Primary Designated Secondary
Master Role M M Master Role
Changing the role of a volume from slave to master allows the volume to be accessed. In
synchronous mirroring, changing the role also starts metadata recording for any changes
made to the volume. This metadata may be used for resynchronization (if the new master
volume remains the master when remote mirroring is reactivated). In asynchronous mirroring,
changing a peer's role automatically reverts the peer to its last_replicated snapshot.
When mirroring is in standby mode, both volumes may have the master role, as shown in the
following section. When changing roles, both peer roles must be changed if possible (the
exception being a site disaster or complete system failure). Changing the role of a slave
volume or CG is typical during a true disaster recovery and production site switch.
In synchronous mirroring, changing a peer role from master to slave allows the slave to
accept mirrored data from the master and cause deletion of metadata that was used to record
any changes while the peer had the master role.
In asynchronous mirroring, changing a peer's role automatically reverts the peer to its
last_replicated snapshot. If at any point in time the command is run on the slave (changing
the slave to a master), the former master must first be changed to the slave role (upon
recovery of the primary site) before changing the secondary role back from master to slave.
Both peers may temporarily have the master role when a failure at site 1 has resulted in a true
disaster recovery production site switch from site 1 to site 2. When site 1 becomes available
again and there is a requirement to switch production back to site 1, the production changes
made to the volumes at site 2 must be resynchronized to the volumes at site 1. In order to do
this, the peers at site 1 must change their role from master to slave, as shown in Figure 4-38.
SITE 1 SITE 2
Production Servers DR Test/Recovery Servers
Volume
Volume Peer Coupling/Mirror Volume Peer
Designated Primary S Standby M Designated Secondary
Slave Role Master Role
CG
Coupling/Mirror
CG Peer Standby CG Peer
Designated Primary Designated Secondary
Slave Role S M Master Role
SITE 1 SITE 2
Production Servers DR Test/Recovery Servers
Remote Target
Volume
Volume Peer Coupling/Mirror Volume Peer
Designated Primary M Active S Designated Secondary
Master Role Slave Role
CG
Coupling/Mirror
CG Peer Active CG Peer
Designated Primary Designated Secondary
Master Role M S Slave Role
The rate for this resynchronization of changes can be specified by the user in MBps using the
XCLI target_config_sync_rates command.
When XIV mirroring is reactivated in the normal direction, changes recorded at the primary
peers are copied to the secondary peers.
SITE 1 SITE 2
Production Servers DR Test/Recovery Servers
Remote Target
Volume
Volume Peer Coupling/Mirror Volume Peer
Designated Primary S Active
M Designated Secondary
Slave Role Master Role
CG
Coupling/Mirror
CG Peer Active CG Peer
Designated Primary Designated Secondary
Slave Role S M Master Role
A typical usage example of this scenario is when returning to the primary site after a true
disaster recovery with production switched to the secondary peers at the remote site.
To add a volume mirror to a mirrored consistency group (for instance, when an application
needs additional capacity):
1. Define XIV volume mirror coupling from the additional master volume at XIV 1 to the slave
volume at XIV 2.
2. Activate XIV remote mirroring from the additional master volume at XIV 1 to the slave
volume at XIV 2.
3. Monitor initialization until it is complete. Volume coupling initialization must be complete
before the coupling can be moved to a mirrored CG.
4. Add the additional master volume at XIV 1 to the master consistency group at XIV 1. (The
additional slave volume at XIV 2 will be automatically added to the slave consistency
group at XIV 2.)
In Figure 4-41, one volume has been added to the mirrored XIV consistency group. The
volumes must be in a volume peer relationship and must have completed initialization
SITE 1 SITE 2
Production DR Test/Recovery Servers
M/P S/S
CG
Coupling/Mirror
Active
Consistency Group Peer Consistency Group Peer
Primary Designation (P) Secondary Designation (S)
Master Role (M) Slave Role (S)
Refer also to 4.4.4, “Defining the XIV mirror coupling and peers: volume” on page 92, and
4.4.6, “Adding volume mirror coupling to consistency group mirror coupling” on page 97, for
additional details.
In Figure 4-42, one volume has been removed from the example mirrored XIV consistency
group with three volumes. After being removed from the mirrored CG, a volume will continue
to be mirrored as part of a volume peer relationship.
Site 1 Site 2
Production DR Test/Recovery Servers
Volume
Coupling/Mirror
P/M Active S/S
Volume
Coupling/Mirror
P/M Active S/S
P/M S/S
CG
Consistency Group Peer Coupling/Mirror Consistency Group Peer
Primary Designation (P) Active Secondary Designation (S)
Master Role (M) Slave Role (S)
Site 1 Site 2
Production Servers DR Test/Recovery Servers
Typical usage of mirror deletion is a one-time data migration using remote mirroring. This
includes deleting the XIV mirror couplings after the migration is complete.
Site 1 Site 2
Production Servers DR Test/Recovery Servers
Target
XIV 1 XIV 2
Volume
Volume Peer Coupling/Mirror Volume Peer
Designated Primary M Active S Designated Secondary
Master Role Slave Role
CG
Coupling/Mirror
CG Peer Active CG Peer
Designated Primary Designated Secondary
Master Role M S Slave Role
4.5.4 Creating application-consistent data at both local and the remote sites
This scenario begins with normal operation of XIV remote mirroring from XIV 1 to XIV 2. This
scenario may be used when the fastest possible application restart is required.
1. No actions are taken to change XIV remote mirroring.
2. Briefly quiesce the application at XIV 1 or place the database into hot backup mode.
3. Ensure that all data has been copied from the master peer at XIV 1 to the slave peer at
XIV 2.
4. Issue Create Mirrored Snapshot at the master peer. This creates an additional snapshot
at the master and slave.
5. Resume normal operation of the application or database at XIV 1.
6. Unlock the snapshot or volume copy.
7. Map the snapshot/volume copy to DR servers at XIV 2.
8. Bring the snapshot or volume copy at XIV 2 online to XIV 2 servers to begin disaster
recovery testing or other functions at XIV 2.
9. When DR testing or other use is complete, unmap the snapshot/volume copy from XIV 2
DR servers.
10.Delete the snapshot/volume copy if desired.
4.5.5 Migration
A migration scenario involves a one-time movement of data from one XIV system to another
(for example, migration to new XIV hardware.) This scenario begins with existing connectivity
between XIV 1 and XIV 2.
1. Define XIV remote mirroring from the master volume at XIV 1 to the slave volume at XIV 2.
2. Activate XIV remote mirroring from the master volume at XIV 1 to the slave volume at XIV
2.
3. Monitor initialization until it is complete.
4. Deactivate XIV remote mirroring from the master volume at XIV 1 to the slave volume at
XIV 2.
5. Delete XIV remote mirroring from the master volume at XIV 1 to the slave volume at XIV 2.
4.6 Planning
The most important planning considerations for XIV Remote Mirroring are those related to
ensuring availability and performance of the mirroring connections between XIV systems, as
well as the performance of the XIV systems. Planning for snapshot capacity usage is also
extremely important.
To optimize availability, XIV remote mirroring connections must be spread across multiple
ports on different adapter cards in different modules, and must be connected to different
networks.
To optimize capacity usage, the number and frequency of snapshots (both those required for
asynchronous replication and any additional user-initiated snapshots) and the workload
change rates must be carefully reviewed. If not enough information is available, a snapshot
area that is 30% of the pool size may be used as a starting point. Storage pool snapshot
usage thresholds must be set to trigger notification (for example, SNMP, e-mail, SMS) when
the snapshot area capacity reaches 50%, and snapshot usage must be monitored continually
to understand long-term snapshot capacity requirements.
Thresholds for RPO and for link disruption may be specified by the user and trigger an event
when the threshold is reached.
Performance statistics from the FC or IP network components are also extremely useful.
4.10 Boundaries
With Version 10.2, the XIV Storage System has the following boundaries or limits:
Maximum remote systems: The maximum number of remote systems that can be
attached to a single primary is 16.
Number of remote mirrors: The combined number of master and slave volumes (including
in mirrored CG) cannot exceed 512.
Distance: Distance is only limited by the response time of the medium used. Use
asynchronous mirroring when the distance causes unacceptable delays to the host I/O in
synchronous mode.
Consistency groups are supported within Remote Mirroring. The maximum number of
consistency groups is 256.
Remote Mirroring can be set up on paths that are either direct or SAN attached via FC or
iSCSI protocols. For most disaster recovery solutions, the secondary system will be located at
a geographically remote site. The sites will be connected using either SAN connectivity with
Fibre Channel Protocol (FCP) or Ethernet with iSCSI. In certain cases, using direct connect
might be the option of choice if the machines are located near each other and could be used
for initialization before the target XIV Storage System is moved to the remote site.
Bandwidth considerations must be taken into account when planning the infrastructure to
support the Remote Mirroring implementation. Knowing when the peak write rate occurs for
systems attached to the storage will help with the planning for the number of paths needed to
support the Remote Mirroring function and any future growth plans.
When the protocol has been selected, it is time to determine which ports on the XIV Storage
System will be used. The port settings are easily displayed using the XCLI Session
environment and the command fc_port_list for Fibre Channel or ipinterface_list for
iSCSI.
There must always be a minimum of two paths configured within Remote Mirroring for FCP
connections, and these paths must be dedicated to Remote Mirroring. These two paths must
be considered a set. Use port 4 and port 2 in the selected interface module for this purpose.
For redundancy, additional sets of paths must be configured in different interface modules.
Fibre Channel paths for Remote Mirroring have slightly more requirements for setup, and we
look at this interface first.
The iSCSI connections are shown in Example 4-2 using the command ipinterface_list.
The output has been truncated to show just the iSCSI connections in which we are interested
here. The command also displays all Ethernet connections and settings. In this example we
have two connections displayed for iSCSI—one connection in module 7 and one connection
in module 8.
>> ipinterface_list
Name Type IP Address Network Mask Default Gateway MTU Module Ports
itso_m8_p1 iSCSI 9.11.237.156 255.255.254.0 9.11.236.1 4500 1:Module:8 1
itso_m7_p1 iSCSI 9.11.237.155 255.255.254.0 9.11.236.1 4500 1:Module:7 1
Click the connecting links between the systems of interest to view the ports.
Right-click a specific port and select Properties, the output of which is shown in Figure 4-46.
This particular port is configured as a target.
Another way to query the port configuration is to select the desired system, click the curved
arrow (at the bottom right of the window) to display the ports on the back of the system, and
Similar information can be displayed for the iSCSI connections using the GUI, as shown in
Figure 4-48. This view can be seen either by right-clicking the Ethernet port (similar to the
Fibre Channel port shown in Figure 4-47) or by selecting the system, then selecting Hosts
and LUNs iSCSI Connectivity. This sequence displays the same two iSCSI definitions
that are shown with the XCLI command.
By default, Fibre Channel ports 2 and 4 (target and initiator, respectively) from every module
are designed to be used for Remote Mirroring. For example, port 4 module 8 (initiator) on the
local machine is connected to port 2 module 8 (target) on the remote machine. When setting
up a new system, it is best to plan for any Remote Mirroring and reserve these ports for that
purpose. However different ports could be used as needed.
fc_port_list
Component ID Status Currently Functioning WWPN Port ID Role
1:FC_Port:4:3 OK yes 5001738000130142 00750029 Initiator
To perform the same function with the GUI, select the primary system, open the patch panel
view, and right-click the port, as shown in Figure 4-49.
Planning for Remote Mirroring is important when determining how many copy pairs will exist.
All volumes defined in the system can be mirrored. A single primary system is limited to a
maximum of 16 secondary systems. Volumes cannot be part of an XIV data migration and a
remote mirror volume at the same time. Data migration information can be found in Chapter 7,
“Data migration” on page 185.
Repeat the same process to define the local (primary) XIV system as a target for the
secondary XIV system.
Next, as shown in Figure 4-53, connections are defined by clicking the line between the two
XIV systems to display the link status detail screen.
Connections can also be defined by clicking a port on the primary system and dragging the
the corresponding port on the target system. This is shown as a blue line in Figure 4-55.
Releasing the mouse button initiates the connection and then the status can be displayed, as
shown in Figure 4-56.
Deleting the connections between two XIV systems is done from the Mirror Connectivity
display. Right-click the connecting links and select Delete, as illustrated in Figure 4-58.
XCLI commands can also be used to delete the connectivity between the primary XIV System
and the secondary XIV system (Figure 4-60).
target_connectivity_delete local_port="1:FC_Port:8:4"
fcaddress=50017380014B0181 target="WSC_1300331"
target_port_delete fcaddress=50017380014B0181 target="WSC_1300331"
target_connectivity_delete local_port="1:FC_Port:8:4"
fcaddress=50017380027F0180 target="WSC_6000639"
target_port_delete fcaddress=50017380027F0180 target="WSC_6000639"
target_connectivity_delete local_port="1:FC_Port:9:4"
fcaddress=50017380014B0191 target="WSC_1300331"
target_port_delete fcaddress=50017380014B0191 target="WSC_1300331"
target_connectivity_delete local_port="1:FC_Port:9:4"
fcaddress=50017380027F0190 target="WSC_6000639"
Note: GUI and XCLI illustrations included in this chapter were created with an early
version of the 10.2.1 code, available at the time of writing. There could be minor
differences with the code that was publicly released.
We assume that the links between the local and remote XIV storage systems have already
been established, as discussed in 4.11.2, “Remote mirror target configuration” on page 118.
When initially configured, one volume is considered the source (master role and resides at
the primary system) and the other is the target (slave role and resides at the secondary
system). This designation is associated with the volume and its XIV system and does not
change. During various operations the role may change (master or slave), but one system is
always the primary and the other is always the secondary.
To create a mirror you can use the XIV GUI or the XCLI.
To create a mirror:
1. Select Create Mirror, as shown in Figure 5-2, and specify the source volume or master for
the mirror pair (Figure 5-3 on page 127).
There are also other way to create a mirror pair. If you are in the Volumes and Snapshots
list panel you can right-click a volume and select Create Mirror there.
Tip: When working with the XCLI session or the XCLI command the windows look the
same and you could address the wrong XIV system with your command. Therefore, it
might be good to always first issue a config_get command to verify that you are talking to
the right XIV system.
To do this:
1. Open an XCLI session on the XIV at the local site (primary XIV) and run the
mirror_create command (Example 5-1).
3. On the secondary XIV, to list the couplings run the mirror_list command, as shown in
Example 5-3. Note that the status of Initializing is used when the coupling is in standby
(inactive) or initializing.
3. Repeat steps 1–2 until all required couplings are activated and are
synchronized/consistent.
2. On the primary XIV run the mirror_list command to see the status of the couplings
(Example 5-5).
3. On the secondary XIV run the mirror_list command to see the status of the couplings
(Example 5-6).
Setting a consistency group to be mirrored is done by first creating a consistency group, then
setting it to be mirrored, and only then populating it with volumes. A consistency group must
be created at the primary XIV and a corresponding consistency group at the secondary XIV.
The names of the consistency groups can be different. When creating a consistency group,
you also must specify the storage pool.
The Create Mirror dialog shown in Figure 5-10 is displayed. Be sure to specify the mirroring
parameters that match the volumes that will be part of that CG.
All volumes that you are going to add to the consistency group must be in that pool on the
primary XIV and in one pool at the secondary XIV. Adding a new volume pair to a mirrored
consistency group requires the volumes to be mirrored exactly as the other volumes within
this consistency group.
Important: All volumes that you want to add to a mirroring consistency group must be
defined in the same pool at the primary site and must be in one pool at the secondary site.
Also, mirrors for volumes must be activated before volumes can be added to a mirrored
consistency group.
It is possible to add a mirrored volume to a non-mirrored consistency group and have this
volume retain its mirroring settings.
Deletion
When a mirror pair (volume or consistency group) is inactive, the mirror relationship can be
deleted. When a mirror relationship has been deleted, the XIV forgets everything about the
relation. If you want to set up the mirror again, the XIV must do an initial copy again from the
source to the target.
Note that when the mirror is deleted, the slave volume becomes a normal volume again, but
the volume is locked, which means that it is write protected. To enable writing to the volume
go to the Volumes list panel. Right-click the volume and select Unlock.
The slave volume must also be formatted before it can be part of a new mirror. Formatting
also requires that all snapshots of that volume be deleted.
Switching roles must be initiated on the master volume or consistency group when remote
mirroring is operational. As the task name implies, it switches the master role to the slave role
and at the same time the slave role to the master role.
Changing roles can be performed at any time (when a pair is active or inactive) for the slave,
and for the master when the coupling is inactive. A change role reverts only the role of that
peer.
Normally, switching the roles requires shutting down the servers at the primary site first,
changing SAN zoning and XIV LUN masking to allow access to the secondary site volumes,
and then restarting the servers with access to the secondary (remote) XIV. However, in
certain clustered environments this takeover could be automated.
Assuming that the primary site is down and the secondary site will become the main
production site, changing roles is performed at the remote (now production) site first. Later,
when the primary site is up again and communication is reestablished you also change the
role at the primary site to a slave to be able to establish remote mirroring from the secondary
site back to the normal production (primary) site.
The new master volume or consistency group (at the secondary site) starts to accept write
commands from local hosts. Because coupling is not active, in the same way as for any
master volume, metadata maintain a record of which write operations must be sent to the
slave volume when communication resumes.
After changing the slave to the master, an administrator must change the original master to
the slave role before communication resumes. If both peers are left with the same role
(master), mirroring cannot be restarted.
Upon re-establishing the connection, the primary volume or consistency group (current slave
volume/CG) updates the secondary volume/CG (new master volume/CG) with this
If the link is resumed and both sides have the same role, the coupling will not become
operational. To solve this problem, the user must use the change role function on one of the
volumes and then activate the coupling.
Resynchronization can be performed in any direction given that one peer has the master role
and the other the slave role. When there is a temporary failure of all links from the primary XIV
to the secondary XIV, you re-establish the mirroring in the original direction after the links are
up again.
Also, if you suspended mirroring for a disaster recovery test at the secondary site, you might
want to reset the changes made to the secondary site during the tests and re-establish
mirroring from the primary to the secondary site.
If there was a disaster and production is now running on the secondary site, re-establish
mirroring first from the secondary site to the primary site and later on switch mirroring to the
original direction from the primary XIV to the secondary XIV.
In any case, the slave peers usually are in a consistent state up to the moment when
resynchronization starts. During the resynchronization process, the peers (volumes or
consistency group) are inconsistent. To preserve consistency, the XIV at the slave side
automatically creates a snapshot of the involved volumes or, in case of a consistency group, a
snapshot of the entire consistency group before transmitting any data to the slave volumes.
If there is a disaster at the primary (master) site, the snapshot taken at the secondary site can
be used to restore the slave volumes to a consistent state, ready for production.
Important: You must delete the mirror relation at the secondary site before you can restore
the last consistent snapshot to the target volumes.
Note: When using the XCLI commands quotation marks (“ “) must be used to enclose
names that include spaces. If they are used for names without spaces the command still
works. The examples in this scenario contain a mixture of commands with and without
quotation marks.
After the couplings have been created and activated, as explained under 5.1, “Synchronous
mirroring configuration” on page 126, the environment will be as illustrated in Figure 5-13.
Local Remote
Site Site
Production
W indows 2008
Active Inactive Standby
W indows 2008
Server Server
Data Flow
FC Link
FC Link
Data Mirroring
FC Link
Primary Secondary
XIV XIV
Local Remote
Site Site
Production Standby
Windows 2008 W indows 2008
Server Server
Data Flow
FC Link
FC Link
Primary Secondary
XIV XIV
2. To view the status of the coupling run the mirror_list command, as shown in
Example 5-8.
Example 5-8 shows that the synchronization status is still consistent for one of the
couplings that is yet to be changed. This is because this reflects the last known state.
When the role is changed, the coupling is automatically deactivated.
3. Repeat steps 1–2 to change roles on other volumes.
Local Remote
Site Site
Production Standby
Windows 2008
Server
Active Windows 2008
Server
Data Flow
Data Flow
FC Link FC Link
Primary Secondary
XIV XIV
Local Remote
Site Site
Production Standby
Windows 2008
Server
Down Active Windows 2008
Server
Data Flow
FC Link
FC Link
Mirroring Inactive
FC Link
Primary Secondary
XIV XIV
Figure 5-19 Change master volumes to slave volumes on the primary XIV
Once you have confirmed, the role is changed to slave, as shown in Figure 5-21.
3. Repeat steps 1–2 for all the volumes that must be changed.
Example 5-9 Change master volumes to slave volumes on the primary XIV
>> mirror_change_role vol=itso_win2008_vol2
Warning: ARE_YOU_SURE_YOU_WANT_TO_CHANGE_THE_PEER_ROLE_TO_SLAVE Y/N: Y
Command executed successfully.
2. To view the status of the coupling run the mirror_list command, as shown in
Example 5-10.
2. On the primary XIV go to the Remote Mirroring menu to check the statuses of the
couplings (Figure 5-24). Note that due to the time lapse between Figure 5-23 and
Figure 5-24 being taken they do show different statuses.
3. Repeat steps 1–2 until all required couplings are reactivated and synchronized.
2. On the secondary XIV run the mirror_list command to see the status of the couplings,
as illustrated in Example 5-12.
3. On the primary XIV run the mirror_list command to see the status of the couplings, as
shown in Example 5-13.
Local Remote
Site Site
Production Standby
Windows 2008 Down Active Windows 2008
Server Server
Data Flow
FC Link
FC Link
Data Mirroring
FC Link
Primary Secondary
XIV XIV
3. You are prompted for confirmation. Select OK. Refer to Figure 5-27 and Figure 5-28 on
page 147.
4. Go to the Remote Mirroring menu on the primary XIV and check the status of the coupling.
It must show the peer volume as a master volume (Figure 5-29).
5. Reassign volumes back to the production server at the primary site and power it on again.
Continue to work as normal. Figure 5-30 on page 148 shows that all the new data is now
back at the primary (local) site.
Example 5-14 Switch from master volume to slave volume on secondary XIV
>> mirror_switch_roles vol=itso_win2008_vol2
Command executed successfully.
3. On the secondary XIV, to list the mirror coupling run the mirror_list command
(Example 5-15).
4. On the primary XIV run the mirror_list command to list the mirror couplings, as shown
in Example 5-16.
Figure 5-30 Production server with mirrored data reassigned at the local site
Local Remote
Site Site
Production Standby
W indows 2008
Server
Active Inactive W indows 2008
Server
Data Flow
FC Link
FC Link
Data Mirroring
FC Link
Primary Secondary
XIV XIV
Asynchronous mirroring enables replication between two XIV volumes or consistency groups
(CG) that does not suffer from the latency inherent to synchronous mirroring, thereby yielding
better system responsiveness and offering greater flexibility for implementing disaster
recovery solutions.
Note: GUI and XCLI illustrations included in this chapter were created with an early
version of the 10.2.1 code, available at the time of writing. There could be minor
differences with the code that was publicly released.
We assume that the links between the local and remote XIV storage systems have already
been established, as discussed in 4.11.2, “Remote mirror target configuration” on page 118.
When initially configured, one volume is considered the source (resides at the primary
system) and the other is the target (resides at the secondary system). This designation is
associated with the volume and its XIV system and does not change. During various
operations the role may change (master or slave) but one system is always the primary and
the other is always the secondary.
Asynchronous mirroring is initiated at defined intervals. This is the sync job schedule. A sync
job entails synchronization of data updates recorded on the master since the last successful
synchronization. The sync job schedule will be defined for both the primary and secondary
system peers in the mirror. This provides a schedule for each peer and will be used when the
peer takes on the role of master. The purpose of the schedule specification on the slave is to
set a default schedule for an automated failover scenario.
A schedule set as NEVER means that no sync jobs will be automatically scheduled. See 6.6,
“Detailed asynchronous mirroring process” on page 173.
The XIV GUI automatically creates schedules based on the RPO selected for the mirror being
created. The interval can be set in the Mirror Properties panel or must be explicitly specified
through the XCLI.
Tip: XIV allows you to set a specific RPO and schedule interval for each mirror coupling.
Slave volumes must be formatted before they are configured as part of a mirror. This means
that the volume must not have any snapshots and must be unlocked.
To create a mirror you can use the XIV GUI or the XCLI. Both methods are illustrated in the
following sections.
Then specify Sync Type as Async, select the slave peer (volume or CG), and specify an RPO
value. Set the Schedule Management field to XIV Internal to create automatic synchronization
using scheduled sync jobs, as shown in Figure 6-2.
When creating a mirror, the slave peer (volume or CG) can also be created automatically on
the target XIV System. To do this, select Create Slave and specify the slave pool name and
the slave volume or CG name, as shown in Figure 6-3.
If schedule type External is selected when creating a mirror, no sync jobs will run for this
mirror and the interval will be set to Never, as illustrated in Figure 6-4.
The Mirroring panel shows the current status of the mirrors. The synchronization of the mirror
must to be initiated manually using the Activate action, as seen in Figure 6-7 on page 155.
Notice, as shown in Figure 6-6, that the selected RPO is displayed for the mirrors created.
Note that mirrors of the sync type do not have an RPO value.
Tip: When working with the XCLI session or the XCLI command the windows look the
same and you could address the wrong XIV system with your command. Therefore, it
might be good to always first issue a config_get command to verify that you are talking to
the right XIV system.
Example 6-1 illustrates the use of XCLI commands to set up a mirror volume.
As seen in Figure 6-8, the Mirror panel now shows the status of the active mirrors as RPO
OK. All the async mirrors have the same mirroring status. Note that Sync Mirrors show the
status as synchronized.
IBM XIV Storage System leverages its consistency group capability to allow for mirroring
numerous volumes at once. The system creates snapshots of the master consistency groups
at user-configured intervals and synchronizes these point-in-time snapshots with the slave.
Setting the consistency group to be mirrored is done by first creating a consistency group,
then setting it to be mirrored, and only then populating it with volumes. A consistency group
must be created at the primary XIV and a corresponding consistency group at the secondary
XIV. The names of the consistency groups can be different. When creating a consistency
group, you also must specify the storage pool.
All volumes that you are going to add to the consistency group must be in that pool on the
primary XIV and in one pool at the secondary XIV. Adding a new volume pair to a mirrored
Important: All volumes that you want to add to a mirroring consistency group must be
defined in the same pool at the primary site and must be in one pool at the secondary site.
It is possible to add a mirrored volume to a non-mirrored consistency group and have this
volume retain its mirroring settings.
To create a mirrored consistency group first create a CG on the primary and secondary XIV
Storage System. Then select the primary CG and specify Create Mirror (Figure 6-9).
The consistency group must not contain any volume when you create the mirror, and be sure
to specify mirroring parameters that match the volumes that will be part of this CG, as shown
in Figure 6-10. The status of the new mirrored CG is now displayed in the Mirroring panel.
Also, mirrors for volumes must be activated before volumes can be added to a mirrored
consistency group. This activation results in the initial copy being completed and sync jobs
being run to create the special last_replicated snapshots (refer to Figure 6-7 on page 155).
As seen in Figure 6-11, the Mirror panel now shows the status of the active mirrors as RPO OK.
All the async mirrors and the mirrored CG have the same mirroring status. Note that sync
mirrors shows the status as synchronized.
To add volumes to the mirrored CG the mirroring parameters must be identical, including the
last_replicated timestamps. The RPO and schedule will be changed to match the values set
for the mirrored consistency group. The volumes must have the same status (RPO OK). It is
possible that during the process the status may change or the last_replicated timestamp may
not be updated yet. If an error occurs verify the status and repeat the operation.
The Mirroring panel now shows the consistency group as a group of volumes, all with the
same status for both the master CG (Figure 6-14) and the slave CG (Figure 6-15 on
page 159).
The Consistency Groups panel shows the last_replicated snapshots, and if the sync job is
currently running there will be a most_recent snapshot, as can be seen in Figure 6-16.
>> sync_job_list
Job Object Local Peer Source Target State Part of CG
Job Type
Volume HS1_1 last-replicated-HS1_1 most-recent-HS1_1 active yes
scheduled
Volume HS1_2 last-replicated-HS1_2 most-recent-HS1_2 active yes
scheduled
Volume HS1_3 last-replicated-HS1_3 most-recent-HS1_3 active yes
scheduled
CG HS_Pool1_CG last-replicated-HS1_1 most-recent-HS1_1 active no
scheduled
The mirroring is terminated by deactivating the mirror and is required for the following actions:
Terminating or deleting the mirroring
Stopping the mirroring process
– For a planned network outage
– To reduce network bandwidth
– For a planned recovery test
The deactivation pauses a running sync job and no new sync jobs will be created as long as
the active state of the mirroring is not restored. However, the deactivation does not cancel the
status check by the master and the slave. The synchronization status of the deactivated
mirror is calculated as though the mirror was active.
Deactivating a mirror results in the synchronization status becoming RPO_Lagging when the
specified RPO time is exceeded. This means that the last-replicated snapshot is older than
the specified RPO.
The activation state changes to inactive, as shown in Figure 6-21. Subsequently, the
replication pauses (and records where it paused). Upon activation, the replication resumes.
Note that an ongoing sync job resumes upon activation. No new sync job will be created until
the next interval.
Note that for consistency group mirroring, deactivation pauses all running sync jobs
pertaining to the consistency group.
-- Activate Mirrored CG
mirror_activate cg="HS_Pool1_CG"
Deletion
When a mirror pair (volume pairs or a consistency group) is inactive, the mirror relationship
can be deleted. When the mirror is deleted, the XIV forgets everything about the mirror. If you
want to set up the mirror again, the XIV must do an initial copy again from the source to the
target volume.
When the mirror is part of a consistency group the mirror first must be removed from the
mirrored CG and the last_replicated CG for the master and the slave must be disbanded. This
snapshot CG is recreated with only the current volumes after the next interval completes. The
last_replicated snapshots for the mirror can now be deleted, allowing a new mirror to be
created.
Note that when the mirror is deleted, the slave volume becomes a normal volume again, but
the volume is locked, which means that it is write protected. To enable writing to the volume
go to the Volumes list panel, select the volume, right-click it, and select Unlock.
The slave volume must also be formatted before it can be part of a new mirror. Formatting
also requires all snapshots of that volume to be deleted.
Change role
In a disaster at the primary site, a role change at the secondary site is the normal recovery
action.
Assuming that the local site is down and that the remote site will become the main production
site, changing roles is performed at the remote (now production) site first. Later, when the
primary site is up again and communication is re-established, you also change the role at the
primary site to a slave to be able to establish mirroring from the secondary site back to the
primary site. This completes a switch role operation.
As shown in Figure 6-23, you are then prompted to confirm the role change (role reverse).
The coupling remains in inactive mode (Figure 6-25). This means that remote mirroring is
deactivated. This ensures an orderly activation when the role of the peer on the other site
is changed.
The new master volume or consistency group starts to accept write commands from local
hosts. Because coupling is not active, in the same way as for any master volume, a log is
maintained of which write operations must be sent to the slave volume when communication
resumes.
After changing the slave to the master, an administrator also must change the original master
to the slave role before communication resumes (Figure 6-26). If both peers are left in the
same role (master), mirroring cannot be restarted.
Upon re-establishing the connection, the primary volume or consistency group (current slave
volume/CG) is updated from the secondary volume/CG (new master volume/CG) with data
that was written to the secondary volume after the last replicated snapshot timestamp.
If the link is resumed and both sides have the same role, the coupling does not become
operational. The user must use the change role function on one of the volumes and then
activate the mirroring.
When recovering from a link failure, the following steps are taken to synchronize the data:
Asynchronous mirroring sync jobs proceed as scheduled. Sync jobs are restarted and a
new most_recent snapshot is taken. See 6.5.4, “Mirroring special snapshots” on
page 172.
The primary system copies the changed data to the secondary volume. Depending on
how much data must be copied, this operation could take a long time, and the status
remains RPO_Lagging.
However, within these broad categories there are a number of situations that may exist. Some
of these and the recovery procedures are considered below:
A disaster that makes the XIV at the primary site unavailable, but leaves the site itself and
the servers there still available
In this scenario the volumes/CG on the XIV at the secondary site can be switched to
master volumes/CG, servers at the primary site can be redirected to the XIV at the
secondary site, and normal operations can start again. When the XIV at the primary site is
recovered the data can be mirrored from the secondary site back to the primary site. A full
initialization of the data is usually not needed.
Only changes that take place at the secondary site are transferred to the primary site. If
desired, a planned site switch can then take place to resume production activities at the
primary site. See 6.2, “Role reversal” on page 166, for details related to this process.
last_replicated
Data to be replicated
Local Remote
site site
5. Sync jobs can now be run to create periodic consistent copies of the master volumes or
consistency groups on the slave system. See 6.6, “Detailed asynchronous mirroring
process” on page 173.
Details of this process can be found in 6.6, “Detailed asynchronous mirroring process” on
page 173.
Snapshot life-cycle
Throughout the sync job life cycle, the most_recent and last_replicated snapshots are created
and deleted to denote the completion of significant mirroring stages.
Table 6-1 indicates which snapshot is created for a given sync job phase.
Sync Job
last_replicated last_replicated
Data to be replicated
Local Remote
site site
last_replicated last_replicated
Local Remote
site site
last_replicated last_replicated
Local Remote
site site
The next sync job can now be run at the next defined interval.
Master
Slave
If RPO is equal to or lower than the If RPO is higher than the difference
difference between the current time between the current time (when the
(when the check is run) and the check is calculated) and the timestamp of
timestamp of the the last_replicated_snapshot, then the
last_replicated_snapshot, then the status will be set to RPO_LAGGING
status will be set to RPO_OK
A new snapshot is created with the same timestamp as the last_replicated snapshot
(Figure 6-38).
-- Duplicate last_replicated
snap_group_duplicate snap_group="last-replicated-HS_Pool1_CG"
After the testing is complete the remote volumes are returned to their previous slave role See
Figure 6-43, Figure 6-44, and Figure 6-45 on page 181.
Any changes made during the testing are removed by restoring the last_replicated snapshot,
and new updates from the local site will be transferred to the remote site when the mirror is
activated again (Figure 6-46 through Figure 6-48).
>> mirror_list -t
local_peer_name,sync_type,current_role,target_name,remote_peer_name,active
Name Mirror Type Role Remote System Remote Peer Active
HS1_1 async_interval Master WSC_6000639 HS1_1 no
HS1_2 async_interval Master WSC_6000639 HS1_2 no
HS1_3 async_interval Master WSC_6000639 HS1_3 no
HS_Pool1_CG async_interval Master WSC_6000639 HS_Pool1_CG no
>> mirror_list -t
local_peer_name,sync_type,current_role,target_name,remote_peer_name,active
Name Mirror Type Role Remote System Remote Peer Active
HS1_1 async_interval Slave WSC_6000639 HS1_1 no
HS1_2 async_interval Slave WSC_6000639 HS1_2 no
HS1_3 async_interval Slave WSC_6000639 HS1_3 no
HS_Pool1_CG async_interval Slave WSC_6000639 HS_Pool1_CG no
>> mirror_list -t
local_peer_name,sync_type,current_role,target_name,remote_peer_name,active
Name Mirror Type Role Remote System Remote Peer Active
HS1_1 async_interval Slave WSC_6000639 HS1_1 yes
HS1_2 async_interval Slave WSC_6000639 HS1_2 yes
HS1_3 async_interval Slave WSC_6000639 HS1_3 yes
HS_Pool1_CG async_interval Slave WSC_6000639 HS_Pool1_CG yes
Tip: Set appropriate pool alert thresholds to be warned ahead of time and be able to take
proactive measures to avoid any serious pool space depletion situations.
At a very high level, the steps to migrate to XIV using the XIV Data Migration function are:
1. Establish connectivity between source device and XIV. The source storage device must
have Fibre Channel connectivity with the XIV.
2. Collect configuration. Detail the configuration of the LUNs to be migrated.
3. Perform data migration:
– Stop/unconfigure all I/O from source-original LUNs.
– Start data migration in XIV.
– Activate new LUNs through XIV.
– Start all I/O on new XIV LUNs.
Note that all images of the XIV GUI shown in this chapter used Version 2.4.2 of the XIV GUI.
If you are still using an earlier version, certain panels may look slightly different.
Module 9
2 4 XIV Port 4 Initiator Mode
1 3
1 5
Module 8
1 4
1 3
1 2
2 4
1 1
1 3 Non-XIV Disk System
1 0
Module 7
C h a n n e l1 D u a lP o rte d H o s tC h a n n e sl
2 1 Dvri e C h a n n e l 2 1 Ch 2 Ch1
OK
G /sb G b s/ DC
9 2 4
2
4
1 2
4
1
OK
ID /D a
i g
OK
2 4
8 AC
7 AC
Host Server 1 3
B OK
6 ID /D a
i g
1 2 1 2
DC 4 4 4 2
OK G b s/ G /sb
OK
Ch 1 Ch 2 1 2 D u a lP o rte d 1 2
5 H o s tC h a n n e sl D vri e C h a n n e l C h a n n e l2
Module 6
4
1
2 4
0
1 3
FC4
16 Module 5
2 4 XIV port 4 Initiator Mode
1 3
SAN Switch
Module 4
2 4
1 3
The IBM XIV Data Migration solution offers a smooth data transfer, because it:
Requires only a single short outage to switch LUN ownership. This enables the immediate
connection of a host server to the XIV Storage System, providing the user with direct
access to all the data before it has been copied to the XIV Storage System.
Synchronizes data between the two storage systems using transparent copying to the XIV
Storage System as a background process with minimal performance impact.
Supports data migration from practically all storage vendors.
Must be set up using Fibre Channel.
Can be used to migrate SAN boot volumes.
The XIV Storage System manages the data migration by simulating host behavior. When
connected to the storage device containing the source data, XIV looks and behaves like a
SCSI initiator, which in common terms means that it acts like a host server. After the
connection is established, the storage device containing the source data believes that it is
During the background copy process, the host server is connected to the XIV Storage
System. The XIV Storage System handles all read and write requests from the host server,
even if the data is not resident on the XIV Storage System. In other words, during the data
migration, the data transfer is transparent to the host and the data is available for immediate
access.
It is important that the connections between the two storage systems remain intact during the
entire migration process. If at any time during the migration process the communication
between the storage systems fails, the process also fails. In addition, if communication fails
after the migration reaches synchronised status, writes from the host will fail if the source
updating option was chosen. The situation is further explained in the 7.2, “Handling I/O
requests” on page 187. The process of migrating data is performed at a volume level, as a
background process.
The data migration facility in XIV firmware revisions 10.1 and later supports the following:
Up to four migration targets can be configured on an XIV (where a target is either one
controller in an active/passive storage device or one active/active storage device). XIV
firmware revision 10.2 increased the number of targets to 16. The target definitions are
used for both Remote Mirroring (RM) and data migration (DM). Both DM and RM functions
can be active at the same time. As already stated, an active/passive storage device with
two controllers uses two target definitions unless only one of the controllers is used for the
migration.
The XIV can communicate with host LUN IDs ranging from 0 to 512 (in decimal). This
does not necessarily mean that the non-XIV disk system can provide LUN IDs in that
range. You may be restricted by the ability of the non-XIV storage controller to use only 16
or 256 LUN IDs depending on hardware vendor and device.
Up to 4000 LUNs can be concurrently migrated.
Important: During the discussion in this chapter, the source system in a data migration
scenario is referred to as a target when setting up paths between the XIV Storage System
and the donor storage (the non-XIV storage). This terminology is also used in Remote
Mirroring, and both functions share the same terminology for setting up paths for
transferring data.
The XIV Storage System handles all host server write requests and the non-XIV disk system
is now transparent to the host. All write requests are handled using one of two user-selectable
methods, chosen when defining the data migration. The two methods are known as source
updating and no source updating.
Source updating
This method for handling write requests ensures that both storage systems (XIV and non-XIV
storage) are updated when a write I/O is issued to the LUN being migrated. By doing this the
source system remains updated during the migration process, and the two storage systems
remain in sync after the background copy process completes. Similar to synchronous Remote
Mirroring, the write commands are only acknowledged by the XIV Storage System to the host
after writing the new data to the local XIV volume, then writing to the source storage device,
and then receiving an acknowledgement from the non-XIV storage device.
An important aspect of selecting this option is that if there is a communication failure between
the target and the source storage systems or any other error that causes a write to fail to the
source system, the XIV Storage System also fails the write operation to the host. By failing
the update, the systems are guaranteed to remain consistent. Change management
requirements determine whether you choose to use this option.
No source updating
This method for handling write requests ensures that only the XIV volume is updated when a
write I/O is issued to the LUN being migrated. This method for handling write requests
decreases the latency of write I/O operations because write requests are only written to the
XIV volume and are not written to the non-XIV storage system. It must be clearly understood
that this limits your ability to back out a migration, unless you have another way of recovering
updates that were written to the volume being migrated after migration began. If the host is
being shutdown for the duration of the migration then this risk is mitigated.
Important: If multiple paths are created between an XIV and an active/active storage
device, the same SCSI LUN IDs must be used for each LUN on each path, or data
corruption may occur.
Note: Certain examples shown in this chapter are from a DS4000 active/passive migration
with each DS4000 controller defined independently as a target to the XIV Storage System.
If you define a DS4000 controller as a target do not define the alternate controller as a
second port on the first target. Doing so causes unexpected issues such as migration
failure, preferred path errors on the DS4000, or very slow migration progress.
It is also possible that the host may be attached via one medium (such as iSCSI), whereas
the migration occurs via the other (such as Fibre Channel). The host-to-XIV connection
method and the data migration connection method are independent of each other.
Depending on the non-XIV storage device vendor and device, it may be easier to zone the
XIV to the ports where the volumes being migrated are already present. In this manner no
reconfiguration of the non-XIV storage device may be required. For example, in EMC
Symmetrix/DMX environments, it is easier to zone the fiber adapters (FAs) to the XIV where
the volumes are already mapped.
If you have already zoned the XIV to the non-XIV storage device, then the WWPNs of the XIV
initiator ports (that end in the number 3) will appear in the WWPN drop-down list. If they are
not there then you must manually add them (though this strongly suggests either the need to
map a LUN0, or that the SAN zoning has not been done correctly).
The XIV must be defined as a Linux or Windows host to the non-XIV storage device. If the
non-XIV device offers several variants of Linux, you can choose SuSE Linux or RedHat Linux
or Linux x86. This defines the correct SCSI protocol flags for communication between the XIV
and non-XIV storage device. The principal criterion is that the host type must start LUN
numbering with LUN ID 0. If the non-XIV storage device is active/passive, check to see
whether the host type selected affects LUN failover between controllers, such as DS4000
(see 7.12.5, “IBM DS3000/DS4000/DS5000” on page 227, for more details).
Tip: The data migration target is represented by an image of a generic rack. If you must
delete or rename the migration device do so by right-clicking the image of that rack.
3. On the dark shaded box that is part of the defined target, right-click and choose Add Port
(Figure 7-5).
a. Enter the WWPN of the first (fabric A) port on the non-XIV storage device zoned to the
XIV. There is no drop-down menu of WWPNs, so you must manually type or paste in
the correct WWPN. Be careful not to make a mistake. It is not necessary to use full
colons to separate every second number. It makes no difference if you enter a WWPN
as 10:00:00:c9:12:34:56:78 or 100000c912345678.
b. Click Add.
Tip: Depending on the storage controller, ensuring that LUN0 is visible on the non-XIV
storage device down the controller path that you are defining helps ensure proper
connectivity between the non-XIV storage device and the XIV. Connections from XIV to
DS4000 or EMC DMX or Hitachi HDS devices require a real disk device to be mapped as
LUN0. However, the IBM ESS 800, for instance, does not need a LUN to be allocated to
the XIV for the connection to become active (turn green in the GUI). The same is true for
EMC CLARiiON.
Note: In clustered environments you could work with only one node until the migration
is complete, so consider shutting down all other nodes in the cluster.
3. Perform a point-in-time copy of the volume on the non-XIV storage device (if that function
is available on the non-XIV storage). This point-in-time copy is a gold copy of the data that
is quiesced prior to starting the data migration process. Do this before changing any host
drivers or installing new host software, particularly if you are going to migrate boot from
SAN volumes.
When mapping volumes to the XIV it is very important to note the LUN IDs allocated by
the non-XIV storage. The methodology to do this varies by vendor and device and is
documented in greater detail in 7.12, “Device-specific considerations” on page 223.
Important: You must unmap the volumes away from the host during this step, even if
you plan to power the host off during the migration. The non-XIV storage only presents
the migration LUNs to the XIV. Do not allow a possibility for the host to detect the LUNs
from both the XIV and the non-XIV storage.
If you want to manually create the volumes on the XIV, consult 7.5, “Manually creating the
migration volume” on page 207. Preferably, instead continue with the next step, discussed
in the following section
Important: If the non-XIV device is active/passive, then the source target system
must represent the controller (or service processor) on the non-XIV device that
currently owns the source LUN being migrated. This means that you must check,
from the non-XIV storage, which controller is presenting the LUN to the XIV.
– Source LUN: Enter the decimal value of the host LUN ID as presented to the XIV from
the non-XIV storage system. Certain storage devices present the LUN ID as hex. The
number in this field must be the decimal equivalent. Ensure that you do not accidentally
use internal identifiers that you may also see on the source storage systems
3. Test the data migration object. Right-click to select the created data migration object and
choose Test Data Migration. If there are any issues with the data migration object the test
fails, reporting the issue found. See Figure 7-11.
Tip: If you are migrating volumes from an MSCS cluster that is still active, then testing a
migration may fail due to the reservations placed on the source LUN by MSCS. You must
bring the cluster down to get the test to succeed.
Note: Once activated, the data migration can be deactivated, but after deactivating the
data migration the host is no longer able to read or write to the migration volume and all
host I/O stops. Do not deactivate the migration with host I/O running. If you want to
abandon the data migration prior to completion consult the back-out process described in
section 7.10, “Backing out of a data migration” on page 219.
Important: The host cannot read the data on the non-XIV volume until the data
migration has been activated. The XIV does not pass through (proxy) I/O for a migration
that is inactive. If you use the XCLI dm_list command to display the migrations, ensure
that the word Yes appears in the Active column for every migration.
Note: In clustered environments, it is usually recommended that only one node of the
cluster be initially brought online after the migration is started, and that all other nodes
be offline until the migration is complete. Once complete, update all other nodes (driver,
host attachment package, and so on), as the primary node was during the initial outage
(see step 5 in “Perform pre-migration tasks for the host being migrated” on page 196).
2. Delete data migration. Once the synchronization has been achieved, the data migration
object can be safely deleted without host interruption.
Right-click to select the data migration volume and choose Delete Data Migration, as
shown in Figure 7-15. This can be done without host/server interruption.
Every command issued in the XIV GUI is logged in a text file with the correct syntax. This is
very helpful for creating scripts. If you are running the XIV GUI under Microsoft Windows, look
for a file titled guicommands_< todays date >.txt, which will be found in the following folder:
C:\Documents and Settings\ < Windows user ID >\Application Data\XIV\GUI10\logs
To make these changes permanent update the relevant profile, making sure that you export
the variables to make them environment variables.
With the data migration defined via the script or batch job above, an equivalent script or batch
job to execute the data migrations then must be run, as shown in Example 7-4.
For an example of the process to determine exact volume size, consider ESS 800 volume
00F-FCA33 depicted in Figure 7-23 on page 217. The size reported by the ESS 800 Web GUI
is 10 GB, which suggests that the volume is 10,000,000,000 bytes in size (because the ESS
800 displays volume sizes using decimal counting). The AIX bootinfo -s hdisk2 command
reports the volume as 9,536 GiB, which is 9,999,220,736 bytes (because there are
1,073,741,824 bytes per GiB). Both of these values are too small. When the volume
properties are viewed on the volume information panel of the ESS 800 Copy Services GUI, it
correctly reports the volume as being 19,531,264 sectors, which is 10,000,007,168 bytes
(because there are 512 bytes per sector). If we created a volume that is 19,531,264 blocks in
size this will be correct. When the XIV automatically created a volume to migrate the contents
of 00F-FCA33 it did create it as 19,531,264 blocks. Of the three information sources that were
considered to manually calculate volume size, only one of them must have been correct.
Using the automatic volume creation eliminates this uncertainty.
If you are confident that you have determined the exact size, then when creating the XIV
volume, choose the Blocks option from the Volume Size drop-down menu and enter the size
of the XIV volume in blocks. If your sizing calculation was correct, this creates an XIV volume
that is the same size as the source (non-XIV storage device) volume. Then you can define a
migration:
1. In the XIV GUI go to the floating menu Remote Migration.
2. Right-click and choose Define Data Migration (Figure 7-9 on page 198).
– Destination Pool: Choose the pool from the drop-down menu where the volume was
created.
– Destination Name: Chose the pre-created volume from the drop-down menu.
– Source Target System: Choose the already defined non-XIV storage device from the
drop-down menu.
Important: If the non-XIV device is active/passive, the source target system must
represent the controller (or service processor) on the non-XIV device that currently
owns the source LUN being migrated. This means that you must check from the
non-XIV storage, which controller is presenting the LUN to the XIV.
If the volume that you created is too small or too large you will receive an error message when
you do a test data migration, as shown in Figure 7-16. If you try and activate the migration you
will get the same error message. You must delete the volume that you manually created on
the XIV and create a new correctly sized one. This is because you cannot resize a volume
that is in a data migration pair, and you cannot delete a data migration pair unless it has
completed the background copy. Delete the volume and then investigate why your size
calculation was wrong. Then create a new volume and a new migration and test it again.
Increasing the max_initialization_rate parameter may decrease the time required to migrate
the data. However, doing so may impact existing production servers on the non-XIV storage
device. By increasing the rate parameters, more outgoing disk resources will be used to serve
migrations and less for existing production I/O. Be aware of how these parameters affect
migrations as well as production. You could always choose to only set this to a higher value
during off-peak production periods.
Important: Just because the initialization rate has been increased does not mean that the
actual speed of the copy increases. The outgoing disk system or the SAN fabric may well
be the limiting factor. In addition, you may cause host system impact by over-committing
too much bandwidth to migration I/O.
If you have a Cisco-based SAN, start Device Manager for the relevant switch and then select
Interface Monitor FC Enabled.
While the migration background copy is being processed, the value displayed in the Used
column of the Volumes and Snapshots panel drops every time that empty blocks are
detected. When the migration is completed, you can check this column to determine how
much real data was actually written into the XIV volume. In Figure 7-19 the used space on the
Windows2003_D volume is 4 GB. However, the Windows file system using this disk shown in
Figure 7-21 on page 214 shows only 1.4 GB of data. This could lead you to conclude wrongly
that the thick-to-thin capabilities of the XIV do not work.
The reason that this has occurred is that when file deletions occur at a file-system level, the
data is not removed. The file system re-uses this effectively free space but does not write
zeros over the old data (as doing so generates a large amount of unnecessary I/O). The end
result is that the XIV effectively copies old and deleted data during the migration. It must be
clearly understood that this makes no difference to the speed of the migration, as these
blocks have to be read into the XIV cache regardless of what they contain.
If you are not planning to use the thin provisioning capability of the XIV, this is not an issue.
Only be concerned if your migration plan specifically requires you to be adopting thin
provisioning.
In a Windows environment you can use a Microsoft tool known as sdelete to write zeros
across deleted files. You can find this tool in the sysinternals section of Microsoft Technet.
Here is the current URL:
http://technet.microsoft.com/en-us/sysinternals/bb897443.aspx
If you instead choose to write zeros to recover space after the migration, you must initially
generate large amounts of empty files, which may initially appear to be counter-productive. It
takes several days for the used space value to decrease after the script or application is run.
This is because recovery of empty space runs as a background task.
What this means is that we can resize that volume to 68 GB (as shown in the XIV GUI) and
make the volume 15 GB larger without effectively consuming any more space on the XIV. In
Figure 7-21 we can see that the migrated Windows2003_D drive is 53 GB in size
(53,678,141,440 bytes).
Because this example is for a Microsoft Windows 2003 basic NTFS disk, we can use the
diskpart utility to extend the volume, as shown in Example 7-9.
We can now confirm that the volume has indeed grown by displaying the volume properties.
In Figure 7-22 we can see that the disk is now 68 GB (68,713,955,328 bytes).
In terms of when to do the re-size, a volume cannot be resized while it is part of a data
migration. This means that the migration process must have completed and the migration for
7.9 Troubleshooting
This section lists common errors that are encountered during data migrations using the XIV
data migration facility.
Refer to “Multi-pathing with data migrations” on page 188 and 7.12, “Device-specific
considerations” on page 223, for additional information.
If migrating from an EMC Symmetrix or DMX there are special considerations. Refer to
7.12.2, “EMC Symmetrix and DMX” on page 224.
7.10.2 Back-out after a data migration has been defined but not activated
If the data migration definition exists but has not been activated, then you can follow the same
steps as described in 7.10.1, “Back-out prior to migration being defined on the XIV” on
page 219. To remove the inactive migration from the migration list you must delete the XIV
volume that was going to receive the migrated data.
7.10.3 Back-out after a data migration has been activated but is not complete
If the data migration shows in the GUI with a status of initialization or the XCLI shows it as
active=yes, then the background copy process has been started. If you deactivate the
migration in this state you will block any I/O passing through the XIV from the host server to
the migration LUN on the XIV and to the LUN on the non-XIV disk system. You must shut
down the host server or its applications first. After doing this you can deactivate the data
migration and then if desired you can delete the XIV data migration volume. Then restore the
original LUN masking and SAN fabric zoning and bring your host back up.
Important: If you chose to not allow source updating and write I/O has occurred after the
migration started, then the contents of the LUN on the non-XIV storage device will not
contain the changes from those writes. Understanding the implications of this is important
in a back-out plan.
Important: If you chose to not allow source updating and write I/O has occurred during the
migration or after it has completed, then the contents of the LUN on the non-XIV storage
device do not contain the changes from those writes. Understanding the implications of this
is important in a back-out plan.
When all data on the non-XIV disk system has been migrated, perform site clean up:
1. Delete all SAN zones related to the non-XIV disk.
2. Delete all LUNs on non-XIV disk and remove it from the site.
2 Site Run fiber cables from SAN switches to XIV for host connections
and migration connections.
3 Non-XIV storage Select host ports on the non-XIV storage to be used for migration
traffic. These ports do not have to be dedicated ports. Run new
cables if necessary.
4 Fabric switches Create switch aliases for each XIV Fibre Channel port and any
new non-XIV ports added to the fabric.
5 Fabric switches Define SAN zones to connect hosts to XIV (but do not activate the
zones). You can do this by cloning the existing zones from host to
non-XIV disk and swapping non-XIV aliases for new XIV aliases.
6 Fabric switches Define and activate SAN zones to connect non-XIV storage to XIV
initiator ports (unless direct connected).
8 Non-XIV storage Define the XIV on the non-XIV storage device, mapping LUN0 to
test the link.
9 XIV Define non-XIV storage to the XIV as a migration target and add
ports. Confirm that links are green and working.
11 XIV Define all the host servers to the XIV (cluster first if using clustered
hosts). Use a host listing from the non-XIV disk to get the WWPNs
for each host.
Once the site setup is complete, the host migrations can begin. Table 7-2 shows the host
migration check list. Repeat this check list for every host. Task numbers that are colored red
must be performed with the host application offline.
1 Host From the host, determine the volumes to be migrated and their relevant LUN
IDs and hardware serial numbers or identifiers.
2 Host If the host is remote from your location, confirm that you can power the host
back on after shutting it down (using tools such as an RSA card or
BladeCenter® manager).
3 Non-XIV Get the LUN IDs of the LUNs to be migrated from non-XIV storage device.
Storage Convert from hex to decimal if necessary.
5 Host Set the application to not start automatically at reboot. This helps when
performing administrative functions on the server (upgrades of drivers, patches,
and so on).
6 Host UNIX servers: Comment out disk mount points on affected disks in the mount
configuration file. This helps with system reboots while configuring for XIV.
8 Fabric Change the active zoneset to exclude the SAN zone that connects the host
server to non-XIV storage and include the SAN zone for the host server to XIV
storage. The new zone should have been created during site setup.
10 Non-XIV Map source volumes to the XIV host definition (created during site setup).
storage
11 XIV Create data migration pairing (XIV volumes created on the fly).
13 XIV Start XIV migration and verify it. If you want, wait for migration to finish.
14 Host Boot the server. (Be sure that the server is not attached to any storage.)
15 Host If co-existence of non-XIV and XIV multi-path software is not allowed, remove
non-XIV multi-pathing software.
17 Host Install the XIV Host Attachment Kit. (Be sure to note prerequisites.)
19 XIV Map XIV volumes to the host server. (Use original LUN IDs.)
21 Host Verify that the LUNs are available and that pathing is correct.
22 Host UNIX servers: Update mount points for new disks in the mount configuration file
if they have changed.
24 Host Set the application to start automatically if this was previously changed.
26 XIV When the volume is synchronized delete the data migration (do not deactivate
the migration).
27 Non-XIV Un-map migration volumes away from XIV if you must free up LUN IDs.
Storage
28 XIV Consider re-sizing the migrated volumes to the next 17 GB boundary if the host
operating system is able to use new space on a re-sized volume.
29 Host If XIV volume was re-sized, use host procedures to utilize the extra space.
30 Host If non-XIV storage device drivers and other supporting software were not
removed earlier, remove them when convenient.
When all the hosts and volumes have been migrated there are two site clean up tasks left, as
shown in Table 7-3.
Given that the XIV supports migration from almost any storage device, it is impossible to list
the methodology to get LUN IDs from each one.
Failover mode 0
LUN numbering
The EMC Symmetrix and DMX, by default, does not present volumes in the range of 0 to 512
decimal. The Symmetrix/DMX presents volumes based on the LUN ID that was given the
volume when the volume was placed on the FA port. If a volume was placed on the FA with a
LUN ID of 90, this is how it is presented to the host by default. The Symmetrix/DMX also
presents the LUN IDs in hex. Thus, LUN ID 201 equates to decimal 513, which is greater than
512 and is outside of the XIV's range. There are two disciplines for migrating data from a
Symmetrix/DMX where the LUN ID is greater than 512 (decimal).
LUN-Offset
With EMC Symmetrix Enginuity code 68 - 71 code, there is an EMC method of presenting
LUN IDs to hosts other than the LUN ID given to the volume when placed on the FA. In the
Symmetrix/DMX world, a volume is given a unique LUN ID when configured on an FA. Each
volume on an FA must have a unique LUN ID. The default method (and a best practice of
presenting volumes to a host) is to use the LUN ID given to the volume when placed on the
FA. In other words, if 'vol1' was placed on an FA with an ID of 7A (hex (0x07a) decimal 122),
this is the LUN ID that is presented to the host. Using the lunoffset option of the symmask
command, a volume can be presented to a host (WWPN initiator) with a different LUN ID than
was assigned the volume when placed on the FA. Because it is done at the initiator level, the
production server can keep the high LUNs (above 128) while being allocated to the XIV using
lower LUN IDs (below 512 decimal).
Multipathing
The EMC Symmetrix and DMX are active/active storage devices.
LUN0
There is a requirement for the HDS TagmaStore Universal Storage Platform (USP) to present
a LUN ID 0 to the XIV in order for the XIV Storage System to communicate with the HDS
device.
LUN numbering
The HDS USP uses hexadecimal LUN numbers.
Multipathing
The HDS USP is an active/active storage device.
7.12.4 HP EVA
The following requirements were determined after migration from a HP EVA 4400 and 8400.
LUN0
There is no requirement to map a LUN to LUN ID 0 for the HP EVA to communicate with the
XIV. This is because by default the HP EVA presents a special LUN known as the Console
LUN as LUN ID 0.
LUN numbering
The HP EVA uses decimal LUN numbers.
Multipathing
The HP EVA 4000/6000/8000 are active/active storage devices. For HP EVA 3000/5000, the
initial firmware release was active/passive, but a firmware upgrade to VCS Version 4.004
made it active/active capable. For more details see the following Web site:
http://h21007.www2.hp.com/portal/site/dspp/menuitem.863c3e4cbcdc3f3515b49c108973a8
01?ciid=aa08d8a0b5f02110d8a0b5f02110275d6e10RCRD
LUN0
There is a requirement for the DS4000 to present a LUN on LUN ID 0 to the XIV to allow the
XIV to communicate with the DS4000. It may be easier to create a new 1 GB LUN on the
DS4000 just to satisfy this requirement. This LUN does not need to have any data on it.
LUN numbering
For all DS4000 models, the LUN ID used in mapping is a decimal value between 0 to 15 or 0
to 255 (depending on model). This means that no hex-to-decimal conversion is necessary.
Figure 7-8 on page 197 shows an example of how to display the LUN IDs.
Instead, two targets should have been defined, as shown in Example 7-11. In this example,
two separate targets have been defined, each target having only one port for the relevant
controller.
In Example 7-13 choose the host type that specifies RDAC (Windows 2000).
LUN0
There is no requirement to map a LUN to LUN ID 0 for the ESS to communicate with the XIV.
LUN numbering
The LUN IDs used by the ESS are in hexadecimal, so they must be converted to decimal
when entered as XIV data migrations. It is not possible to specifically request certain LUN
IDs. In Example 7-14 there are 18 LUNs allocated by an ESS 800 to an XIV host called
NextraZap_ITSO_M5P4. You can clearly see that the LUN IDs are hex. The LUN IDs given in
the right-hand column were added to the output to show the hex-to-decimal conversion
needed for use with XIV. An example of how to view LUN IDs using the ESS 800 Web GUI is
shown in Figure 7-23 on page 217.
Restriction: The ESS can only allocate LUN IDs in the range 0 to 255 (hex 00 to FF). This
means that only 256 LUNs can be migrated at one time.
Multipathing
The ESS 800 is an active/active storage device. You can define multiple paths from the XIV to
the ESS 800 for migration. Ideally, connect to more than one host bay in the ESS 800.
Because each XIV host port is defined as a separate host system, ensure that the LUN ID
used for each volume is the same. There is a check box on the Modify Volume Assignments
panel titled “Use same ID/LUN in source and target” that will assist you. Figure 7-28 on
page 236 shows a good example of two XIV host ports with the same LUN IDs.
LUN0
There is no requirement to map a LUN to LUN ID 0 for a DS6000 or DS8000 to communicate
with the XIV.
LUN numbering
The DS6000 and DS8000 use hexadecimal LUN IDs. These can be displayed using DSCLI
with the showvolgrp -lunmap xxx command, where xxx is the volume group created to assign
volumes to the XIV for data migration. Do not use the Web GUI to display LUN IDs.
We now determine which physical disks must be migrated. In Example 7-17 we use the lspv
commands to determine that hdisk3, hdisk4, and hdisk5 are the relevant disks for this VG.
The lsdev -Cc disk command confirms that they are located on an IBM ESS 2105. We then
use the lscfg command to determine the hardware serial numbers of the disks involved.
Because we now know the source hardware we can create connections between the ESS
800 and the XIV and the XIV and Dolly (our host server). First, in Example 7-18 we identify
the existing zones that connect Dolly to the ESS 800. We have two zones, one for each AIX
HBA. Each zone contains the same two ESS 800 HBA ports.
We now create two new zones. The first zone connects the initiator ports on the XIV to the
ESS 800. The second and third zones connects the target ports on the XIV to Dolly (for use
after the migration). These are shown in Example 7-19. All six ports on the XIV clearly must
have been cabled into the SAN fabric.
We then create the migration connections between the XIV and the ESS 800. An example of
using the XIV GUI to do this was shown in “Define target connectivity (Fibre Channel only).”
on page 203. In Example 7-20 we use the XCLI to define a target, then the ports on that
target, then the connections between XIV and the target (ESS 800). Finally, we check that the
links are active=yes and up=yes. We can use two ports on the ESS 800 because it is an
active/active storage device.
Example 7-20 Connecting ESS 800 to XIV for migration using XCLI
>> target_define protocol=FC target=ESS800 xiv_features=no
Command executed successfully.
>> target_port_add fcaddress=50:05:07:63:00:c9:0c:21 target=ESS800
Command executed successfully.
>> target_port_add fcaddress=50:05:07:63:00:cd:0c:21 target=ESS800
Command executed successfully.
>> target_connectivity_define local_port=1:FC_Port:5:4
fcaddress=50:05:07:63:00:c9:0c:21 target=ESS800
Command executed successfully.
>> target_connectivity_define local_port=1:FC_Port:7:4
fcaddress=50:05:07:63:00:cd:0c:21 target=ESS800
Command executed successfully.
>> target_connectivity_list
Target Name Remote Port FC Port IP Interface Active Up
ESS800 5005076300C90C21 1:FC_Port:5:4 yes yes
ESS800 5005076300CD0C21 1:FC_Port:7:4 yes yes
We now define the XIV as a host to the ESS 800. In Figure 7-27 we have defined the two
initiator ports on the XIV (with WWPNs that end in 53 and 73) as Linux (x86) hosts called
Nextra_Zap_5_4 and NextraZap_7_4.
Once the zoning changes have been done and connectivity and correct definitions confirmed
between XIV to ESS and XIV to AIX host, we take an outage on the volume group and related
file systems that are going to be migrated. In Example 7-22 we unmount the file system, vary
off the volume group, and then export the volume group. Finally, we rmdev the hdisk devices.
If the Dolly host no longer needs access to any LUNs on the ESS 800 we remove the SAN
zoning that connects Dolly to the ESS 800. In Example 7-18 on page 233 this was the zone
called ESS800_dolly_fcs0.
We now create the DMs and run a test on each LUN. The XIV GUI or XCLI could be used. In
Example 7-23 the commands to create, test, and activate one of the three migrations is
shown. We must run each command for hdisk3 and hdisk4 also.
After we create and activate all three migrations, the Migration panel in the XIV GUI looks as
shown in Figure 7-29. Note that the remote LUN IDs are 0, 1, and 2, which must match the
LUN numbers seen in Figure 7-28.
Now we can bring the volume group back online. Because this AIX host was already using
SDDPCM, we can install the XIVPCM (the AIX host attachment kit) at any time prior to the
change. In Example 7-24 we confirm that SDDPCM is in use and that the XIV definition file
set is installed. We then run cfgmgr to detect the new disks. We then confirm that the disks
are visible using the lsdev -Cc disk command.
A final check before bringing the volume group back ensures that the Fibre Channel pathing
from the host to the XIV is set up correctly. We can use the AIX lspath command against
each hdisk, as shown in Example 7-25. Note that in this example the host can connect to
port 2 on each of the XIV modules 4, 5, 6, and 7 (which is confirmed by checking the last two
digits of the WWPN).
Non-XIV devices
===============
Device Size Paths
-----------------------------------
hdisk1 N/A 1/1
hdisk2 N/A 1/1
We can also use the XIV GUI to confirm connectivity by going to the Hosts and Clusters
Host Connectivity panel. An example is shown in Figure 7-31, where the connections match
those seen in Example 7-25 on page 237.
Having confirmed that the disks have been detected and that the paths are good, we can now
bring the volume group back online. In Example 7-27 we import the VG, confirm that the
PVIDs match those seen in Example 7-17 on page 232, and then mount the file system.
Now at the ESS 800 Web GUI we can un-map the three ESS 800 LUNs from the Nextra_Zap
host definitions. This frees up the LUN IDs to be reused for the next volume group migration.
After the migrations are deleted, a final suggested task is to re-size the volumes on the XIV to
the next 17 GB cutoff. In this example we migrate ESS LUNs that are 10 GB in size. However,
the XIV commits 17 GB of disk space because all space is allocated in 17 GB portions. For
this reason it is better to resize the volume on the XIV GUI from 10 GB to 17 GB so that all the
allocated space on the XIV is available to the operating system. This presumes that the
operating system can tolerate a LUN size growing, which in the case of AIX is true.
We must unmount any file systems and vary off the volume group before we start. Then we
go to the volumes section of the XIV GUI, right-click to select the 10 GB volume, and select
the Resize option. The current size appears. In Figure 7-33 the size is shown in 512 byte
blocks because the volume was automatically created by the XIV based on the size of the
source LUN on the ESS 800. If we multiply 19531264 by 512 bytes we get 10,000,007,168
bytes, which is 10 GB.
We change the sizing methodology to GB and the size immediately changes to 17 GB, as
shown in Figure 7-34. If the volume was already larger than 17 GB, then it will change to the
next interval of 17 GB. For example, a 20 GB volume shows as 34 GB.
Now the volume is really 17 GB and no space is being wasted on the XIV. The new size is
shown in Figure 7-35.
Vary on the VG again to update AIX that the volume size has changed. In Example 7-28 we
import the VG, which detects that the source disks have grown in size. We then run the chvg
-g command to grow the volume group, then confirm that the file system can still be used.
We can now resize the file system to take advantage of the extra space. In Example 7-29 the
original size of the file system in 512 byte blocks is shown.
[Entry Fields]
File system name /mnt/redbk
NEW mount point [/mnt/redbk]
SIZE of file system
Unit Size 512bytes
Number of units [41943040]
The file system has now grown. In Example 7-31 we can see the file system has grown from
20 GB to 40 GB.
The combination of SVC and XIV allows a client to benefit from the high-performance grid
architecture of the XIV while retaining the business benefits delivered by the SVC (such as
higher performance via disk aggregation, multivendor and multi-device copy services, and
data migration functions).
The order of the sections in this chapter address each of the requirements of an
implementation plan in the order in which they arise. This chapter does not, however, discuss
physical implementation requirements (such as power requirements), as they are already
addressed in the book IBM XIV Storage System: Architecture, Implementation, and Usage,
SG24-76599, found here:
http://www.redbooks.ibm.com/Redbooks.nsf/RedbookAbstracts/sg247659.html?Open
SVC firmware
The first SVC firmware version that supported XIV was 4.3.0.1. However, the SVC cluster
should be on at least SVC firmware Version 4.3.1.4 or more preferably the most recent level
available from IBM. You can display the SVC firmware version by viewing the cluster
properties in the SVC GUI or by using the svcinfo lscluster command specifying the name
of the cluster. The SVC in Example 8-1 is on SVC code level 4.3.1.5.
Example 8-1 Displaying the SVC cluster code level using SVC CLI
IBM_2145:SVCSTGDEMO:admin> svcinfo lscluster SVCSTGDEMO
code_level 4.3.1.5 (build 9.16.0903130000)
XIV firmware
The XIV should be on at least XIV firmware Version 10.0.0.a. The XIV firmware version is
shown on the All Systems front page of the XIV GUI. The XIV in Figure 8-1 is on version
10.0.1.b (circled on the upper right in red).
Note that an upgrade from XIV 10.0.x.x code levels to 10.1.x.x code levels is not concurrent
(meaning that the XIV is unavailable for I/O during the upgrade).
6 27.26 8 4 4:5 6
11 54.65 20 10 4:5:7:8:9 6
12 61.74 20 10 4:5:7:8:9 6
13 66.16 24 12 4:5:6:7:8:9
14 73.24 24 12 4:5:6:7:8:9
15 79.11 24 12 4:5:6:7:8:9
Another way to view the activation state of the XIV interface modules is shown in Table 8-2.
As additional capacity is added to an XIV, additional XIV host ports become available. Where
a module is shown as inactive, this refers only to the host ports, not the data disks.
Module 9 Not present Inactive Inactive Active Active Active Active Active
host ports
Module 8 Not present Active Active Active Active Active Active Active
host ports
Module 7 Not present Active Active Active Active Active Active Active
host ports
91 93
Module 9
90 92
91
81 93
83
Module 8
90
80 92
82
71 73
Module 7
70 72
61 63
Module 6
60 62
91
51 93
53
Module 5
90
50 92
52
41 43
Module 4
40 42
Port 2 Port 4
Port 1 Port 3
In Figure 8-2, the MP value (module/port, which make up the last two digits of the WWPN) is
shown in each small box. The diagram represents the patch panel found at the rear of the XIV
rack.
To display the XIV WWPNs use the back view on the XIV GUI or the XCLI fc_port_list
command.
In the output example shown in Example 8-3 the four ports in module 4 are listed.
For availability and performance use ports 1 and 3 for SVC and general host traffic. If you
have two fabrics, place port 1 in the first fabric and port 3 in the second fabric.
You can instead choose to use ports 2 and 4, although in principle these are reserved for data
migration and remote mirroring. For that reason port 4 on each module is by default in initiator
mode. If you want to change the mode of port 4 to target mode, you can do so easily from the
XIV GUI or XCLI. However, you may also need an RPQ from IBM. Contact your IBM XIV
representative to discuss this.
Ideally, the number of MDisks presented by the XIV to the SVC should be a multiple of the
number of XIV host ports, from one to four. However, there is good math to support this.
The XIV can handle a queue depth of 1400 per Fibre Channel host port and a queue depth of
256 per mapped volume per host port:target port:volume tuple. However, the SVC sets the
following internal limits:
The maximum queue depth per MDisk is 60.
The maximum queue depth per target host port on an XIV is 1000.
Based on this knowledge, we can determine an ideal number of XIV volumes to map to the
SVC for use as MDisks by using this algorithm:
Q = ((P x C) / N) / M
If a 2-node SVC cluster is being used with 12 ports on IBM XIV System and 48 MDisks, this
yields a queue depth that as follows:
Q = ((12 ports*1000)/2 nodes)/48 MDisks = 125
Because 125 is greater than 60, the SVC uses a queue depth of 60 per MDisk. If a 4-node
SVC cluster is being used with 12 host ports on the IBM XIV System and 48 MDisks, this
yields a queue depth that as follows:
Q = ((12 ports*1000)/4 nodes)/48 MDisks = 62
Because 62 is greater than 60, the SVC uses a queue depth of 60 per MDisk.
If you have a 6-node or 8-node cluster, the formula suggests that you must use much larger
XIV volumes. However, currently available SVC firmware does not support an MDisk larger
than 2 TB, so it is simpler to continue to use the 1632 GB volume size. When using 1632 GB
volumes, there is leftover space. That space could be used for testing or for non-SVC
direct-attach hosts. If you map the remaining space to the SVC as an odd sized volume then
VDisk striping is not balanced, meaning that I/O is not be evenly striped across all XIV host
ports.
Tip: If you only provision part of the usable space of the XIV to be allocated to the SVC,
then the calculations above no longer work. You should instead size your MDisks to ensure
that at least two (and up to four) MDisks are created for each host port on the XIV.
Thus, XIV is using binary sizing when creating volumes, but displaying it in decimal and then
rounding it down.
The recommended volume size for XIV volumes presented to the SVC is 1632 GB (as viewed
on the XIV GUI). There is nothing special about this volume size, it simply divides nicely to
create on average four XIV volumes per XIV host port (for queue depth purposes).
The size of a 1632 GB volume (as viewed on the XIV GUI) can be stated in four ways:
GB 1632 GB (decimal), as shown in the XIV GUI, but rounded down to the
nearest GB (see the number of bytes below).
GiB 1520 GiB (binary counting where 1 GiB = 230 bytes). This is exactly
1520 GiB.
Bytes 1,632,087,572,480 bytes.
Blocks 3,187,671,040 blocks (each block being 512 bytes).
Note that the SVC reports each MDisk presented by XIV as 1520 GiB. Figure 8-3 shows what
the XIV reports.
If you right-click the volume in the XIV GUI and display properties, you will be able to see that
this volume is 3,187,671,040 blocks. If you multiply 3,187,671,040 by 512 (because there are
512 bytes in a SCSI block) you will get 1,632,087,572,480 bytes. If you divide that by
1,073,741,824 (the number of bytes in a binary GiB), then you will get 1520 GiB, which is
exactly what the SVC reports for the same volume (MDisk), as shown in Example 8-4.
IBM_2145:SVCSTGDEMO:admin>svcinfo lsmdisk
id name status mode capacity ctrl_LUN_# controller_name
9 mdisk9 online unmanaged 1520.0GB 0000000000000007 XIV
In Figure 8-4 there are three volumes that will be mapped to the SVC. The first volume is
2199 GB (2 TiB), but the other two are larger than that.
When presented to the SVC, the SVC reports all three as being 2 TiB (2048 GiB), as shown
in Example 8-5.
Because there was no benefit in using larger volume sizes do not follow this example. Always
ensure that volumes presented by the XIV to the SVC are 2199 GB or smaller (when viewed
on the XIV GUI or XCLI).
In terms of the available SVC extent sizes and the effect on maximum SVC cluster size, see
Table 8-4.
16 MB 64 TB
32 MB 128 TB
64 MB 256 TB
128 MB 512 TB
256 MB 1024 TB
512 MB 2048 TB
1024 MB 4096 TB
2048 MB 8192 TB
To determine whether removing a managed disk controller requires quorum disk relocation,
run a script to find the MDisks that are being used as quorum disks, as shown in
Example 8-6. This script can be run safely without modification. Example 8-6 shows two
MDisks on the DS6800 and one MDisk on the DS4700.
Example 8-7 Using the svcinfo lsquorum command on SVC code level 5.1 and later
IBM_2145:mycluster:admin>svcinfo lsquorum
quorum_index status id name controller_id controller_name active
0 online 0 mdisk0 0 DS6800_1 yes
1 online 1 mdisk1 1 DS6800_1 no
2 online 2 mdisk2 2 DS4700 no
To move the quorum disk function, we specify three MDisks that will become quorum disks.
Depending on your MDisk group extent size, each selected MDisk must have between 272
MB and 1024 MB of free space. Execute the svctask setquorum commands before you start
migration. If all available MDisk space has been allocated to VDisks then you will not be able
to use that MDisk as a quorum disk. Table 8-5 shows the amount of space needed on each
MDisk.
Table 8-5 Quorum disk space requirements for each of the three quorum MDisks
Extent size (in MB) Number of extents Amount of space per MDisk
needed by quorum needed by quorum
16 17 272 MB
32 9 288 MB
64 5 320 MB
128 3 384 MB
256 2 512 MB
1024 1 1024 MB
2048 1 2048 MB
In Example 8-8 there are three free MDisks. They are 1520 GiB in size (1632 GB).
In Example 8-9 the MDisk group is created using an extent size of 1024 MB.
All three MDisks are set to be quorum disks, as shown in Example 8-11.
The MDisk group has now lost free space, as shown in Example 8-12.
This means that free capacity fell by 3,221,225,472 bytes, which is 3 GiB or 1 GiB per quorum
MDisk.
Note: In this example all three quorum disks were placed on a single XIV. This may not be
an ideal configuration. The Web tip referred to at the start of this section has more details
about best practice, but in short you should try and use more than one managed disk
controller if possible.
3. Add the SVC host ports to the host definition of the first SVC node, as shown in
Example 8-15.
4. Add the SVC host ports to the host definition of the second SVC node, as shown in
Example 8-16.
5. Repeat steps 3 and 4 for each SVC I/O group. If you only have two nodes then you only
have one I/O group.
6. Create a storage pool. In Example 8-17 the command shown creates a pool with 8160 GB
of space and no snapshot space. The total size of the pool is determined by the volume
size that you choose to use. We do not need snapshot space because we cannot use XIV
snapshots with SVC MDisks.
Important: You must not use XIV thin provisioning pools with SVC. You must only use
regular pools. The command shown in Example 8-17 creates a regular pool (where the
soft size is the same as the hard size. This does not stop you from using thin
provisioned VDisks on the SVC.
Important: Only map volumes to the SVC cluster (not to individual nodes in the
cluster). This ensures that each SVC node sees the same LUNs with the same LUN
IDs. You must not allow a situation where two nodes in the same SVC cluster have
different LUN mappings.
Tip: The XIV GUI normally reserves LUN ID 0 for in-band management. The SVC
cannot take advantage of this, but is not affected either way. In Example 8-19 we
started the mapping with LUN ID 0, but if you used the GUI you will find that by default
you start with LUN ID 1.
9. If necessary, change the system name for XIV so that it matches the controller name used
on the SVC. In Example 8-20 we use the config_get command to determine the machine
type and serial number. Then we use the config_set command to set the system_name.
Whereas the XIV allows a long name with spaces, SVC can only use 15 characters with
no spaces.
3. Create an MDisk group, as shown in Example 8-22, where an MDisk group is created
using an extent size of 1024 MB.
Important: Adding a new managed disk group to the SVC may result in the SVC
reporting that you have exceeded the virtualization license limit. Whereas this does not
affect operation of the SVC, you continue to receive this error message until the
situation is corrected (by either removing the MDisk Group or increasing the
virtualization license). If the non-XIV disk is not being replaced by the XIV then ensure
that an additional license has been purchased. Then increase the virtualization limit
using the svctask chlicense -virtualization xx command (where xx specifies the
new limit in TB).
4. Relocate quorum disks if required as documented in “Using an XIV for SVC quorum disks”
on page 253.
5. Rename the controller from its default name. A managed disk controller is given a name
by the SVC such as controller0 or controller1 (depending on how many controllers have
already been detected). Because the XIV can have a system name defined for it, aim to
closely match the two names. Note, however, that the controller name used by SVC
cannot have spaces and cannot be more than 15 characters long. In Example 8-23
controller number 2 is renamed to match the system name used by the XIV itself (which
was set in Example 8-20 on page 257).
Now we must follow one of the migration strategies, as described in the 8.7, “Data movement
strategy overview” on page 259.
We discuss this method in greater depth in 8.8, “Using SVC migration to move data to XIV” on
page 261.
We discuss this method in greater depth in 8.9, “Using VDisk mirroring to move the data” on
page 263.
In these cases we can migrate the VDisks to image mode and take an outage to do the
relocation and extent re-size. There will be a host outage, although it can kept very short
(potentially in the order of seconds or minutes).
This method is detailed in greater depth in 8.10, “Using SVC migration with image mode” on
page 267.
We then must identify the VDisks that we are migrating. We can filter by MDisk Group ID, as
shown in Example 8-26, where there is only one VDisk that must be migrated.
We then create an Mdisk group called XIV_Target using the new XIV MDisks, with the same
extent size as the source group. In Example 8-28 it is 256.
We confirm the new MDisk group is present. In Example 8-29 we are filtering by using the
new ID of 2.
8.8.3 Migration
Now we are ready to migrate the VDisks. In Example 8-30 we migrate VDisk 5 into MDisk
group 2 and then confirm that the migration is running.
Important: Scripts that use VDisk names or IDs will not be affected by the use of VDisk
migration, as the VDisk names and IDs do not change.
We then must identify the VDisks that we are migrating. In Example 8-33 we filter by ID.
We then create an MDisk group called XIV_Target using the new XIV MDisks (with the same
extent size as the source group, in this example 256), as shown in Example 8-35.
We confirm that the new MDisk group is present. In Example 8-36 we are filtering by using
the new ID of 2.
In Example 8-40 we can see the two copies (and also that they are not yet in sync).
If copying is going too slowly, you could choose set a higher syncrate when you create the
copy.
You can also increase the syncrate from the default value of 50 (which equals 2 MBps) to 100
(which equals 64 MBps). This change affects the VDisk itself and isvalid for any future copies.
Example 8-43 shows the syntax.
Once the estimated completion time passes, we can confirm that the copy process is
complete for VDisk 5. In Example 8-44 the sync is complete.
Important: Scripts that use VDisk names or IDs should not be affected by the use of VDisk
mirroring, as the VDisk names and IDs do not change. However, if you choose to split the
VDisk copies and continue to use copy 0, it will be a totally new VDisk with a new name
and a new ID.
Now to make a matching XIV volume we can either make an XIV volume that is larger than
the source VDisk or one that is exactly the same size. The easy solution is to create a larger
volume. Because the XIV creates volumes in 16 GiB portions (that display in the GUI as
rounded decimal 17 GB chunks), we could create a 17 GB LUN using the XIV and then map
it to the SVC (in this example the SVC host is defined by the XIV as svcstgdemo) and use the
next free LUN ID, which in Example 8-49 is LUN ID 12 (it is different every time).
The drawback of using a larger volume size is that we eventually end up using extra space.
So it is better to create a volume that is exactly the same size. To do this we must know the
size of the VDisk in bytes (by default the SVC shows the VDisk size in GiB, even though it
says GB). In Example 8-50 we first choose to display the size of the VDisk in GB.
Now that we know the size of the source VDisk in bytes, we can divide this by 512 to get the
size in blocks (there are always 512 bytes in a standard SCSI block). 10,737,418,240 bytes
divided by 512 bytes per block is 20,971,520 blocks. This is the size that we use on the XIV to
create our image mode transitional volume.
Example 8-52 shows an XCLI command run on an XIV to create a volume using blocks.
Having created the volume, on the XIV we now map it to the SVC (using the XIV GUI or
XCLI).
Then, on the SVC, we can detect it as an unmanaged MDisk using the svctask detectmdisk
command.
In Example 8-53 we first identify the source VDisk number (by listing VDisks per MDisk
group) and then identify the candidate MDisk (by looking for unmanaged MDisks).
In Example 8-53 we identified a source VDisk(5) sized 10 GiB and a target MDisk(9) sized
16 GiB.
Now we migrate the VDisk into image mode without changing MDisk groups (we stay in
group 1, which is where the source VDisk is currently located). The target MDisk must be
unmanaged to be able to do this. If we migrate to a different MDisk group, the extent size of
the target group must be the same as the source group. The advantage of using the same
group is simplicity, but it does mean that the MDisk group contains MDisks from two different
controllers (which is not the best option for normal operations). Example 8-54 shows the
command to start the migration.
We must confirm that the VDisk is in image mode or data loss will occur in the next step. At
this point we must take an outage.
The commands shown in Example 8-56 apply to a host whose Host ID is 2 and the VDisk ID
is 5.
The MDisk is now unmanaged (even though it contains customer data) and could be mapped
to a different SVC cluster or simply mapped directly to a non-SVC host.
We can now reboot the host (or scan for new disks) and the LUN will return with data intact.
Important: The VDisk ID and VDisk names were both changed in this example. Scripts
that use the VDisk name or ID (such as those used to automatically create flashcopies)
must be changed to reflect the new name and ID.
First we create a new managed disk group using volumes on the XIV intended to be used as
the final destination. In Example 8-59, five volumes, each 1632 GB, were created on the XIV
and mapped to the SVC. These are detected as 1520 GiB (because 1632 GB on the XIV GUI
equals 1520 GiB on the SVC GUI). At a certain point the MDisks must also be renamed from
the default names given by the SVC using the svctask chmdisk -name command.
We create a MDisk group using an extent size of 1024 MB with the five free MDisks. In
Example 8-60 MDisk group 3 is created.
We then migrate the image mode VDisk (in our case VDisk 5) into the new MDisk group (in
our case group 3), as shown in Example 8-61.
In Example 8-63, we monitor the migration and wait for it to complete (no response means
that it is complete).
We can clean up the transitional MDisk (which should now be unmanaged), as shown in
Example 8-64.
Note: If the XIV has the Capacity on Demand (CoD) feature, then no hardware change
or license key is necessary to use available capacity that has not yet been purchased.
The customer simply starts using additional capacity as required until all available
usable space is allocated. The billing process to purchase this capacity occurs
afterwards.
2. From the Pools section of the XIV GUI, right-click the relevant pool and resize it depending
on how the new capacity will be split between any pools. If all the space on the XIV is
dedicated to a single SVC then there must be only one pool.
3. From the Volumes by Pools section of the XIV GUI, add new volumes of 1632 GB until no
more volumes can be created. (There will be space left over, which can be used as scratch
space for testing and for non-SVC hosts.)
4. From the Host section of the XIV GUI, map these new volumes to the relevant SVC
cluster. This completes the XIV portion of the upgrade.
5. From the SVC, detect and then add the new MDisks to the existing managed disk group.
Alternatively, a new managed disk group could be created. Remember that every MDisk
uses a different XIV host port, so a new MDisk group ideally contains several MDisks to
spread the Fibre Channel traffic.
6. If new volumes are added to an existing managed disk group, it may be desirable to
rebalance the existing extents across the new space.
To explain why an extent rebalance may be desirable, the SVC uses one XIV host port as a
preferred port for each MDisk. If a VDisk is striped across eight MDisks, then I/O from that
VDisk will be potentially striped across eight separate I/O ports on the XIV. If the space on
these eight MDisks is fully allocated, then when new capacity is added to the MDisk group,
new VDisks will only be striped across the new MDisks. If additional capacity supplying only
two new MDisks is added, then I/O for VDisks striped across just those two MDisks is only
directed to two host ports on the XIV. This means that the performance characteristics of
these VDisks may be slightly different, despite the fact that all XIV volumes effectively have
the same back end disk performance. The extent rebalance script is located here:
http://www.alphaworks.ibm.com/tech/svctools
The suggested host port to capacity ratios are shown in Table 8-6.
6 27 4
9 43 8
10 50 8
11 54 10
12 61 10
13 66 12
14 73 12
15 79 12
To use additional XIV host ports, run a cable from the SAN switch to the XIV and attach to the
relevant port on the XIV patch panel. Then zone the new XIV host port to the SVC cluster via
the SAN switch. No commands must be run on the XIV.
We can confirm is that the SVC is utilizing all six XIV interface modules. In Example 8-66 XIV
interface modules 4 through 9 are all clearly zoned to the SVC (because the WWPN ending in
71 is from XIV module 7, the module with WWPN ending in 61 is from XIV module 6, and so
on. To decode the WWPNs use the process described in 8.3.2, “Determining XIV WWPNs”
on page 247.
The publications listed in this section are considered particularly suitable for a more detailed
discussion of the topics covered in this book.
Other publications
These publications are also relevant as further information sources:
IBM XIV Storage System Installation and Service Manual, GA32-0590
IBM XIV Storage System XCLI Manual, GC27-2213
IBM XIV Storage System Introduction and Theory of Operations, GC27-2214
IBM XIV Storage System Host System, GC27-2215
IBM XIV Storage System Model 2810 Installation Planning Guide, GC52-1327-01
IBM XIV Storage System Pre-Installation Network Planning Guide for Customer
Configuration, GC52-1328-01
XCLI Reference Guide, GC27-2213-00
Host System Attachment Guide for Windows- Installation Guide
http://publib.boulder.ibm.com/infocenter/ibmxiv/r2/index.jsp
The iSCSI User Guide
http://download.microsoft.com/download/a/e/9/ae91dea1-66d9-417c-ade4-92d824b871
af/uguide.doc
AIX 5L System Management Concepts: Operating System and Devices
http://publib16.boulder.ibm.com/pseries/en_US/aixbman/admnconc/hotplug_mgmt.htm
#mpioconcepts
System Management Guide: Operating System and Devices for AIX 5L
http://publib16.boulder.ibm.com/pseries/en_US/aixbman/baseadmn/manage_mpio.htm
Host System Attachment Guide for Linux, which can be found at the XIV Storage System
Information Center
http://publib.boulder.ibm.com/infocenter/ibmxiv/r2/index.jsp
Sun StorEdge Traffic Manager Software 4.4 Release Notes
http://dlc.sun.com/pdf/819-5604-17/819-5604-17.pdf
Online resources
These Web sites are also relevant as further information sources:
IBM XIV Storage Web site
http://www.ibm.com/systems/storage/disk/xiv/index.html
System Storage Interoperability Center (SSIC)
http://www.ibm.com/systems/support/storage/config/ssic/index.jsp
Storage Networking Industry Association (SNIA) Web site
http://www.snia.org/
IBM Director Software Download Matrix page
http://www.ibm.com/systems/management/director/downloads.html
IBM Director documentation
http://www.ibm.com/systems/management/director/
E
environment variables 205–206 K
ESS 189, 195, 207, 229 KB 5
Ethernet 113 Keep Source Updated 188, 199, 208
Ethernet switch 176
event log 211, 227
Exchange Server 45 L
extent size 252–254 last consistent snapshot 136
new managed disk group 261 timestamp 137
last_consistent 135
last_replicated 100
F last_replicated snapshot 100–101, 157, 167–168, 172
fail-over 188–189, 218 Level 1.0 58–60
failure 72 license 258, 273, 275
fan-in 89 link
fan-out 89 failure 136
fc_port_config 117 link failure 169
fc_port_list 113, 117 link status 79
Fibre Channel 113 link utilization 79
Fibre Channel (FC) 71, 86, 91 Linux x86 192, 230
Fibre Channel ports 91, 116, 176 local site 72, 79, 88, 127–128, 133, 166, 170, 181
FLASHCOPYMANAGER 59 Master volumes 138
old Master volumes 138
lock Snapshot 35
G Logical Unit Number (LUN) 77, 92
GiB 250–252 LUN ID 187, 197–198, 200
Graphical User Interface LUN mapping 197
Remote Mirroring performance statistics 112 LUN numbering 192, 223
Graphical User Interface (GUI) 8, 11, 91, 113 LUN0 192, 195, 221, 223
GUI example 176
GUI step 178
M
MachinePool Editor 51
H managed mode 267, 271–272
Host Attachment Kit 58 map_vol host 268
Host Attachment Procedure 197 Master 72, 77
host server 186–187, 201 Master peer 78, 107–108, 133, 151, 168
I/O requests 187 master peer
host_add_port host 256 actual data 108
Deactivate XIV remote mirroring 108
I remote mirroring 108
IBM Tivoli Storage FlashCopy Manager 41 XIV remote mirroring 108
IBM Tivoli Storage Manager V6 42 Master role 74, 76–77, 96, 126, 133–134, 136, 161, 166
IBM XIV Master volume 5, 10, 15–16, 73, 78, 81, 101, 132,
data migration solution 186 134–136, 167–168
storage 187 periodic consistent copies 171
image mode 245, 252, 260, 267 master volume 5, 9, 14
Index 281
R secondary XIV 72–73, 81, 128, 136, 155, 161
RDAC 227–228 corresponding consistency group 155
reactivation 111 Mirror statuses 130, 145
Recovery Point Objective (RPO) 73, 149 Slave volume 147
Redbooks Web site service 45
Contact us xiv shadow copy 44
RedHat 192, 231 persistent information 45
redirect on write 5, 22, 66 single XIV
redirect-on-write 47 footprint 253
Remote Mirror 33, 71–72, 113–114 rack 253
activate 144 Storage Pool 92
remote mirror pair 198 system 82, 85, 89
Remote Mirroring 8, 23, 71–73, 80, 125–126, 129, 132, Slave 72, 77
159 Slave peer 74, 100, 132–133, 136, 151–152, 161
actions 86 slave peer
activate 155 consistent data 74
delete 165 Slave Pool 127
function 113 Slave role 73, 96, 126, 134–136, 167, 179–181
implementation 113 Slave volume 76–79, 81, 127, 129, 132–133, 150, 152
planning 111 Changing role 100
usage 82 consistent state 77
remote mirroring 116 whole group 77
consistency groups 72, 112 snap_group 31, 37
Fibre channel paths 113 snap_group_duplicate 178
first step 81 Snapshot
single unit 96 automatic deletion 6, 8, 21
synchronous 125 creation 8, 31
XIV systems 86 deletion priority 6, 11–12
remote site 71–72, 78, 127, 133, 137, 166, 170, 177, 181 details 31
secondary peers 103 duplicate 6, 9–10
Slave volumes 138 last_consistent 135
standby server 137 last_replicated 172
requestor 45, 47 lock 35
resize 113, 208, 214–215 most_recent 172
resize operation 66 restore 37–38
Resynchronization 102, 107, 136 size 31
resynchronization 136, 169 snapshot 1, 5
role 72, 76–77, 100 delete 6, 19
change 78 duplicate 6, 9–10
switching 78 last consistent 136
role reversal 166 locked 6, 9
RPO 73, 153, 162 naming convention 9
RPO_Lagging 81, 161 snapshot group 28, 30
RPO_OK 81 snapshot volume 4–5, 7, 16, 46, 56
snapshot_delete 20
Snapshot/volume copy 109
S SNMP traps 112
SAN boot 68, 186 Source LUN 198, 208, 217
SAN connectivit 113 source MDisk group
schedule 150, 162 extent size 259
schedule interval 73 Image Mode MDisks 261
Schedule Management 151 Source Target System 198, 207
schedule_create schedule 154, 163 source updating 187–188, 201, 219
SCSI initiator 191 source updating option 187
SDDPCM 237 SQL Server 45
Secondary 72, 77 SqlServerWriter 46
secondary site 131–135, 156, 166, 169–170 standby 128
mirror relation 137 state 77
remote mirroring 134 consistent 130
Role changeover 139–140 initializing 129
Index 283
remote mirroring peers 110
remote targets 108
Slave peer 108
Slave volume 108
volume copy 110
XIV remote mirroring 108–109
XIV system 107, 110
XIV systems 110–111
XIV 2 104, 107
additional Slave volume 104
consistency group 104
Disaster Recovery testing 109
DR servers 108
other functions 109
production applications 107
production changes 107
production workload 107–108
Slave peer 108
Slave volume 104, 109, 111
Unmap Master peers 108
XIV asynchronous mirroring 74, 83–84, 97–98, 149
XIV Command Line Interface (XCLI) 202
XIV GUI 76, 79, 126, 150, 244, 247–248
Host section 273
Pools section 273
XIV host port 248–249, 251
XIV mirroring 84, 86, 91–92, 98, 100, 102
Advantages 112
XIV remote mirroring 82–83, 85
normal operation 107–109
Planned deactivation 100
user deactivation 111
XIV Storage
System xi, 1, 5, 71, 113, 185–186, 243
XIV Storage System 46–47, 65–67, 72, 93, 112
primary IP address 50
snapshot operations 52
XIV subsystem 5–6
XIV system 2, 73–75, 150, 152, 154
available disk drives 92
available physical disk drives 93
mirroring connections 111
planned outage 82
same number 91
single Storage Pool 96
XIV mirroring target 86
XIV Snapshot function 85
XIV volume 68–69, 78, 92, 104, 149, 250–252
XIV VSS Provider
configuration 49
XIV VSS provider 41, 47
xiv_devlist 238
XIVPCM 237
Z
zoning 190–192, 197, 216
Learn copy and This IBM Redbooks publication provides a practical understanding of
the XIV Storage System copy and migration functions. The XIV Storage INTERNATIONAL
migration functions
System has a rich set of copy functions suited for various data TECHNICAL
and explore practical
protection scenarios, which enables clients to enhance their business SUPPORT
scenarios continuance, data migration, and online backup solutions. These ORGANIZATION
functions allow point-in-time copies, known as snapshots and full
Integrate Snapshots volume copies, and also include remote copy capabilities in either
with Tivoli FlashCopy synchronous or asynchronous mode. These functions are included in
Manager the XIV software, and all their features are available at no additional
charge. BUILDING TECHNICAL
Understand The various copy functions are reviewed under separate chapters that INFORMATION BASED ON
include detailed information about usage as well as practical PRACTICAL EXPERIENCE
SVC-based
illustrations.
migrations IBM Redbooks are developed
This book also discusses how to integrate the snapshot function with by the IBM International
IBM Tivoli FlashCopy manager, explains the XIV built-in migration Technical Support
capability, and presents migration alternatives based on the San Organization. Experts from
Volume Controller (SVC). IBM, Customers and Partners
from around the world create
timely technical information
based on realistic scenarios.
Specific recommendations
are provided to help you
implement IT solutions more
effectively in your
environment.