We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 0
EDUCATION
Page 1 Disk and Tape Backup Mechanisms
Copyright 2006 Storage Networking Industry Association. All Rights Reserved. Michael Fishman, EMC Disk and Tape Backup Mechanisms EDUCATION Disk and Tape Backup Mechanisms2006 Storage Networking Industry Association. All Rights Reserved. 2 SNIA Legal Notices The material contained in this tutorial is copyrighted by the SNIA Member companies and individuals may use this material in presentations and literature under the following conditions: Any slide or slides used must be reproduced without modification The SNIA must be acknowledged as source of any material used in the body of any document containing material from these presentations This presentation is a project of the SNIA Education Committee EDUCATION Disk and Tape Backup Mechanisms2006 Storage Networking Industry Association. All Rights Reserved. 3 About SNIA and the DMF About the Storage Networking Industry Association (SNIA) SNIAs primary goal is to ensure that storage networks become complete and trusted solutions across the IT community For additional information about SNIA see www.snia.org SNIAs Dictionary of Storage Networking Terminologyis online at www.snia.org/dictionary About the SNIA Data Management Forum (DMF) The DMF is a sub-group of SNIA acting as the worldwide authority on Data Management, Data Protection and ILM The DMF is a collaborative storage industry resource available to anyone responsible for the accessibility and integrity of their organizations information. DMF Data Protection Initiative (DPI) Information Lifecycle Management Initiative (ILMI) Long term Archive and Compliance Storage Initiative (LTACSI) Defining new approaches and best practices for data protection and recovery Developing, teaching and promoting ILM practices, implementation methods, and benefits Addressing challenges in developing, securing, and retaining long-term digital archives www.snia-dmf.org EDUCATION Disk and Tape Backup Mechanisms2006 Storage Networking Industry Association. All Rights Reserved. 4 Abstract Disk and Tape Backup Mechanisms Extending the enterprise backup paradigm with disk-based technologies allow users to significantly shrink or eliminate the backup time window. This tutorial focuses on various methodologies that can deliver an efficient and cost effective disk-to-disk-to-tape (D2D2T) solution. This includes approaches to storage pooling inside of modern backup applications, using disk and file systems within these pools, as well as how and when to utilize virtual tape libraries (VTL) within these infrastructures. EDUCATION Disk and Tape Backup Mechanisms2006 Storage Networking Industry Association. All Rights Reserved. 5 Disk and Tape Backup Mechanisms Fundamental concepts for modern, high- performance, scalable systems for Data Protection using networked storage An Overview of Backup Mechanisms for Storage Networks Conclusions Appendix EDUCATION Disk and Tape Backup Mechanisms2006 Storage Networking Industry Association. All Rights Reserved. 6 Data Protection Data protection is about data availability There are a wide variety of tools available to us to achieve that goal, including backup, restoration, replication and recovery, but it is critical to keep focused on the actual goal -- availability of the data -- and to balance how we achieve that goal by using the right set of tools for the specific job. Held in the balance are concepts like data importance or business criticality, budget, speed, and cost of downtime. EDUCATION Disk and Tape Backup Mechanisms2006 Storage Networking Industry Association. All Rights Reserved. 7 DDRR - The Process Detection Corruption or failure noted Diagnosis / Decision What went wrong? What recovery point should be used? What method of recovery -- overall strategy for the recovery? Restoration This phase involves moving data -- from tape to disk or disk to disk from the backup or archive (source) to the primary or production disks. Recovery Final phase:application environment perform standard recovery and startup operations -- log replays for a database or journals replays for a file system. EDUCATION Disk and Tape Backup Mechanisms2006 Storage Networking Industry Association. All Rights Reserved. 8 Concepts RPO - Recovery Point Objective - The maximum desired time period prior to a failure or disaster during which changes to data may be lost as a consequence of recovery. Data changes preceding the failure or disaster by at least this time period are preserved by recovery. Zero is a valid value and is equivalent to a "zero data loss" requirement RTO - Recovery Time Objective - The maximum desired time period required to bring one or more applications and associated data back to a correct operational state. EDUCATION Disk and Tape Backup Mechanisms2006 Storage Networking Industry Association. All Rights Reserved. 9 RPO & RTO
Previous Point-in-Time Data PIT Image Recovery Point Objective Recovery Time Objective RPO Drives RTO Application Restart Diagnose/ Decision Restore Recover Modifications Since Last PIT Image EDUCATION Disk and Tape Backup Mechanisms2006 Storage Networking Industry Association. All Rights Reserved. 10 Methodologies of Backup Cold Systems are offline for the period of time it takes to capture an image of the data in total. As backup window shrinks and data size expands, cold backup becomes untenable. If possible, this is the easiest and cheapest form of protection. Application Consistent Application supports ability to take pieces of overall data set offline for a period of time to protect it - application knows how to recover from a collection of individual consistent pieces. No downtime for backup window. Crash Consistent or Atomic Data can be copied or frozen at the exact same moment across the entire dataset. Application recovery from an atomic backup performs like a high availability failover. No backup window. EDUCATION Disk and Tape Backup Mechanisms2006 Storage Networking Industry Association. All Rights Reserved. 11 Protection Design Trade-off Whats most important: Backup Window (speed of backup) Recovery: RTO (speed of recovery, cost of downtime) Recovery: RPO (amount of data loss) There are trade-offs everywhere Newer technology minimizes trade-offs But they still exist!!! Need to identify the priority order, and establish SLA targets for each piece (Backup, RTO, RPO) Ease of use, audit-ability, and error recovery need to be considered too Pay me now, or pay me later! EDUCATION Disk and Tape Backup Mechanisms2006 Storage Networking Industry Association. All Rights Reserved. 12 Backup versus Mirroring Backup Protecting data by making copies or allowing copies to be generated from saved data Examples: snapshots, split mirrors, VTL, tape, CDP When? Multiple Recovery Points needed Recovery from data corruption Archival and indexing Mirroring/Replication Protecting data by moving the data, usually as it changes, to a remote copy. Synchronous or Asynchronous mirroring When? Disaster Recovery Time Objective (DR/RTO centric usually) Data Migration Content Distribution EDUCATION Disk and Tape Backup Mechanisms2006 Storage Networking Industry Association. All Rights Reserved. 13 Disk and Tape Backup Mechanisms Fundamental concepts for modern, high- performance, scalable systems for Data Protection using networked storage An Overview of Backup Mechanisms for Storage Networks Conclusions Appendix EDUCATION Disk and Tape Backup Mechanisms2006 Storage Networking Industry Association. All Rights Reserved. 14 B/R Topology Components = Agent Application Server Backup Server Storage Node Library Secondary Storage LAN Catalog Primary Storage File System Runs the backup engine to get the data from the platform to backup Collects the data and meta- data accordingly to the requested level of abstraction Device on which the data are written If plugged on a SAN, it is shareable. Thus the backup application must : authorize multiple attachements of this same physical device handle dynamic bus address change (aka serialization) Central point of administration and management for the backup application Usually concentrates all the meta- data information in a repository (aka catalog) that may be built from a collection of flat files to a real database May offer some redundancy or disaster recovery methods for securing the catalog Mechanical device moving removable media in secondary storage devices Made up of slots, mailbox, picker and locations for secondary storage devices Usually attached only once and driven from a single host The backup software may enable a logical split to create as many different logical smaller Librairies Collects the data from the agent to redirect them on a secondary storage device May offer some caching functionality for stream manipulation like : Multiplexing (aka fan-in) Multiple streams handling (aka fan-out) EDUCATION Disk and Tape Backup Mechanisms2006 Storage Networking Industry Association. All Rights Reserved. 15 Traditional LAN Backup = = Agent Application Server Backup Server Storage Node Library Secondary Storage LAN Data Meta-Data Tutor Server Catalog Primary Storage File System In this mode, the application server reads and sends the data over the LAN Data and meta-data go to the backup server LAN impacted by NAS, iSCSI, Backup & Mirroring/Replication! Application server impacted by primary storage I/O Various network protocols: NFS, CiFS, iSCSI, proprietary protocol, NDMP EDUCATION Disk and Tape Backup Mechanisms2006 Storage Networking Industry Association. All Rights Reserved. 16 Traditional Local Backup = Agent Application Server Backup Server Storage Node Library Secondary Storage LAN Tutor Server Catalog Primary Storage File System Data Meta-Data In this mode, the application server reads and writes the data locally Secondary storage statically attached to the application server Meta-data go to the backup server No LAN impact except meta-data (significant for millions of files) Application server impacted by both primary and secondary storage I/O May be done by proprietary protocol or NDMP EDUCATION Disk and Tape Backup Mechanisms2006 Storage Networking Industry Association. All Rights Reserved. 17 LAN-free Backup SAN In this mode, the application server reads and writes the data locally through the SAN Secondary storage is: Attached to the application server via the SAN Resources attached and shared among multiple application servers Dynamically allocated by the backup application Meta-data go to the backup server No LAN impact except meta-data (significant for millions of files) Application server impacted by both primary and secondary storage I/O = Agent Application Server Backup Server Storage Node Library Secondary Storage LAN Tutor Server Catalog Primary Storage File System Data Meta-Data EDUCATION Disk and Tape Backup Mechanisms2006 Storage Networking Industry Association. All Rights Reserved. 18 Application Server-free Backup = Agent Application Server Backup Server Storage Node Library Secondary Storage LAN Tutor Server Catalog Primary Storage File System Snapshot Data Meta-Data In this mode, the application server gives a snapshot of the primary storage volume to a tutor server that reads and sends the data over the LAN or SAN Tutor server must understand the volume structure Snapshot volume may be mounted on tutor server or extent list map created Meta-data (and perhaps data) go to the backup server Mirror : Application server impacted when re-silvering the mirror Or Snapshot (shared blocks) : Application server impacted by volume access EDUCATION Disk and Tape Backup Mechanisms2006 Storage Networking Industry Association. All Rights Reserved. 19 Server-free Backup Agent Application Server Backup Server Storage Node Library Secondary Storage LAN Tutor Server Catalog Primary Storage File System Snapshot Data Mover Data Meta-Data In this mode, the server delegates the primary storage I/O processing to a Data-mover device on the SAN, using SCSI Extended Copy (XCOPY or Third-Party Copy ) command. Stable image (snapshot) backup and extent list meta-data are necessary components of this. Tutor server must understand the volume structure Meta-data go to the backup server but with less granularity No LAN impact there are fewer meta-data Application server impacted when re-silvering the mirror or by snapshot volume access No storage node impact SAN EDUCATION Disk and Tape Backup Mechanisms2006 Storage Networking Industry Association. All Rights Reserved. 20 Monday Monday Agent Saturday Saturday Backup Server Disk and Tape IP Tuesday Tuesday Thursday Thursday Wednesday Wednesday Full Backup -- Everything copied to backup (cold or hot backup) Restoration straight-forward for cold backup (and, typically, hot backup) Huge resource consumption (server, network, tapes) Incremental Backup -- Only the data that changed since last full/incremental copied Massive reduction in data moved/copied Can lead to nightmares in restoration and recovery Friday Friday Full Backup Incremental Backups Full and Incremental Backup EDUCATION Disk and Tape Backup Mechanisms2006 Storage Networking Industry Association. All Rights Reserved. 21 Incremental Monday Incremental Monday Tues. Tues. Wed. Wed. Friday Friday Agent Full Backup on Saturday : Production ServerResource Consumption Network Bandwidth Consumption Full Backup on Saturday : Production ServerResource Consumption Network Bandwidth Consumption Backup Server Disk and Tape Incremental Monday Incremental Monday Tues. Tues. Wed. Wed. Thurs. Thurs. Friday Friday IP Full Synthetic Backup on Saturday Full Synthetic Backup on Saturday Thurs. Thurs. Use Information from full and incrementals to reconstitute a full backup No server involvement Minimize network resource consumption Synthetic Full Backup EDUCATION Disk and Tape Backup Mechanisms2006 Storage Networking Industry Association. All Rights Reserved. 22 Tape Technology Sequential technology Versus random access Can be sequestered Away from prying hands, DR Legal - regulated industry? Media replacement costs Tape life, reusability Reliability Performance and Utilization Streaming and multiplexing Libraries and Robotics Removability EDUCATION Disk and Tape Backup Mechanisms2006 Storage Networking Industry Association. All Rights Reserved. 23 Application Server Storage Node LAN Primary Storage File System Secondary Storage Traditional approach: Data go over LAN from the Primary Storage to the Secondary Storage The Secondary Storage is composed of a Media Manager and Tape Drives Typically managed by a B/R application Tape Backup EDUCATION Disk and Tape Backup Mechanisms2006 Storage Networking Industry Association. All Rights Reserved. 24 Application Server Storage Node LAN Primary Storage File System Secondary Storage Tape Backup: Pros and Cons Pros: Lower costs Established process Off-site media storage Cons: Performance Device sharing Reliability of devices and media Risk of Tape loss EDUCATION Disk and Tape Backup Mechanisms2006 Storage Networking Industry Association. All Rights Reserved. 25 Tape challenges for Enterprise customers Speed and Reliability Need to reduce backup window Need faster, more reliable restores Management Media management of tape-based B/R program can be complex Cost ROI analysis of hard and soft costs (Equipment costs, maintenance, media, down-time) Device sharing Complexity, schedule risk, resource conflicts EDUCATION Disk and Tape Backup Mechanisms2006 Storage Networking Industry Association. All Rights Reserved. 26 Backup to Disk (B2D) Replaces primary backup target with disks Advantages: Speed (Backup and Restore) Scale Fewer shared devices Fibre Channel Disks versus ATA versus SAS I/O per second random access, MB/s sequential Large #of small I/Os, Small numbers of large I/Os Backup to disk may require updates to backup software or extra modules EDUCATION Disk and Tape Backup Mechanisms2006 Storage Networking Industry Association. All Rights Reserved. 27 Originated in mainframe environments Multiple applications Many data sets with long data life cycle times Integration into Open Systems Designed to avoid changes to backup software Packaged, formatted, optimized Eliminate media handling, improves BU and Restore Tape Virtualization EDUCATION Disk and Tape Backup Mechanisms2006 Storage Networking Industry Association. All Rights Reserved. 28 Virtual Tape Advantages Speed and reliability Single stream performance exceeding tape drives Aggregate performance without multiplexing No mechanical failures, No robots Inherent RAID protection Management Plug and play in existing back-up environment Appears as open systems tape cartridges, drives, libraries Compatible with existing backup & restore operations Easily integrates with todays off-site processes Cost Cost effective & scalable Leverage lower cost disks Reduced device sharing Reduces complexity, schedule risk, resource conflicts EDUCATION Disk and Tape Backup Mechanisms2006 Storage Networking Industry Association. All Rights Reserved. 29 SAN LAN NAS Constant change and heterogeneity in technologies Operating Systems Disk Storage Appliances Network Architectures / Topologies Tape Storage Devices Challenge Protect mission-critical data Timely backup and restore Administration overhead Optimize storage resources Backup Scaling EDUCATION Disk and Tape Backup Mechanisms2006 Storage Networking Industry Association. All Rights Reserved. 30 Scalability & Flexibility Leverage architecture Snapshot facilities SAN, LAN Heterogeneity Multiple platforms (HW/ OS) Multiple tape drives & libraries Multiple applications NAS and SAN Open Use of standards API, command line Advanced tape management Tape mirroring Off-site storage Advanced library management Sharing, splitting Port handling Security Authentication DMZ Firewall support EDUCATION Disk and Tape Backup Mechanisms2006 Storage Networking Industry Association. All Rights Reserved. 31 Manageability Accounting Reporting about Protection services Allows internal or external billing Centralized Administration GUI & smart interface Backup strategies Scheduling Media management Easy Installation and deployment Centralized Supervision Real-time monitoring Alarms Event log SNMP compliant, integration with Frameworks EDUCATION Disk and Tape Backup Mechanisms2006 Storage Networking Industry Association. All Rights Reserved. 32 Performance Smart recovery functions Most recent image Consistent view =true image Minimize downtime : time to diagnose time to restore Lost time + downtime =total loss time Performance Caching on storage node for multiplexing and stream management Synthetic full backup offlload backup engine faster backup = Incrementals faster restore = restore from full session LAN-free full backup from incremental Manageable by user User profile and authentication Delegate: not only an administrator tool EDUCATION Disk and Tape Backup Mechanisms2006 Storage Networking Industry Association. All Rights Reserved. 33 Many thanks to the following individuals for their contributions to this tutorial: SNIA Data protection Initiative Nancy Clay SNIA Data Management Forum Mike Rowan SNIA Tech Council SW Worth Please send any comments on this tutorial to SNIA: [email protected] - Find a passion - Join a committee - Gain knowledge & influence - Make a difference www.snia-dmf.org Get I nvol ved ! EDUCATION Disk and Tape Backup Mechanisms2006 Storage Networking Industry Association. All Rights Reserved. 34 Thank you for your feedback Questions and Answers EDUCATION APPENDIX www.snia.org/education/tutorials EDUCATION Disk and Tape Backup Mechanisms2006 Storage Networking Industry Association. All Rights Reserved. 36 Backup Definition A collection of data stored on (usually removable) non-volatile storage media for purposes of recovery in case the original copy of data is lost or becomes inaccessible. Also called backup copy. To be useful for recovery, a backup must be made by copying the source data image when it is in a consistent state or contains elements and information enabling a consistent state to be recovered. Source: SNIA Dictionary EDUCATION Disk and Tape Backup Mechanisms2006 Storage Networking Industry Association. All Rights Reserved. 37 Secondary Storage Standard Data Formats Posix 1003.1 Archive/Interchange File Format also well known as tar or cpio Pros : implemented and available on all Unix systems Cons : meta-data limited to posix system implementations and designed for sequential unaltered file level data (no streams, no compression, no multiplexing, etc.) SIDF, System Independent Data Format, focuses on representing system data and file data and meta-data for all types of operating systems and platforms in a media independent common format. Pros : handle all kinds of stream manipulation and meta-data forms with tags Cons : used privately in backup products, i.e. no native command on systems yet http://www.cs.wisc.edu/~jgast/sidf/ EDUCATION Disk and Tape Backup Mechanisms2006 Storage Networking Industry Association. All Rights Reserved. 38 Secondary Storage Proprietary Data Formats Server vendor based (usually at volume or lower file system level) Pros : maximum support of vendor specific features (volume specificities, unique meta-data, etc.) Cons : no possible interchange with other platforms Backup software vendor based Pros : maximum support of vendor specific features like multiplexing, compression, etc. all kinds of implemented stream manipulation Cons : no possible interchange with other products and must be read via the backup software (may be processed sometimes to be read in standard format or standalone command available) EDUCATION Disk and Tape Backup Mechanisms2006 Storage Networking Industry Association. All Rights Reserved. 39 Network Data Management Protocol NDMP is a general open network protocol for controlling the exchange of data between two parties Partition the problem between vendors Each vendor implements solutions in compliance with this protocol: Data Server / Tape Server / Client Enable best of breed combinations of multi-vendor products in customer solutions Enhanced interoperability Vendor focus on core competencies Improve user offering reduce time-to-market http://www.ndmp.org EDUCATION Disk and Tape Backup Mechanisms2006 Storage Networking Industry Association. All Rights Reserved. 40 B/R Application Disk Support First Backup-to-Disk approach: Data go over LAN from the Primary Storage to the Secondary Storage The Secondary Storage is composed of Disk Drives managed by the B/R Application Storage maybe block or file based with raw, filesystem or sequential format Application Server Storage Node LAN Primary Storage File System Secondary Storage EDUCATION Disk and Tape Backup Mechanisms2006 Storage Networking Industry Association. All Rights Reserved. 41 Virtualization Approach Virtualization approach Data sent over LAN from the Primary Storage to the Secondary Storage The Secondary Storage is an appliance filled with Disk Drives It behaves like a regular Media Manager and Tape Drives Application Server Storage Node LAN Primary Storage File System Secondary Storage