0% found this document useful (0 votes)

63 views

Availability Digest: Asynchronous Replication Engines

replication

Uploaded by

smjain

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

63 views

Availability Digest: Asynchronous Replication Engines

replication

Uploaded by

smjain

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

the

Availability Digest
November 2006

Asynchronous Replication Engines

Active/Active Systems
A fundamental tenet of active/active systems is that there are at least two copies of the application database in the network. These copies and the processing nodes which use them should be geographically distributed to ensure that the application will be tolerant to problems affecting a wide area. The database copies must be kept in synchronism to ensure that any node can use any database copy and end up with the same result. This means that as soon as a change is made to one of the copies, it must be replicated to the other database copies in the application network. There are many ways to do this, including: asynchronous replication synchronous replication transaction replication network transactions
application database application database

replication Node A Node B

In this article, we will look at the techniques and issues of asynchronous replication. We will talk about the other methods in later articles.

An Active/Active System

The Replication Engine

With asynchronous replication, the source and target systems are loosely coupled. Changes made to the source database are queued for replication to the target database. At some later time (which may be in milliseconds), a replication engine picks up the changes and sends them to the target system where they are applied to the target database. The source applications are in effect unaware of this transfer of data changes and are unaffected by replication. Asynchronous replication is done via an asynchronous replication engine. Although there are many forms of replication engines in the marketplace, by and large they follow the same general architecture in order to replicate data from a source database to a target database.
1

Asynchronous replication engines are described in great detail in the book entitled Breaking the Availability Barrier: Survivable Systems for Enterprise Computing, by Dr. Bill Highleyman, Paul J. Holenstein, and Dr. Bruce Holenstein.

1
2006 Sombers Associates, Inc., and W. H. Highleyman www.availabilitydigest.com

A replication engine typically depends upon some form of a real-time log of changes. We will call this log the Change Queue. Information describing each change made to the source database is entered into the Change Queue either by the database manager, by triggers in the database, or by the application itself. The Change Queue is usually disk-resident so that the replication engine can recover from node or engine failures. An Extractor process follows the tail of the Change Queue and extracts the description of a data change as it is logged. It sends the data change information to the target system so that it can be applied to the target database. Data changes may be sent directly to the target system by the Extractor, or they may first pass through another disk queue used for recovery purposes. In the latter case, the Extractor will write the change to the disk-resident Recovery Queue. A Transmitter process then reads the recovery queue and sends the data to the target system.

source system application

source database

change queue

extractor

recovery queue

transmitter

target system

target database

applier

apply queue

receiver

Asynchronous Replication Engine

At the target system, change information may be received directly by an Applier process, which updates the target database with the changes. Alternatively, the change information may be written to another intermediate disk queue which is used to control the sequence of changes applied to the target database. The Applier can use this Apply Queue to filter out aborted transactions and to apply transactions in their original order to guarantee referential integrity.

Replication Latency
One of the most important characteristics of a data replication engine is its replication latency. Replication latency is the time that it takes for a change to propagate from the source database to the target database. As we shall see later, replication latency creates some of the undesirable

2
2006 Sombers Associates, Inc., and W. H. Highleyman www.availabilitydigest.com

characteristics of data replication. These characteristics include data loss following a node failure, 2 data collisions, and application performance if synchronous replication is being used. Many replication engines have replication latency measured as subseconds. Others require many seconds or more to propagate changes. There are many causes of replication latency, including: The number of disk queuing points in the replication path. Buffering delays at the communication channel as changes are batched to improve communication efficiency. The speed of the communication channel. Whether the processes that follow the tail of a disk queue are event-driven or use polling. If disk-reading processes are poll-driven, what is their polling interval. The delay required to reserialize commits to guarantee referential integrity.

It is important to choose a replication engine whose replication latency is satisfactory for the application.

Bidirectional Replication
The replication engine which we described above is a one-way replication engine. Of course, for active/active systems, replication has to be bidirectional. This is accomplished simply by configuring two replication engines, one replicating in each direction. If there are more than two nodes in an active/active network, a bidirectional pair of replication engines is required between enough of the nodes to create a fully connected network. Note that not all nodes in an active/active network need have database copies, and some nodes might be configured to forward changes to other nodes.

replication engine application database replication engine application database

Bidirectional Replication

Advantages of Asynchronous Replication

Since an asynchronous replication engine feeds off a Change Queue that may be created by the application anyway (such as a change log created by a database manager), the replication process is totally transparent to the applications. It happens under the covers, and except for a small additional processor load required to support the replication engine, it has no impact on the application. In addition, asynchronous replication is totally noninvasive. No changes to the application are required. Replication engines are off-the-shelf products that simply plug in. One exception to the above is if there is no inherent Change Queue created by the application. In this case, the application must be modified to create a change log of some sort. This will also increase the processing load somewhat.

A detailed performance analysis for replication engines is provided in the forthcoming book entitled Breaking the Availability Barrier: Achieving Century Uptimes with Active/Active Systems, by Paul J. Holenstein, Dr. Bill Highleyman, and Dr. Bruce Holenstein.

3
2006 Sombers Associates, Inc., and W. H. Highleyman www.availabilitydigest.com

Of course, the most obvious advantage of an active/active system is the ability to switch users rapidly to surviving nodes should a node fail. However, this is a characteristic of the active/active architecture. Each of the data replication methods mentioned above provides this capability.

Asynchronous Replication Issues

There are several considerations that must be taken into account when contemplating asynchronous replication for active/active systems. These include referential integrity, pingponging, data loss following a node failure, and data collisions. Referential Integrity The data replication engine must guarantee referential integrity. Only then can all database copies be used for application processing. If a data replication engine does not guarantee that transactions are applied to the target database in the same order that they were applied to the source database, the database copy will not be consistent. Child rows may exist without parents. Indices may point to nothing. New data may be overwritten by old data. This is particularly a problem with hardware replication schemes in which a physical data block is replicated without regard to transaction boundaries. It is also a characteristic of some softwarebased replication engines. Ping-Ponging In order to run in a bidirectional configuration, the replication engine must prevent ping-ponging. This is the return to the source system of a change just received from the source system. Data Loss Following a Node Failure Should a node containing a database copy fail, it is likely that there will be changes still in the replication pipeline at the time of failure. These changes will never be propagated to the target system and will be lost. There is no way to recover them unless the 3 failed node can be restored quickly. This is one case in which replication latency is important. The longer the replication latency, the more likely it is that data will be lost following a node failure. Data Collisions A data collision occurs if two users should update the same data item at two different database copies within the replication latency interval. In this case,
3

data base

X
ping-ponging

data base

lost data

data item a

data base X

X data item a

data base

data collision

Asynchronous Replication Issues

HP NonStop servers support a remote mirror of their audit trail. The remote mirror guarantees that no transactions will be lost following a node failure. HP calls this configuration ZLT for Zero Lost Transactions.

4
2006 Sombers Associates, Inc., and W. H. Highleyman www.availabilitydigest.com

each new value of the data item will be replicated to the other system and will overwrite the original change made at that system. As a result, the database copies are different and both are wrong. This is another case in which replication latency is important. The shorter the latency time, the 4 less likely it is that there will be a data collision. There are several ways to attack the problem of data collisions: Avoidance: Data collisions can be avoided by: Partitioning the database so that a particular data item is always updated on a designated database copy. Those changes are then replicated from that copy to the other database copies in the application network. Creating a master node to which all updates are directed. The master node then replicates changes to the other nodes in the network. Using relative replication to replicate operations on the data rather than replicating the final value of the data item itself (for instance, add 10 or subtract 5). Using synchronous replication as described in our next article.

Detection and Resolution: Collisions can be detected by comparing the version of the data item to be updated to the version of the update. If they differ, a collision has 5 occurred. There are several options for resolving a detected collision: Establish a node precedence. The node with the highest precedence wins, and its data value is accepted. Use data content to resolve a collision. For instance, the update with the most recent timestamp may win. Ignore collisions if the database will be self-correcting over time due to other noncolliding updates. Ignore collisions and periodically resynchronize the databases to one designated as the database of record. If all else fails, collisions must be resolved manually.

Products
There are several replication engines off the shelf that purport to support active/active 6 architectures. Following is only a partial list of the many available products. Be aware that some

The probabilities of data collisions for different circumstances are derived in Chapter 9, Data Conflict Rates, in the book entitled Breaking the Availability Barrier: Survivable Systems for Enterprise Computing, referred to earlier. 5 The handling of data collisions is extensively covered in the Breaking the Availability Barrier series of books referenced earlier. See especially Chapter 4, Active/Active and Related Technologies in the forthcoming book, Breaking the Availability Barrier: Achieving Century Uptimes with Active/Active Systems. 6 Others are also listed in Appendix 4, Implementing a Data Replication Project, in the book entitled Breaking the Availability Barrier: Survivable Systems for Enterprise Computing, referred to earlier.

5
2006 Sombers Associates, Inc., and W. H. Highleyman www.availabilitydigest.com

vendors may redefine the term active/active to fit their products capabilities, so be sure to analyze the characteristics of a data replication engine before committing to its use. Streams from Oracle (www.oracle.com) Times Ten from Oracle (memory-to-memory replication) (www.oracle.com) DRNet from Network Technologies (www.network-tech.com) Shadowbase from Gravic (www.gravic.com) GoldenGate for Active/Active from GoldenGate (www.goldengate.com) Sun Cluster Geographic Edition (Availability Suite) for Sun (www.sun.com) Metro Mirror (for Parallel Sysplex) from IBM (www.ibm.com) Global Mirror (for Parallel Sysplex) from IBM (www.ibm.com) RepliStor from EMC Legato (software.emc.com) Cluster Replica SQL for MSSQL from XLink (www.xlink.com) NSI Double-Take from Double-Take (www.doubletake.com) DataXtend RE from Progress (www.progress.com) MetiLinx Database Suite (for MySQL) from MetiLinx (www.metilinx.com) Colada from MARSYS (www.marsys.com)

Summary
Asynchronous replication is by far the most popular replication tool used in todays active/active systems. It is fast, non-intrusive, and under-the-covers. It does have its issues, which must be understood before moving to an asynchronous active/active environment. These issues include the assurance of referential integrity of the replicated data, lost data following a node failover, data collisions, and minimizing replication latency. Available today are many products that support asynchronous replication. There is virtually a product for any platform, and many products are heterogeneous in that they can replicate between disparate platforms. Active/active is here today, as are the products that support it.

6
2006 Sombers Associates, Inc., and W. H. Highleyman www.availabilitydigest.com

Exam - ACA Big Data Certification
100% (3)
Exam - ACA Big Data Certification
13 pages
Learn SAP Basis in 24 Hours
From Everand
Learn SAP Basis in 24 Hours
Alex Nordeen
4.5/5 (2)
DMBOK2 Chapters 14 - 17
100% (1)
DMBOK2 Chapters 14 - 17
29 pages
Answer ET Assignment 2
33% (3)
Answer ET Assignment 2
18 pages
Oracle Database Administration Interview Questions You'll Most Likely Be Asked: Job Interview Questions Series
From Everand
Oracle Database Administration Interview Questions You'll Most Likely Be Asked: Job Interview Questions Series
Vibrant Publishers
5/5 (1)
Learn Data Warehousing in 24 Hours
From Everand
Learn Data Warehousing in 24 Hours
Alex Nordeen
No ratings yet
CL207Course Guide
No ratings yet
CL207Course Guide
614 pages
Breaking the Availability Barrier Ii: Achieving Century Uptimes with Active/Active Systems
From Everand
Breaking the Availability Barrier Ii: Achieving Century Uptimes with Active/Active Systems
Dr. Bruce Holenstein
No ratings yet
Real-Time Analytics: Techniques to Analyze and Visualize Streaming Data
From Everand
Real-Time Analytics: Techniques to Analyze and Visualize Streaming Data
Byron Ellis
No ratings yet
21 - Scorta Iuliana Articol
No ratings yet
21 - Scorta Iuliana Articol
7 pages
Dataflow and Reactive Programming Systems
From Everand
Dataflow and Reactive Programming Systems
Matt Carkci
No ratings yet
Operating Systems Interview Questions You'll Most Likely Be Asked
From Everand
Operating Systems Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Oracle 11g Streams Implementer's Guide
From Everand
Oracle 11g Streams Implementer's Guide
Ann L. R. McKinnell
No ratings yet
Replication
No ratings yet
Replication
21 pages
Oracle Recovery Appliance Handbook: An Insider’S Insight
From Everand
Oracle Recovery Appliance Handbook: An Insider’S Insight
Ramesh Raghav
No ratings yet
Intro To Replication
No ratings yet
Intro To Replication
48 pages
Mainframe Database Replication and Recovery Over Extended Distances: Special Challenges and Industry Trends
No ratings yet
Mainframe Database Replication and Recovery Over Extended Distances: Special Challenges and Industry Trends
22 pages
Siebel Remote Administration 8 Blackbook
From Everand
Siebel Remote Administration 8 Blackbook
Mohammed Azizuddin Aamer
No ratings yet
Distributed Databases: Not Just A Client/server System
No ratings yet
Distributed Databases: Not Just A Client/server System
43 pages
IAU-ST-Lecture5
No ratings yet
IAU-ST-Lecture5
50 pages
Introduction to Oracle Database Administration
From Everand
Introduction to Oracle Database Administration
Ying Wang
5/5 (1)
The HAProxy Handbook: Load Balancing for Modern Infrastructure
From Everand
The HAProxy Handbook: Load Balancing for Modern Infrastructure
Robert Johnson
No ratings yet
Determinism in Database Systems
No ratings yet
Determinism in Database Systems
11 pages
The Case For Determinism in Database Systems: Alexander Thomson Thomson@cs - Yale.edu Daniel J. Abadi Dna@cs - Yale.edu
No ratings yet
The Case For Determinism in Database Systems: Alexander Thomson Thomson@cs - Yale.edu Daniel J. Abadi Dna@cs - Yale.edu
11 pages
Replication
No ratings yet
Replication
16 pages
Firewalls and Database Recovery: A Firewall Can Serve The Following Functions
No ratings yet
Firewalls and Database Recovery: A Firewall Can Serve The Following Functions
13 pages
Chapter Four Advan Database
No ratings yet
Chapter Four Advan Database
6 pages
Advanced Distributed Systems Replication: What Is Replication? Reasons For Replication
No ratings yet
Advanced Distributed Systems Replication: What Is Replication? Reasons For Replication
20 pages
Adbms
No ratings yet
Adbms
70 pages
Operating System Interview Questions and Answers
From Everand
Operating System Interview Questions and Answers
Manish Soni
No ratings yet
HPE Compute Certification Guide: 444 Practice Questions for the Advanced HPE1-H02 Exam
From Everand
HPE Compute Certification Guide: 444 Practice Questions for the Advanced HPE1-H02 Exam
Steve Brown
No ratings yet
Adv N Disadv of Replication
No ratings yet
Adv N Disadv of Replication
5 pages
Backup: Abhinav Misra - 10030141124
No ratings yet
Backup: Abhinav Misra - 10030141124
27 pages
Oracle: Protect Your Data
From Everand
Oracle: Protect Your Data
Floribert TCHOKO
No ratings yet
Forward Chaining: Fundamentals and Applications
From Everand
Forward Chaining: Fundamentals and Applications
Fouad Sabry
No ratings yet
Term Paper On Database Architecture: Submitted To: Dr.v.saravana
No ratings yet
Term Paper On Database Architecture: Submitted To: Dr.v.saravana
16 pages
Computerised Systems Architecture: An embedded systems approach
From Everand
Computerised Systems Architecture: An embedded systems approach
S Mathioudakis
No ratings yet
Accelerated DevOps with AI, ML & RPA: Non-Programmer’s Guide to AIOPS & MLOPS
From Everand
Accelerated DevOps with AI, ML & RPA: Non-Programmer’s Guide to AIOPS & MLOPS
Stephen Fleming
5/5 (2)
Appendix A: Core Concepts in Core Concepts in SQL Server High Availability and Replication Replication
No ratings yet
Appendix A: Core Concepts in Core Concepts in SQL Server High Availability and Replication Replication
16 pages
Production System: Fundamentals and Applications
From Everand
Production System: Fundamentals and Applications
Fouad Sabry
No ratings yet
Mastering Terraform A Comprehensive Guide to Infrastructure As Code
From Everand
Mastering Terraform A Comprehensive Guide to Infrastructure As Code
Mario Marinov
No ratings yet
Complet DB Backup and Recovery
No ratings yet
Complet DB Backup and Recovery
12 pages
5.high Availability - 2
No ratings yet
5.high Availability - 2
34 pages
Distributed Databases: CMP-3440 - Database Systems
No ratings yet
Distributed Databases: CMP-3440 - Database Systems
12 pages
Database Backup and Recovery
No ratings yet
Database Backup and Recovery
65 pages
Distributed Databases: Not Just A Client/server System
No ratings yet
Distributed Databases: Not Just A Client/server System
43 pages
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
From Everand
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
8616 Ijdms 03
No ratings yet
8616 Ijdms 03
9 pages
2 - ACID Vs BASE
No ratings yet
2 - ACID Vs BASE
30 pages
Nosql Systems: Sharding, Replication and Consistency: Riccardo Torlone Università Roma Tre
No ratings yet
Nosql Systems: Sharding, Replication and Consistency: Riccardo Torlone Università Roma Tre
28 pages
Session 19 Recovery
No ratings yet
Session 19 Recovery
18 pages
Database
No ratings yet
Database
6 pages
Performance Monitoring Alwayson Availability Groups: Anthony E. Nocentino
No ratings yet
Performance Monitoring Alwayson Availability Groups: Anthony E. Nocentino
35 pages
Advanced Dynamic-System Simulation: Model Replication and Monte Carlo Studies
From Everand
Advanced Dynamic-System Simulation: Model Replication and Monte Carlo Studies
Granino A. Korn
No ratings yet
Q Rep DB2 Oracle
No ratings yet
Q Rep DB2 Oracle
34 pages
Industrial Cases in Simulation Modeling
From Everand
Industrial Cases in Simulation Modeling
James A. Chisman PhD
No ratings yet
Essays on Infrastructure-as-code
From Everand
Essays on Infrastructure-as-code
Ravi Rajamani
No ratings yet
Unit No.7 Crash Recovery & Backup
No ratings yet
Unit No.7 Crash Recovery & Backup
17 pages
High Availability and Load Balancing For Postgresql Databases: Designing and Implementing.
100% (1)
High Availability and Load Balancing For Postgresql Databases: Designing and Implementing.
8 pages
Application Design: Key Principles For Data-Intensive App Systems
From Everand
Application Design: Key Principles For Data-Intensive App Systems
Rob Botwright
No ratings yet
Visual Basic 2010 Coding Briefs Data Access
From Everand
Visual Basic 2010 Coding Briefs Data Access
Kevin Hough
5/5 (1)
Slide 3 Record Layout Diagrams
No ratings yet
Slide 3 Record Layout Diagrams
19 pages
Firewalls and Database Recovery CNI-BBA 5th Sem
No ratings yet
Firewalls and Database Recovery CNI-BBA 5th Sem
7 pages
Stan Diffusion Twitter
No ratings yet
Stan Diffusion Twitter
68 pages
Overview of Scidb: Large Scale Array Storage, Processing and Analysis
No ratings yet
Overview of Scidb: Large Scale Array Storage, Processing and Analysis
6 pages
Flannel Vs Calico A Battle of L2 Vs L3 Based Networking
No ratings yet
Flannel Vs Calico A Battle of L2 Vs L3 Based Networking
2 pages
Splunk Lisa2014 Final Version
No ratings yet
Splunk Lisa2014 Final Version
16 pages
Availability Digest: Reliability Diagrams
No ratings yet
Availability Digest: Reliability Diagrams
8 pages
Raft Made Simple
No ratings yet
Raft Made Simple
18 pages
Availability Digest: Cellular Provider Goes Active/Active For Prepaid Calls
No ratings yet
Availability Digest: Cellular Provider Goes Active/Active For Prepaid Calls
5 pages
Advanced Topics in Data Mining Special Focus: Social Networks
No ratings yet
Advanced Topics in Data Mining Special Focus: Social Networks
35 pages
Random Graph Models of Social Networks: Paper Authors: M.E. Newman, D.J. Watts, S.H. Strogatz
No ratings yet
Random Graph Models of Social Networks: Paper Authors: M.E. Newman, D.J. Watts, S.H. Strogatz
21 pages
Architecture of ArjunaCore
No ratings yet
Architecture of ArjunaCore
20 pages
The Whys, Whats, and Whens of Modelling in Healthcare
No ratings yet
The Whys, Whats, and Whens of Modelling in Healthcare
34 pages
Back Door Into Java EE Application Servers
100% (1)
Back Door Into Java EE Application Servers
17 pages
The Whys, Whats, and Whens of Modelling in Healthcare
No ratings yet
The Whys, Whats, and Whens of Modelling in Healthcare
34 pages
Tomcat Cluster Setup
No ratings yet
Tomcat Cluster Setup
3 pages
Os Voltdb PDF
No ratings yet
Os Voltdb PDF
14 pages
Zookeeper Tutorial
0% (1)
Zookeeper Tutorial
12 pages
IIRS Quiz-1 Bits
No ratings yet
IIRS Quiz-1 Bits
15 pages
EDI 104 - Set B Question
No ratings yet
EDI 104 - Set B Question
11 pages
Microstrategy Tutorial
100% (2)
Microstrategy Tutorial
119 pages
Big data analytics 2016th Edition Radha Shankarmani 2024 Scribd Download
No ratings yet
Big data analytics 2016th Edition Radha Shankarmani 2024 Scribd Download
72 pages
Becoming A ZFS Ninja
No ratings yet
Becoming A ZFS Ninja
68 pages
CS208 Principles of Data Base Design
No ratings yet
CS208 Principles of Data Base Design
3 pages
BC0058 SLM Unit 02
No ratings yet
BC0058 SLM Unit 02
13 pages
Lab 4: Search Data Structure: 1 Binary Tree - Binary Search Tree
No ratings yet
Lab 4: Search Data Structure: 1 Binary Tree - Binary Search Tree
5 pages
NDMP Configuration Overview
No ratings yet
NDMP Configuration Overview
34 pages
CSE311 Course Objective and Outcome Outline
No ratings yet
CSE311 Course Objective and Outcome Outline
4 pages
Tuning PostgreSQL With Pgbench
No ratings yet
Tuning PostgreSQL With Pgbench
11 pages
4) A Logical View of Data
No ratings yet
4) A Logical View of Data
3 pages
Ch6 Ais6 Reviewer
No ratings yet
Ch6 Ais6 Reviewer
8 pages
Integration of R With Hadoop
No ratings yet
Integration of R With Hadoop
9 pages
12 - SmartPlant P-ID
No ratings yet
12 - SmartPlant P-ID
22 pages
Overview of Er and Rea Approach
100% (3)
Overview of Er and Rea Approach
7 pages
Frequently Asked Questions - ALPHA Conversion: Back To Top
No ratings yet
Frequently Asked Questions - ALPHA Conversion: Back To Top
3 pages
Working With Files in Python
No ratings yet
Working With Files in Python
2 pages
Er Diagram
No ratings yet
Er Diagram
4 pages
SQL Sem 2
No ratings yet
SQL Sem 2
30 pages
Abdul's Resume
No ratings yet
Abdul's Resume
1 page
General Interview Questions
No ratings yet
General Interview Questions
4 pages
DD PivotTable Tutorial
No ratings yet
DD PivotTable Tutorial
26 pages
The Data Warehouse ETL Toolkit - Chapter 04
100% (1)
The Data Warehouse ETL Toolkit - Chapter 04
51 pages
SQL - Syntax
No ratings yet
SQL - Syntax
4 pages
Class 11 - Mysql
No ratings yet
Class 11 - Mysql
48 pages
JDBC Notes
0% (1)
JDBC Notes
18 pages

Availability Digest: Asynchronous Replication Engines

Uploaded by

Availability Digest: Asynchronous Replication Engines

Uploaded by

the

Asynchronous Replication Engines

replication Node A Node B

The Replication Engine

source system application

Asynchronous Replication Engine

replication engine application database replication engine application database

Advantages of Asynchronous Replication

Asynchronous Replication Issues

Asynchronous Replication Issues

You might also like