DBA-325-A - SQLCAT - SQL - Server - HA - and - DR - Design - Patterns - Architectures - and - Best - Practices - Using - SQL - Server - 2012
DBA-325-A - SQLCAT - SQL - Server - HA - and - DR - Design - Patterns - Architectures - and - Best - Practices - Using - SQL - Server - 2012
Disaster Recovery
Design Patterns and Architectures
Sanjay Mishra, Mike Weiner, Justin Erickson
Program Managers
Microsoft Corporation
Source: www.dilbert.com
SQLCAT (Customer Advisory Team)
The SQL Server Customer Advisory Team (SQL CAT) represents the
customer-facing resources from the SQL Server Product Group. SQLCAT is
comprised of product and solution experts that regularly engage in the
largest, most complex, and most unique customer deployments worldwide.
4) WSFC Mounts
Physical Disk Resources
2) Storage
software detects
storage
replication state
3) Storage
software sets
storage to read
write enable
Storage
Storage
Replication
Replication
5) Replication reversed
Architecture Enhancements – Multi-site Cluster
• Multi-site FCI Solution does not require passing the storage validation tests, to be
supported. http://support.microsoft.com/kb/943984
dd “MultiSubnetFailover=True” in
connection string
Witness
Synchronous
Database
Mirroring
Log Shipping
SQL Server Codename “Denali”
Replace Database Mirroring and Log Shipping with Availability Group
Synchronous
Synchronous / Asynchronous
Note: More secondaries (total upto 4) can be added for additional resiliency or read scaleout
Considerations
All SQL servers (including the secondary in the DR site) in the same Windows
domain
• One Windows Server Failover Cluster spreads over the primary and DR sites
All the databases must be in FULL recovery model
The unit of failover (for local HA, as well as DR) is at the AG level, i.e., group of
databases – not the instance
• Consider using Contained Database for containing logins for failover
• For jobs and other objects outside the database, simple customization needed
No delayed apply on the secondary
Removing log shipping means the regular log backup job is removed
• Need to re-establish periodic log backup (essential for truncating the log)
• New tools for monitoring and alerting
• AlwaysOn Dashboard
• System Center Operations Manager
Client Connectivity
Fileshare Witness
Availability Group
Primary Secondary
1 vote 1 vote Secondary
0 vote
Synchronous
Note: The Fileshare Witness always has 1 vote. So, the above WSFC has 3 votes.
DR Scenario
Database
Mirror
SQL-FCI-1 SQL-FCI-2
Mirroring
SQL Server Codename “Denali”
Replace Database Mirroring with Availability Group
SQL-FCI-1 SQL-FCI-2
Secondary
Availability
Primary
Group
Considerations
One Windows Server Failover Cluster spreads over the primary and DR sites,
encompassing the two FCIs
• New ways to look at setup, quorum models, DR operations, etc.
The DR failover unit is at the AG level, i.e., group of databases – not the
instance
• Consider using Contained Database for containing logins for failover
• For jobs and other objects outside the database, simple customization
needed
• New tools for monitoring and alerting
• AlwaysOn Dashboard
• System Center Operations Manager
• Pre-requisite Windows Service packs / QFEs:
• Asymmetric Storage
• Windows Server 2008 with http://support.microsoft.com/kb/976097
• OR, Windows Server 2008 R2 SP1
• Node Votes: http://support.microsoft.com/kb/2494036
• Validate disk test QFE: http://support.microsoft.com/kb/2531907
Asymmetric Storage
Primary Site
Node 1 Node 2
SQL-FCI-1
Primary
Add Nodes:
Validation Tests pass with warnings
SQL-FCI-1
Primary
Add Disks
Install the second FCI (SQL-FCI-2)
SQL-FCI-1 SQL-FCI-2
Secondary
Primary
Create AG between the two FCIs
SQL-FCI-1 SQL-FCI-2
Secondary
Availability
Primary
Group
Client Connectivity
Secondary
SQL-FCI-1 Availability SQL-FCI-2
Primary
Group
Quorum Configuration: Quorum Model
Secondary
Availability
Primary SQL-FCI-2
SQL-FCI-1 Group
Asymmetric Disk as Quorum Resource
• Prior to Asymmetric Disk capability, for a disk to be a
cluster resource (and a quorum resource) it was
required to be visible from all the nodes.
• With Asymmetric Disk capability, a cluster disk can be
visible to a subset of nodes.
• Asymmetric Disk can be used a quorum resource:
• Not through Failover Cluster Manager GUI, or PowerShell
• But through cluster.exe command line
• Asymmetric Disk as quorum resource enables quorum
models:
• Node + Asymmetric Disk Majority
• Asymmetric Disk Only
FCI + AG Configuration with Asymmetric
Disk-Only Quorum Model
Secondary
Availability
Primary SQL-FCI-2
SQL-FCI-1 Group
Asymmetric Disk-Only
Quorum
Quorum Configuration
Secondary
Availability
SQL-FCI-1 SQL-FCI-2
Primary
Group
Recap: Denali HA+DR Solutions
Corresponding
Denali HA+DR Solution Solution Characteristics
Existing Solution
• Shared Storage solution *
Multi-site Failover • Instance Level HA
Multi-site FCI using
1 Cluster Instance (FCI) • Instance Level DR
stretch VLAN
for HA and DR • Doesn’t require database to be in
FULL recovery model