Optimizing RMAN Recovery
Optimizing RMAN Recovery
RMAN Repository
Catalog
RMAN
RMAN RMAN (opt)
Channels
RMAN
Client Channels
RMAN
Channels
DBA Channels Control
Files
Backup
Validate
Restore
Validate
Validate Functionality
• Backup Validate
– Read data blocks from primary disk
– RMAN validation
– No write or compression/encryption
• Restore Validate
– Read backup from backup storage
– Decompress and decrypt if needed
– RMAN validation
– No write to primary disk
Quick comparison with Validate
• This chart is just a comparison of time
between Backup Validate and Backup, as well
as Restore Validate and Restore
Command NFS Filer 1 NFS Filer 2 +FRA on ASM
SESSION_EVENT AVG(TIME_SEC)
------------------------------ -------------
RMAN backup & recovery I/O 19.4375
SQL*Net message from client 6.9725
events in waitclass Other 6.46
Example – Backup
• Backup – 9:47 Minutes (587 seconds)
RMAN> backup filesperset 1 tablespace pete_ts;
Starting backup at 27-Jan-2016 09:28:43 …
Finished backup at 27-Jan-2016 09:38:30
SESSION_EVENT AVG(TIME_SEC)
------------------------------ -------------
RMAN backup & recovery I/O 517.3025
SQL*Net message from client 48.2275
events in waitclass Other 5.4375
Example – Restore Validate
• Restore Validate – 8:00 Minutes (480 Seconds)
RMAN> restore tablespace pete_ts validate;
Starting restore at 31-Jan-2016 13:20:25 …
Finished restore at 31-Jan-2016 13:28:25
SESSION_EVENT AVG(TIME_SEC)
------------------------------ -------------
RMAN backup & recovery I/O 422.92
SQL*Net message from client 46.56
Example – Restore
• Restore – 8:31 Minutes (511 Seconds)
RMAN> restore tablespace pete_ts;
Starting restore at 31-Jan-2016 13:29:55 …
Finished restore at 31-Jan-2016 13:38:26
SESSION_EVENT AVG(TIME_SEC)
------------------------------ -------------
RMAN backup & recovery I/O 447.5975
SQL*Net message from client 39.2575
Example – Summary
Backup Restore
Time
Validate
Backup
Validate
Restore
• Questions
CPU 13.00 24.00 8.00 24.00
IO Input 19.60 0.32 422.92 447.60
– Why is there no IO Input
IO Output N/A 517.39 N/A 0.03 time for backup?
Other Waits
Total
13.43
46.03
53.67
595.37
46.56
477.48
39.26
510.89
– Why is there no IO
Actual 45.00 587.00 480.00 511.00 Output time for restore?
13 Validate CPU 24 – 8 =
16 Build CPU
24 – 13 =
11 Other CPU 8 = Validate CPU
48 – 20 (read IO)
= 28 Misc Waits
==============
589 Total Time
What to look for
• Which phase is taking longest?
• Read/Write bottleneck show high % IO Wait
• Copy bottleneck show low % IO Wait and high
CPU time on process (from ps)
Compression & Performance
• May increase throughput, if the bottleneck is
on the backup storage IO
• There is overhead in the COPY phase to
compression both on the backup and restore
– The key is to find the balance between IO waits
and CPU time
Compression Example
• Next slides shows impact of compression
using different backup storage systems
• Basic Compression is included with Oracle
– However, the best things in life are not always free
• Low, Medium and High Compression require
Advanced Compression Option
Compression Example 1
• +FRA on Exadata