0% found this document useful (0 votes)

56 views28 pages

Crash Recovery: CS 186 Fall 2009 R&G - Chapter 18

1) The recovery manager helps guarantee atomicity and durability by using write-ahead logging (WAL) to record redo and undo information for transactions before data is written to disk. 2) WAL involves writing log records to disk before writing data pages to disk, and forcing all log records for a transaction to disk before committing the transaction. This allows crashed transactions to be rolled back and committed transactions to be redone if needed for recovery. 3) Recovery involves rolling forward from the most recent checkpoint by redoing log records for committed transactions, and rolling back uncommitted transactions by undoing their changes using compensation log records.

Uploaded by

yugalkumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

56 views28 pages

Crash Recovery: CS 186 Fall 2009 R&G - Chapter 18

Uploaded by

yugalkumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 28

Crash Recovery

CS 186 Fall 2009

R&G - Chapter 18

If you are going to be in the logging

business, one of the things that you have
to do is to learn about heavy equipment.
Robert VanNatta,
Logging History of
Columbia County
Review: The ACID properties
• Atomicity: All actions in the Xact happen, or none
happen.
• Consistency: If each Xact is consistent, and the DB

starts consistent, it ends up consistent.

• Isolation: Execution of one Xact is isolated from that
of other Xacts.
• Durability: If a Xact commits, its effects persist.

• Question: which ones does the Recovery Manager

help with?
Atomicity & Durability (and
also used for Consistency-related
rollbacks)
Motivation
• Atomicity:
– Transactions may abort (“Rollback”).
• Durability:
– What if DBMS stops running? (Causes?)

 Desired state after system

restarts: crash!
T1 Commit
– T1 & T3 should be durable. T2 Abort
– T2, T4 & T5 should be T3 Commit
aborted (effects not seen). T4
T5
Assumptions
• Concurrency control is in effect.
– Strict 2PL, in particular.
• Updates are happening “in place”.
– i.e. data is overwritten on (deleted from) the actual page copies
(not private copies).

• Can you think of a simple scheme (requiring no logging) to

guarantee Atomicity & Durability?
– What happens during normal execution (what is the minimum
lock granularity)?
– What happens when a transaction commits?
– What happens when a transaction aborts?
Buffer Management Plays a Key Role
One possible approach – Force/No Steal:
• Force – make sure that every updated page is
written to disk before commit.
– Provides durability without REDO logging.
– But, can cause poor performance.

• No Steal – don’t allow buffer-pool frames with

uncommited updates to overwrite committed data
on disk.
– Useful for ensuring atomicity without UNDO logging.
– But can cause poor performance.
Preferred Policy: Steal/No-Force
• This combination is most complicated but allows for
highest flexibility/performance.
• NO FORCE (complicates enforcing Durability)
– What if system crashes before a modified page written by
a committed transaction makes it to disk?
– Write as little as possible, in a convenient place, at commit
time, to support REDOing modifications.
• STEAL (complicates enforcing Atomicity)
– What if the Xact that performed udpates aborts?
– What if system crashes before Xact is finished?
– Must remember the old value of P (to support UNDOing
the write to page P).
Buffer Management summary

No Steal Steal No Steal Steal

No Force Fastest No Force No UNDO UNDO

REDO REDO

Force No UNDO UNDO

Slowest Force
No REDO No REDO

Performance Logging/Recovery
Implications Implications
Basic Idea: Logging

• Record REDO and UNDO information, for every

update, in a log.
– Sequential writes to log (put it on a separate disk).
– Minimal info (diff) written to log, so multiple updates
fit in a single log page.
• Log: An ordered list of REDO/UNDO actions
– Log record contains:
<XID, pageID, offset, length, old data, new data>
– and additional control info (which we’ll see soon).
Write-Ahead Logging (WAL)
• The Write-Ahead Logging Protocol:
1) Must force the log record for an update before the
corresponding data page gets to disk.
2) Must force all log records for a Xact before commit.
(transaction is not committed until all of its log records
including its “commit” record are on the stable log.)

• #1 (with UNDO info) helps guarantee Atomicity.

• #2 (with REDO info) helps guarantee Durability.
• This allows us to implement Steal/No-Force

• We’ll look at the ARIES algorithms from IBM.

WAL & the Log DB RAM
LSNs pageLSNs flushedLSN

• Each log record has a unique

Log Sequence Number (LSN).
Log records
– LSNs always increasing. flushed to disk
• Each data page contains a pageLSN.
– The LSN of the most recent log record
for an update to that page.
• System keeps track of flushedLSN. flushedLSN
– max LSN flushed to stable log so far.
• WAL (rule 1): For a page “i” to pageLSN i “Log tail”
be written must flush log at in RAM

least to the point where: Pagei

pageLSNi flushedLSN
Log Records
prevLSN is the LSN of the previous
log record of this transaction
(records of an Xact form a linked
LogRecord fields: list backwards in time)
Possible log record types:
LSN
• Update, Commit, Abort
prevLSN
• Checkpoint (for log maintainence)
XID
• Compensation Log Records
type (CLRs)
pageID – for UNDO actions
for length • End (end of commit or abort)
update
offset
records
before-image
only
after-image
Other Log-Related State (in memory)
• Two in-memory tables:
• Transaction Table
One entry per currently active transaction.
• entry removed when Xact commits or aborts
Contains XID (i.e., transactionId), status
(running/committing/aborting) lastLSN (most recent
LSN written by Xact).
• Dirty Page Table
One entry per dirty page currently in buffer pool.
Contains recLSN -- the LSN of the log record that first caused
the page to be dirty.
Normal Execution of an Xact
• Assume:
– Strict 2PL concurrency control
– STEAL, NO-FORCE buffer management, with WAL.
– Disk writes are atomic (i.e., all-or-nothing)
• Transaction is a series of reads & writes, followed by commit
or abort.
– Update TransTable on transaction start/end
– For each update operation:
• create log record with LSN l = ++MaxLSN and
prevLSN = TransTable[XID].lastLSN;
• update TransTable[XID].lastLSN = l
• if modified page NOT in DirtyPageTable,
then add it with recLSN = l
– When buffer manager replaces a dirty page, remove
its entry from the DPT
Transaction Commit
• Write commit record into log.
• Flush all log records up to Xact’s commit
record to log disk.
– WAL Rule #2: Ensure flushedLSN  lastLSN.
• Force log out up to lastLSN if necessary
– Note that log flushes are sequential,
synchronous writes to disk and many log
records per log page.
• so, cheaper than forcing out the updated data and
index pages.
• Commit() returns.
• Write end record to log.
Simple Transaction Abort
• For now, consider an explicit abort of a Xact.
– No crash involved.
• We want to “play back” the log in reverse
order, UNDOing updates.
– Write an Abort log record before starting to
rollback operations.
– Get lastLSN of Xact from Transaction table.
– Can follow chain of log records backward via the
prevLSN field.
– For each update encountered:
• Write a “CLR” (compensation log record) for
each undone operation.
• Undo the operation (using before image from
log record).
Abort, cont.

• To perform UNDO, must have a lock on data!

– No problem (we’re doing Strict 2PL)!
• Before restoring old value of a page, write a CLR:
– You continue logging while you UNDO!!
– CLR has one extra field: undonextLSN
• Points to the next LSN to undo (i.e. the prevLSN of the record
we’re currently undoing).
– CLRs are never Undone (but they might be Redone
when repeating history: guarantees Atomicity!)
• At end of UNDO, write an “end” log record.
Checkpointing
• Conceptually, keep log around for all time. Obviously this has
performance/implemenation problems…
• Periodically, the DBMS creates a checkpoint, in order to minimize
the time taken to recover in the event of a system crash. Write to
log:
– begin_checkpoint record: Indicates when chkpt began.
– end_checkpoint record: Contains current Xact table and dirty page
table. This is a `fuzzy checkpoint’:
• Other Xacts continue to run; so these tables accurate only as of the time of the
begin_checkpoint record.
• No attempt to force dirty pages to disk; effectiveness of checkpoint limited by
oldest unwritten change to a dirty page.
– Store LSN of most recent chkpt record in a safe place (master record).
The Big Picture: What’s Stored Where

LOG RAM
DB
LogRecords
prevLSN Xact Table
XID Data pages lastLSN
type each status
pageID with a
length pageLSN Dirty Page Table
offset recLSN
before-image master record
after-image LSN of flushedLSN
most recent
checkpoint
Crash Recovery: Big Picture
 Start from a checkpoint
Oldest log (found via master record).
rec. of Xact
active at crash
 Three phases. Need to:
1. Analysis - update structures:
Smallest – Trans Table: which Xacts
recLSN in
dirty page were active at time of crash.
table after – Dirty Page Table: which
Analysis
pages might have been dirty
in the buffer pool at time of
Last chkpt
crash.
2. REDO all actions.
CRASH (repeat history)
A R U 3. UNDO effects of failed Xacts.
Recovery: The Analysis Phase
• Re-establish knowledge of state at checkpoint.
– via transaction table and dirty page table stored in the checkpoint
• Scan log forward from checkpoint.
– End record: Remove Xact from Xact table.
– All Other records: Add Xact to Xact table, set lastLSN=LSN, change Xact
status on commit.
– also, for Update records: If page P not in Dirty Page Table, Add P to
DPT, set its recLSN=LSN.
• At end of Analysis…
– transaction table says which xacts were active at time of crash.
– DPT says which dirty pages might not have made it to disk
Phase 2: The REDO Phase
• We repeat History to reconstruct state at crash:
– Reapply all updates (even of aborted Xacts!), redo CLRs.
• Scan forward from log rec containing smallest recLSN in DPT. Q:
why start here?
• For each update log record or CLR with a given LSN, REDO the
action unless:
– Affected page is not in the Dirty Page Table, or
– Affected page is in D.P.T., but has recLSN > LSN, or
– pageLSN (in DB) LSN. (this last case requires I/O)
• To REDO an action:
– Reapply logged action.
– Set pageLSN to LSN. No additional logging, no forcing!
Phase 3: The UNDO Phase

ToUndo={lastLSNs of all Xacts in the Trans Table}

Repeat:
– Choose (and remove) largest LSN among ToUndo.
– If this LSN is a CLR and undonextLSN==NULL
• Write an End record for this Xact.
– If this LSN is a CLR, and undonextLSN != NULL
• Add undonextLSN to ToUndo
– Else this LSN is an update. Undo the update, write a CLR,
add prevLSN to ToUndo.
Until ToUndo is empty.
Example of Recovery – (up to crash)

LSN LOG

RAM 00 begin_checkpoint
05 end_checkpoint
Xact Table 10 update: T1 writes P5
lastLSN 20 update T2 writes P3
status
30 T1 abort
Dirty Page Table
recLSN 40 CLR: Undo T1 LSN 10, UndoNxt=Null
flushedLSN 45 T1 End
50 update: T3 writes P1
ToUndo 60 update: T2 writes P5
CRASH, RESTART
Example (cont.):Analysis & Redo
LSN LOG
00 begin_checkpoint
Xact Table
05 end_checkpoint
Trans lastLSN Stat update: T1 writes P5
10
T1
T2 10
30
40
20
60 ra 20 update T2 writes P3
T2
T3 20
50 r 30 T1 abort
40 CLR: Undo T1 LSN 10, UndoNxt=Null
45 T1 End
Dirty Page Table
50 update: T3 writes P1
PageId recLSN
60 update: T2 writes P5
P5 10 CRASH, RESTART

P3 20
Redo starts at LSN 10;
P1 50 in this case, reads P5, P3,
and P1 from disk, redoes
ops if pageLSN < LSN
Ex (cont.): Undo & Crash During
00 begin_checkpoint,
Restart! 05 end_checkpoint
After Analysis/Redo: 10 update: T1 writes P5;Prvl=null
ToUndo: 50 & 60 20 update T2 writes P3; Prvl = null
ToUndo: 30 T1 abort
50 & 20 40 CLR: Undo T1 LSN 10
ToUndo: 45 T1 End
20 50 update: T3 writes P1; PrvL=null
After Analysis/Redo: 60 update: T2 writes P5; PrvL=20
ToUndo: 70 CRASH, RESTART
ToUndo: 70 CLR: Undo T2 LSN 60; UndoNxtLSN=20
20 80 CLR: Undo T3 LSN 50;UndoNxtLSN=null
85 T3 end
ToUndo:
Finished! CRASH, RESTART
90 CLR: Undo T2 LSN 20;UndoNxtLSN=null
100 T2 end
Additional Crash Issues
• What happens if system crashes during Analysis? During
REDO?

• How to reduce the amount of work in Analysis?

– Take frequent checkpoints.
• How do you limit the amount of work in REDO?
– Frequent checkpoints plus
– Flush data pages to disk asynchronously in the background
(during normal operation and recovery).
• Buffer manager can do this to unpinned, dirty pages.
• How do you limit the amount of work in UNDO?
– Avoid long-running Xacts.
Summary of Logging/Recovery
• Transactions support the ACID properties.
• Recovery Manager guarantees Atomicity &
Durability.
• Use Write Ahead Longing (WAL) to allow
STEAL/NO-FORCE buffer manager without
sacrificing correctness.
• LSNs identify log records; linked into
backwards chains per transaction (via
prevLSN).
• pageLSN allows comparison of data page and
log records.
Summary, Cont.
• Checkpointing: A quick way to limit the
amount of log to scan on recovery.
• Aries recovery works in 3 phases:
– Analysis: Forward from checkpoint. Rebuild
transaction and dirty page tables.
– Redo: Forward from oldest recLSN, repeating
history for all transactions.
– Undo: Backward from end to first LSN of oldest
Xact alive at crash. Rollback all transactions not
completed as of the time of the crash.
• Redo “repeats history”: Simplifies the logic!
• Upon Undo, write CLRs. Nesting structure of
CLRS avoids having to “undo undo operations”.

ARIES Recovery Algorithm
No ratings yet
ARIES Recovery Algorithm
4 pages
Database Recovery Techniques
No ratings yet
Database Recovery Techniques
41 pages
Chapter 4 Database Recovery Techniques
100% (1)
Chapter 4 Database Recovery Techniques
32 pages
DBMS-Module - 5 Updated
No ratings yet
DBMS-Module - 5 Updated
98 pages
Lecture 18
No ratings yet
Lecture 18
49 pages
8 - RecoveryTechniques - Ch19
No ratings yet
8 - RecoveryTechniques - Ch19
83 pages
Aries
No ratings yet
Aries
42 pages
Lecture 21
No ratings yet
Lecture 21
53 pages
Database System Recovery: CSEP 545 Transaction Processing For E-Commerce Philip A. Bernstein
No ratings yet
Database System Recovery: CSEP 545 Transaction Processing For E-Commerce Philip A. Bernstein
45 pages
Chapter 5
No ratings yet
Chapter 5
22 pages
DBMS - Part 2 - Transaction Management
No ratings yet
DBMS - Part 2 - Transaction Management
54 pages
Recovery
No ratings yet
Recovery
35 pages
Crash Recovery: Database Management Systems, 3ed, R. Ramakrishnan and J. Gehrke 1
No ratings yet
Crash Recovery: Database Management Systems, 3ed, R. Ramakrishnan and J. Gehrke 1
26 pages
Chapter 5 Database Recovery Techniques
No ratings yet
Chapter 5 Database Recovery Techniques
30 pages
CST 4305 DBMS L12
No ratings yet
CST 4305 DBMS L12
41 pages
Aries Recovery Algorithm
No ratings yet
Aries Recovery Algorithm
42 pages
Crash Recovery
No ratings yet
Crash Recovery
20 pages
DB CH 3
No ratings yet
DB CH 3
19 pages
Crash Recovery Method: Kathleen Durant CS 3200
No ratings yet
Crash Recovery Method: Kathleen Durant CS 3200
35 pages
Ch4-Crash Recovery
No ratings yet
Ch4-Crash Recovery
38 pages
Adbms CH 1.c
No ratings yet
Adbms CH 1.c
45 pages
Database Recovery
No ratings yet
Database Recovery
38 pages
5 Recovery Techniques Modified
No ratings yet
5 Recovery Techniques Modified
28 pages
Crash Recovery
No ratings yet
Crash Recovery
30 pages
Data Access
No ratings yet
Data Access
18 pages
Concurrency Control and Recovery: Module 6, Lecture 1
No ratings yet
Concurrency Control and Recovery: Module 6, Lecture 1
24 pages
Lecture07 Recovery
No ratings yet
Lecture07 Recovery
27 pages
ch16 Overview Xacts
No ratings yet
ch16 Overview Xacts
18 pages
Chapter 3 - Recovery Techniques
100% (1)
Chapter 3 - Recovery Techniques
22 pages
Lec23 6up
No ratings yet
Lec23 6up
3 pages
17 Recovery
No ratings yet
17 Recovery
14 pages
CSIS 3300 W13 Transactions
No ratings yet
CSIS 3300 W13 Transactions
13 pages
Recovery
No ratings yet
Recovery
26 pages
Dbms Unit 4 Notes.
No ratings yet
Dbms Unit 4 Notes.
21 pages
ADBS Chapter 5
No ratings yet
ADBS Chapter 5
31 pages
Crash Recovery: R&G - Chapter 20
No ratings yet
Crash Recovery: R&G - Chapter 20
28 pages
UNIT 1 Recovery
No ratings yet
UNIT 1 Recovery
22 pages
Chapter 5 - Recovery Techniques
No ratings yet
Chapter 5 - Recovery Techniques
30 pages
ARIES: A Transaction Recovery Method Supporting Fine Granularity Locking and Partial Rollbacks Using Write-Ahead Logging
No ratings yet
ARIES: A Transaction Recovery Method Supporting Fine Granularity Locking and Partial Rollbacks Using Write-Ahead Logging
7 pages
MIT6 830F10 Lec13
No ratings yet
MIT6 830F10 Lec13
4 pages
Chapter 4
No ratings yet
Chapter 4
12 pages
21 Recovery
No ratings yet
21 Recovery
7 pages
CMSC 724: Recovery: Amol Deshpande
No ratings yet
CMSC 724: Recovery: Amol Deshpande
13 pages
Chapter 5 - Recovery Techniques
No ratings yet
Chapter 5 - Recovery Techniques
24 pages
DataBase Recovery Techniques
100% (1)
DataBase Recovery Techniques
37 pages
Database Recovery Techniques
No ratings yet
Database Recovery Techniques
22 pages
Recovery and Atomicity
No ratings yet
Recovery and Atomicity
5 pages
Transaction Managment
No ratings yet
Transaction Managment
14 pages
Crash Recovery: A C I D
No ratings yet
Crash Recovery: A C I D
9 pages
Chap6 Recovery Techniques
No ratings yet
Chap6 Recovery Techniques
35 pages
Database Systems: Recovery Control
No ratings yet
Database Systems: Recovery Control
25 pages
Recovery
No ratings yet
Recovery
4 pages
IV Sem CSE DBMS Module 4 (Transaction Processing)
No ratings yet
IV Sem CSE DBMS Module 4 (Transaction Processing)
14 pages
14 Recovery
No ratings yet
14 Recovery
4 pages
Crash Recovery
No ratings yet
Crash Recovery
5 pages
Cheat Sheet Dbms
No ratings yet
Cheat Sheet Dbms
1 page
Chapter One: 1. Basic Concepts, Methods of Data Collection and Presentation
No ratings yet
Chapter One: 1. Basic Concepts, Methods of Data Collection and Presentation
111 pages
Aspeninfoplus21db2006 5 Usr
100% (3)
Aspeninfoplus21db2006 5 Usr
66 pages
Pre-Quiz - Attempt Review SS
No ratings yet
Pre-Quiz - Attempt Review SS
3 pages
BBA - Business Statistics 201
No ratings yet
BBA - Business Statistics 201
176 pages
Retail Store Operation
100% (1)
Retail Store Operation
43 pages
PDF Af La Sirenita Colorear - Compress
0% (2)
PDF Af La Sirenita Colorear - Compress
10 pages
Group 3 Final Research
No ratings yet
Group 3 Final Research
48 pages
Intro To MongoDB
100% (1)
Intro To MongoDB
13 pages
IGNITE Event Brochure 2025
No ratings yet
IGNITE Event Brochure 2025
72 pages
Dbms Lab Material1
No ratings yet
Dbms Lab Material1
47 pages
LA - Android - Unit I ONE
100% (1)
LA - Android - Unit I ONE
27 pages
Sap Hana:: OLTP: Simple Queries Like INSERT, UPDATE, DELETE Etc
No ratings yet
Sap Hana:: OLTP: Simple Queries Like INSERT, UPDATE, DELETE Etc
6 pages
Exercise 1.
No ratings yet
Exercise 1.
2 pages
CS3492 Syllabus
No ratings yet
CS3492 Syllabus
2 pages
How Do We Ensure This?
No ratings yet
How Do We Ensure This?
174 pages
Lesson 4-Mathematics Curriculum - 1
No ratings yet
Lesson 4-Mathematics Curriculum - 1
10 pages
Orient DB
No ratings yet
Orient DB
23 pages
Collected Papers On The PITA Project
No ratings yet
Collected Papers On The PITA Project
43 pages
User Interface
No ratings yet
User Interface
29 pages
Arpan Karki L2C2
No ratings yet
Arpan Karki L2C2
56 pages
Quick View Schema
No ratings yet
Quick View Schema
8 pages
Lexa Official Academic Resume 1
No ratings yet
Lexa Official Academic Resume 1
1 page
Data Analytics Curriculum
No ratings yet
Data Analytics Curriculum
13 pages
Introduction To Data Analysis and Decision Making
No ratings yet
Introduction To Data Analysis and Decision Making
11 pages
Teaching Plan Statistics For Decision Science
No ratings yet
Teaching Plan Statistics For Decision Science
4 pages
Logistic Regression
No ratings yet
Logistic Regression
45 pages
Python Imp
No ratings yet
Python Imp
29 pages
Synopsis On mARKETING sTRATEGY OF vOLtas
No ratings yet
Synopsis On mARKETING sTRATEGY OF vOLtas
9 pages
Module II - Introduction To Android Activities and Layouts
No ratings yet
Module II - Introduction To Android Activities and Layouts
8 pages
lecture1&2-đã chuyển đổi
No ratings yet
lecture1&2-đã chuyển đổi
46 pages
Book Chapter - Vaisman and Zimányi, 2014 - Database Concepts
No ratings yet
Book Chapter - Vaisman and Zimányi, 2014 - Database Concepts
40 pages
3.1-Database Entities PDF
No ratings yet
3.1-Database Entities PDF
15 pages
Code and Attribute Table
No ratings yet
Code and Attribute Table
5 pages
Inspire DQ&MD Krakow 2010-06-22 Minutes
No ratings yet
Inspire DQ&MD Krakow 2010-06-22 Minutes
47 pages
Networking & TCP IP
No ratings yet
Networking & TCP IP
2 pages
Assignment1 Solutions
No ratings yet
Assignment1 Solutions
15 pages
The Mac Terminal Reference and Scripting Primer
From Everand
The Mac Terminal Reference and Scripting Primer
Jay Docherty
4.5/5 (3)
Linux Services Deployment
From Everand
Linux Services Deployment
Fabian Mestre
No ratings yet
DRBD-Cookbook: How to create your own cluster solution, without SAN or NAS!
From Everand
DRBD-Cookbook: How to create your own cluster solution, without SAN or NAS!
Joerg Christian Seubert
No ratings yet
Easy Linux For Beginners
From Everand
Easy Linux For Beginners
Felix Cannon
2/5 (1)

Crash Recovery: CS 186 Fall 2009 R&G - Chapter 18

Uploaded by

Crash Recovery: CS 186 Fall 2009 R&G - Chapter 18

Uploaded by

Crash Recovery

CS 186 Fall 2009

If you are going to be in the logging

starts consistent, it ends up consistent.

• Question: which ones does the Recovery Manager

 Desired state after system

• Can you think of a simple scheme (requiring no logging) to

• No Steal – don’t allow buffer-pool frames with

No Steal Steal No Steal Steal

No Force Fastest No Force No UNDO UNDO

Force No UNDO UNDO

• Record REDO and UNDO information, for every

• #1 (with UNDO info) helps guarantee Atomicity.

• We’ll look at the ARIES algorithms from IBM.

• Each log record has a unique

least to the point where: Pagei

• To perform UNDO, must have a lock on data!

ToUndo={lastLSNs of all Xacts in the Trans Table}

• How to reduce the amount of work in Analysis?

You might also like