Crash Recovery

The document discusses different types of database failures like transaction failure, system crash, and disk failure. It describes log-based and shadow paging techniques for recovery and maintaining transaction atomicity. Log-based recovery works by writing logs before and after transactions. Shadow paging creates copies of modified pages instead of writing to original pages until transaction commits.

Uploaded by

Ram Nath

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

349 views

Crash Recovery

Uploaded by

Ram Nath

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

CHAPTER - 9

Crash Recovery

9.1 Introduction
A major responsibility of the database administrator is to prepare for the possibility of hardware,
software, network, process, or system failure. If such a failure affects the operation of a database
system, we must usually recover the database and return to normal operation as quickly as
possible. Recovery should protect the database and associated users from unnecessary problems
and avoid or reduce the possibility of having to duplicate work manually. Recovery processes
vary depending on the type of failure that occurred, the structures affected, and the type of
recovery performed.

Failure Classification
There are various types of failure that may occur in a system, each of which needs to be dealt
with in a different manner. In this chapter, we shall consider only the following types of failure:
 Transaction failure: When a transaction is failed to execute or it reaches a point after
which it cannot be completed successfully it has to abort. This is called transaction failure.
Where only few transaction or process are hurt. Reason for transaction failure could be:
o Logical errors: where a transaction cannot complete because of it has some code error
or any internal error condition
o System errors: where the database system itself terminates an active transaction
because DBMS is not able to execute it or it has to stop because of some system
condition. For example, in case of deadlock or resource unavailability systems aborts
an active transaction.

 System crash: There are problems, which are external to the system, which may cause the
system to stop abruptly and cause the system to crash. For example interruption in power
supply, failure of underlying hardware or software failure. Examples may include
operating system errors.

 Disk failure: In early days of technology evolution, it was a common problem where hard
disk drives or storage drives used to fail frequently. Disk failures include formation of bad
sectors, unreachability to the disk, disk head crash or any other failure, which destroys all
or part of disk storage. Copies of the data on other disks, or archival backups on tertiary
media, such as DVD or tapes, are used to recover from the failure.

9.2 Recovery and Atomicity

When a system crashes, it many have several transactions being executed and various files
opened for them to modifying data items. As we know that transactions are made of various
operations, which are atomic in nature. But according to ACID properties of DBMS, atomicity of
transactions as a whole must be maintained that is, either all operations are executed or none.
When DBMS recovers from a crash it should maintain the following:
 It should check the states of all transactions, which were being executed.
 A transaction may be in the middle of some operation; DBMS must ensure the atomicity
of transaction in this case.
 It should check whether the transaction can be completed now or needs to be rolled back.
 No transactions would be allowed to leave DBMS in inconsistent state.

There are two types of techniques, which can help DBMS in recovering as well as maintaining
the atomicity of transaction:
 Maintaining the logs of each transaction, and writing them onto some stable storage
before actually modifying the database.
 Maintaining shadow paging, where the changes are done on a volatile memory and later
the actual database is updated.
DBMS-Compiled by Yagya Raj Pandeya, NAST, Dhandadhi ©[email protected] Page 1
A. Log-based recovery
The log is a sequence of log records, recording all the update activities in the database. There
are several types of log records. An update log record describes a single database write. It
has these fields:
 Transaction identifier, which is the unique identifier of the transaction that performed
the write operation.
 Data-item identifier, which is the unique identifier of the data item written. Typically,
it is the location on disk of the data item, consisting of the block identifier of the block
on which the data item resides, and an offset within the block.
 Old value, which is the value of the data item prior to the write.
 New value, which is the value that the data item will have after the write.

Log based recovery works as follows:

 The log file is kept on stable storage media
 When a transaction enters the system and starts execution, it writes a log about it:
<Tn, Start>
 When the transaction modifies an item X, it write logs as follows: <Tn, X, V1, V2>;
It reads Tn has changed the value of X, from V1 to V2.
 When transaction finishes, it logs: <Tn, commit>

Whenever a transaction performs a write, it is essential that the log record for that write be
created and added to the log, before the database is modified. Once a log record exists, we
can output the modification to the database if that is desirable. Also, we have the ability to
undo a modification that has already been output to the database. We undo it by using the
old-value field in log records. For log records to be useful for recovery from system and disk
failures, the log must reside in stable storage.

Checkpoints
Keeping and maintaining logs in real time and in real environment may fill out all the
memory space available in the system. At time passes log file may be too big to be handled at
all. Checkpoint is a mechanism where all the previous logs are removed from the system and
stored permanently in storage disk. Checkpoint declares a point before which the DBMS was
in consistent state and all the transactions were committed. There are two major difficulties
with this approach:
1. The search process is time-consuming.
2. Most of the transactions that, according to our algorithm, need to be redone have
already written their updates into the database. Although redoing them will cause no
harm, it will nevertheless cause recovery to take longer.

A checkpoint is performed as follows:

1. Output onto stable storage all log records currently residing in main memory.
2. Output to the disk all modified buffer blocks.
3. Output onto stable storage a log record of the form <checkpoint L>, where L is a list
of transactions active at the time of the checkpoint.

Recovery
When system with concurrent transaction
crashes and recovers, it does behave in the
following manner:
 The recovery system reads the logs
backwards from the end to the last
Checkpoint.
 It maintains two lists, undo-list and
redo-list.
DBMS-Compiled by Yagya Raj Pandeya, NAST, Dhandadhi ©[email protected] Page 2
 If the recovery system sees a log with <Tn, Start> and <Tn, Commit> or just <Tn,
Commit>, it puts the transaction in redo-list.
 If the recovery system sees a log with <Tn, Start> but no commit or abort log found, it
puts the transaction in undo-list.

All transactions in undo-list are then undone and their logs are removed. All transaction in
redo-list, their previous logs are removed and then redone again and log saved.

B. Shadow-paging
Shadow paging is a technique for providing atomicity and durability (two of the ACID
properties) in database systems. A page in this context refers to a unit of physical storage
(probably on a hard disk).

It is inconvenient to maintain logs of all transactions for the purposes of recovery. An

alternative is to use a system of shadow paging. This is where the database is divided into
pages that may be stored in any order on the disk. In order to identify the location of any
given page, we use something called
a page table. During the life of a
transaction two page tables are
maintained, one called a shadow
page table and current page table. To
start with, both the page tables are
identical. Only current page table is
used for data item accesses during
execution of the transaction.
Whenever any page is about to be
written for the first time, a copy of
this page is made onto an unused
page. The current page table is then
made to point to the copy, and the
update is performed on the copy.

 Advantages of shadow-paging over log-based schemes – no overhead of writing log

records; recovery is trivial

Disadvantages:
 Commit overhead is high (many pages need to be flushed)
 Data gets fragmented (related pages get separated)
 After every transaction completion, the database pages containing old versions of
modified data need to be garbage collected and put into the list of unused pages
 Hard to extend algorithm to allow transactions to run concurrently

9.3 Data backup and Recovery

A backup, or the process of backing up, refers to the copying and archiving of computer data so
it may be used to restore the original after a data loss event. The verb form is to back up in two
words, whereas the noun is backup.

Backups are needed in case a file or a group of files is lost. The reasons for losing files include
hardware failure like disk breaking, accidentally deleting wrong file and computer being stolen.
Backups help in all the above situations. In addition, it may be good to have access to older
versions of files, for example a configuration file worked a week ago.

A database backup consists of backups of the physical files (all data files and a control file) that
constitute a database. Replacing a current, possibly damaged, copy of a data file, table space, or
database with a backup copy is called restoring that portion of the database. Backups have two
distinct purposes. The primary purpose is to recover data after its loss, be it by data deletion or
corruption. The secondary purpose of backups is to recover data from an earlier time, according
to a user-defined data retention policy, typically configured within a backup application for how
long copies of data are required. Since a backup system contains at least one copy of all data
worth saving, the data storage requirements can be significant. A data repository model can be
used to provide structure to the storage. Before data are sent to their storage locations, they are
selected, extracted, and manipulated. Many different techniques have been developed to optimize
the backup procedure. These include optimizations for dealing with open files and live data
sources as well as compression, encryption, and de-duplication, among others.

The basic types of Backup

There are many techniques for backing up files. The techniques you use will depend on the type
of data you're backing up, how convenient you want the recovery process to be, and more. If you
view the properties of a file or directory in Windows Explorer, you'll note an attribute called
Archive. This attribute often is used to determine whether a file or directory should be backed up.
If the attribute is on, the file or directory may need to be backed up. The basic types of backups
you can perform include
 Normal/full backups: All files that have been selected are backed up, regardless of the
setting of the archive attribute. When a file is backed up, the archive attribute is cleared. If the
file is later modified, this attribute is set, which indicates that the file needs to be backed up.
 Copy backups: All files that have been selected are backed up, regardless of the setting of
the archive attribute. Unlike a normal backup, the archive attribute on files isn't modified.
This allows you to perform other types of backups on the files at a later date.
 Differential backups: Designed to create backup copies of files that have changed since the
last normal backup. The presence of the archive attribute indicates that the file has been
modified and only files with this attribute are backed up. However, the archive attribute on
files isn't modified. This allows you to perform other types of backups on the files at a later
date.
 Incremental backups: Designed to create backups of files that have changed since the most
recent normal or incremental backup. The presence of the archive attribute indicates that the
file has been modified and only files with this attribute are backed up. When a file is backed
up, the archive attribute is cleared. If the file is later modified, this attribute is set, which
indicates that the file needs to be backed up.
 Daily backups Designed to back up files using the modification date on the file itself. If a
file has been modified on the same day as the backup, the file will be backed up. This
technique doesn't change the archive attributes of files.

Recovery: Recovery of the database is restored to the most recent consistent state just before the
time of failure. Usually the system log or trail or journal, keeps the information about the
changes that were applied to the data items by the various transactions. Main recovery techniques
are of two type:
A. Deffered update techniques
This technique do not physically update the database on disk until after a transaction reaches
its commit point. Before reaching the commit point, all transaction updates are recorded in
the local transaction workspace (or buffers). During commit, the updates are first recorded
persistently in the log and then written to the DB. If a transaction fails before reaching its
commit point, no UNDO is needed because it will not have changed the database anyway. If
there is a crash, it may be necessary to REDO the effects of committed transactions from the
Log because their effect may not have been recorded in the database. Deferred update also
known as NO-UNDO/REDO algorithm.

B. Immediate update techniques
In this technique, the DB may be updated by some operations of a transaction before the
transaction reaches its commit point. To make recovery possible, force write the changes on
the log before to apply them to the DB. If a transaction fails before reaching commit point, it
must be rolled back by undoing the effect of its operations on the DB. It is also required to
redo the effect of the committed transactions. Immediate update also known as
UNDO/REDO algorithm. A variation of the algorithm where all updates are recorded in the
database before a transaction commits requires only redo –UNDO/NO-REDO algorithm

9.4 Remote Backup Systems

Traditional transaction-processing systems are centralized or client–server systems. Such systems
are vulnerable to environmental disasters such as fire, flooding, or earthquakes. Increasingly,
there is a need for transaction-processing systems that can function in spite of system failures or
environmental disasters. Such systems must provide high availability.

We can achieve high availability by performing transaction processing at one site, called the
primary site, and having a remote backup site where all the data from the primary site are
replicated. The remote backup site is sometimes also called the secondary site. The remote site
must be kept synchronized with the primary site, as updates are performed at the primary. We
achieve synchronization by sending all log records from primary site to the remote backup site.
The remote backup site must be physically separated from the primary—for example, we can
locate it in a different
state—so that a disaster
at the primary does not
damage the remote
backup site. Figure
shows the architecture
of a remote backup
system.

When the primary site fails, the remote backup site takes over processing. First, however, it
performs recovery, using its (perhaps outdated) copy of the data from the primary, and the log
records received from the primary. In effect, the remote backup site is performing recovery
actions that would have been performed at the primary site when the latter recovered. Standard
recovery algorithms, with minor modifications, can be used for recovery at the remote backup
site. Once recovery has been performed, the remote backup site starts processing transactions.

DP-203 Dump
100% (1)
DP-203 Dump
96 pages
Cloud Unit3
No ratings yet
Cloud Unit3
26 pages
Compiler Lab Manual RCS 652
No ratings yet
Compiler Lab Manual RCS 652
33 pages
Hive Lab
No ratings yet
Hive Lab
33 pages
Hazelcast Manual PDF
No ratings yet
Hazelcast Manual PDF
798 pages
Dbms Unit 1 Notes
0% (1)
Dbms Unit 1 Notes
14 pages
Cooperative Process: Prepared & Presented By: Abdul Rehman & Muddassar Ali
No ratings yet
Cooperative Process: Prepared & Presented By: Abdul Rehman & Muddassar Ali
18 pages
Experiment No. 1: Theory
No ratings yet
Experiment No. 1: Theory
7 pages
DBMS Unit-1 PPT 1.1 (Introduction, Drawback of File Sysstem, View of Data)
No ratings yet
DBMS Unit-1 PPT 1.1 (Introduction, Drawback of File Sysstem, View of Data)
4 pages
BDA Unit2 Complete
No ratings yet
BDA Unit2 Complete
56 pages
CS3492 DBMS Univ - QP Answer AM 2024
No ratings yet
CS3492 DBMS Univ - QP Answer AM 2024
19 pages
Final Dbms Lab Manual
No ratings yet
Final Dbms Lab Manual
87 pages
Mongodb
No ratings yet
Mongodb
19 pages
Unit-I: Introduction To J2EE
No ratings yet
Unit-I: Introduction To J2EE
29 pages
Cs3481 - Dbms Record
No ratings yet
Cs3481 - Dbms Record
63 pages
FSD Unit2
No ratings yet
FSD Unit2
41 pages
Database Management Systems: ©silberschatz, Korth and Sudarshan 1.1 Database System Concepts
No ratings yet
Database Management Systems: ©silberschatz, Korth and Sudarshan 1.1 Database System Concepts
33 pages
Chapter 4 Computer Codes
No ratings yet
Chapter 4 Computer Codes
30 pages
Module II
No ratings yet
Module II
22 pages
KCS 713 Unit 1 Lecture 5
No ratings yet
KCS 713 Unit 1 Lecture 5
32 pages
Problems, Problem Spaces and Search
No ratings yet
Problems, Problem Spaces and Search
54 pages
DC Question Bank 5 Units
No ratings yet
DC Question Bank 5 Units
17 pages
CCS335-Cloud-Computing-QB - Unit 3, 4 & 5
No ratings yet
CCS335-Cloud-Computing-QB - Unit 3, 4 & 5
57 pages
Hyper-Threading Technology
No ratings yet
Hyper-Threading Technology
22 pages
Introduction
No ratings yet
Introduction
25 pages
BDA Presentations Unit-4 - Hadoop, Ecosystem
100% (1)
BDA Presentations Unit-4 - Hadoop, Ecosystem
25 pages
CS2302 Computer Networks Anna University Engineering Question Bank 4 U
No ratings yet
CS2302 Computer Networks Anna University Engineering Question Bank 4 U
48 pages
CloudComputing - Practical File
100% (1)
CloudComputing - Practical File
18 pages
Presantation - Chapter 07-Decrease and Conquer
No ratings yet
Presantation - Chapter 07-Decrease and Conquer
41 pages
FDP Day1
No ratings yet
FDP Day1
35 pages
Unit-4-Database Security
No ratings yet
Unit-4-Database Security
14 pages
Data Structure and C++ Lab Manual
0% (2)
Data Structure and C++ Lab Manual
4 pages
Hadoop ppt@87
No ratings yet
Hadoop ppt@87
16 pages
Practical 1: 1. Write A PL/SQL Block To Display The Message "Hello World"
No ratings yet
Practical 1: 1. Write A PL/SQL Block To Display The Message "Hello World"
72 pages
Dbms Viva Questions
No ratings yet
Dbms Viva Questions
33 pages
Practical No. 5 OS - Dinning Philosopher Problem Using Semaphores
No ratings yet
Practical No. 5 OS - Dinning Philosopher Problem Using Semaphores
4 pages
Operating System Assignments
No ratings yet
Operating System Assignments
2 pages
Unit - V: Advanced Topics
No ratings yet
Unit - V: Advanced Topics
92 pages
Counting Oneness in A Window
No ratings yet
Counting Oneness in A Window
12 pages
Module-4 Cloud Computing Architecture PDF
No ratings yet
Module-4 Cloud Computing Architecture PDF
19 pages
Hbase PPT PDF
No ratings yet
Hbase PPT PDF
100 pages
Computer Network Assignment
No ratings yet
Computer Network Assignment
17 pages
Design and Analysis of Algorithms Laboratory (15Csl47)
100% (1)
Design and Analysis of Algorithms Laboratory (15Csl47)
12 pages
User Interface Design
No ratings yet
User Interface Design
12 pages
Unit-V: Elementary UDP Sockets
No ratings yet
Unit-V: Elementary UDP Sockets
9 pages
Unit 1: Database Management System (DBMS) Historical Perspective
100% (1)
Unit 1: Database Management System (DBMS) Historical Perspective
30 pages
Study On Intel 80386 Microprocessor
No ratings yet
Study On Intel 80386 Microprocessor
3 pages
Laboratory Manual: Silver Oak College of Engineering and Technology
No ratings yet
Laboratory Manual: Silver Oak College of Engineering and Technology
27 pages
Dbms Lab
100% (1)
Dbms Lab
54 pages
Practical File Cloud Computing IT-704
No ratings yet
Practical File Cloud Computing IT-704
27 pages
Web Services Lab Manual
No ratings yet
Web Services Lab Manual
6 pages
Virtual Ization
No ratings yet
Virtual Ization
3 pages
Distributed Operating Systems: Unit - 2
No ratings yet
Distributed Operating Systems: Unit - 2
48 pages
16 Mark Questions OOAD
100% (2)
16 Mark Questions OOAD
9 pages
Fundamentals of Data Science: Nehru Institute of Engineering and Technology
100% (1)
Fundamentals of Data Science: Nehru Institute of Engineering and Technology
17 pages
B32-RDBMS Assignment Question
No ratings yet
B32-RDBMS Assignment Question
4 pages
Unit 3 - FSW - Important Ques With Ans
No ratings yet
Unit 3 - FSW - Important Ques With Ans
36 pages
CS9211-Computer Architecture Question
No ratings yet
CS9211-Computer Architecture Question
7 pages
JDBC Questions Bank 2020
No ratings yet
JDBC Questions Bank 2020
1 page
R20-Atcd-Q.p - Model Paper.
100% (1)
R20-Atcd-Q.p - Model Paper.
3 pages
IAT-I Question Paper With Solution of 18CS823 Nosql Database May-2021-Poonam Tijare
100% (1)
IAT-I Question Paper With Solution of 18CS823 Nosql Database May-2021-Poonam Tijare
12 pages
Introduction to Linux: Installation and Programming
From Everand
Introduction to Linux: Installation and Programming
N. B. Venkateswarlu
No ratings yet
Trackpad Ver. 2.0 Class 5
From Everand
Trackpad Ver. 2.0 Class 5
Nidhi Arora
No ratings yet
8.1. - Pointers
No ratings yet
8.1. - Pointers
17 pages
Operators and Expressions
No ratings yet
Operators and Expressions
19 pages
Data Types and Input or Output Operators
No ratings yet
Data Types and Input or Output Operators
24 pages
Introduction To C Programming
No ratings yet
Introduction To C Programming
16 pages
File Organisation and Indexing
No ratings yet
File Organisation and Indexing
10 pages
B+ Tree Dbms
No ratings yet
B+ Tree Dbms
22 pages
Transaction Processing and Concurrency Control
No ratings yet
Transaction Processing and Concurrency Control
6 pages
Actian Operational Data Warehouse Basics For Dummies
No ratings yet
Actian Operational Data Warehouse Basics For Dummies
51 pages
Getting Started With Customization
No ratings yet
Getting Started With Customization
14 pages
Important For Printout Today
No ratings yet
Important For Printout Today
112 pages
1z0 931 Exam Edited
No ratings yet
1z0 931 Exam Edited
15 pages
Active Directory Lightweight Directory Services Role
No ratings yet
Active Directory Lightweight Directory Services Role
4 pages
Chart Conversion Describe
No ratings yet
Chart Conversion Describe
52 pages
Board Practical Question Paper - 2024-2025
100% (1)
Board Practical Question Paper - 2024-2025
9 pages
Read Smart Card Chip Data With APDU Commands ISO 7816
0% (1)
Read Smart Card Chip Data With APDU Commands ISO 7816
1 page
Cambridge IGCSE: 0417/11 Information and Communication Technology
No ratings yet
Cambridge IGCSE: 0417/11 Information and Communication Technology
16 pages
2022-23-BDA-LAB Manual
No ratings yet
2022-23-BDA-LAB Manual
59 pages
Azure Virtual Machines and Windows Virtual Desktop
No ratings yet
Azure Virtual Machines and Windows Virtual Desktop
3 pages
DBMS Unit 3
No ratings yet
DBMS Unit 3
90 pages
Variants of Blast: By-Darshana D Ghadi Roll No. - 03
No ratings yet
Variants of Blast: By-Darshana D Ghadi Roll No. - 03
17 pages
Introduction To Geoprocessing
No ratings yet
Introduction To Geoprocessing
15 pages
Features of Hadoop: - Suitable For Big Data Analysis
No ratings yet
Features of Hadoop: - Suitable For Big Data Analysis
6 pages
BITS Pilani, Hyderabad Campus CSF212, Database Systems : (2M) (2M) (2M) (2M) (2M)
No ratings yet
BITS Pilani, Hyderabad Campus CSF212, Database Systems : (2M) (2M) (2M) (2M) (2M)
8 pages
Web-Based Platform For Pet Adoption
No ratings yet
Web-Based Platform For Pet Adoption
59 pages
CS Practical Record by Siphin 12
No ratings yet
CS Practical Record by Siphin 12
42 pages
083 - CS - With - Python - Class 12 - Sample Paper 1 - Solved
No ratings yet
083 - CS - With - Python - Class 12 - Sample Paper 1 - Solved
18 pages
POWER BI FAQ Part 2
No ratings yet
POWER BI FAQ Part 2
20 pages
Issues in BIM For Facility Management From Industry Practitioners' Perspectives
No ratings yet
Issues in BIM For Facility Management From Industry Practitioners' Perspectives
9 pages
Siebel Analytics OBIEE Interview Questions
No ratings yet
Siebel Analytics OBIEE Interview Questions
54 pages
Design and Deployment of TinyURL
No ratings yet
Design and Deployment of TinyURL
14 pages
Essbase Notes
No ratings yet
Essbase Notes
28 pages
Actix Analyzer: Faculty of Enginerring
No ratings yet
Actix Analyzer: Faculty of Enginerring
20 pages
DBMS Database Models
No ratings yet
DBMS Database Models
3 pages
Pending Papers: PC-3, PC-8, PC-16
No ratings yet
Pending Papers: PC-3, PC-8, PC-16
3 pages