0% found this document useful (0 votes)
7 views37 pages

Unit Two Database Back Up

This document outlines various methods and types of database backups, including logical and physical backups, as well as full, incremental, differential, mirror, and continuous data protection backups. It discusses the advantages and disadvantages of onsite, offsite, and hybrid storage solutions, along with RAID configurations for data redundancy and performance. The course aims to equip learners with essential skills for effective backup and recovery strategies.

Uploaded by

rob69do
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
7 views37 pages

Unit Two Database Back Up

This document outlines various methods and types of database backups, including logical and physical backups, as well as full, incremental, differential, mirror, and continuous data protection backups. It discusses the advantages and disadvantages of onsite, offsite, and hybrid storage solutions, along with RAID configurations for data redundancy and performance. The course aims to equip learners with essential skills for effective backup and recovery strategies.

Uploaded by

rob69do
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 37

Unit Two: Database Backup Introduction to Backup

Methods
Course Outline

Lesson 1. Introduction to Backup

Lesson 2. • Methods for back-


up and recovery

Lesson 3. back up types

Lesson 4. • RAID

Lesson 5• Hybrid storage

2
Methods
for back-up
and recovery
We will cover these skills:

1. Types of Backup

3
Introduction to Backup

 Backup is the process of  Restore is the process of


creating a copy of data to retrieving data from a backup.
protect against accidental  This might mean copying data
or malicious deletion,
from backup media to an
corruption, hardware
existing device or to a new
failure, ransom ware
device.
attacks, and other types
of data loss.  It also could mean copying

data from the cloud to a local
Data backups can be
device or from one cloud to
created locally, offsite, or
another.
both.

4
The most common type of database
backups
 Logical Backup_  Physical Backup
 Definition: Captures data at the logical  Definition: Captures the entire database
level, such as tables, rows, and columns. at the storage level, including data files,
 Format: Typically stored in formats like log files, and configuration settings.
SQL scripts or CSV files.  Format: Stored as binary files or
 Content: Includes data and schema snapshots of the storage used by the
definitions (e.g., table structures, database.
constraints).  Content: Includes everything needed to
 Use Case: Ideal for migrating data restore the database to a specific state,
between different systems, performing including all data and metadata.
upgrades, or selectively restoring parts of  Use Case: Suitable for full system
the database. recovery, disaster recovery, or when a
 Restoration: Data is restored by executing quick restoration is needed.
the logical scripts, which can be slower  Restoration: Typically faster since it
and may require re-creation of indexes involves copying files directly back to
and constraints. their original locations.
5
Full Backup

 Description: A complete copy of all the data in a database or system


at a specific point in time.
 Advantages: Simplifies restoration since all data is in one place.
 Disadvantages: Time-consuming and requires significant storage
space.

6
Incremental Backup

 Description: Backs up only the data that has changed since the last
backup (either full or incremental).
 Advantages: Faster and requires less storage compared to full
backups.
 Disadvantages: Restoration can be complex, as it requires the last full
backup and all subsequent incremental backups.

7
Differential Backup

 Description: Backs up all data that has changed since the last full
backup.
 Advantages: Easier to restore than incremental backups, as it only
requires the last full backup and the latest differential backup.
 Disadvantages: Takes longer and uses more storage than incremental
backups as time progresses.

8
Mirror Backup

 Description: Creates an exact copy of the source data, including all


files and folders.
 Advantages: Provides a real-time update of the backup, ensuring
quick access to the latest data.
 Disadvantages: If data is deleted or corrupted in the source, it is also
deleted or corrupted in the backup.

9
Continuous Data Protection (CDP)

 Description: Continuously captures changes to data in real time,


allowing for point-in-time recovery.
 Advantages: Provides the most up-to-date backup, minimizing data
loss.
 Disadvantages: Requires significant storage and can be resource-
intensive.
 Cloud Backup
 Description: Backs up data to a cloud storage service.
 Advantages: Provides offsite storage, which is useful for disaster
recovery.
 Disadvantages: Dependent on internet access and may incur ongoing
costs. 10
Local Backup

 Description: Data is backed up to local storage devices, such as


external hard drives or network-attached storage (NAS).
 Advantages: Faster access and recovery times.
 Disadvantages: Vulnerable to local disasters (e.g., fire, flood).
 Bare-Metal Backup
 Description: A complete backup of an entire system, including the
operating system, applications, and data.
 Advantages: Allows for complete system restoration on new
hardware.
 Disadvantages: Requires more storage and can take longer to create.

11
Off-line back-ups

 Off-line backup is also called cold backup or static backup.


 It is a database backup during which the database is offline and not
accessible to update or the database operations are entirely stopped,
and then the backup is performed.
 While the backup is in progress, no business operations can be
performed. This requires the data access to be completely shut off
from the front-end and neither online users nor background processes
can access the database for the duration of the backup.

12
On-line file back-ups
 It is also called a hot backup or dynamic backup.
 A hot backup is a backup performed while the database is open and
available for use (read and write activity). It is performed in near
real-time when the systems are up and running, and new data is
continuously generated or captured.

13
Off-site back-up files

 Offsite backup is the replication of the data to a server which is


separated geographically from a production systems site.
 Offsite data backup may also be done via direct access, over Wide
Area Network (WAN).
 An offsite backup is a backup process or facility that stores backup
data or applications external to the organization or core IT
environment.
 It is similar to a standard backup process, but uses a facility or
storage media that is not physically located within the organization’s
core infrastructure.

14
Advantages and disadvantages
Offsite Storage

 Advantages
 Scalability:. The cloud provider or managed service provider is responsible
for offering storage that can be expanded on demand.
 Cost & Value: Offsite and cloud storage is extremely affordable; in most
circumstances, you only pay for what you use. There is no initial outlay for
expensive storage platforms and no additional maintenance or support
contracts.
 Fast Deployment: With offsite cloud storage, the infrastructure is already
in place, and the platform is already available to use.
 There is no waiting for purchasing and installation of hardware and the
client can simply plug into the cloud storage and get started immediately.
15
  Managed Storage Service:
  Connectivity:
  Performance:

16
• Disadvantages

 Some disadvantages of offsite storage include;


  It can be difficult to access the data when it is needed. For example, some
of the offsite data servers will be routinely taken offline in order to perform
preventive maintenance. During that time period, access to the offsite data will
either be severely limited or completely cut off. In most cases, clients who use
offsite storage are given plenty of notice when a site is going down for
maintenance or repair.
  Security and Privacy: One of the major concerns with offsite cloud storage
is the security and integrity of data. Data protection and privacy are extremely
important for business organizations. If you choose to move data to an offsite
storage provider, consideration must be given to compliance as well as the
security measures in place to protect the data. Offsite storage must be protected
from unauthorized access and should always be encrypted.
  Compliance and Data Governance: there are complex compliance and
governance rules which must be adhered to when securing or destroying data.
17
  Lifetime Costs: Offsite storage is often affordable; however,
there may be a risk of vendor lock-in when an organization is reliant
on the provider’s organization. This may result in a lifetime of
monthly charges for data usage, so it’s important to weigh up if the
overall costs and benefits outweigh implementing local storage.
 Speed: Although cloud/offsite performance is generally very good,
some data-intense applications may perform better using local
storage. Even if dedicated express routes are used, latency and
network bottlenecks may impact performance.
 Noisy Neighbours: When choosing offsite storage, it’s important to
understand whether you will be leveraging dedicated offsite storage
or if you will be using shared storage.

18
Onsite Backup

 In onsite storage, data and storage hardware are geographically


located internally to your business or organization.
 You may have a computer room or data center onsite where the
storage arrays are securely located.
 All your internal systems will have direct access to the storage within
the same building or organization, usually over an internal Local Area
Network (LAN).

19
Advantages of onsite storage:

  Immediate access to data


  Less expensive
  Internet access not needed
  Control of your own data security
  Performance improvement

20
Hybrid storage

 Hybrid storage is a mix of offsite and onsite storage approaches.


 A typical hybrid setup could be a local storage array at a head office
which replicates data to an external cloud-based location; such a
setup could be for backups or data integrity reasons.

21
Disk mirroring

 Disk mirroring, also known as RAID 1, is the replication of data to two


or more disks.
 Disk mirroring is a strong option for data that needs high availability
because of its quick recovery time.

22
RAID

 RAID refers to redundancy array of the independent disk. It is a


technology which is used to connect multiple secondary storage
devices for increased performance, data redundancy or both. It gives
the ability to survive one or more drive failure depending upon the
RAID level used.
 It consists of an array of disks in which multiple disks are connected
to achieve different goals.

23
There are 7 levels of RAID schemes.

 These schemas are as RAID 0, RAID 1, ...., RAID 6.


 These levels contain the following characteristics:
 - It contains a set of physical disk drives.
 - The operating system views these separate disks as a single
logical disk.

 - In this technology, data is distributed across the physical


drives of the array.
 - Redundancy disk capacity is used to store parity information.
 - In case of disk failure, the parity information can be helped to
recover the data
24
RAID 0

 RAID level 0 provides data stripping, i.e., a data can place across
multiple disks. It is based on stripping that means if one disk fails then
all data in the array is lost. This level doesn't provide fault tolerance but
increases the system performance.
 Pros of RAID 0:
 - In this level, throughput is increased because multiple data
requests probably not on the same disk.
 - This level full utilizes the disk space and provides high
performance.
 - It requires minimum 2 drives.
 Cons of RAID 0:
 - It doesn't contain any error detection mechanism.
 - The RAID 0 is not a true RAID because it is not fault-tolerance.
 - In this level, failure of either disk results in complete data loss in
respective array.
25
RAID 1

 This level is called mirroring of data as it copies the data from drive 1
to drive 2. It provides 100% redundancy in case of a failure. Only half
space of the drive is used to store the data. The other half of drive is
just a mirror to the already stored data.
 Pros of RAID 1:
 - The main advantage of RAID 1 is fault tolerance. In this level,
if one disk fails, then the other automatically takes over.
 - In this level, the array will function even if any one of the
drives fails.
 Cons of RAID 1:
 - In this level, one extra drive is required per drive for
mirroring, so the expense is higher.
26
RAID 2
 RAID 2 consists of bit-level striping using hamming code parity. In this
level, each data bit in a word is recorded on a separate disk and ECC
code of data words is stored on different set disks. Due to its high
cost and complex structure, this level is not commercially used. The
same performance can be achieved by RAID 3 at a lower cost.
 Pros of RAID 2:
 - This level uses one designated drive to store parity.
 - It uses the hamming code for error detection.
 Cons of RAID 2:
 - It requires an additional drive for error detection.

27
RAID 3

 RAID 3 consists of byte-level striping with dedicated parity. In this level,


the parity information is stored for each disk section and written to a
dedicated parity drive. In case of drive failure, the parity drive is
accessed, and data is reconstructed from the remaining devices. Once
the failed drive is replaced, the missing data can be restored on the new
drive. In this level, data can be transferred in bulk. Thus high-speed data
transmission is possible.
 Pros of RAID 3:
 - In this level, data is regenerated using parity drive.
 - It contains high data transfer rates.
 - In this level, data is accessed in parallel.
 Cons of RAID 3:
 - It required an additional drive for parity.
 - It gives a slow performance for operating on small sized files.
28
RAID 4

 RAID 4 consists of block-level stripping with a parity disk. Instead of


duplicating data, the RAID 4 adopts a parity-based approach. This
level allows recovery of at most 1 disk failure due to the way parity
works. In this level, if more than one disk fails, then there is no way to
recover the data. Level 3 and level 4 both are required at least three
disks to implement RAID.
 In this level, parity can be calculated using an XOR function. If the
data bits are 0,0,0,1 then the parity bits is XOR (0,1,0,0) = 1. If the
parity bits are 0,0,1,1 then the parity bit is XOR (0,0,1,1)= 0. That
means, even number of one results in parity 0 and an odd number of
one results in parity 1.
 This level allows us to recover lost data.

29
2.7.6. RAID 5

 RAID 5 is a slight modification of the RAID 4 system. The only


difference is that in RAID 5, the parity rotates among the drives. It
consists of block-level striping with DISTRIBUTED parity. Same as
RAID 4, this level allows recovery of at most 1 disk failure. If more
than one disk fails, then there is no way for data recovery. This level
was introduced to make the random write performance better.
 Pros of RAID 5:
 - This level is cost effective and provides high performance.
 - In this level, parity is distributed across the disks in an array.
 - It is used to make the random write performance better.
 Cons of RAID 5:
 - In this level, disk failure recovery takes longer time as parity
has to be calculated from all available drives.
 - This level cannot survive in concurrent drive failure.
30
RAID 6
 This level is an extension of RAID 5.
 It contains block-level stripping with 2 parity bits.
 In RAID 6, you can survive 2 concurrent disk failures. Suppose you are
using RAID 5, and RAID 1. When your disks fail, you need to replace the
failed disk because if simultaneously another disk fails then you won't be
able to recover any of the data, so in this case RAID 6 plays its part where
you can survive two concurrent disk failures before you run out of options.
 Pros of RAID 6:
 - This level performs RAID 0 to strip data and RAID 1 to mirror. In
this level, stripping is performed before mirroring.
 - In this level, drives required should be multiple of 2.
 Cons of RAID 6:
 - It is not utilized 100% disk capability as half is used for mirroring.
 - It contains very limited scalability.
31
2.2.2. Determining appropriate
methods
 To determine the type of backup strategy to be used there are different
determinant factors such as overall business cost, performance, data
protection levels, total amount of data retained and availability goals.
 Do the right or appropriate backup for your organization.
 For organizations with small data sets, running a daily full backup
provides a high level of protection without much additional storage
space costs.
 Larger organizations or those with more data or server volume find that
running a weekly full backup, coupled with either daily incremental
backups or differential backups, provides a better option.
 Using differentials provides a higher level of data protection with less
restore time for most scenarios and a small increase in storage capacity.
 For this reason, using a strategy of weekly full backups with daily
differential backups is a good option for many organizations.
32
Summary
A comparison of different types of backup

Backup Full Mirror Incremental Differential


Backup 1 All data All data Selected - -
Backup 2 All data All data Selected Changes from backup 1 Changes from backup 1

Backup 3 All data All data Selected Changes from backup 2 Changes from backup 1

Backup 4 All data All data Selected Changes from backup 3 Changes from backup 1

33
 From these types of backup, it is possible to develop an approach for
comprehensive data protection. An organization often uses one of the
following backup settings:
 • Full daily
 • Full weekly + differential daily
 • Full weekly + incremental daily

34
Self-check 2
Part-I multiple choice
1. Backing up the data to a server which is geographically separated from a production systems.
A. Online B. Offline C. Offsite D. Onsite
2. A mix of offsite and onsite storage approaches for the benefit of completeness.
A. Dynamic B. Hybrid C. Cloud D. Cold
3. One is not the advantages of onsite backup
A. May be destroyed during a catastrophic event C. Control of your own data security
B. Immediate access to data D. Less expensive
4. Which RAID level is called a mirroring of data?
A.RAID 0 B. RAID 1 C. RAID 2 D. RAID 7 E. All level
5. How many disk drives are required to implement disk mirroring?
A. At least two physical drives.
B. At least one disk drive and two logical drives
C. At least two logical drives
D. None
6. Which one of the following statement is incorrect about RAID?
B. It contains a set of physical disk drives.
C. The operating system views these separate disks as a single logical disk.
D. Data is distributed across the physical drives of the array.
E. None
35
 Part-II Give short answer
1. List and explain Types of backup?
2. Write the difference between hot and cold backup?
3. List the Advantages and Dis advantages of Full backup

36
The end of chapter Two !!!!!

37

You might also like