Unit 5 DMS

The document provides an overview of database backup processes, types of failures, and backup strategies, emphasizing the importance of backups for disaster recovery. It also discusses advanced database concepts such as data warehouses and data lakes, highlighting their characteristics, use cases, and differences. Additionally, it covers data mining techniques, big data characteristics, and features of NoSQL databases like MongoDB and DynamoDB.

Uploaded by

dinesh22.b4u

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views4 pages

Unit 5 DMS

Uploaded by

dinesh22.b4u

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Database Backup:

Types of Failures, Causes of Failure, and Backup Overview

Database Backup Overview: A database backup is the process of creating a copy of the data from a database so that it
can be restored in case of data loss or corruption. Backup is a critical part of a database's disaster recovery plan,
ensuring data integrity and availability in case of failure.

Types of Failures in Database Systems

1. Hardware Failures:
o Cause: Physical damage to the storage device (e.g., hard disk failure).
o Impact: Data loss or unavailability due to the inability to access stored data.
2. Software Failures:
o Cause: Bugs, application crashes, or system malfunction.
o Impact: Data corruption or failure to process database queries.
3. Human Errors:
o Cause: Accidental deletion of data, wrong configuration changes, or improper data entry.
o Impact: Loss of critical data or misconfigured databases, leading to system malfunctions.
4. Data Corruption:
o Cause: Issues like bugs in the database management system (DBMS), network issues, or power outages.
o Impact: Inaccessibility or incorrect retrieval of data.
5. Security Breaches:
o Cause: Hacking, ransomware, or malicious attacks.
o Impact: Unauthorized access, data theft, or corruption.
6. Natural Disasters:
o Cause: Fires, floods, earthquakes, etc.
o Impact: Physical damage to data storage facilities and servers.

Causes of Database Failures

1. Hardware Failure: This could include disk crashes, memory failures, or server downtime.
2. Software Bugs: Issues within the DBMS or database applications that cause crashes or malfunctions.
3. Power Outages: Unexpected power losses that disrupt the database operations, possibly leading to data
corruption.
4. Network Failures: Issues like slow network speeds or connection drops that may disrupt access to the database.
5. Human Mistakes: Manual errors, such as accidental deletion or incorrect updates, can lead to critical data loss.
6. Security Threats: Attacks like SQL injection, ransomware, and data breaches can compromise data integrity and
availability.
7. Improper Maintenance: Lack of regular updates or failure to monitor the system can lead to performance
degradation or vulnerabilities.

Types of Database Backups

1. Full Backup:
o A complete copy of the entire database, including all data, schema, and settings.
o Pros: Simplifies restoration because all data is in one backup.
o Cons: Takes longer to complete and requires more storage space.
2. Incremental Backup:
o Only the changes made since the last backup (either full or incremental) are backed up.
o Pros: Faster and requires less storage than full backups.
o Cons: Restoration is slower since multiple backup sets may be needed.
3. Differential Backup:
o Backs up all changes made since the last full backup.
o Pros: Faster restoration compared to incremental backups.
o Cons: Takes more storage space and time than incremental backups.
4. Transaction Log Backup:
o Captures all the transaction log entries that are recorded in the database.
o Pros: Enables point-in-time recovery.
o Cons: Requires that the database be running in full or bulk-logged recovery model.

Backup Strategies
1. Local Backup:
o Backups are stored on the same server or nearby local storage devices.
o Pros: Fast access for restoring data.
o Cons: Vulnerable to local disasters (e.g., fire, theft).
2. Remote Backup:
o Backups are stored on external servers or cloud storage.
o Pros: Provides protection against local failures and disasters.
o Cons: Slower recovery times, and the need for internet connectivity.
3. Cloud Backup:
o Backups are stored in cloud services like AWS, Google Cloud, or Azure.
o Pros: Off-site storage, automated backups, and scalability.
o Cons: Reliant on internet access and cloud service provider reliability.

Advanced Database Concepts: Data Warehouse and Data Lakes

Data Warehouse:
 Definition: A Data Warehouse is a centralized repository used for storing and analyzing large volumes of
structured data from multiple sources. It’s specifically designed for business intelligence (BI) tasks like reporting,
querying, and data analysis.
 Characteristics:
o Structured Data: Primarily stores structured data (e.g., relational data from operational systems).
o OLAP (Online Analytical Processing): Optimized for complex queries and analytical workloads rather
than transactional processing.
o Data Integration: Integrates data from various sources like transactional databases, external data
sources, and other systems.
o ETL Process: Data is extracted, transformed, and loaded (ETL) into the warehouse for analysis.
 Use Cases:
o Business intelligence (BI) reporting
o Trend analysis and decision-making support
o Historical data analysis
 Examples:
o Amazon Redshift, Google BigQuery, Microsoft Azure Synapse Analytics.

Data Lake:
 Definition: A Data Lake is a large, centralized repository that stores vast amounts of raw, unstructured, semi-
structured, and structured data. It allows for the storage of data in its native format until it is needed for
analysis.
 Characteristics:
o Raw and Unstructured Data: Can store unstructured data like text, images, videos, logs, and social
media data, alongside structured data.
o Scalability: Highly scalable, capable of storing petabytes of data, often used in big data applications.
o Schema-on-Read: Data is stored without a predefined schema, and the schema is applied when the data
is read or analyzed, making it more flexible for future analysis.
o Data Variety: Capable of storing diverse data types (e.g., JSON, XML, images, text).
 Use Cases:
o Big data analytics
o Machine learning and data mining
o Real-time analytics and data exploration
 Examples:
o Amazon S3 (with analytics tools like AWS Lake Formation), Microsoft Azure Data Lake Storage, Google
Cloud Storage.

Difference Between
Feature Data Warehouse Data Lake
Data Type Primarily structured data Structured, semi-structured, and unstructured data
Storage Data is processed and structured before
Raw data stored in its native format
Format storing
Schema Schema-on-write (predefined schema) Schema-on-read (schema applied during analysis)
Business intelligence, reporting, historical Big data analytics, machine learning, real-time
Use Case
analysis processing
Optimized for complex queries and reporting Optimized for flexible and scalable data storage and
Processing
(OLAP) processing

Data Mining
 Definition: Data mining is the process of discovering patterns, correlations, trends, and useful information from
large datasets using statistical, mathematical, and computational techniques. It transforms raw data into
valuable insights.
 Techniques:
o Classification: Assigning items to predefined categories (e.g., spam vs. non-spam emails).
o Clustering: Grouping similar items without predefined labels (e.g., customer segmentation).
o Association Rule Mining: Finding interesting relationships between variables (e.g., market basket
analysis).
o Regression: Predicting a continuous value based on data (e.g., forecasting sales).
o Anomaly Detection: Identifying outliers or unusual patterns in data (e.g., fraud detection).
 Applications:
o Market basket analysis
o Fraud detection
o Customer segmentation
o Predictive analytics (e.g., stock market forecasting)

2. Big Data
 Definition: Big Data refers to extremely large datasets that are too complex or voluminous to be processed by
traditional database management systems (DBMS) or computing tools. Big data typically involves the 3 Vs:
o Volume: Large amounts of data (petabytes, exabytes).
o Velocity: The speed at which data is generated, processed, and analyzed (real-time or near real-time).
o Variety: Different types of data (structured, semi-structured, unstructured).
 Characteristics:
o Scale: Big data can include vast amounts of data that come from diverse sources like social media,
sensors, logs, and more.
o Data Processing: It requires distributed computing systems, such as Hadoop and Spark, to process
efficiently.
o Advanced Analytics: Often used for advanced analytics, predictive modeling, and machine learning.
 Applications:
o Real-time analytics (e.g., stock market analysis)
o Internet of Things (IoT)
o Social media analysis
o Healthcare, genomics, and scientific research

3. MongoDB
 Definition: MongoDB is an open-source, NoSQL database that stores data in a flexible, document-oriented
format using JSON-like documents (BSON - Binary JSON). MongoDB is designed to handle large-scale,
unstructured, or semi-structured data.
 Key Features:
o Schema-less Design: Data is stored in JSON-like documents, allowing flexibility in the structure (no
predefined schema).
o Scalability: MongoDB is horizontally scalable, meaning it can distribute data across many servers
(sharding).
o High Availability: Supports replication, where data is duplicated across multiple nodes to ensure
availability.
o Aggregation Framework: Provides a powerful way to perform data analysis and aggregation.

 Advantages:
o Flexibility and scalability
o High availability and fault tolerance
o Suitable for unstructured data

4. DynamoDB

 Definition: DynamoDB is a fully managed NoSQL database service provided by Amazon Web Services (AWS). It
is designed for high availability, scalability, and low-latency performance. DynamoDB is optimized for
applications that require consistent, single-digit millisecond response times.
 Features:
o Key-Value and Document Data Model: It supports both key-value pairs and document-based structures,
allowing for flexible data representation.
o Scalable and Fast: DynamoDB can automatically scale up or down to handle increasing traffic without
manual intervention.
o Fully Managed: Amazon handles infrastructure, backups, replication, and scaling automatically.
o Global Replication: Allows for data replication across multiple regions for low-latency access.
 Advantages:
o Automatic scaling and management
o Low-latency reads and writes
o Built-in fault tolerance and replication

Unit VII Advanced Topics
No ratings yet
Unit VII Advanced Topics
23 pages
Advanced Databases and Mining For Mtech
67% (3)
Advanced Databases and Mining For Mtech
134 pages
Bcs Database - Complete Reference 2022
No ratings yet
Bcs Database - Complete Reference 2022
109 pages
RDBMS Notes
No ratings yet
RDBMS Notes
227 pages
DMS Chapter5 Full
No ratings yet
DMS Chapter5 Full
34 pages
Azure Data Engineering Complete Guide
No ratings yet
Azure Data Engineering Complete Guide
130 pages
Lecture Notes 1 - AD - Database Concepts - An Overview
No ratings yet
Lecture Notes 1 - AD - Database Concepts - An Overview
67 pages
Mo 7
No ratings yet
Mo 7
22 pages
DBMS VTH Unit
No ratings yet
DBMS VTH Unit
79 pages
3 Database
No ratings yet
3 Database
21 pages
Lecture Notes 1 - AD
No ratings yet
Lecture Notes 1 - AD
67 pages
DBMS, Big Data Anlaytics Module 1 Notes
No ratings yet
DBMS, Big Data Anlaytics Module 1 Notes
15 pages
Summaries Chapters
No ratings yet
Summaries Chapters
75 pages
Dbms Module I
No ratings yet
Dbms Module I
39 pages
Data Recovery Technique and Data Ware House
No ratings yet
Data Recovery Technique and Data Ware House
12 pages
KPC-OF-ALL-045 - Database Backup and Recovery
No ratings yet
KPC-OF-ALL-045 - Database Backup and Recovery
93 pages
TM07 Database Backup and Recovery
No ratings yet
TM07 Database Backup and Recovery
20 pages
Unit 5
No ratings yet
Unit 5
10 pages
UNIT.3 Dbms
No ratings yet
UNIT.3 Dbms
15 pages
Unit No.7 Crash Recovery & Backup
No ratings yet
Unit No.7 Crash Recovery & Backup
17 pages
Unit - 1 Notes
No ratings yet
Unit - 1 Notes
27 pages
Advanced Database - Midterm RVWR
No ratings yet
Advanced Database - Midterm RVWR
7 pages
DBMS Reference Notes
No ratings yet
DBMS Reference Notes
104 pages
Very Short Notes of Dbms RGPV (CS502)
No ratings yet
Very Short Notes of Dbms RGPV (CS502)
17 pages
Database Systems - Lecture 3
No ratings yet
Database Systems - Lecture 3
6 pages
Report On Database Recovery and Security-1
No ratings yet
Report On Database Recovery and Security-1
12 pages
Report On Database Recovery and Security
No ratings yet
Report On Database Recovery and Security
8 pages
Complet DB Backup and Recovery
100% (1)
Complet DB Backup and Recovery
13 pages
Computer Science Option A Database
No ratings yet
Computer Science Option A Database
9 pages
Answer 1
No ratings yet
Answer 1
9 pages
IT5351.L001 Intro To DBMS
No ratings yet
IT5351.L001 Intro To DBMS
43 pages
Lesson 2
No ratings yet
Lesson 2
50 pages
Mod 5
No ratings yet
Mod 5
10 pages
DB BKP
No ratings yet
DB BKP
14 pages
Info Management
No ratings yet
Info Management
6 pages
Cheat
No ratings yet
Cheat
2 pages
RDBMS Unit 5
No ratings yet
RDBMS Unit 5
11 pages
Peco Lodg
No ratings yet
Peco Lodg
15 pages
Term Paper On Database Architecture: Submitted To: Dr.v.saravana
No ratings yet
Term Paper On Database Architecture: Submitted To: Dr.v.saravana
16 pages
Database Concept
No ratings yet
Database Concept
3 pages
Ds Notes
No ratings yet
Ds Notes
88 pages
Chapter 4 Bing
No ratings yet
Chapter 4 Bing
5 pages
Advanced Database Concepts
No ratings yet
Advanced Database Concepts
7 pages
Complet DB Backup and Recovery
No ratings yet
Complet DB Backup and Recovery
12 pages
Unit 4-DBMS
No ratings yet
Unit 4-DBMS
20 pages
Database Management Systems
No ratings yet
Database Management Systems
4 pages
Data Repositories in Data Analytics
No ratings yet
Data Repositories in Data Analytics
8 pages
Jeena
No ratings yet
Jeena
3 pages
Infomration Management Notes 1ST Sem
No ratings yet
Infomration Management Notes 1ST Sem
9 pages
BSC CsIt Complete RDBMS Notes
No ratings yet
BSC CsIt Complete RDBMS Notes
86 pages
Backup
No ratings yet
Backup
14 pages
InfoManage Handouts 01 02
No ratings yet
InfoManage Handouts 01 02
6 pages
Introduction To Database Systems
No ratings yet
Introduction To Database Systems
4 pages
Data Base System Assignment
No ratings yet
Data Base System Assignment
4 pages
Definition, Goal of Data Engineering Transaction Concept and Main Issues
No ratings yet
Definition, Goal of Data Engineering Transaction Concept and Main Issues
5 pages
Udbms Notes
No ratings yet
Udbms Notes
18 pages
INFOMAN Prelim Notes
No ratings yet
INFOMAN Prelim Notes
9 pages
Hose Reel Calculation
100% (4)
Hose Reel Calculation
2 pages
KT Ykts
No ratings yet
KT Ykts
41 pages
T R 6 SIM - Rio Ananta Tarigan
No ratings yet
T R 6 SIM - Rio Ananta Tarigan
2 pages
2.RGP Corneal Lens
No ratings yet
2.RGP Corneal Lens
13 pages
CE Board Nov 2020 - Hydraulics - Set 19
No ratings yet
CE Board Nov 2020 - Hydraulics - Set 19
1 page
A REPORT ON MIMO IN WIRELESS APPLICATIONS - Final
No ratings yet
A REPORT ON MIMO IN WIRELESS APPLICATIONS - Final
11 pages
Att 8 - ASTM B8-4
No ratings yet
Att 8 - ASTM B8-4
7 pages
Pp12a
No ratings yet
Pp12a
55 pages
1.2 Newtonian Relativity and Galilean Transformations
No ratings yet
1.2 Newtonian Relativity and Galilean Transformations
7 pages
SImple and Compound Interest Notes Lyst6475
No ratings yet
SImple and Compound Interest Notes Lyst6475
11 pages
The Definition of El Niño: Kevin E. Trenberth
No ratings yet
The Definition of El Niño: Kevin E. Trenberth
7 pages
Strength Tests On Concrete: (1) Compressive Strength Test (ASTM C 39)
No ratings yet
Strength Tests On Concrete: (1) Compressive Strength Test (ASTM C 39)
12 pages
Ciprofloxacin Suspension in Syrup NF
No ratings yet
Ciprofloxacin Suspension in Syrup NF
0 pages
SAFECode Dev Practices0211
No ratings yet
SAFECode Dev Practices0211
56 pages
Welding Machine Pre Start Checklist
No ratings yet
Welding Machine Pre Start Checklist
2 pages
Cortex™ M3
No ratings yet
Cortex™ M3
384 pages
Socio 101 - Midterm Exam Reviewer
No ratings yet
Socio 101 - Midterm Exam Reviewer
8 pages
Microsoft Excel Intermediate
No ratings yet
Microsoft Excel Intermediate
9 pages
MG HG Replacement
No ratings yet
MG HG Replacement
16 pages
User Manual GALILEO: 06/2013 MN04802104Z-EN
No ratings yet
User Manual GALILEO: 06/2013 MN04802104Z-EN
17 pages
Cryptography and Network Security: Fifth Edition by William Stallings
No ratings yet
Cryptography and Network Security: Fifth Edition by William Stallings
23 pages
Global Elevation Data Download Tool - January 15, 2025
No ratings yet
Global Elevation Data Download Tool - January 15, 2025
5 pages
Icd Tutorial
No ratings yet
Icd Tutorial
42 pages
Outline For Photosynthesis
No ratings yet
Outline For Photosynthesis
6 pages
Automated Face Mask Detection: A Project by Nishant Goel Under The Guidance of Dr. Anil Kumar
No ratings yet
Automated Face Mask Detection: A Project by Nishant Goel Under The Guidance of Dr. Anil Kumar
21 pages
Lecture 1 - INTRODUCTION
No ratings yet
Lecture 1 - INTRODUCTION
25 pages
Anritsu - Spectrum Master MS2720T - 2009
No ratings yet
Anritsu - Spectrum Master MS2720T - 2009
28 pages
Determine and Describe The Intersection of Sets Using Various Representations and B
No ratings yet
Determine and Describe The Intersection of Sets Using Various Representations and B
18 pages
LTspice Tutorial Part 4 - Intermediate Circuits
No ratings yet
LTspice Tutorial Part 4 - Intermediate Circuits
23 pages
Marantz SR 4500 Brochure
No ratings yet
Marantz SR 4500 Brochure
4 pages
Audovia Documentation 4.0
No ratings yet
Audovia Documentation 4.0
12 pages
Database And Computer Management: SERIES 1, #3
From Everand
Database And Computer Management: SERIES 1, #3
Elias Mutegi
No ratings yet
Databases: System Concepts, Designs, Management, and Implementation
From Everand
Databases: System Concepts, Designs, Management, and Implementation
Jonathan Rigdon
No ratings yet

Unit 5 DMS

Uploaded by

Unit 5 DMS

Uploaded by

Database Backup:

Types of Failures, Causes of Failure, and Backup Overview

Types of Failures in Database Systems

Causes of Database Failures

Types of Database Backups

Advanced Database Concepts: Data Warehouse and Data Lakes

You might also like