0% found this document useful (0 votes)
25 views

02 - Distributed Database

This document provides an overview of distributed databases. It begins with a brief history of data processing systems from paper files to centralized databases. It then defines a distributed database as a collection of logically related databases spread across multiple physical locations. The key characteristics and advantages of distributed databases are described, such as improved performance, scalability, and fault tolerance. Examples of applications that benefit from distributed databases include banking, airlines, and multi-location businesses. The document concludes by differentiating between distributed and parallel databases and describing the types of distributed databases.

Uploaded by

Amirali Hossiny
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
25 views

02 - Distributed Database

This document provides an overview of distributed databases. It begins with a brief history of data processing systems from paper files to centralized databases. It then defines a distributed database as a collection of logically related databases spread across multiple physical locations. The key characteristics and advantages of distributed databases are described, such as improved performance, scalability, and fault tolerance. Examples of applications that benefit from distributed databases include banking, airlines, and multi-location businesses. The document concludes by differentiating between distributed and parallel databases and describing the types of distributed databases.

Uploaded by

Amirali Hossiny
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 26

ISLAMIC REPUBLIC OF AFGHANISTAN

MINISTRY OF HIGHER EDUCATION


GHALIB UNIVERSITY
COMPUTER SCEINE FACULTY
Distributed Database(Semester 5)

(Distributed Database)

LECTURE R: HAMED AMIRY

DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 1


Table of content
Little bit of History
File processing Systems
Database Approach
Distributed DB and DBMS
Main Characteristics
Where to apply
Distributed DBMS
DDBMS Advantages
DDBMS Disadvantages
Distributed Database
Reasons for DDBS
Types of distributed databases
Difference b/w Distributed Database and Parallel Databases

DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 2


Little bit of History
The “dark ages”: paper forms in file cabinets
Computerized systems emerge
 Same functionality as old paper-based systems
 Initially for big projects like Social Security
The “golden age”: databases are everywhere
 Most activities tracked electronically
Stored data provides detailed history of activity
The next step: use data for decision-making
Made possible by omnipresence of IT
 Identify inefficiencies in current processes
 Quantify likely impact of decisions

DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 3


Computerized systems(File Processing System)
Traditional File Processing System: the very first form of business
data processing
Each program contains data description that it manipulates
Redundancy of data
Problems in maintenance

DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 4


Traditional File Processing

DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 5


Library Examination Registration

Library Examination Registration


Applications Applications Applications

Library Examination Registration


Data Data Data
Files Files Files

DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 6


File Processing Systems
Library Exam Registration
Reg_Number Reg_Number Reg_Number
Name Name Name
Father Name Address Father Name
Books Issued Class Phone
Fine Semester Address
Grade Class

Duplication of Data
Vulnerable to Inconsistency

DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 7


History continues

Database Approach: (Also called centralized database) Database


is a shared collection of logically related data

DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 8


Database Approach
PROGRAM 1
Data
Description
Database
PROGRAM 2 Data
Manipulation
….
PROGRAM 3

Takes care of all major drawbacks of File


System Environment plus more

DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 9


Distributed DB and DBMS

DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 10


Distributed Database
Distributed Database: A collection of logically interrelated databases
that are spread physically across multiple locations connected by a
data communications link.

DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 11


Main Characteristics
Data at multiple sites
DM at each site
Local requirements
Global perspective

DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 12


Where to apply
Major two reasons that make an application a candidate to be
DDBS application
Large Number of Users
Operation spread large geographical area

DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 13


Banking
Air Ticketing
Business at multiple locations

DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 14


Distributed DBMS:

A software system that permits the management of DDB


and makes the distributed transparent environment to the
users.

DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 15


DDBMS Advantages
Data are located near “greatest demand” site
Faster data access
Faster data processing
Growth facilitation
Improved communications
Reduced operating costs
User-friendly interface
Less danger of a single-point failure
Processor independence
DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 16
DDBMS Disadvantages
Complexity of management and control
Security
Lack of standards
Increased storage requirements
Greater difficulty in managing the data environment
Increased training cost

DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 17


Reasons for DDBS

DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 18


Reasons for DDBS
Local units want control over data.
Consolidate data for integrated decisions
Reduce telecommunication costs.
Reduce the risk of telecommunication failures.

DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 19


Transparency
User View System View

DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 20


Types of distributed databases
Homogeneous Database: In a homogeneous database, all different sites store
database identically. The operating system, database management system and the
data structures used – all are same at all sites. Hence, they’re easy to manage

DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 21


Types of distributed databases cont.…
Heterogeneous Database: In a heterogeneous distributed database, different sites
can use different schema and software that can lead to problems in query
processing and transactions.

Also, a particular site might be completely unaware of the other sites. Different
computers may use a different operating system, different database application. They
may even use different data models for the database. Hence, translations are required
for different sites to communicate.

DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 22


What is Parallel database...??
A parallel database system is to improve performance through parallelization of various operations,
such as loading data, building indexes and evaluating queries.
The distribution is solely done on the bases of performance.
Parallel databases improve processing and input/output speeds by using multiple CPUs and disks in
parallel.
Many operations are performed simultaneously
Data may be stored in a distributed fashion.

DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 23


Difference b/w Distributed Database and Parallel
Database
Characteristics Parallel Database Distributed database
Definition It is a software system It is a software system that
where multiple manages multiple logically
processors or machines are interrelated databases
used to distributed over a computer
execute and run queries in network.
parallel.
Geographical The nodes are located at The nodes are usually
Location geographically located at geographically
same location. different locations.
Execution SPEED Quicker Slower
Overhead Less More

DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 24


Difference b/w Distributed Database and Parallel
Database
Characteristics Parallel Database Distributed database
Node types Compulsorily Homogeneous Need not be homogeneous
Performance Lower reliability & Higher reliability &
availability. availability.
Scope of Difficult to expand Easier to expand
Expansion
Backup Backup at one site only Backup at multiple sites
Consistency Maintaining consistency is Maintaining consistency is
easier difficult.

DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 25


Any Question??
Please write your questions and suggestions on the group!

DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 26

You might also like