ISLAMIC REPUBLIC OF AFGHANISTAN
MINISTRY OF HIGHER EDUCATION
GHALIB UNIVERSITY
COMPUTER SCEINE FACULTY
Distributed Database(Semester 5)
(Distributed Database)
LECTURE R: HAMED AMIRY
DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 1
Table of content
Little bit of History
File processing Systems
Database Approach
Distributed DB and DBMS
Main Characteristics
Where to apply
Distributed DBMS
DDBMS Advantages
DDBMS Disadvantages
Distributed Database
Reasons for DDBS
Types of distributed databases
Difference b/w Distributed Database and Parallel Databases
DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 2
Little bit of History
The “dark ages”: paper forms in file cabinets
Computerized systems emerge
Same functionality as old paper-based systems
Initially for big projects like Social Security
The “golden age”: databases are everywhere
Most activities tracked electronically
Stored data provides detailed history of activity
The next step: use data for decision-making
Made possible by omnipresence of IT
Identify inefficiencies in current processes
Quantify likely impact of decisions
DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 3
Computerized systems(File Processing System)
Traditional File Processing System: the very first form of business
data processing
Each program contains data description that it manipulates
Redundancy of data
Problems in maintenance
DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 4
Traditional File Processing
DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 5
Library Examination Registration
Library Examination Registration
Applications Applications Applications
Library Examination Registration
Data Data Data
Files Files Files
DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 6
File Processing Systems
Library Exam Registration
Reg_Number Reg_Number Reg_Number
Name Name Name
Father Name Address Father Name
Books Issued Class Phone
Fine Semester Address
Grade Class
Duplication of Data
Vulnerable to Inconsistency
DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 7
History continues
Database Approach: (Also called centralized database) Database
is a shared collection of logically related data
DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 8
Database Approach
PROGRAM 1
Data
Description
Database
PROGRAM 2 Data
Manipulation
….
PROGRAM 3
Takes care of all major drawbacks of File
System Environment plus more
DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 9
Distributed DB and DBMS
DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 10
Distributed Database
Distributed Database: A collection of logically interrelated databases
that are spread physically across multiple locations connected by a
data communications link.
DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 11
Main Characteristics
Data at multiple sites
DM at each site
Local requirements
Global perspective
DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 12
Where to apply
Major two reasons that make an application a candidate to be
DDBS application
Large Number of Users
Operation spread large geographical area
DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 13
Banking
Air Ticketing
Business at multiple locations
DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 14
Distributed DBMS:
A software system that permits the management of DDB
and makes the distributed transparent environment to the
users.
DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 15
DDBMS Advantages
Data are located near “greatest demand” site
Faster data access
Faster data processing
Growth facilitation
Improved communications
Reduced operating costs
User-friendly interface
Less danger of a single-point failure
Processor independence
DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 16
DDBMS Disadvantages
Complexity of management and control
Security
Lack of standards
Increased storage requirements
Greater difficulty in managing the data environment
Increased training cost
DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 17
Reasons for DDBS
DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 18
Reasons for DDBS
Local units want control over data.
Consolidate data for integrated decisions
Reduce telecommunication costs.
Reduce the risk of telecommunication failures.
DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 19
Transparency
User View System View
DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 20
Types of distributed databases
Homogeneous Database: In a homogeneous database, all different sites store
database identically. The operating system, database management system and the
data structures used – all are same at all sites. Hence, they’re easy to manage
DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 21
Types of distributed databases cont.…
Heterogeneous Database: In a heterogeneous distributed database, different sites
can use different schema and software that can lead to problems in query
processing and transactions.
Also, a particular site might be completely unaware of the other sites. Different
computers may use a different operating system, different database application. They
may even use different data models for the database. Hence, translations are required
for different sites to communicate.
DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 22
What is Parallel database...??
A parallel database system is to improve performance through parallelization of various operations,
such as loading data, building indexes and evaluating queries.
The distribution is solely done on the bases of performance.
Parallel databases improve processing and input/output speeds by using multiple CPUs and disks in
parallel.
Many operations are performed simultaneously
Data may be stored in a distributed fashion.
DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 23
Difference b/w Distributed Database and Parallel
Database
Characteristics Parallel Database Distributed database
Definition It is a software system It is a software system that
where multiple manages multiple logically
processors or machines are interrelated databases
used to distributed over a computer
execute and run queries in network.
parallel.
Geographical The nodes are located at The nodes are usually
Location geographically located at geographically
same location. different locations.
Execution SPEED Quicker Slower
Overhead Less More
DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 24
Difference b/w Distributed Database and Parallel
Database
Characteristics Parallel Database Distributed database
Node types Compulsorily Homogeneous Need not be homogeneous
Performance Lower reliability & Higher reliability &
availability. availability.
Scope of Difficult to expand Easier to expand
Expansion
Backup Backup at one site only Backup at multiple sites
Consistency Maintaining consistency is Maintaining consistency is
easier difficult.
DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 25
Any Question??
Please write your questions and suggestions on the group!
DISTRIBUTED DATABASE LECTURE 02 SPRING 2020 26