0% found this document useful (0 votes)
8 views30 pages

Lecture 7 and 8 - Administration of DDBMS

The document discusses the concept of Distributed Database Management Systems (DDBMS), which manage multiple interconnected databases spread across various locations, making them appear as a single database to users. It outlines the features, benefits, and challenges of distributed databases, including their ability to enhance reliability, availability, and performance while also addressing issues like latency and complexity. Additionally, it covers types of distributed databases such as replication and sharding, and emphasizes the importance of DDBMS in data warehousing and maintaining data integrity.

Uploaded by

metrombuvi
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
8 views30 pages

Lecture 7 and 8 - Administration of DDBMS

The document discusses the concept of Distributed Database Management Systems (DDBMS), which manage multiple interconnected databases spread across various locations, making them appear as a single database to users. It outlines the features, benefits, and challenges of distributed databases, including their ability to enhance reliability, availability, and performance while also addressing issues like latency and complexity. Additionally, it covers types of distributed databases such as replication and sharding, and emphasizes the importance of DDBMS in data warehousing and maintaining data integrity.

Uploaded by

metrombuvi
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 30

Distributed DBMS - Concepts

For proper functioning of any organization, there’s a


need for a well-maintained database. In the recent
past, databases used to be centralized in nature.
However, with the increase in globalization,
organizations tend to be diversified across the globe.
They may choose to distribute data over local
servers instead of a central database. Thus, arrived
the concept of Distributed Databases
Distributed DBMS - Distributed Databases
This chapter introduces the concept of DDBMS. In a
distributed database, there are a number of databases that
may be geographically distributed all over the world. A
distributed DBMS manages the distributed database in a
manner so that it appears as one single database to users. In
the later part of the chapter, we go on to study the factors
that lead to distributed databases, its advantages and
disadvantages.
A distributed database is a collection of multiple
interconnected databases, which are spread physically
across various locations that communicate via a computer
network.
Features
Databases in the collection are logically interrelated with each
other. Often they represent a single logical database.
Data is physically stored across multiple sites. Data in each site
can be managed by a DBMS independent of the other sites.
The processors in the sites are connected via a network. They
do not have any multiprocessor configuration.

A distributed database is not a loosely connected file system.


A distributed database incorporates transaction processing, but
it is not synonymous with a transaction processing system
Distributed Database Management System

A distributed database management system (DDBMS)

is a centralized software system that manages a

distributed database in a manner as if it were all

stored in a single location


Types of Distributed Databases
Distributed databases can be broadly classified into homogeneous and heterogeneous
distributed database environments, each with further sub-divisions, as shown in the
following illustration
Features of Distributed Database Management System
1. It is used to create, retrieve, update and delete distributed
databases.
2. It synchronizes the database periodically and provides access
mechanisms by the virtue of which the distribution becomes
transparent to the users.
3. It ensures that the data modified at any site is universally
updated.
4. It is used in application areas where large volumes of data are
processed and accessed by numerous users simultaneously.
5. It is designed for heterogeneous database platforms.
6. It maintains confidentiality and data integrity of the databases.
Administration of DDBMS
A distributed system is a system with multiple components located
on different nodes (machines) that communicate and coordinate
actions in order to appear as a single system to the end-user
The node that is a part of a distributed system may be
computers, physical servers, containers, or any other node that
can connect to the network, have local memory, and
communicate by passing messages.
Although distributed systems can sometimes be obscure, they
usually have three primary characteristics: all components run
concurrently, there is no global clock, and all components fail
independently of each other
What are distributed Computing systems
Benefits of distributed systems
Horizontal Scalability

Scale-out (horizontal) scaling is distributing the workload over many resources. Loosely
coupled systems allow you to distribute the system over different resources and provide
better availability.
Reliability
Most distributed systems are fault-tolerant as they can be made up of hundreds of
nodes that work together. The system generally doesn’t experience any disruptions if a
single machine fails (avoid single-point failure).
Performance
Distributed systems are extremely efficient because workloads can be broken up and
sent to multiple machines.
Cost-effective
The initial cost is higher than a traditional system, but because of their scalability, they quickly
become more cost-effective
Parallelism
Distributed systems can be designed for parallelism, where multiple processors divide up a
complex problem into pieces

Challenges of distributed systems


However, distributed systems are not without challenges. Complex architectural design, construction, and
debugging processes that are required to create an effective distributed system can be overwhelming.
Failure Handling
Failure handling can be difficult with distributed systems because some components fail while others
continue to function. This can often serve as an advantage to prevent large-scale failures, but it also
leads to more complexity when it comes to troubleshooting and debugging.
Latency
The more widely your system is distributed, the more latency you can experience with communications.
Concurrency
A common issue occurs when several clients attempt to access a shared resource simultaneously. You
must ensure that all resources are safe in a concurrent environment.
Higher initial infrastructure costs
The initial deployment cost of a distributed system can be higher than a single system. This pricing
includes basic network setup issues, such as transmission, high load, and loss of information
We use distributed databases to:
•Increased Reliability and availability
•Better Response
•Modular Development
•Less Data Movement over Network
•Smaller Databases are Easier to Manage
•Smaller Databases are Faster
•Database Sharding can Reduce Costs
Types of distributed databases
Replication
Database replication can be used to provide fault tolerance that has a
database back end. You just need to make sure that your database platform
supports it. You also need to make sure that the application stores data in a
way that is conducive to replication
Sharding
Sharding is the process of breaking up large tables into smaller chunks
called shards that are spread across multiple servers. The idea is to
distribute data that can’t fit on a single node onto a cluster of database
nodes.
Microservices
Microservices are an architectural approach to building applications. As an
architectural framework, microservices are distributed and loosely coupled
services that communicate with each other through the network
Explain fully distributed database
management system

A distributed database management system


(DDBMS) is a set of multiple, logically
interrelated databases distributed over a
network. They provide a mechanism that makes
the distribution of data transparent to users
DDBMS is a centralized application that manages a distributed
database.
This database system synchronizes data periodically and
ensures that any change in data made by users is universally
updated in the database.

DDBMS is widely used in data warehousing, where huge


volumes of data are processed and accessed by numerous
users or database clients at the same time. This database
system is used to manage data in networks, maintain
confidentiality and handle data integrity.

A distributed database management system is designed for


heterogeneous database platforms that focus on heterogeneous
database management systems
What is distributed database administration?

A distributed database is a database that consists of two


or more files located in different sites either on the
same network or on entirely different networks.
Portions of the database are stored in multiple physical
locations and processing is distributed among multiple
database nodes
What are the components of distributed database
management system?
The different components of DDBMS are as follows: Computer
workstations or remote devices (sites or nodes) that form the
network system. The distributed database system must be
independent of the computer system hardware. Network hardware
and software components that reside in each workstation or
device
What is the purpose of a distributed database
management system DDBms?
DDBMS is widely used in data warehousing, where huge volumes
of data are processed and accessed by numerous users or
database clients at the same time.

This database system is used to manage data in networks,


maintain confidentiality and handle data integrity
What are the three main characteristics of a distributed
system?
Although distributed systems can sometimes be obscure, they
usually have three primary characteristics: all components run
concurrently, there is no global clock, and all components fail
independently of each other

What are the major objectives of distributed database?


Definition. A distributed DBMS is a software system that manages a
distributed database, which consists of data that are partitioned and
replicated among interconnected server sites. The primary objective
of a distributed DBMS is to hide data distribution so that it
appears as one logical database system to the clients .
What are the three 3 advantages of distributed
database systems?
1)In a distributed database, data can be stored in
different systems like personal computers, servers,
mainframes, etc.
2) A user doesn't know where the data is located
physically. Database presents the data to the user as if
it were located locally.
3)Database can be accessed over different networks.
Advantages of distributed database:
1) In a distributed database, data can be stored in different systems like personal computers, servers,
mainframes, etc.
2) A user doesn’t know where the data is located physically. Database presents the data to the user as if it
were located locally.
3) Database can be accessed over different networks.
4) Data can be joined and updated from different tables which are located on different machines
5) Even if a system fails the integrity of the distributed database is maintained.
6) A distributed database is secure.

Disadvantages of distributed database:


1) Since the data is accessed from a remote system, performance is reduced.
2) Static SQL cannot be used.
3) Network traffic is increased in a distributed database.
4) Database optimization is difficult in a distributed database.
5) Different data formats are used in different systems.
6) Different DBMS products are used in different systems which increases in complexity of the
system.
7) Managing system catalog is a difficult task.
8) While recovering a failed system, the DBMS has to make sure that the recovered system is
consistent with other systems.
9) Managing distributed deadlock is a difficult task.
An architecture that can be considered distributed
A Thorough Introduction to Distributed Systems
Distributed computing is defined as a system
consisting of software components spread over
different computers but running as a single entity.

A distributed system can be an arrangement of


different configurations, such as mainframes,
computers, workstations, and minicomputers. This
article gives in-depth insights into the working of
distributed systems, the types of system
architectures, and essential components with real-
world examples.
Distributed Components in the
Call Center Management
System
Components of a distributed multimedia system

You might also like