0% found this document useful (0 votes)
9 views

Database Management Systems and Distributed Systems Lesson 3

Uploaded by

go0215
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
9 views

Database Management Systems and Distributed Systems Lesson 3

Uploaded by

go0215
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 34

Database management systems

and distributed systems


Definition of Database management
System(DBMS)
A concept which develops a special data processing system, or part of a
data processing system, which aids in the storage, manipulation,
reporting, management, and control of data.
OR
A DBMS is a concept which lead to set of software programs that
controls the organization, storage, management, and retrieval of data
in and from database.
File processing system
File processing is the process of
• creating,
• storing and
• accessing content of files.
File processing
• File processing can be used in opening saved files for read only purpose.
• It can be used to save a new file or displace the existing one.
Through file processing you can make new files and save the existing files.
• You can also modify files through this process.
• Compressing files, folders, and programs decreases their size and reduces
the amount of space they use on your volumes or removable storage
devices.
• Volume compression decreases the amount of space that is used by all of
the files and folders that are stored on that volume. Because of a potential
loss of performance, you may not want to compress some files.
Features/Advantages of DBMS

Data independence 6. Security management


Removes redundancy 7. Access control
Data integrity 8. Improved consistency
Back-up and Recovery 9. Reliability
Sharing the data 10. Multi-user
Database Approach

• A database is a collection of interrelated data's stored in a database server;


these data's will be stored in the form of tables. The primary aim of database
is to provide a way to store and retrieve database information in
fast and efficient manner.
• There are number of characteristics that differs from traditional file
management system. In file system approach, each user defines and
implements the needed files for a specific application to run.
For example in sales department of an enterprise, One user will be
maintaining the details of how many sales personnel are there in the sales
department and their grades, these details will be stored and maintained in a
separate file.
• Another user will be maintaining the salesperson salary details working in
the concern, the detailed salary report will be stored and maintained in a
separate file.
Although both of the users are interested in the data's of the sales persons
they will be having their details in a separate files and they need different
programs to manipulate their files. This will lead to wastage of space and
redundancy or replication of data's, which may lead to confusion, sharing of
data among various users is not possible, data inconsistency may occur.
These files will not be having any inter-relationship among the data's stored
in these files.
Therefore in traditional file processing every user will be defining their own
constraints and implement the files needed for the applications.
Distributed database system
• A distributed database system is a type of database
management system that stores data across multiple
computers or sites that are connected by a network.
Database, distributed database, relational
database
• A database is a set of structured information stored on a computer
that can provide a variety of views of the data.
• A distributed database is a collection of databases physically stored
across sites in a computer network.
• A relational database is a collection of multiple tables, records and
fields.
Distributed Database System

• A distributed database is basically a database that is not


limited to one system, it is spread over different sites, i.e. on
multiple computers or over a network of computers.
• A distributed database system is located on various sites that
don’t share physical components.
• This may be required when a particular database needs to be
accessed by various users globally.
• It needs to be managed such that for the users it looks like one
single database.
Types
Two types
• Homogeneous
• Heterogeneous
Homogeneous Database

In a homogeneous database, all different sites store database
identically.
• The operating system, database management system, and the
data structures used – all are the same at all sites.
Hence, they’re easy to manage.
Heterogeneous Database
In a heterogeneous distributed database, different sites can use
different schema and software that can lead to problems in
query processing and transactions.
Also, a particular site might be completely unaware of the other
sites. Different computers may use a different operating system,
different database application.
They may even use different data models for the database.
Hence, translations are required for different sites to
communicate.
Distributed Data Storage
• There are 2 ways in which data can be stored on different sites.
These are:
• Replication
• Fragmentation
Replication

• In this approach, the entire relationship is stored redundantly at


2 or more sites.
• If the entire database is available at all sites, it is a fully
redundant database.
• Hence, in replication, systems maintain copies of data.
Fragmentation
• In this approach, the relations are fragmented (i.e., they’re
divided into smaller parts) and each of the fragments is stored
in different sites where they’re required.
• It must be made sure that the fragments are such that they can
be used to reconstruct the original relation (i.e, there isn’t any
loss of data).
Fragmentation is advantageous as it doesn’t create copies of
data, consistency is not a problem.
Fragmentation
Fragmentation of relations can be done in two ways
• Horizontal fragmentation – Splitting by rows
The relation is fragmented into groups of tuples so that each
tuple is assigned to at least one fragment.
• Vertical fragmentation – Splitting by columns –
The schema of the relation is divided into smaller schemas.
Each fragment must contain a common candidate key so as to
ensure a lossless join.
combined
• In certain cases, an approach that is hybrid of fragmentation
and replication is used.
Applications of Distributed Database
• It is used in Corporate Management Information System.
• It is used in multimedia applications.
• Used in Military’s control system, Hotel chains etc.
• It is also used in manufacturing control system.
There are several different architectures for
distributed database systems
• Client-server architecture
• Peer to peer
• Federated architecture
Client-server architecture
• In this architecture, clients connect to a central server, which
manages the distributed database system.
• The server is responsible for coordinating transactions,
managing data storage, and providing access control.
Peer to peer
• In this architecture, each site in the distributed database
system is connected to all other sites.
• Each site is responsible for managing its own data and
coordinating transactions with other sites.
Federated architecture

• In this architecture, each site in the distributed database


system maintains its own independent database, but the
databases are integrated through a middleware layer that
provides a common interface for accessing and querying the
data.
Advantages of Distributed Database
System
• There is fast data processing as several sites participate in
request processing.
• Reliability and availability of this system is high.
• It possess reduced operating cost.
• It is easier to expand the system by adding more sites.
• It has improved sharing ability and local autonomy.
Disadvantages of Distributed Database
System

1)The system becomes complex to manage and control.
2) The security issues must be carefully managed.
3) The system require deadlock handling during the transaction
processing otherwise
the entire system may be in inconsistent state.
4) There is need of some standardization for processing of
distributed database
system.
What is information technology?

• Information technology (IT) is the use of any computers,


storage, networking and other physical devices, infrastructures
and processes to create, process, store, secure and exchange
all forms of electronic data.
• Typically, IT is used in the context of business operations, as
opposed to technology used for personal or entertainment
purposes.
• The commercial use of IT encompasses both computer
technology and telecommunications.
IT COMPUTER HARDWARE
• Hardware
• There are many different types of computer hardware.
Computer servers run business applications. Servers interact
with client devices in the client-server model. They also
communicate with other servers across computer networks,
which typically link to the internet.
• Storage is another type of hardware.
• It's any technology that holds information as data.
• Storage may be local on a specific server or shared among many servers,
and it may be installed on premises or accessed via a cloud service.

• Information that is stored can take many forms, including file, multimedia,
telephony, and web and sensors data.
• Storage hardware includes volatile random-access memory (RAM) as well
as non-volatile tape, hard disk drives and solid-state drives.
• Telecom equipment, comprising network interface cards (NICs), cabling,
wireless communications and switching devices, connect the hardware
elements together and to external networks.
Data Processing Technology
Data Processing Technology
• It is process involving the merger of data-processing technologies and
computer services with telecommunications systems to allow the
widespread networking and circulation of information.
Computer
• A computer is a device that transforms data into meaningful
information.
• It processes the input according to the set of instructions
provided to it by the user and gives the desired output.
• Computers are of various types and they can be categorized in
two ways on the basis of size and on the basis of data handling
capabilities.
• There are two bases on which we can define the types of
computers. We will discuss the type of computers on the basis
of size and data handling capabilities
Types of computers
• Super Computer
• Mainframe computer
• Mini Computer
• Workstation Computer
• Personal Computer (PC)
• Server Computer
• Analog Computer
• Digital Computer
• Hybrid Computer
• Tablets and Smartphone
assignment
● Education • Vegetation and forests
● Statistics (population census • Health
● Transport (road, rail, air) • Weather /climate/ cyclones,
● Social media • disasters
● Farming • The drainage system (rivers, dams
● Mining and minerals lakes, etc)
● Crimes (Security, prisons, theft) • Banking
• Religion
• Retail shops

You might also like