0% found this document useful (0 votes)
8 views

DBMS - Lesson 1 - Data Management

Uploaded by

keithrashid8
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
8 views

DBMS - Lesson 1 - Data Management

Uploaded by

keithrashid8
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 15

Database Management Systems (DBMS) - Lesson 1 – Data Management

Introduction to database management


An organization must have accurate and reliable data (information) for effective decision
making. Data (information) is the back bone and must critical resources of an organization that
enables managers and organizations to gain a competitive edge.
In this age of information explosion, where people are bombarded with data, getting the right
information, in the right amount, the right time is not an easy task. Therefore, only those
organizations that successfully manage their data will survive.
Basic definition and concepts
In an organization data is the most basic resource. To run the organization efficiently the proper
organization and management of data is essential. The following are some of the terms used in
data management.
Data
Data is a collection of raw facts figures which can be processed to produce information. Data
includes text, numbers, images, audio and video. It is represented with the help of characters like
alphabets (A-Z, a-z), digit (0- 9) or special characters (+,/,*,<,>,= etc.)
Data aids in producing information which is based on facts, e.g. if we have data about marks
obtained by all students we can be able to tell the highest mark, the average mark and the lowest
mark hence making an informed decision of which student were non performers.
Data is the raw material used as input to the data processing process and information is processed
data obtained as an output of data which is in turn communicated to the target recipients who use
it to make informed decisions as shown in the following diagram

Input Output User


Processing Decision
data information

Information
Information is processed data. In other words, information is processed, organized or
summarized data.
Characteristics of good/ quality information
Nowadays there is a lot of data but we lack quality information. Quality information must have
the following characteristics:

1 Complied by Mr. P.K Munene


Database Management Systems (DBMS) - Lesson 1 – Data Management

 Accuracy: Accuracy means that information is free from errors and it clearly and
correctly reflects the meaning of data on which it is based, i.e. its degree of correctness is
high.
 Timeliness: Timelines means that the recipients receive the information when they need
it and within the required time frame.
 Relevance: Relevance means the usefulness of the information to the corresponding
persons or intended recipients.
Meta data
Meta data is data about data. Meta data describes objects in the database and makes it easier for
those objects to be accessed or manipulated. The Meta data describes the following about data
 The database structure.
 Size of data type.
 Constrains.
 Applications
 Authorization etc.
All these items are used as an integral tool for information resource management.
Type of Meta data
There are three main types of Meta data namely:
 Descriptive Meta data: Descriptive Meta data is a type Meta data that describes a
resource (data) for the purpose of being discovered, identified or selected. It includes, a
traditional library catalogue form of Meta data, title, abstract, author and keywords etc.
 Structural Meta data: Structural Meta data is a type Meta data that describes the
relationship between the various parts of a resource (data), e.g. how many pages of a
book are ordered to form a chapter.
 Administration Meta data: Administration Meta data is a type Meta data that
provides information to help manage a resource (data), such as when and how it was
created, file type , technical information, preservation, right and use.

2 Complied by Mr. P.K Munene


Database Management Systems (DBMS) - Lesson 1 – Data Management

Data management
Data management is the practice of managing data as a valuable resource to unlock its
potential. It is the practice of collecting, storing, organizing, protecting, verifying and processing
essential data to ensure its accessibility, reliability and timeliness to its users, who use it to make
informed decisions.
Importance of data management
The following are some of the benefits of good data management
 Increased productivity: With good data management organizations will be more
organized and productive. Employees will have an easier time finding, understanding and
relying information.
 Smooth operation: Good data management makes it easy for organization to respond
quickly to the world around them. This means that organization can respond efficiently to
market changes and react appropriately to competitors.
 Reduce security risk: Proper data management helps ensure that organization’s
information stays secure and never ends up in the wrong hands. A strong data
management system will help protect its information from theft and attacks.
 Reduce data loss: With a good data management plan in place organization greatly
reduce the risk of losing vital information. It also ensures important information is backed
up and retrieval in case something happens to the original copies.
 Accurate decisions: Proper data management helps ensure all employees and workers
view and analyses the same information. This helps ensure that the organization will be
making the most accurate decisions based on the most accurate information.
 Cost effective: Good data management can help organization avoid unnecessary extra
costs such as unneeded duplication when data is easily accessible.
Elements/components of data management
The data management process is made up of the following components or concepts:
 Architecture: This is data management component involves defining how data is
connected, integrated, transformed, stored and used.
 Modelling: This is data management component involves defining key business
concepts, e.g. customers, products etc. Data is designed through different models and the
relationship between them are shown.

3 Complied by Mr. P.K Munene


Database Management Systems (DBMS) - Lesson 1 – Data Management

 Integration: This is data management component involves consolidating data into a


single place and view. It combines different sources and also analyses those data for
processing of information.
 Governance: This is data management component ensures that data is consistent and
properly used throughout an organization.
 Security: This data management component prevents unauthorized access to
information.
 Analytics / Mining: This data management component involves drawing insight from
data. It is used for transforming raw data into information.
 Database administration: This data management component involves managing
databases and ensuring data availability.
 Quality management: This data management component ensures data meets business
requirements.
Data management approaches
There are mainly two approaches to data management namely:
 File base approach.
 Database approach.
File base approach.
The file base approach was an early attempt to computerize the manual filling system. A file
system is a method for starting and organizing computer file. And data they contain to make it
easy to find and access them. A file is a collection of related data stored in memory. In this
approach each file is independent of another file and a single file is known as a flat file.
Data in different files are integrated by writing individual program for each application. The data
and the application programs that uses the data are arranged that any change to the data requires
modifying all the programs that use the data.
This is because each file is hand coded with specific information like data type, data size etc.
Sometimes it is not possible to identify all the programs using particular data and programs are
identified on a trial and error basis.
The following diagrams show different applications with each having their own copy of the files
they need in order to carry out the activities for which they are responsible in a file based data
management system, separate files are created and stored for each application program.

4 Complied by Mr. P.K Munene


Database Management Systems (DBMS) - Lesson 1 – Data Management

Marketing Manufacturing Inventory Payroll

Application Application Application Application


program program program program

File File File File File File File


File File File File File
1 2 3 1 2 3 2 2 3
1 3 1
In a file base approach, each functional area in an organization creates, processes and
disseminates its own files. The functional area such as inventory and payroll generates separate
files and do not communicate with each other. Separate files are created and stored for each
application program as shown in the following diagram

. Application
Data File program Users

progran
Marketing Reports
Marketing Program
g

Manufacturing
Manufacturing Control Reports
Program

Inventory
Inventory Control Reports
Program

Payroll Reports
Payroll Program

5 Complied by Mr. P.K Munene


Database Management Systems (DBMS) - Lesson 1 – Data Management

Characteristics of file based approach


The following are the main characteristics of a file processing system (file based approach)
 It is a group of files storing data of an organization.
 Each file is independent from one another.
 Each file is known as a flat file.
 Each file contains and processes information for one specific function, such as
accounting or inventory.
 Files are designed by using programs written in programming languages such as
COBOL, C, and C++.
 The physical implementation and access procedures are written into database application,
therefore, physical changes resulted in intensive rework on the part of the programmer.
 As the system became more complex, file processing system offers little flexibility,
presented many limitations and were difficult to maintain.
Draw backs / Limitations/ disadvantages of file based approach
The following are some of the disadvantages of using the file based approach to manage data.
 Data redundancy: Since each application has its own data file the same data may have
to be recorded and stored in many files e.g. personal file and pay roll file., both contain
data on employee name, designation etc. the results is unnecessary duplication of data
items or redundant data items.
 Data inconsistency: data redundancy leads to data inconsistency especially when data is
to be updated. Data inconsistency occurs due to the same data items that appear in more
than one file, e.g.an employee is promoted from clerk to manager, the same is
immediately updated in the human resource but no in the payroll file. This results in two
different designations of the employee of the employee at the same time. Note: Over a
period of time, such discrepancies’ degrade the quality of information in the data file
hence affects the accuracy of reports.
 Lack of data integration: Since independent data file exists, users face difficulty in
getting information on any query that requires accessing the data in many files. In such a
case complicated programs have to be developed to retrieve data from every file or the
users have to manually collect the required information.

6 Complied by Mr. P.K Munene


Database Management Systems (DBMS) - Lesson 1 – Data Management

 Program dependence: The reports produced by a file processing system are program
dependent, which means if any change in the format or structure of data and records in
the file is to be made, the programs have to be modified correspondingly. Also, a new
program will have to be developed to produce a new report.
 Data dependence: The application/programs in a file processing system are data
dependent i.e. the file organization its physical location and retrieval from the storage
media are dictated by the requirements of the particular application, the file may be
organized o employee records sorted on their last name, which implies that accessibility
of any employee’s record has to be through the last name.
 Limited data sharing: There are limited data sharing possibilities with the traditional file
system. Each application has its own private files and user little choice to share
the data outside their own applications. Complex programs require to be written to obtain
data from several incompatible files.
 Poor data control: In file processing system there is no centralized control of data
element level, hence data field may have multiple names defined by the different
department of an organization and depending on the file it is in. This situation leads to
different meaning of a data field in different context or same meaning for different fields.
This causes poor data control.
 Security problem: It is very difficult to enforce security checks and access rights in a file
processing system, since application programs are added in an unplanned (ad hoc)
manner.
 Data manipulation capability: is in adequate: The data manipulation capability is very
limited in the file based approach since they do not provide strong relationships between
data in different files.
 Needs excessive programming: An excessive programming effort is needed to develop
a new application program and data in a file system. Each new application requires that
the developers start from scratch by designing new file formats, descriptions and then
write the file access logic for each new life.
 Data isolation: Data is scattered in different files and files are different formats hence
making writing a new application program to retrieve data difficult.

7 Complied by Mr. P.K Munene


Database Management Systems (DBMS) - Lesson 1 – Data Management

 Data atomicity: Ensuring that transactions fully happen or not happen at all is difficult,
since the information needed to roll back a transaction may not be readily available in a
file processing system.
The shared file approach
One approach of solving the problems, experienced with each application having its own set of
files is to share files between different applications. This will eliminate the problem of
duplication and inconsistent data between different applications and is as shown in the following
diagram

Marketing Manufacturing Inventory Payroll

Application Application Application Application


program program program program

File 2 File 3
File 1

The introduction of the shared file approach solves the problem of duplication and inconsistent
data access different versions of the same file had by different departments but other problems
may emerge which include the following:
 File incompatibility: A file structure that suits one application may not necessarily suit
another application, e.g. when each department has its own version of a file for
processing, each department could have to ensure that the structure of the file suits the ir
specific application which in turn would not be suitable for another.
 Difficult to control access: Some application may require access to more data than
others, hence some files will still need to contain additional information to support the
application.

8 Complied by Mr. P.K Munene


Database Management Systems (DBMS) - Lesson 1 – Data Management

 Physical data dependence: If the structure of the data need to the changed in the same
way, these alterations will need to be reflected in all application program that use the data
file.
 Difficult to implement concurrency: While a data file is being processed by one
application, the file will not be available for other applications since if more than one
application is allowed to alter data in a file at one time, serious problems can arise in
ensuring that updates made by each application do not crush with one another. File base
approach avoids these problems by not allowing more than one application to access a
file at the same time.
Note: All the limitations of the file based approach can be attribute to two factors:
 The definition of data embedded in the application programs rather than being stored
separately and independently.
 There is no control over the access and manipulation of data beyond that imposed by
application programs.
In order to remove all limitations of the file based approach a new approach is required that must
be more effective. This approach is known as the database approach.
The database approach
In the database approach, data is defined and stored centrally in a database as shown in the
following diagram

Marketing
Application Reports
Program
Data base
management Manufacturing
Marketing data Control Reports
System Application
Manufacturing data Program
Inventory data
Inventory Control Reports
Pay roll data Application
Program

Payroll Reports
Application
Program

9 Complied by Mr. P.K Munene


Database Management Systems (DBMS) - Lesson 1 – Data Management

The database technology eliminates many of the problems experienced in the file based approach
by organizing many application and different groups at the same time. In this approach a pool of
related data is shared by multiple application programs. Rather than having separate files each
application uses a collection of data that is joined or related in the database.
Characteristics of database based approach
The following are the main characteristics of a database approach
 Self-describing nature of a database system: A database contains not only the data, but
also a complete definition of the data structure, data types and data constraints. The
additional information is known as metadata which is stored in a file known as the data
dictionary.
 Data abstraction: In a traditional file processing system, the structure of the data files is
hard coded in the application programs; thus any changes in structure would need the
related application programs to be modified accordingly whereas in a database system,
the application programs are independent of the data stored in the database. The
application programs are only concerned with “what data” is stored in the database and
not concerned with “how the data is stored”. As long as the contents of data remain
unchanged, the database structure can be changed, without affecting the existing
application programs. This features is called data abstraction.
 Support for multiple views of the data: Depending on different needs and different
levels of authorizations, different users will be provided with different perspectives of the
same data known as views. A view refers to a subset of the stored data or set of virtual
data, i.e. data derived from the stored data.
 Multiuser access and concurrency control: A multiuser Database Management System
(DBMS) allows multiple users to access the same database concurrently. This is achieved
by including concurrency control software in the Database Management System
(DBMS), to ensure that database remains consistent, despite access by multiple users
concurrently.
 Effective system protection through grants of access right: Access rights are granted
to the users, to the extent required for their roles in the organization. These rights are
stored in the data dictionary itself. When a query is to be processed, the Database

10 Complied by Mr. P.K Munene


Database Management Systems (DBMS) - Lesson 1 – Data Management

Management System (DBMS) will first ensure that the user submitting the query has
sufficient rights for the processing of that query; only then is the query processed.
 Support for efficient recovery: When a system is restarted after a failure, log based
recovery recovers the database efficiently.
Advantages / Benefits of the database approach
The main advantages of the database approach are defined as follows:
 Data redundancy is minimized: Database system keeps data at one place in the
database. The data is integrated into a single, logical structure. Different applications
refer to the data from the centrally controlled location. The storage of the data,
centrally, minimizes data redundancy.
 Data inconsistency is reduced: Minimizing data redundancy using database system
reduces data inconsistency too. Updating of data value becomes simple and there is
no disagreement in the stored value, e.g. student’s mobile numbers are stored at a
single location and get updated centrally.
 Data is shared: Data sharing means sharing the same data among more than one
user, each user has access to the same data, though they may it for different purposes.
A database is designed to support shared data. Authorized users are permitted to use
the data from the database. Users are provided with views of the data to facilitate its
use, e.g. the student’s mobile numbers stored in the database which is shared by
student profile system and library system.
 Data independence: It is the separation of data description (Meta data) from the
application programs that use the data. In the database approach, data descriptions are
stored in a central location called the data dictionary. This property allows an
organization’s data to change and evolve (within limits) without changing the
application programs that process the data.
 Data integrity is maintained: Stored data is changed, frequently for variety of
reasons such as adding new data items types and changing the data formats. The
integrity and consistency of the database are protected using constraints on values that
data items can have. Data constraints definitions are maintained in the data dictionary
 Data security is improved: The database is a valuable resource that needs
protection. The database is kept secure by limiting access to the database by

11 Complied by Mr. P.K Munene


Database Management Systems (DBMS) - Lesson 1 – Data Management

authorized personnel. Authorized users are generally restricted to the particular data
they can access and whether they can update it or not. Access is often controlled by
passwords.
 Backup and recovery support: Backup and recovery are supported by the software
that logs changes to the database. This support helps in recovering the current state of
the database in case of system failure.
 Standards are enforced: Since data is stored centrally it is easy to enforce standards
on the database. Standards could include the naming conversions and standard for
updating accessing and processing data. Tools are available for developing and
enforcing standards.
 Application development time is reduced: The database approach greatly reduce
the cost and time for developing new business applications. Programmers can focus
on specific functions required for the new applications, without having to worry about
design or low level implementation details as related data have already been designed
and implemented. Tools for generation of forms and reports are also available.
In addition to the advantages highlighted above there are above there are several implications of
using the database approach like provision of multiple user interface , representation of complex
relationships, concurrent data access etc.
Disadvantages / Draws backs/ Limited of the database approach
There are some disadvantages of data systems over the file based system which include:
 Database systems are more vulnerable then file based system because of the centralized
nature of a large integrated database.
 If a failure occurs, the recovery process is complex and sometimes may result in lost
transactions.
Comparison of the file base approach with the database approach.
The following table shows a comparison between the file base approach and the database
approach

12 Complied by Mr. P.K Munene


Database Management Systems (DBMS) - Lesson 1 – Data Management

File based approach (System) Database approach (System)


File oriented systems are small Database oriented system are large
File oriented systems are relatively cheap Database oriented systems are relatively expensive
File oriented systems have a simple structure Database oriented systems have a complex structure.
File oriented systems need very little preliminary Database oriented systems need vast preliminary
design. design
File oriented systems are not secure. Database oriented systems are more secure.
File oriented systems are often single user oriented Database oriented systems are usually multi user
oriented.
File oriented systems have isolated data Database oriented systems have shared data.
File oriented systems have simple, primitive , Database oriented systems have complex and
backup/ recovery mechanism sophisticated back up/recovery mechanisms.
Programs and data in the file system are Programs and data in the database are independent
interdependent
In a file system, the data may be duplicated in In the database, the data is not duplicated and appears
different files that cause data redundancy. only once.
Inconsistencies may arise when the same data is The data appear only once so it is always
stored in different files. consistency.
Data cannot be retrieved easily as it is stored Data can be retrieved easily as it is stored in tables
separately in various files that are linked together.
The data is distributed across many different files The data is stored in one central location and can be
and cannot be easily shared. easily shared with others.
It is difficult to apply data integrity checks on the The database approach provides many constraints for
files. data security.
It provides poor security as the data is widely It provides many procedures to maintain security.
spread.
It provides many procedures to maintain security. It provides many facilities to maintain the programs
easily.
It is a less complex system. It is a very complex system.
The cost is very less than DBMS. The cost is much more than the file system

13 Complied by Mr. P.K Munene


Database Management Systems (DBMS) - Lesson 1 – Data Management

Difference between file base approaches with the database approach


There are the following differences between DBMS and File systems:
Basis File based approach (System) Database approach (System
Meaning The file system is a collection of data. In DBMS is a collection of data. In
this system, the user has to write the DBMS, the user is not required to
procedures for managing the database. write the procedures.
Sharing Data is distributed in many files and it Due to the centralized approach,
may be of different formats so it is not data sharing is easy.
easy to share data
Data abstraction The file system provides the detail of the DBMS gives an abstract view of
data representation and storage of data data that hides the details.
Security and protection It is not easy to protect a file under the DBMS provides a good protection
file system. mechanism
Recovery Mechanism The file system doesn't have a crash DBMS provides a crash recovery
mechanism, i.e., if the system crashes mechanism, i.e., DBMS protects the
while entering some data, then the user from system failure.
content of the file will be lost.
Manipulation technics The file system cannot efficiently store DBMS contains a wide variety of
and retrieve the data sophisticated techniques to store and
retrieve the data.
Concurrency Problems In the file system, concurrent access has DBMS takes care of Concurrent
many problems like redirecting the file access of data using some form of
while deleting some information or locking.
updating some information
Where to use Files system approach used in large Database approach used in large
systems which interrelate many files systems which interrelate many files
Cost The file system approach is cheaper to The database system is expensive to
design. design.
Data redundancy and Files and application programs are Due to the centralization of the
inconsistency created by different programmers so that database, the problems of data
there exists a lot of duplication of data redundancy and inconsistency are
which may lead to inconsistency controlled.
Structure The file system approach has a simple The database structure is complex to
structure. design.
Data independence In the file system approach, there exists In this system, Data independence
no data independence. exists and it can be of two types
namely:
 Logical data independence
 Physical data independence

14 Complied by Mr. P.K Munene


Database Management Systems (DBMS) - Lesson 1 – Data Management

Basis File based approach (System) Database approach (System)


Integrity Constraints Integrity Constraints are difficult to Integrity Constraints are easy to
implement in file system. apply.
Data models In the file system approach, there is no In the database approach three types
concept of data models exists of data models exist:
 Hierarchal data models.
 Network data models.
 Relational data models.
Flexibility The flexibility of the system is less as Changes are often a necessity to the
compared to the DBMS approach. content of the data stored in any
system, and these changes are more
easily with a database approach.
Examples Cobol, C++ etc. Oracle, SQL Server, Sybase etc.
Note:
 The database approach is an improvement on the shared file solution as the user of a
database and a Database Base Management System (DBMS) is provided with facilities
for querying, data security, and data integrity and is allowed simultaneous access to data.
 The main difference between a file-based approach and a database approach is: A
file system is used to store information about files and folders on a hard disk while a
DBMS is used to store information related to database.
 Why is the database approach better than the file-based approach? The benefits of
using a database over a traditional file processing system are that administrators have
more control over who can access the data and that data is only stored once, reducing
redundancy.
References
 Dr. S.B Gupta, A. Mittal (2009) Introduction to database management systems. Laxmi
publications PVT limited.
 C.Rajiv (2016) Database Management System (DBMS). A practical Approach 5th
Edition. S.Chand Publishing.
 R.P Mahapata, Govind Verma (2013) Database Management System. Khama Publishing
House.
 T.Connolly, C.Begg (2005) Database Management System. A practical approach to
design, implementation and management. Pearson Education Limited
 https://www.studytonight.com/dbms/database-model.php
 https://www.javatpoint.com/dbms-architecture

15 Complied by Mr. P.K Munene

You might also like