Topic 1. Limitations of File Processing Systems and DBMS
Topic 1. Limitations of File Processing Systems and DBMS
I N F O R M A T I O N M A N A G E M E N T
INTRODUCTION
In daily life, we come across various needs to store
data. It can be maintaining daily household bills, bank
account details, salary details, payment details,
student information, student reports, books in the
library, etc. How it will be recorded in one place so
that we can get it back when required? It should be
recorded in such a way that
INTRODUCTION
1. Should be able to get the data at any point in time
latter
2. Should be able to add details to it whenever required
3. Should be able to modify stored information, as
needed
4. Should also be able to delete them
INTRODUCTION
In the traditional approach, before to computer, all pieces of
information were stored in papers. When we need information, we used
to search through the papers. If we know a particular date or category
of information we are searching for, we go to that particular session in
the papers. When we want to update or delete some data, we search
for it and modify them or strike off them. If the data is limited, then all
these tasks are easy. Imagine library information or information about
a student in school, or a banking system! How do we search for single
required data in papers? It is a never-ending task! Yes, Computers
solved our problems.
File Processing System
When computers came, all these jobs become easy. But initial
days, these records were stored in the form of files. The way we
stored in files is similar to papers, in the form of flat files – to be
simpler, in notepad. Yes, the pieces of information where all in
the notepads with each field of information separated by
space, tab comma, semicolon, or any other symbol.
File Processing System
All the files were grouped based on their categories; the file used
to have only related information and each file is named properly.
As we can see in the above sample file has Student information.
Student files for each class were bundled inside different folders to
identify it quickly.
File Processing System
All the files were grouped based on their categories; the file
used to have only related information and each file is named
properly. As we can see in the above sample file has Student
information. Student files for each class were bundled inside
different folders to identify it quickly.
File Processing System
Now how will we open a file, read or update it? There are
different programs like C, C++, COBOL, etc which help to do
this task. Using these programming languages, we can search for
files, open them, search for the data inside them, and go to a
specific line in the file, add/update/delete specific information.
Disadvantages of file processing
The file processing system is good when there is only a
limited number of files and data in are very less. As the
data and files in the system grow, handling them
becomes difficult. There are following disadvantages of
file system.
DISADVANTAGES OF
FILE SYSTEM
Data Mapping and Access Data Isolation Concurrent Access
Although all the related information is grouped and stored in different files, there
is no mapping between any two files. i.e.; any two dependent files are not linked.
Even though Student files and Student_Report files are related, they are two
different files and they are not linked by any means. Hence if we need to display
student details along with his report, we cannot directly pick from those two files.
We have to write a lengthy program to search the Student file first, get all details,
then go Student_Report file and search for his report. When there is a very huge
amount of data, it is always a time-consuming task to search for particular
information from the file system. It is always an inefficient method to search for
the data.
DISADVANTAGES OF FILE SYSTEM
Data Redundancy
There are no methods to validate the insertion of duplicate data in the file
system. Any user can enter any data. The file system does not validate for the
kind of data being entered nor does it validate for the previous existence of the
same data in the same file. Duplicate data in the system is not appreciated as it is
a waste of space and always leads to confusion and mishandling of data. When
there are duplicate data in the file, and if we need to update or delete the record,
we might end up in updating/deleting one of the records, leaving the other
record in the file. Again the file system does not validate this process.
DISADVANTAGES OF FILE SYSTEM
Data Dependence
In the files, data are stored in a specific format, say tab, comma, or semicolon. If
the format of any of the files is changed, then the program for processing this file
needs to be changed. But there would be many programs dependent on this file.
We need to know in advance all the programs which are using this file and
change in the entire place. Missing to change in any one place will fail the whole
application. Similarly, changes in the storage structure, or accessing the data,
affect all the places where this file is being used. We have to change its entire
programs. That is the smallest change in the file affects all the programs and
needs changes in all of them.
DISADVANTAGES OF FILE SYSTEM
Data inconsistency
Imagine Student and Student_Report files have student’s address in it, and there
was a change request for one particular student’s address. The program
searched only the Student file for the address and it updated it correctly. There is
another program that prints the student’s report and mails it to the address
mentioned in the Student_Report file. What happens to the report of a student
whose address is being changed? There is a mismatch in the actual address and
his report is sent to his old address. This mismatch in different copies of the
same data is called data inconsistency. This has occurred here because there is
no proper listing of files which has the same copies of data.
DISADVANTAGES OF FILE SYSTEM
Data Isolation
Each file can be password protected. But what if you have to give access to only a
few records in the file? For example, the user has to be given access to view only
their bank account information in the file. This is very difficult in the file system.
DISADVANTAGES OF FILE SYSTEM
Integrity
If we need to check for certain insertion criteria while entering the data into the
file it is not possible directly. We can do it by writing programs. Say, if we have to
restrict the students above age 18, then it is by means of the program alone.
There is no direct checking facility in the file system. Hence these kinds of
integrity checks are not easy in file systems.
DISADVANTAGES OF FILE SYSTEM
Atomicity
If there is any failure to insert, update, or delete in the file system, there is no
mechanism to switch back to the previous state. Imagine marks for one
particular subject need to be entered into the Report file and then total needs to
be calculated. But after entering the new marks, the file is closed without saving.
That means whole of the required transaction is not performed. Only the totaling
of marks has been done, but the addition of marks not being done. The total
mark calculated is wrong in this case. Atomicity refers to the completion of the
whole transaction or not completing it at all. Partial completion of any
transaction leads to incorrect data in the system.
DISADVANTAGES OF FILE SYSTEM
Concurrent Access
Accessing the same data from the same file is called concurrent access. In the file
system, concurrent access leads to incorrect data. For example, a student wants
to borrow a book from the library. He searches for the book in the library file and
sees that only one copy is available. At the same time, another student also
wants to borrow the same book and checks that one copy available. The first
student opt for borrow and gets the book. But it is still not updated to zero-copy
in the file and the second student also opt for borrow! But there are no books
available. This is the problem of concurrent access to the file system.
Properties of data
I N F O R M A T I O N M A N A G E M E N T
PROPERTIES OF A DATA
DATABASE APPROACH
I N F O R M A T I O N M A N A G E M E N T
DATABASE APPROACH
An information system that uses a Database Management System (DBMS) to
manage its information has a particular structure, comprising three components:
Data, DBMS, and Application software. This structure as described below is
referred to as the database approach to information system development.
DATABASE APPROACH
The Database is a shared collection of logically related data, designed to meet the
information needs of an organization. A database is a computer based record
keeping system whose over all purpose is to record and maintains information.
The database is a single, large repository of data, which can be used
simultaneously by many departments and users. Instead of disconnected files
with redundant data, all data items are integrated with a minimum amount of
duplication.
DATABASE APPROACH
The database is no longer owned by one department but is a shared corporate
resource. The database holds not only the organization’s operational data but
also a description of this data. For this reason, a database is also defined as a
self-describing collection of integrated records. The description of the data is
known as the Data Dictionary or Meta Data (the ‘data about data’). It is the self-
describing nature of a database that provides program-data independence.
DATABASE APPROACH
A database implies the separation of physical storage from the use of the data by
an application program to achieve program/data independence. Using a
database system, the user or programmer, or application specialist need not
know the details of how the data are stored and such details are “transparent to
the user”. Changes (or updating) can be made to data without affecting other
components of the system. These changes include, for example, a change in data
format or file structure or relocation from one device to another.
In the DBMS approach, an application program written in some programming
language like Java, Visual Basic.Net, and Developer 2000, etc. uses database
connectivity to access the database stored in the disk with the help of the
operating system’s file management system.
DATABASE APPROACH
The central component of the database approach is the DBMS. This software is
also referred to as the “database engine” or the “back end.” With regard to the
data it manages, it has several responsibilities including the following:
DATABASE MODELS 2. RELATIONSHIP - how pieces of data connect or relate to each other
3. INTERACTIONS - how you can retrieve, add, update, or remove.
Entity-Relationship
Object Oriented
Semantic
Functional
CATEGORIES OF DATABASE MODELS
OBJECT-BASED DATA MODELS
Hierarchical Model
Network Model
Relational Model
The relational model has gained favor over the other two in recent
years. The network and hierarchical models are still used in a large
number of older databases.
ROLES OF A DATABASE ADMINISTRATOR
I N F O R M A T I O N M A N A G E M E N T
ROLES OF A DATABASE ADMINISRATOR
Data is the lifeblood of any organization, and the management of data in
IT systems remains a critical exercise, particularly at a time when data
privacy regulation is a hot topic.
The role of the Database Administrator (DBA) has likely evolved over
time, given the evolution of data sources, types, and storage options.
ROLES OF A DATABASE ADMINISRATOR
What is a DBA?
The DBA must become conversant with data protection regulation such
as GDPR, and how to implement the relevant security controls to ensure
user/customer privacy rights are respected in business operations.
GDPR - General Data Protection Regulation
-EOF