Week 14 Persistent Data Storage
Week 14 Persistent Data Storage
File organization refers to the way data is stored in a file. File organization is very
important because it determines the method of access, efficiency, flexibility and storage
devices to be used.
There are four methods of organizing files on a storage media namely:
1) serial
2) Sequential
3) indexed- sequential
4) random
Factors influencing file design (factors to consider when selecting a file organization
method)
There are several methods of file organization and each one is suited for a particular
task or purpose. Here are the factors to consider before choosing a file organization
method;
i. Frequency of update/ Volatility: This refers to the frequency of adding or deleting
records from a file. A file that needs to be updated every now and then needs an
organization method that will allow easy retrieval of information and ease of
updating, example of such a file is the transaction file. Highly volatile files will
require random organization while low volatile files will require serial or
sequential files.
ii. File activity: This refers to the frequency of using a file. High activity files will
require serial or sequential while low activity files will require random
organization. Different files have different activities, example a sort file is used to
sort data in sequential order and therefore sequential method would be
appropriate for such a file.
iii. Cost: It is essential that a cost benefit analysis be conducted because different file
will require different cost.
iv. Storage media: Different files design use different storage media. E.g. serial and
sequential files use magnetic tapes while index-sequential and random use
magnetic disc.
v. File access method: Definitely different files have different methods of being
accessed, example a reference file is accessed using random method for easy
retrieval of data.
vi. Nature of the system: Files that are used in a particular system will depend on the
nature of the system i.e. the suitable organization method for that particular
system.
vii. Area of application: different file designs are applicable in different areas. E.g.
serial and sequential files are applicable in batch processing while index-sequential
is applicable in both batch and online processing and random files in real time
processing.
viii. Master file medium: The master file is the main file for keeping permanent
updates of records from transaction files and other sources, the medium by which
it is updated will determine the organization method to be used.
ix. Response time: This refers to the speed of access and when a fast response is
required random and index-sequential files are ideal.
x. Expected file size and anticipated growth pattern: If a file is large and the
anticipated growth rate is high, then random organization is preferred, otherwise
serial and sequential files are ideal.