0% found this document useful (0 votes)

9 views19 pages

2022 - CMP 262 - File Organisation - Slides

Uploaded by

ayomidetolani07

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views19 pages

2022 - CMP 262 - File Organisation - Slides

Uploaded by

ayomidetolani07

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

CMP 262: INTRODUCTION TO FILE PROCESSING

PART II

Federal University Dutsin-Ma 2020/2021 Academic Session

BASIC CONCEPTS

 Field – The basic element of data which contains a single value

 Record – A collection of related fields that can be treated as a unit
 File – A collection of related records
 File organisation – How records are arranged and mapped onto physical storage
FILE ORGANISATION METHODS

 Pile(Serial) – Data stored in the order in which they arrive

 Sequential File Organisation– Records stored in key sequence
 Indexed Sequential File Organisation – Adds a an index to sequential File method
 Direct(Hashed) File – Uses hashing on the key value
SERIAL FILE ORGANISATION
 Records are stored in the order in which they arrive i.e. chronologically. New records are therefore appended to
the end of the file.
 This organisation method is mostly on magnetic tapes
 Has a high hit rate i.e large number or records are accessed per time
 Files can only be accessed serially from head to tail
 Records may have different fields in different orders. Each field is therefore self-describing, including a field name
and value.
 Primarily used for transaction files e.g. Billing systems, Sales points
R1 R3 …………. R7 R8
R2
New Record
Beginning of the File End of the File
R1 R3 …………. R7 R8 R2
Updated File
ADVANTAGES & DISADVANTAGES OF SERIAL FILE ORGANISATION

Advantages
 It is a simple method
 It is cheap
 Makes optimum use of storage media

Disadvantages
 Access of files is cumbersome
 A lot of time is spent retrieving records
SEQUENTIAL FILE ORGANISATION
 This is the most common method of file organisation with a fixed format for all records.
 Records are of the same length consisting of fixed-length fields in a particular order.
 Only the values need to be stored since the field names and length are attributes of the file structure.
 Usually, the first field is referred to as the key field since it uniquely identifies the record.
 Records are stored in key sequence; alphabetic for text and numeric for numerical keys.
 New records are initially added to the end of the file and then sorted in the appropriate sequence.
 Best used for master file and batch processing applications e.g. payroll systems
R1 R3 …………. R7 R8
R2
Beginning of the File End of the File New Record

R1 R2 R3 …………. R7 R8
Sorted File

Beginning of the File End of the File

ADVANTAGES & DISADVANTAGES OF SEQUENTIAL FILE
ORGANISATION

Advantages
 The sorted nature makes it easy to access records
 It is easy to maintain and understand

Disadvantages
 Does nor support modern technologies that require fast access to stored records
 It is not always easy to enforce fixed length for records
INDEXED SEQUENTIAL FILE ORGANISATION
 Similar to sequential file organisation where records are ordered by a key.
 For each primary key, an index value is generated and mapped with the record.
 The index is the address of the record in the file.
Two types of indexes:
 Exhaustive index – contains one entry for every record in the main file. The index itself is organized as a
sequential file for ease of searching
 Partial index – contains entries to records where the field of interest exists
Data Records Data Block in memory
R1 0XFG122 0XAD132
R2 0XBF124 0XJD552
R3 0XAD132 0XBF124
R4 0XAZ137 0XFG122
. .
. .
. .

R9 0XJD552 0XAZ137
ADVANTAGES & DISADVANTAGES OF INDEXED SEQUENTIAL FILE
ORGANISATION

Advantages
 Gives many different options for access
 Indexes provide a very fast method of access
 Records cannot be duplicated
Disadvantages
 Could be expensive
 Increased storage overhead as the index requires disk space
HASH(DIRECT) FILE ORGANISATION
 Records are stored randomly in any available position in a file.
 There is a pre-defined relationship between the key field of a record and its location within the file.
 A hash function is used on the key field of a record to define the position of the disc block where the record will
be stored.
 Best used for applications where rapid file access is a priority. e. g Reservation and ticketing systems, e-commerce.
Data Records Data Block in memory Data Records Data Blocks in memory
R1 0XAD132 R1 0XAD132
R4 0XJD552 R4 0XJD552
R6 0XBF124 R6 0XHK324
R5 0XFG122 R5 0XBF124
.
.
.
.
. . 0XFG122
. . .
. . . .
. . . .
.
.
R3 0XAZ137 R3 .

New Record R8 0XAZ137

ADVANTAGES & DISADVANTAGES OF HASH(DIRECT) FILE
ORGANISATION

Advantages
 Does nor require records to be sorted
 Fast access of desired records
 Multiple records can be accessed at the same time as each record is independent of the other.

Disadvantages
 It is expensive
 Search can only be performed on the field used for the hash function
 If has fields are not selected properly, it can led to data loss.
FILE ACCESS

 While some systems provide only one method of file access, other systems support many access methods.
Choosing the right one for an application is very important.
 Methods of file access include:
 Sequential access
 Direct access
 Indexed-sequential access
SEQUENTIAL ACCESS

 It is the simplest access method

 Records are searched one after the other from the start of the file till the desired record if found.
 In a serial file, search for a file will continue till the record is found or till the end of the file if not found
 For a sequential file, search for a record will continue until the record is found or the key value of the current
record being checked is greater than the key field of the record being searched for.
 Best used when all records in a file are to be processed.
DIRECT ACCESS

 Records can be found without others being physically read

 Can be used in both sequential and direct files
 Best used where individual records are to be processed per time.
FACTORS INFLUENCING CHOICE OF FILE ORGANISATION
METHOD

 Frequency of update – A file that needs to updated frequently need an organisation method that allows fast and
easy retrieval.
 Cost – Cost benefit analysis should be conducted as different methods have different costs.
 Storage media – Different organisation methods use different storage media
 Area of application – some organisation methods may not be suitable for certain types of applications
 Expected file size and anticipated growth pattern – If a file is large and anticipated to grow larger faster, random
organisation may be preferable.
PHYSICAL VS LOGICAL FILES

 Physical files contain the actual data on a storage medium. It also contains a description of how data is to be
presented or received from a program.
 Logical files contain description of records that are found in one or more physical files. A logical file is just a view
or representation of physical files and does not contain data itself.
LOGICAL FILE VS PHYSICAL FILE

Logical File Physical File

It not contain data and therefore does not occupy It contains actual data and therefore occupies a portion
memory space of memory.
It can contain up to 32 record formats. It contains one record format
It cannot exist without a physical file It can exist without a logical file
It can be deleted without deleting its associated physical It cannot be deleted until its associated logical file is
file deleted if it exists.
It can represent one or more physical files It represents actual data saved on a system
It contains description of records in the physical files it It describes how data is to be displayed to or retrieved
represents from a program.
PHYSICAL STORAGE

 There are different types of storage devices which can be used to store files. These include:
 Primary storage devices e.g. RAM (SRAM, DRAM, SDRAM), ROM (PROM, EPROM)
 Magnetic storage devices e.g Floppy disk, Hard disk
 Flash memory devices e.g Pen drive, SSD, SD card, Multimedia card
 Optical storage devices e.g. CD (CD-R, CD-RW), DVD (DVD-R, DVD-RW)
 Cloud storage e.g Amazon Web Services, Google Drive, OneDrive
DATA STORAGE UNITS ON THE COMPUTER

TERM DESCRIPTION
Bit The smallest unit of data. Either 1 or 0
Nibble 4 bits
Byte (B) 8 bits
Kilobyte (KB) (210) 1,024 bytes
Megabyte (220)1,024 kilobytes
Gigabyte (GB) (230) 1,024 megabytes
Terabyte (TB) (240) 1 024 gigabytes
Petabyte (PB) (250) 1,024 terabytes
Exabye (EB) (260) 1 024 petabytes

Practical Guide For Sap Security
100% (2)
Practical Guide For Sap Security
277 pages
Storage System Hierarchy in DBMS
No ratings yet
Storage System Hierarchy in DBMS
20 pages
File Organization
No ratings yet
File Organization
2 pages
MODULE-5 FILE & Their Organization
No ratings yet
MODULE-5 FILE & Their Organization
13 pages
Ss2 Data Processing 2nd Term
0% (1)
Ss2 Data Processing 2nd Term
33 pages
Lecture 6 - File Management Security
No ratings yet
Lecture 6 - File Management Security
103 pages
OS-Chapter 5 - File Management
100% (1)
OS-Chapter 5 - File Management
10 pages
FP-Lecture-6 01
No ratings yet
FP-Lecture-6 01
33 pages
Unit 5
No ratings yet
Unit 5
62 pages
File Structure
No ratings yet
File Structure
18 pages
Database File Organisation Lecture
No ratings yet
Database File Organisation Lecture
32 pages
File Structure
No ratings yet
File Structure
17 pages
File Handling
No ratings yet
File Handling
27 pages
Methods of File Organization and Access
No ratings yet
Methods of File Organization and Access
15 pages
Chapter 5
No ratings yet
Chapter 5
28 pages
Unitv Part1
No ratings yet
Unitv Part1
53 pages
Unit 6 File Management
No ratings yet
Unit 6 File Management
70 pages
MCA File Structures MCA 212
No ratings yet
MCA File Structures MCA 212
31 pages
Week 14 Persistent Data Storage
No ratings yet
Week 14 Persistent Data Storage
7 pages
Lecture 3.3.3 Sequential, Relative
No ratings yet
Lecture 3.3.3 Sequential, Relative
16 pages
File Organization
No ratings yet
File Organization
17 pages
Lecture 37-39
No ratings yet
Lecture 37-39
35 pages
File Org
No ratings yet
File Org
13 pages
Chapter 5: File Organization
No ratings yet
Chapter 5: File Organization
13 pages
1-File Structure
No ratings yet
1-File Structure
17 pages
Database Basics 1
No ratings yet
Database Basics 1
42 pages
DBMS Book Special Notes PDF
No ratings yet
DBMS Book Special Notes PDF
68 pages
File Organisation DP ss2 WK 1
No ratings yet
File Organisation DP ss2 WK 1
9 pages
File Organization
No ratings yet
File Organization
5 pages
Unit 6 (22516)
No ratings yet
Unit 6 (22516)
40 pages
File Organisation
No ratings yet
File Organisation
45 pages
DS TM Study Material Presentations Unit-4 1TM
No ratings yet
DS TM Study Material Presentations Unit-4 1TM
22 pages
Unit 1 Lecture 9
No ratings yet
Unit 1 Lecture 9
22 pages
File and Database Design
No ratings yet
File and Database Design
28 pages
Module 5 File Organization 1
No ratings yet
Module 5 File Organization 1
37 pages
TOPIC THREE-File System
No ratings yet
TOPIC THREE-File System
15 pages
Explain File Management in An Operating System
No ratings yet
Explain File Management in An Operating System
57 pages
Unit 6
No ratings yet
Unit 6
20 pages
Unit 1 Introduction To Dbms
No ratings yet
Unit 1 Introduction To Dbms
27 pages
The en 50600 Series European Data Centre
100% (1)
The en 50600 Series European Data Centre
14 pages
Unit 4 Storage and Querying
No ratings yet
Unit 4 Storage and Querying
48 pages
File Organization in RDBMS
No ratings yet
File Organization in RDBMS
9 pages
WINSEM2024-25 CBS1003 ETH VL2024250505129 2025-04-08 Reference-Material-I
No ratings yet
WINSEM2024-25 CBS1003 ETH VL2024250505129 2025-04-08 Reference-Material-I
12 pages
File Organization
No ratings yet
File Organization
4 pages
ICDL Power Point
100% (2)
ICDL Power Point
4 pages
Fundamental File Structure Concepts
No ratings yet
Fundamental File Structure Concepts
17 pages
Ds Mod 5
No ratings yet
Ds Mod 5
17 pages
Chapter 1
No ratings yet
Chapter 1
11 pages
Computing Fundamentals Midterm by Jezza
100% (2)
Computing Fundamentals Midterm by Jezza
6 pages
Presentation ON File Organisation: Submitted To: Mrs. Sonal Beniwal
No ratings yet
Presentation ON File Organisation: Submitted To: Mrs. Sonal Beniwal
23 pages
DBMS Unit-5
No ratings yet
DBMS Unit-5
24 pages
File Organization Midterm
No ratings yet
File Organization Midterm
43 pages
Computer Science Notes - Files
No ratings yet
Computer Science Notes - Files
17 pages
AWS Security Architecture
No ratings yet
AWS Security Architecture
153 pages
Chapter 11 File Management
No ratings yet
Chapter 11 File Management
13 pages
File Org
No ratings yet
File Org
2 pages
Grade 11 - File Organisation and File Access New
No ratings yet
Grade 11 - File Organisation and File Access New
2 pages
DSA Unit6 Theory
No ratings yet
DSA Unit6 Theory
23 pages
Spring Interview Questions
100% (1)
Spring Interview Questions
19 pages
E-Note SS Two 2nd Term Data Processing
No ratings yet
E-Note SS Two 2nd Term Data Processing
17 pages
ECG-SE-1515-Connect A Treadmill To ECG Workstation SE-1515-1.0
No ratings yet
ECG-SE-1515-Connect A Treadmill To ECG Workstation SE-1515-1.0
8 pages
58 Cool Linux Hacks!
No ratings yet
58 Cool Linux Hacks!
15 pages
Internal File Structure: Methods and Design Paradigm
No ratings yet
Internal File Structure: Methods and Design Paradigm
6 pages
A Presentation On: File Organization
No ratings yet
A Presentation On: File Organization
18 pages
Api Net E3d
No ratings yet
Api Net E3d
12 pages
Unit 5
No ratings yet
Unit 5
3 pages
A Program Reads Three Numbers A, B, C With Range (1,50) and Print The Largest Number - Design Test Cases For This Program Using Equivalence Class Testing Techniques
No ratings yet
A Program Reads Three Numbers A, B, C With Range (1,50) and Print The Largest Number - Design Test Cases For This Program Using Equivalence Class Testing Techniques
6 pages
1-Basics of Microprocessor
100% (1)
1-Basics of Microprocessor
9 pages
BC0057 - Object Oriented Analysis and Design
No ratings yet
BC0057 - Object Oriented Analysis and Design
8 pages
SRM Institute of Science and Technology: Programming For Problem Solving (18CSS101J)
No ratings yet
SRM Institute of Science and Technology: Programming For Problem Solving (18CSS101J)
88 pages
Civil 3D Handling of Survey Points Practice Manual
No ratings yet
Civil 3D Handling of Survey Points Practice Manual
7 pages
Digital Image and Video Processing - 2013
No ratings yet
Digital Image and Video Processing - 2013
7 pages
Chapter 3 EIS Charts For Revision
No ratings yet
Chapter 3 EIS Charts For Revision
12 pages
01 Computational Methods For Numerical Analysis With R - 1
No ratings yet
01 Computational Methods For Numerical Analysis With R - 1
28 pages
Agri Census21-22
No ratings yet
Agri Census21-22
2 pages
SonicWALL TZ190 Getting Started Guide
No ratings yet
SonicWALL TZ190 Getting Started Guide
50 pages
Network Lab Programs For 7th Sem Vtu
No ratings yet
Network Lab Programs For 7th Sem Vtu
40 pages
Electronic Payment System
No ratings yet
Electronic Payment System
20 pages
Gorilla - Large Language Model Connected With Massive APIs
No ratings yet
Gorilla - Large Language Model Connected With Massive APIs
18 pages
Kotian 2024
No ratings yet
Kotian 2024
11 pages
Log
No ratings yet
Log
7 pages
Design of High Speed Multiplier Using Modified Booth Algorithm With Hybrid Carry Look-Ahead Adder
No ratings yet
Design of High Speed Multiplier Using Modified Booth Algorithm With Hybrid Carry Look-Ahead Adder
7 pages
BGP Frequently Asked Questions
No ratings yet
BGP Frequently Asked Questions
9 pages
Radar Secundário Rsm970
No ratings yet
Radar Secundário Rsm970
6 pages
Question Paper Computing Principles
No ratings yet
Question Paper Computing Principles
16 pages
Intrusion Detection
No ratings yet
Intrusion Detection
29 pages
Anurag
No ratings yet
Anurag
1 page
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
From Everand
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
Wei Liu
No ratings yet
Oracle Database 12c Quickstart
From Everand
Oracle Database 12c Quickstart
Michael Elliott
5/5 (5)
Mastering the SAS DS2 Procedure: Advanced Data-Wrangling Techniques, Second Edition
From Everand
Mastering the SAS DS2 Procedure: Advanced Data-Wrangling Techniques, Second Edition
Mark Jordan
No ratings yet

2022 - CMP 262 - File Organisation - Slides

Uploaded by

2022 - CMP 262 - File Organisation - Slides

Uploaded by

CMP 262: INTRODUCTION TO FILE PROCESSING

Federal University Dutsin-Ma 2020/2021 Academic Session

 Field – The basic element of data which contains a single value

 Pile(Serial) – Data stored in the order in which they arrive

Beginning of the File End of the File

New Record R8 0XAZ137

 It is the simplest access method

 Records can be found without others being physically read

Logical File Physical File

You might also like