0% found this document useful (0 votes)

33 views18 pages

File Management

Uploaded by

akshayapamul7

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views18 pages

File Management

Uploaded by

akshayapamul7

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 18

File Management

File
“A file is a named collection of related information that is recorded on secondary
storage such as magnetic disks, magnetic tapes and optical disks.”

“In general, a file is a sequence of bits, bytes, lines or records whose meaning is
defined by the files creator and user.”

File Attributes

A file has certain other attributes, which vary from operating system to another, but
typically consist of these:

Name:

 The file name is the only information kept in human readable form.

Identifier:

 This unique tag, usually a number, identifies the file within the file system.

Type:

 This information is needed for those systems that support different types.

Location:

 This information is a pointer to a device and to the location of the file on that
device.

Size:

 The current size of the file (in bytes, words, or blocks), and possibly the
maximum allowed size are included in this attribute.

Protection:

 Access-control information determines who can do reading, writing,

executing, and so on.

Time, date, and user identification: This information may be kept for creation,
last modification, and last use.

1
Operations on file

Six basic file operations: The OS can provide system calls to create, write, read,
reposition, delete, and truncate files.

Creating a file:
Two steps are necessary to create a file.
 Space in the file system must be found for the file.
 An entry for the new file must be made in the directory.

Writing a file:
 To write a file, we make a system call specifying both the name of the file
and the information to be written to the file.
 The system must keep a write pointer to the location in the file where the
next write is to take place.
 The write pointer must be updated whenever a write occurs.

Reading a file:
 To read from a file, we use a system call that specifies the name of the file
and where (in memory) the next block of the file should be put.
 The system needs to keep a read pointer to the location in the file where the
next read is to take place.

Repositioning within a file:

 The directory is searched for the appropriate entry, and the current-file-
position pointer is repositioned to a given value.
 Repositioning within a file need not involve any actual I/O. This file
operation is also known as a file seeks.

Deleting a file:
 To delete a file, we search the directory for the named file.
 Having found the associated directory entry, we release all file space, so
that it can be reused by other files, and erase the directory entry.

Truncating a file:
 The user may want to erase the contents of a file but keep its attributes.
 Rather than forcing the user to delete the file and then recreate it, this
function allows all attributes to remain unchanged (except for file length)
but lets the file be reset to length zero and its file space released.

2
File Type
File type refers to the ability of the operating system to distinguish different types
of file such as text files source files and binary files etc. Many operating systems
support many types of files. Operating system like MS-DOS and UNIX has the
following types of files:

Ordinary files
 These are the files that contain user information.
 These may have text, databases or executable program.
 The user can apply various operations on such files like add, modify, delete
or even remove the entire file.
Directory files
 These files contain list of file names and other information related to these
files.

Special files:
 These files are also known as device files.
 These files represent physical device like disks, terminals, printers,
networks, tape drive etc.
Two types

 Character special files - data is handled character by character as in

case of terminals or printers.
 Block special files - data is handled in blocks as in the case of disks
and tapes.

File Structure
File structure is a structure, which is according to a required format that operating
system can understand.

 A file has a certain defined structure according to its type.

 A text file is a sequence of characters organized into lines.
 A source file is a sequence of procedures and functions.
 An object file is a sequence of bytes organized into blocks that are
understandable by the machine.
3
 When operating system defines different file structures, it also contains the
code to support these file structure. UNIX, MS-DOS support minimum
number of file structure.

File Protection :

 File Naming

By providing the name to the file only the owner of the file may know the
name and can access the file.

 Access control
o Providing the access control
o Read , write , execute
o Only owner can decide regarding the permissions to other users.
 Passwords

Passwords are the other means of providing the protection to the files.

It is hard to access the file if we don’t know the password of it.

File Access Mechanisms

File access mechanism refers to the manner in which the records of a file may be
accessed. There are several ways to access files

 Sequential access
 Direct/Random access

 Sequential access

 Sequential access is based on a tape model of a file.

 Information in the file is processed in order, one record after the other.

 This is the most common mode of access of files.

 A read operation reads the next portion of the file and automatically
advances the file pointer.

4
 Similarly, a write appends to the end of the file and the file pointer advances
to the end of the newly written material (the new end of file).

 Such a file can be reset to the beginning, and, on some systems, a program
may be able to skip forward or backward n records, for some integer n.

 This scheme is known as sequential access to a file.

Beginning current position end

Rewind Read/Write

 Direct/Random access

 Direct access is based on a disk model of a file.

 For direct access, the file is viewed as a numbered sequence of block or
records.
 A direct-access file allows random blocks to be read or written. Thus, after
block 18 has been read, block 57 could be next, and then block 3.
 There are no restrictions on the order of reading and writing for a direct
access file.
 Direct access files are of great use for intermediate access to large amounts
of information.
 The file operations must be modified to include the block number as a
parameter.
 Thus, we have "read n", where n is the block number, rather than "read
next", and "write n", rather than "write next".

Space Allocation
Files are allocated disk spaces by operating system. Operating systems deploy
following three main ways to allocate disk space to files.

1. Contiguous Allocation
2. Linked Allocation
3. Indexed Allocation

5
 Contiguous Allocation
 In contiguous allocation, files are assigned to contiguous areas of secondary
storage.
 The location of a file is defined by the disk address of the first block and its
length.
 File owner has to specify the size of the file in advance.
 Widely used in CD-ROMs, DVDs where final file sizes are known in
advance and won't change
 This is shown in the following figure.

 Advantages:
1) All records of a file are normally physically adjacent to each other. This
increases the accessing speed of records.
2) Contiguous allocation supports both sequential and direct accessing.
3) Ease of implementation.

 Disadvantages:
1) To find a space for a new file we have to search N free contiguous free
holes.
2) External fragmentation.
3) Compaction is required.
4) Reallocate the file if it grows in size.
6
Linked Allocation

 Linked allocation is a disk-based version of the linked list.

 With linked list allocation each file is linked list of disk blocks. These disk
blocks may be scattered through the disk.
 A few bytes of each disk block contain the address of the next block.
 The directory contains a pointer to the first (and last) blocks of the file.
 A file can continue to grow as there are free blocks.
 In this figure there is a file of 5 disk blocks which starts from block 9 and
ends with a disk block 25.

 The advantages:
1) It’s simplicity
2) No external disk fragmentation.
3) No disk compaction required.
4) There is also no need of declaration of the size of a file in linked allocation
while it is created.

 The disadvantages :
1) Slow direct accessing of any disk block
7
2) Space requirement for pointers
3) Reliability - Since disk blocks are linked by pointers, a single damaged
pointer can make thousands of disk blocks inaccessible.

 Indexed Allocation
 The principle of indexed allocation is illustrated in figure.
 In this scheme each file is provided with its own index block, which is an
array of disk block pointers (addresses).
 The Nth entry in the index block points to the Nth disk block of the file. The
directory contains the address of the index block.
 To read the Nth disk block the pointer in the Nth index block entry is used to
find the desired block and then read.

 The advantages
1) The absence of external fragmentation
2) Indexing of free space can be accomplished by means of the bit map.
 The disadvantages
1) The number of disk accesses necessary to retrieve the address of the target
block on disk.
2) Indexed allocation requires lots of space for keeping pointers.

8
General Directory Structure :

There are many types of directory structure in Operating System. They are as
follows:-

1) Single Level Directory

2) Two Level Directory
3) Tree Structured Directory

1) Single Level Directory

 Single level directory is simple to implement but each file must have a
unique name.
 In this all file are stored in the same directory. A single-level directory has
significant limitations, however, when the number of files increases or when
there is more than one user.
 Since all files are in the same directory, they must have unique names. If
there are two users who call their data file "test", then the unique-name rule
is violated.
 Even with a single-user, as the number of files increases, it becomes difficult
to remember the names of all the files in order to create only files with
unique names

9
Limitations of Single Level Directory

a) since all files are in the same directory, they must have unique name.
b) If two users call their data free test, then the unique name rule is violated.
c) Files are limited in length.
d) Even a single user may find it difficult to remember the names of all files as the
number of file increases.
e) Keeping track of so many file is difficult task.

2) Two Level Directory

 The standard solution to limitations of single-level directory is to create a

separate directory for each user.
 In the two-level directory structure, each user has his own user file directory
(UFD). The UFDs have similar structures, but each lists only the files of a
single user.
 When a user job starts or a user logs in, the system's master file directory
(MFD) is searched. The MFD is indexed by user name or account number,
and each entry points to the UFD for that user.

Advantages:
i) Path name
ii) Can have the same file name for different user
iii) Efficient searching

Disadvantages:
No grouping capability

10
Tree Structured Directory
 In Tree structured directory system, any directory entry can either be a file
or sub directory.

 Tree structured directory system overcomes the drawbacks of two level

directory system. The similar kind of files can now be grouped in one
directory.

 Each user has its own directory and it cannot enter in the other user's
directory. However, the user has the permission to read the root's data but he
cannot write or modify this. Only administrator of the system has the
complete access of root directory.

 Searching is more efficient in this directory structure. The concept of current

working directory is used. A file can be accessed by two types of path, either
relative or absolute.

 Absolute path is the path of the file with respect to the root directory of the
system while relative path is the path with respect to the current working
directory of the system. In tree structured directory systems, the user is given
the privilege to create the files as well as directories.

Disk Organization and Disk Structure

11
In modern computers, most of the secondary storage is in the form of magnetic
disks. Hence, knowing the structure of a magnetic disk is necessary to understand
how the data in the disk is accessed by the computer.

Physical structure of a magnetic disk

A magnetic disk contains several platters. Each platter is divided into circular
shaped tracks. The length of the tracks near the centre is less than the length of the
tracks farther from the centre. Each track is further divided into sectors, as shown
in the figure.
Tracks of the same distance from centre form a cylinder. A read-write head is used
to read data from a sector of the magnetic disk.
The speed of the disk is measured as two parts:

12
 Transfer rate: This is the rate at which the data moves from disk to the
computer.
 Random access time: It is the sum of the seek time and rotational latency.

Seek time: is the time taken by the arm to move to the required track. Rotational
latency is defined as the time taken by the arm to reach the required sector in the
track.

Logical structure

 The Master Boot Record (or MBR)

At the beginning of the hard drive is the MBR. When your computer starts
using your hard drive, this is where it looks first.

The MBR itself has a specific organization. The size of the MBR is 512
bytes.

The boot loader is the first 446 bytes of the MBR. This section contains
executable code, where programs are housed.

The partition tables are 4 slots of 16 bytes each, containing the description of
a partition (primary or extended) on the disk.

The Magic Number is two bytes used to determine if the hard disk has a
bootloader or not. If it does, the magic number should be equal in value to
hexadecimal 55AA.

Raid structure of disk

RAID (redundant array of independent disks; originally redundant array of

inexpensive disks) is a way of storing the same data in different places on
multiple hard disks to protect data in the case of a drive failure. However, not all
RAID levels provide redundancy.

13
How RAID works
RAID works by placing data on multiple disks and allowing input/output (I/O)
operations to overlap in a balanced way, improving performance. Because the use
of multiple disks increases the mean time between failures (MTBF), storing data
redundantly also increases fault tolerance.
RAID arrays appear to the operating system (OS) as a single logical hard disk.
RAID employs the techniques of disk mirroring or disk striping. Mirroring copies
identical data onto more than one drive. Striping partitions each drive's storage
space into units ranging from a sector (512 bytes) up to several megabytes. The
stripes of all the disks are interleaved and addressed in order.

Standard RAID levels

RAID 0: This configuration has striping, but no redundancy of data. It offers the
best performance, but no fault tolerance.

RAID 1: Also known as disk mirroring, this configuration consists of at least two
drives that duplicate the storage of data. There is no striping. Read performance is
improved since either disk can be read at the same time. Write performance is the
same as for single disk storage.

14
RAID 2: This configuration uses striping across disks, with some disks storing
error checking and correcting (ECC) information. It has no advantage over RAID 3
and is no longer used.

RAID 3: This technique uses striping and dedicates one drive to

storing parity information. The embedded ECC information is used to detect
errors. Data recovery is accomplished by calculating the exclusive OR (XOR) of
15
the information recorded on the other drives. Since an I/O operation addresses all
the drives at the same time, RAID 3 cannot overlap I/O. For this reason, RAID 3 is
best for single-user systems with long record applications.

RAID 4: This level uses large stripes, which means you can read records from any
single drive. This allows you to use overlapped I/O for read operations. Since all
write operations have to update the parity drive, no I/O overlapping is possible.
RAID 4 offers no advantage over RAID 5.

16
RAID 5: This level is based on block-level striping with parity. The parity
information is striped across each drive, allowing the array to function even if one
drive were to fail. The array's architecture allows read and write operations to span
multiple drives. This results in performance that is usually better than that of a
single drive, but not as high as that of a RAID 0 array. RAID 5 requires at least
three disks, but it is often recommended to use at least five disks for performance
reasons.

RAID 5 arrays are generally considered to be a poor choice for use on write-
intensive systems because of the performance impact associated with writing parity
information. When a disk does fail, it can take a long time to rebuild a RAID 5
array. Performance is usually degraded during the rebuild time, and the array is
vulnerable to an additional disk failure until the rebuild is complete.

17
RAID 6: This technique is similar to RAID 5, but includes a second parity scheme
that is distributed across the drives in the array. The use of additional parity allows
the array to continue to function even if two disks fail simultaneously. However,
this extra protection comes at a cost. RAID 6 arrays have a higher cost per
gigabyte (GB) and often have slower write performance than RAID 5 arrays.

Vue - JS: Framework For Building User Interfaces
No ratings yet
Vue - JS: Framework For Building User Interfaces
93 pages
VMware VDI LAB
No ratings yet
VMware VDI LAB
197 pages
Level 3 Repair: 8-1. Components Layout
No ratings yet
Level 3 Repair: 8-1. Components Layout
63 pages
Tutorial CoDeSys V2 1 en
0% (1)
Tutorial CoDeSys V2 1 en
56 pages
The Evolution of DSP Processors
No ratings yet
The Evolution of DSP Processors
35 pages
APRO ReferenceGuide PDF
No ratings yet
APRO ReferenceGuide PDF
979 pages
Windows® XP Operating System
No ratings yet
Windows® XP Operating System
3 pages
FCP - FortiAnalyzer 7.4 Administrator Sample Questions
50% (2)
FCP - FortiAnalyzer 7.4 Administrator Sample Questions
2 pages
Operating System
No ratings yet
Operating System
42 pages
Nursing Informatics
100% (2)
Nursing Informatics
15 pages
10.1.1.72.4304 DeadLock
No ratings yet
10.1.1.72.4304 DeadLock
320 pages
Operating Systems Unit - 5: I/O and File Management
No ratings yet
Operating Systems Unit - 5: I/O and File Management
48 pages
CSI3131 Mod 9 File Sys
No ratings yet
CSI3131 Mod 9 File Sys
89 pages
OS CO4 S4 FileDirectories FileSystemImplementation
No ratings yet
OS CO4 S4 FileDirectories FileSystemImplementation
79 pages
Chapter 2 - File System Management
No ratings yet
Chapter 2 - File System Management
43 pages
Module 5
No ratings yet
Module 5
68 pages
File Allocation Methods
No ratings yet
File Allocation Methods
9 pages
Bus Detection Device For Blind
No ratings yet
Bus Detection Device For Blind
49 pages
Complete Unit of Java
No ratings yet
Complete Unit of Java
66 pages
Chapter 12 File Management
No ratings yet
Chapter 12 File Management
57 pages
Chapter 7
No ratings yet
Chapter 7
41 pages
OS CHAPTER-11 - File Management
No ratings yet
OS CHAPTER-11 - File Management
44 pages
Mad Summer 2023
No ratings yet
Mad Summer 2023
48 pages
Unit 4 Information and File MGMT
No ratings yet
Unit 4 Information and File MGMT
42 pages
Wa0024
No ratings yet
Wa0024
30 pages
Microsoft Azure SQL Database
No ratings yet
Microsoft Azure SQL Database
52 pages
Zookeeper and Hbase
No ratings yet
Zookeeper and Hbase
43 pages
Unit 4 Information and File MGMT
No ratings yet
Unit 4 Information and File MGMT
42 pages
Data File
No ratings yet
Data File
22 pages
Unit-5 File Management
No ratings yet
Unit-5 File Management
41 pages
Os 5TH
No ratings yet
Os 5TH
38 pages
Group 4 OS Work
No ratings yet
Group 4 OS Work
34 pages
7269IV - 5th Semester - Computer Science and Engineering
No ratings yet
7269IV - 5th Semester - Computer Science and Engineering
37 pages
Ch5 (Dev, File and System Level Io)
No ratings yet
Ch5 (Dev, File and System Level Io)
40 pages
File System Interface: Unit - 5
No ratings yet
File System Interface: Unit - 5
24 pages
File Management
No ratings yet
File Management
26 pages
Operating System Unit-5
No ratings yet
Operating System Unit-5
27 pages
File Concept
No ratings yet
File Concept
21 pages
OS - Chapter - 5 - File System
No ratings yet
OS - Chapter - 5 - File System
30 pages
Os-Unit Iv
No ratings yet
Os-Unit Iv
30 pages
SPIERSalign's User Manual v3.1.0
No ratings yet
SPIERSalign's User Manual v3.1.0
21 pages
File System Interface Access Methods Directory Structure
No ratings yet
File System Interface Access Methods Directory Structure
27 pages
Top 32 Node - Js Interview Questions (2023) - Javatpoint
No ratings yet
Top 32 Node - Js Interview Questions (2023) - Javatpoint
25 pages
Java Quiz 1
No ratings yet
Java Quiz 1
16 pages
4Kb EEPROM With Single-Wire HDQ Interface and Temperature Sensor
No ratings yet
4Kb EEPROM With Single-Wire HDQ Interface and Temperature Sensor
26 pages
File Management
No ratings yet
File Management
25 pages
File Management Module-5
No ratings yet
File Management Module-5
23 pages
Software Engineering Techniques: Low Level Design Issues For Programming-In-The-Large
No ratings yet
Software Engineering Techniques: Low Level Design Issues For Programming-In-The-Large
21 pages
OS Unit5
No ratings yet
OS Unit5
23 pages
Malware Analysis
No ratings yet
Malware Analysis
19 pages
Unit-V Os
No ratings yet
Unit-V Os
27 pages
(Ch11) File System Interface
No ratings yet
(Ch11) File System Interface
54 pages
Os Unit 4
No ratings yet
Os Unit 4
20 pages
OSY Chapter 6 SSP
No ratings yet
OSY Chapter 6 SSP
24 pages
File System
No ratings yet
File System
27 pages
Unit-Iv File Management
No ratings yet
Unit-Iv File Management
21 pages
Managing and Optimizing Resources For SQL Server: Balmukund Lakhani Technical Lead - SQL Support Team
No ratings yet
Managing and Optimizing Resources For SQL Server: Balmukund Lakhani Technical Lead - SQL Support Team
28 pages
OSY Notes Vol 2 (6th Chapter) - Ur Engineering Friend
No ratings yet
OSY Notes Vol 2 (6th Chapter) - Ur Engineering Friend
23 pages
OS - Unhsgeit IV
No ratings yet
OS - Unhsgeit IV
10 pages
Osy 6
No ratings yet
Osy 6
19 pages
Os Unit 5
No ratings yet
Os Unit 5
21 pages
Osunit 6
No ratings yet
Osunit 6
16 pages
Os Chapter 5
No ratings yet
Os Chapter 5
20 pages
File System Management
No ratings yet
File System Management
9 pages
Whatisafile?: Attributes of The File
No ratings yet
Whatisafile?: Attributes of The File
15 pages
File System
No ratings yet
File System
8 pages
File Concept
No ratings yet
File Concept
14 pages
OS Unit 3 Part 2
No ratings yet
OS Unit 3 Part 2
20 pages
Dataguard Theory
No ratings yet
Dataguard Theory
2 pages
Seek Time
No ratings yet
Seek Time
10 pages
cs301 MCQS
No ratings yet
cs301 MCQS
6 pages
File System-1
No ratings yet
File System-1
11 pages
File System New
No ratings yet
File System New
16 pages
OSY Chapter 6
No ratings yet
OSY Chapter 6
12 pages
Unit-7 File System Interface Management
No ratings yet
Unit-7 File System Interface Management
15 pages
File System Print
No ratings yet
File System Print
9 pages
Configuring Pseudowire: Understanding Pseudowires
No ratings yet
Configuring Pseudowire: Understanding Pseudowires
12 pages
LAB-1 (A) Introduction To Software Tools For Networking - WIRESHARK
No ratings yet
LAB-1 (A) Introduction To Software Tools For Networking - WIRESHARK
10 pages
6.file Managment
No ratings yet
6.file Managment
7 pages
Os Lesson 3 File Management
No ratings yet
Os Lesson 3 File Management
9 pages
File 1. File Concept
No ratings yet
File 1. File Concept
6 pages
Mod 5 QB Soln
No ratings yet
Mod 5 QB Soln
5 pages
Client Side PDF Creation For Fiori Apps
No ratings yet
Client Side PDF Creation For Fiori Apps
6 pages
File Management
No ratings yet
File Management
4 pages
Awr Vs Ash
No ratings yet
Awr Vs Ash
22 pages
What Is Hive Serde?: Reading From HDFS
No ratings yet
What Is Hive Serde?: Reading From HDFS
3 pages
Cs3351 - Dpco QB
No ratings yet
Cs3351 - Dpco QB
3 pages
It 101 Reviewer 1
No ratings yet
It 101 Reviewer 1
3 pages
C++ File Handling Step by Step: A Practical Guide with Examples
From Everand
C++ File Handling Step by Step: A Practical Guide with Examples
William E. Clark
No ratings yet
Best Free Open Source Data Recovery Apps for Mac OS English Edition
From Everand
Best Free Open Source Data Recovery Apps for Mac OS English Edition
Cyber Jannah Sakura
No ratings yet

File Management

Uploaded by

File Management

Uploaded by

File Management

 Access-control information determines who can do reading, writing,

Repositioning within a file:

 Character special files - data is handled character by character as in

 A file has a certain defined structure according to its type.

It is hard to access the file if we don’t know the password of it.

File Access Mechanisms

 Sequential access is based on a tape model of a file.

 This is the most common mode of access of files.

 This scheme is known as sequential access to a file.

Beginning current position end

 Direct access is based on a disk model of a file.

 Linked allocation is a disk-based version of the linked list.

1) Single Level Directory

1) Single Level Directory

2) Two Level Directory

 The standard solution to limitations of single-level directory is to create a

 Tree structured directory system overcomes the drawbacks of two level

 Searching is more efficient in this directory structure. The concept of current

Disk Organization and Disk Structure

Physical structure of a magnetic disk

 The Master Boot Record (or MBR)

Raid structure of disk

RAID (redundant array of independent disks; originally redundant array of

Standard RAID levels

RAID 3: This technique uses striping and dedicates one drive to

You might also like