0% found this document useful (0 votes)

2 views

5.FileSystems

The document discusses various file systems, focusing on FAT and EXT2, detailing their structures, operations, and methods for maintaining consistency. It also covers advanced concepts like journaling, copy-on-write, and the Btrfs file system, which utilizes a B-tree structure for efficient data management. Key topics include inode management, bitmap allocation, and strategies for handling file system inconsistencies after crashes.

Uploaded by

soumyapadhy3

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

5.FileSystems

Uploaded by

soumyapadhy3

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 33

Unit-III: File Systems

Subhrendu Chattopadhyay, IDRBT

Disclaimer: A few of the Images are taken from the Internet and Textbooks 1
FAT File Allocation Table

2
FAT
• Simple file system popularized by MS-DOS
• First introduced in 1977
• Most devices today use the FAT32 spec from 1996
• FAT12, FAT16, VFAT, FAT32, etc.
• Still quite popular today
• Default format for USB sticks and memory cards
• Used for EFI boot partitions
• Name comes from the index table used to track directories and files
FAT Layout
Superblock FAT
● Stores basic info about the file system ● File allocation table (FAT)
● FAT version, location of boot files ● Marks which blocks are free or in-use
● Total number of blocks ● Linked-list structure to manage large
● Index of the root directory in the FAT files

Data Block
● Store file and directory data
● Each block is a fixed size (4KB – 64KB)
● Files may span multiple blocks

https://elixir.bootlin.com/linux/v6.11-rc4/source/mm
FAT
• Entry size depends on FAT-12, FAT-16, FAT-32
• Directories are special files
• Contains a list of entries inside the directory file
• Possible values for FAT entries:
• 0 – entry is empty
• 1 – reserved by the OS
• 1 < N < 0xFFFF – next block in a chain
• 0xFFFF – end of a chain
EXT2 Extended File System

6
Ext2
• Inode: Every file or directory is represented by an
inode. Special data structure to index
files/directories
• Block Groups: The file system is divided into block
groups, which contain a fixed number of inodes,
data blocks, and a superblock. This layout improves
performance and allows for better disk usage.
Directories are files. Contains
the list of entries in the directory ● Each inode can directly point to 12 blocks
● Can also indirectly point to blocks at 1, 2, and 3
Ext2 levels of depth

Name Inode
bin

. 0
/ home
bin 1

home 2

initrd.img 3

Inode Data
Inode Data Block
Bitmap Bitmap
Root inode = 0
/*
Ext2: Inode Entries * Structure of an inode on the disk
*/
struct ext2_inode { … }

Field Size Description

Mode 2B File type and permissions (indicates file type, owner, group, and other permissions).

UID 2B User ID of the file's owner.

Size 4B File size in bytes. For directories, it is the total size of the directory contents

Atime 4B Last access time (in seconds since Unix epoch).

Ctime 4B Creation time or last inode status change time.

Mtime 4B Last modification time.

Dtime 4B Deletion time. Stores when the file was deleted (if applicable).

https://elixir.bootlin.com/linux/v6.11-rc4/source/fs/ext2/ext2.h#L290
Ext2: Inode Entries
Field Size Description

GID 2B Group ID of the file.

Links count 2B Number of hard links to the file.

Flags 4B File attributes (e.g., secure deletion, immutability, compression).

Blocks 4B Total number of 512-byte blocks used by the file (including indirect blocks).

Pointers to data Blocks 60B 12 direct, 1 single indirect, 1 double indirect, 1 triple indirect block pointers.

Generation number 4B File versioning number, used in network file systems (e.g., NFS).

File ACL 4B Access control list information for additional file permissions.

Directory ACL 4B ACL information specific to directories.

Fragment address 4B Not commonly used; holds file fragment information if the file is fragmented.

OS-specific 12B OS-specific data fields for additional functionality.

Ext2: Bitmaps: Block Bitmap
• The block bitmap tracks the allocation status of data blocks within a block group.
• The size of the block bitmap depends on the number of blocks in the block group. Each bit in the
bitmap represents a single block in the block group.
• 1: The block is allocated (in use).
• 0: The block is free (available for allocation).
• Usage:
• When a file is created or expanded, the filesystem scans the block bitmap to find free blocks
(bits set to 0). Once blocks are allocated, the corresponding bits in the block bitmap are set to
1.
• During file deletion, the blocks are deallocated, and the corresponding bits in the bitmap are
cleared (set to 0), marking those blocks as free for future use.

Block Bitmap: | 1 | 0 | 1 | 1 | 0 | 0 | 1 | 0 | 1 | 0 |
|---|---|---|---|---|---|---|---|---|---|
| | | | | | | | | | |
| B1| B2| B3| B4| B5| B6| B7| B8| B9|B10|
Ext2: Bitmaps: inode Bitmap
• The inode bitmap tracks the allocation status of inodes within a block group.
• The size of the inode bitmap depends on the number of inodes in the block group. Each bit in the
inode bitmap represents a single inode.
• 1: The inode is allocated (in use, assigned to a file or directory).
• 0: The inode is free (available for use).
• Usage:
• When a new file or directory is created, the inode bitmap is scanned to find a free inode (bit
set to 0). The inode is then allocated, and the corresponding bit in the bitmap is set to 1.
• When a file or directory is deleted, the inode is deallocated, and its corresponding bit is
cleared to 0, marking the inode as free.

Inode Bitmap: | 1 | 1 | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 0 |
|---|---|---|---|---|---|---|---|---|---|
| | | | | | | | | | |
| I1| I2| I3| I4| I5| I6| I7| I8| I9|I10|
Ext2: Write Operation Example
• Let’s go through an example where you write a 10 KB file to an EXT2 filesystem, assuming the block size is 4 KB.
• Open the File:
• The kernel opens the file, looks up its inode, and initializes the necessary metadata.
• Write 10 KB of Data:
• The kernel splits the 10 KB of data into blocks. Since the block size is 4 KB, the file will require 3 blocks: two full 4 KB
blocks and one partial 2 KB block.
• Block Allocation:
• The kernel checks the block bitmap and allocates 3 blocks for the file. The block pointers in the inode are updated to
point to these blocks.
• Data Write:
• The kernel writes the first 4 KB of data to the first block, the next 4 KB to the second block, and the remaining 2 KB to
the third block.
• Update Inode:
• The inode is updated to reflect the new file size (10 KB) and the block pointers are updated. The modification time
(i_mtime) is also updated.
• Delayed Write:
• The data may initially be written to the page cache and only later written to the disk during a sync operation or a
periodic flush.
Ext2: Inconsistency
• Many operations results in multiple, independent writes to the file system
• Example: append a block to an existing file
• Update the free data bitmap
• Update the inode
• Write the user data
• What happens if the computer crashes in the middle of this process?
Ext2: Inconsistency
• The disk guarantees that sector writes are atomic
• No way to make multi-sector writes atomic
• How to ensure consistency after a crash?
• Don’t bother to ensure consistency
• Accept that the file system may be inconsistent after a crash
• Run a program that fixes the file system during bootup
• File system checker (fsck)
• Use a transaction log to make multi-writes atomic
• Log stores a history of all writes to the disk
• After a crash the log can be “replayed” to finish updates
• Journaling file system
Approach-1: Check file-system
• Key idea: fix inconsistent file systems during bootup
• Unix utility called fsck (chkdsk on Windows)
• Scans the entire file system multiple times, identifying and correcting inconsistencies
• Why during bootup?
• No other file system activity can be going on
• After fsck runs, bootup/mounting can continue
fsck
• Superblock: validate the superblock, replace it with a backup if it is corrupted
• Free blocks and inodes: rebuild the bitmaps by scanning all inodes
• Reachability: make sure all inodes are reachable from the root of the file system
• inodes: delete all corrupted inodes, and rebuild their link counts by walking the directory tree
• directories: verify the integrity of all directories
• … and many other minor consistency checks

• Advantages of fsck
• Doesn’t require the file system to do any work to ensure consistency
• Makes the file system implementation simpler
• Disadvantages of fsck
• Very complicated to implement the fsck program
• Many possible inconsistencies that must be identified
• Many difficult corner cases to consider and handle
• fsck is super slow
• Scans the entire file system multiple times
• Imagine how long it would take to fsck a 40 TB RAID array
Approach-2: Journaling
• Key idea: make writes transactional by using a write-ahead log
• Commonly referred to as a journal
• Ext3 and NTFS use journaling
• After the log is written, the writes execute normally
• In essence, the log records transactions
Approach-2: Journaling
• What happens after a crash…
• If the writes to the log are interrupted?
• The transaction is incomplete
• The user’s data is lost, but the file system is consistent
• If the writes to the log succeed, but the normal writes are interrupted?
• The file system may be inconsistent, but…
• The log has exactly the right information to fix the problem
Approach-2: Journaling
• Advantages of journaling
• Robust, fast file system recovery
• No need to scan the entire journal or file system
• Relatively straightforward to implement
• Disadvantages of journaling
• Write traffic to the disk is doubled
• Especially the file data, which is probably large
B+ Tree Use of B+ tree in File Systems

21
inode and it’s problem
• Recall: inodes use indirection to acquire additional blocks of pointers
• Problem: inodes are not efficient for large files
• Example: for a 100MB file, you need 25600 block pointers (assuming 4KB blocks)
• This is unavoidable if the file is 100% fragmented
• However, what if large groups of blocks are contiguous?
• Extents are better suited for contiguous files
• B Trees are widely used for file system representation (WAFL, ZFS, BTRFS)
• Logarithmic time key search, insert and remove
• Well represents sparse files
• The File System as a large tree made up of fixed size pages
• Shadowing:
• Technique to support atomic updates over persistent data structures
• Implement snapshots, crash recovery, write-batching, RAID
Shadowing
• To update an on-disk page (the page is in the disk, not available in the memory)
• Read the entire page in the memory
• Modify the page
• Write in an alternate location
• When a page is shadowed, its location on the disk changes
• Update (and shadow) the immediate ancestor of the
• page with the new address
• Propagates up to the file system root
Copy-on-Write (COW)
• The core idea behind copy-on-write is to delay the copying of data until it is absolutely necessary,
saving resources like memory and disk space. Instead of creating a full copy of data when it’s shared
or duplicated, the system shares the same data with multiple processes or users and only creates a
copy when someone tries to modify the shared data.
• Some filesystems, like Btrfs, ZFS, and snapshots in EXT4 (with certain configurations), use
copy-on-write to efficiently manage file modifications and snapshots.
• Snapshots: When creating a snapshot, the filesystem doesn’t immediately duplicate the entire file or
directory. Instead, it marks the snapshot as sharing the original data. If the original data is modified,
the filesystem only copies the blocks being modified, leaving the unmodified blocks shared between
the original file and the snapshot.
btrfs

25
History
• A file system based on COW principle -- initially designed at Oracle Corporation for use in Linux
• The development began in 2007, since November 2013 it has been marked as stable
• Principal Btrfs author: Chris Mason
• “to let Linux scale for the storage that will be available. Scaling is not just about addressing the storage but
also means being able to administer and to manage it with a clean interface.”
History & Basics
• A file system based on COW principle -- initially designed at Oracle Corporation for use in Linux
• The development began in 2007, since November 2013 it has been marked as stable
• Principal Btrfs author: Chris Mason
• “to let Linux scale for the storage that will be available. Scaling is not just about addressing the storage but
also means being able to administer and to manage it with a clean interface.”

• Page, block: A 4KB contiguous region on disk and in memory. This is the standard Linux page size.
• Extent: A contiguous on-disk area. It is page aligned, and its length is a multiple of pages.
• Copy-on-write (COW): Creating a new version of an extent or a page at a different location
• The data is loaded from disk to memory, modified, and then written elsewhere
• Do not update the original location in place, risking a power failure and partial update
BTRFS B-tree
• A generic structure with three types of data structures: keys, items, and block headers
• Block header: A fixed size data structure, holds fields like checksums, flags, filesystem ids, generation
number, etc.
• Key: describes an object address,
• Item: is a key with additional offset and size fields.
• Internal tree nodes hold only [key, block-pointer] pairs
• Leaf nodes hold arrays of [item, data] pairs.
• The offset field describes data held in an extent.
BTRFS B-tree
• A leaf stores
• an array of items in the beginning
• a reverse sorted data array at the ends
• These arrays grow towards each other.
BTRFS: Small files
• Small files that occupy less than one leaf block are packed into the b-tree inside the extent item
• Key offset if the byte offset of the data in the file
• The size field indicates how much data is stored
BTRFS: Large files
• Large files are stored in extents -- contiguous on-disk areas that hold user-data without additional headers
or formatting
• Extent maintains a [disk block, disk num blocks] pair to record the area of disk corresponding to the file.
BTRFS: Large files
• A directory holds an array of dir_item elements
• Maps a file name (string) to a 64bit object_id
• Directory lookup index - [dir_item_key, filename 64bit hash]
BTRFS
• BTRFS does its own device management

BTRFS

Traditional File Systems with DMs

LINUX File System: Slides Adopted From
No ratings yet
LINUX File System: Slides Adopted From
41 pages
6 LinuxFS
No ratings yet
6 LinuxFS
22 pages
Ext 3
No ratings yet
Ext 3
21 pages
Linux File System Structure
100% (1)
Linux File System Structure
55 pages
File Syetem
No ratings yet
File Syetem
33 pages
Lecture 16
No ratings yet
Lecture 16
15 pages
2023 334 The3
No ratings yet
2023 334 The3
19 pages
Session 5 6 Revision
No ratings yet
Session 5 6 Revision
47 pages
Ext 2
No ratings yet
Ext 2
12 pages
Assingment Dbms
No ratings yet
Assingment Dbms
15 pages
Chapter 3
No ratings yet
Chapter 3
18 pages
Extxfs Short
No ratings yet
Extxfs Short
41 pages
Ext3/4 File Systems: Don Porter CSE 506
No ratings yet
Ext3/4 File Systems: Don Porter CSE 506
33 pages
10 File Systems
No ratings yet
10 File Systems
108 pages
Mod 3
No ratings yet
Mod 3
121 pages
Files File System Core Lecture
No ratings yet
Files File System Core Lecture
36 pages
Module 4 File System
No ratings yet
Module 4 File System
58 pages
Workshop On Free/Open Source Software
No ratings yet
Workshop On Free/Open Source Software
38 pages
2 (1) (1) - File System
No ratings yet
2 (1) (1) - File System
7 pages
Lecture Notes Course Outcome 1 & Session 4 Topic: SFS File System Implementation
No ratings yet
Lecture Notes Course Outcome 1 & Session 4 Topic: SFS File System Implementation
8 pages
Ext4 Foss
No ratings yet
Ext4 Foss
25 pages
Files Systems
No ratings yet
Files Systems
36 pages
11.1 EXT2 File System
No ratings yet
11.1 EXT2 File System
56 pages
Media and Storage: UNIX File Systems
No ratings yet
Media and Storage: UNIX File Systems
47 pages
Section10-File Systems PDF
No ratings yet
Section10-File Systems PDF
43 pages
Session 15 EXT2 and EXT3 Amzb
No ratings yet
Session 15 EXT2 and EXT3 Amzb
47 pages
He-Dieu-Hanh - Kai-Li - Filelayout - (Cuuduongthancong - Com)
No ratings yet
He-Dieu-Hanh - Kai-Li - Filelayout - (Cuuduongthancong - Com)
7 pages
The Second Extended File System
No ratings yet
The Second Extended File System
47 pages
Module II Notes
No ratings yet
Module II Notes
24 pages
ExtXfs_short
No ratings yet
ExtXfs_short
38 pages
Lecture 2 Advanced File Systems
No ratings yet
Lecture 2 Advanced File Systems
66 pages
14 File System Implementation
No ratings yet
14 File System Implementation
46 pages
File System Structure1
No ratings yet
File System Structure1
21 pages
18.FileSystems Fundamentals
No ratings yet
18.FileSystems Fundamentals
14 pages
File Systems: Fundamentals: Files
No ratings yet
File Systems: Fundamentals: Files
14 pages
Operating Systems CMPSC 473
No ratings yet
Operating Systems CMPSC 473
27 pages
File Systems
100% (1)
File Systems
64 pages
4.4.persistence-fs-impl
No ratings yet
4.4.persistence-fs-impl
32 pages
Understanding UNIX / Linux File System: What Is A File?
No ratings yet
Understanding UNIX / Linux File System: What Is A File?
9 pages
11 Case Study Unix PDF
No ratings yet
11 Case Study Unix PDF
42 pages
10 File Systems1
No ratings yet
10 File Systems1
45 pages
18.FileSystems Fundamentals Handout
No ratings yet
18.FileSystems Fundamentals Handout
5 pages
File Systems Inodes: - Abstraction - Which Disk Blocks Go With Which File. - Inode: Data Structure For Bookkeeping
No ratings yet
File Systems Inodes: - Abstraction - Which Disk Blocks Go With Which File. - Inode: Data Structure For Bookkeeping
4 pages
File System Lecutre1
No ratings yet
File System Lecutre1
40 pages
Os 1 PDF
No ratings yet
Os 1 PDF
43 pages
Ext2 File System Assignments and Description Lab
No ratings yet
Ext2 File System Assignments and Description Lab
14 pages
Reading: Washington. Thank You, Hank!
No ratings yet
Reading: Washington. Thank You, Hank!
4 pages
File System
No ratings yet
File System
46 pages
Creating and Mounting File System
No ratings yet
Creating and Mounting File System
6 pages
Os - Unit 5
No ratings yet
Os - Unit 5
60 pages
List+of+Filesystems
No ratings yet
List+of+Filesystems
3 pages
File System implementation
No ratings yet
File System implementation
32 pages
Inode
No ratings yet
Inode
29 pages
Tutorial
No ratings yet
Tutorial
23 pages
Unix File System: No One Can Delete File Root User (Immutability)
No ratings yet
Unix File System: No One Can Delete File Root User (Immutability)
6 pages
4 Internal Representation of Files
No ratings yet
4 Internal Representation of Files
12 pages
Operating Systems - File-System Interface
No ratings yet
Operating Systems - File-System Interface
13 pages
Seminar Report On File System in Linux
No ratings yet
Seminar Report On File System in Linux
14 pages
Project Gutenberg "Best Of" CD August 2003
From Everand
Project Gutenberg "Best Of" CD August 2003
Project Gutenberg
No ratings yet
Linux for Beginners: Linux Command Line, Linux Programming and Linux Operating System
From Everand
Linux for Beginners: Linux Command Line, Linux Programming and Linux Operating System
Steve Will
4.5/5 (3)
Agricultural Trade Policy
No ratings yet
Agricultural Trade Policy
14 pages
Solution PKP Tutorial
No ratings yet
Solution PKP Tutorial
6 pages
Inggris 10
No ratings yet
Inggris 10
3 pages
Well Played
No ratings yet
Well Played
4 pages
TLS06F006-C Covidien PB540 PB560 Spec 2982400 Rev 2 - 7
No ratings yet
TLS06F006-C Covidien PB540 PB560 Spec 2982400 Rev 2 - 7
24 pages
Chapter Two: Implementing Strategy: The Value Chain, The Balanced Scorecard, and The Strategy Map
No ratings yet
Chapter Two: Implementing Strategy: The Value Chain, The Balanced Scorecard, and The Strategy Map
27 pages
O.Ph.D.-4-2020 MKBU
100% (1)
O.Ph.D.-4-2020 MKBU
2 pages
Overwatch 115380 Retailx64 enUS 23000 08-03-23 16.25.31 ErrorLog
No ratings yet
Overwatch 115380 Retailx64 enUS 23000 08-03-23 16.25.31 ErrorLog
39 pages
SOLAR PV MODULES Mounting System
100% (1)
SOLAR PV MODULES Mounting System
30 pages
YANE MAGLOIRE MAKAYA 23 JAN
No ratings yet
YANE MAGLOIRE MAKAYA 23 JAN
1 page
Question and Answer
No ratings yet
Question and Answer
47 pages
Testing Testability
No ratings yet
Testing Testability
49 pages
Junior CP
No ratings yet
Junior CP
44 pages
Docx
No ratings yet
Docx
15 pages
Professional CompanyProfile PratamaIndomitra 2020 R01 LR
No ratings yet
Professional CompanyProfile PratamaIndomitra 2020 R01 LR
7 pages
Math 110: Linear Algebra Homework #4: David Zywina
No ratings yet
Math 110: Linear Algebra Homework #4: David Zywina
9 pages
Yamaha RD 350
100% (1)
Yamaha RD 350
6 pages
Idle Air Control Valve
No ratings yet
Idle Air Control Valve
14 pages
Ansals Hub83
No ratings yet
Ansals Hub83
1 page
Cancer Pathophysiology Final
100% (1)
Cancer Pathophysiology Final
3 pages
Events and Issues - Script
No ratings yet
Events and Issues - Script
2 pages
2015 TAG TAGUSA Rules
No ratings yet
2015 TAG TAGUSA Rules
30 pages
Seminar Report Blackberry Phones : Submitted To: Submitted by
No ratings yet
Seminar Report Blackberry Phones : Submitted To: Submitted by
16 pages
2024上半年四级翻译课讲义 PDF打印版
No ratings yet
2024上半年四级翻译课讲义 PDF打印版
28 pages
Top 5 Jazz Endings Sheet Music
No ratings yet
Top 5 Jazz Endings Sheet Music
1 page
Barangay Budget Preparation Form No
100% (1)
Barangay Budget Preparation Form No
2 pages
Amen by Vann Joseph B. Ibasco RN
No ratings yet
Amen by Vann Joseph B. Ibasco RN
1 page
LNT80 - 80,000m LNG Carrier: Main Dimensions Machinery & Propulsion
No ratings yet
LNT80 - 80,000m LNG Carrier: Main Dimensions Machinery & Propulsion
2 pages
Business Profile: Civil Contractors and Interior and Exterior Decorators
No ratings yet
Business Profile: Civil Contractors and Interior and Exterior Decorators
24 pages
Ratu Fianita Priningrum - ABAP - Summarry Weekly
No ratings yet
Ratu Fianita Priningrum - ABAP - Summarry Weekly
20 pages

5.FileSystems

Uploaded by

5.FileSystems

Uploaded by

Unit-III: File Systems

Subhrendu Chattopadhyay, IDRBT

Field Size Description

UID 2B User ID of the file's owner.

Atime 4B Last access time (in seconds since Unix epoch).

Ctime 4B Creation time or last inode status change time.

Mtime 4B Last modification time.

GID 2B Group ID of the file.

Links count 2B Number of hard links to the file.

Flags 4B File attributes (e.g., secure deletion, immutability, compression).

Directory ACL 4B ACL information specific to directories.

OS-specific 12B OS-specific data fields for additional functionality.

Traditional File Systems with DMs

You might also like