0% found this document useful (0 votes)

15 views27 pages

4 3-fsFileRepresentation

Uploaded by

holynightstarsbright

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views27 pages

4 3-fsFileRepresentation

Uploaded by

holynightstarsbright

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 27

CPSC313: Computer Hardware

and Operating Systems

Unit 4: File Systems
Representing files on disk

CPSC 313 1
Admin

Final examination
 Reserve your time of PrairieTest if you haven't already done it.

Quiz 3 retakes
 From yesterday until Saturday


Tutorial 8 is this week

Lab 8 has been released and is due Sunday November 17th.

Code for today is in the course code repository:
4.3-file-indexing
CPSC 313 2
More admin

Next week :
 No tutorials.
 No lectures or office hours Monday, Tuesday and
Wednesday.

CPSC 313 3
Where we are

Unit Map:
 ...
 P19: File descriptors
 4.2. File descriptors management
 P20: File Systems implementation overview
 4.3. How we represent files
 P21: Why fixed-size block file systems?
 4.4. Building a file index

CPSC 313 4
Today

Today’s Learning Outcomes
 Explain how files might be represented on disk
 Use file system terminology:

Superblock

Inode

Inode number or inumber

Internal fragmentation

External fragmentation
 Compare and contrast different file representations and identify the
tradeoffs among them.

CPSC 313 5
Posix API: hierarchical name space, byte-streams, open, close, read, write
open close read write

Map: Name to file Allocate and manage

metadata file descriptors

Find location of in-memory file

file metadata object: VNODE

Map: file offsets to

disk blocks

Persistent storage: Numbered disk blocks, checksums and ECC, bad block handling

CPSC 313 6
File System Design Goals and Constraints

Long-lived and robust
 Many files created, deleted, extended and truncated over time
 Performance should not degrade with time (at least not too much)

General purpose
 Different file sizes; files can be sparse (two slides ahead)
 Different access patterns: sequential and arbitrary (“random”)

7
File System Design Goals and Constraints

Performance dictated by storage media hardware (rotating disk
or SSD)
 File is collection of disk blocks that store its data and meta-data
 Seeking to a block can be much slower than transferring all of its data
 Where blocks are on the disk can really matter

8
Sparse Files

Purpose
 Some files represent data placed at particular offsets to a starting
point…
 …potentially with large gaps in between…
 …rather than continuous streams of data.

Implementation Consideration
 E.g., Write one byte at offset 0 and another at offset 230 - 1.

File’s size is 230 bytes

But, storing that data requires only 2 disk blocks for the file’s data
 Allocating 230 bytes worth of blocks would waste valuable disk space
9
Layers of Abstraction

Application
 Stream of bytes a stream of bytes

File byte offset o
 Sequence of Logical Blocks
A sequence of logical blocks
 LBN = floor(offset / file_system_block_size)
LBN = L

Disk
 Sequence of Physical Blocks A collection of disk sectors or pages
 A file is a collection of Physical Blocks PBN p
 PBN is computed from LBN using inode block map
1
0
Basic Data Structures and Concepts

Super block
 Meta-data for entire file system.
 Stored at specific disk locations (e.g., block # 1).
 To mount a file system OS must be able to read its super block;
so, these may be replicated multiple places on disk.

11
Basic Data Structures and Concepts (cont)

Inode
 On-disk meta-data that describes a file
 Stores: (root of) mapping to disk block # (LBN => PBN) and some other
meta-data (file size, file permissions, etc)
 Does not have a symbolic, human-readable name

Inumber
 Internal “name” of an inode
 File system can map inumber to disk block for that file’s inode
 ls -i to see file inumbers

12
Let’s store something in a new file

Steps
 create: allocate a new inode, thus assigning the file an inumber
 write: allocate disk blocks and write data into those blocks

Questions to ask about how we represent files so we can find
the disk block corresponding to each block of data in the file:
 What would the map from LBN to PBN look like?
 Does this handle small and large files well?
 How about sparse files?
 How well does it handle sequential and random access?

13
Strategy 1 : Single-Extent-Based Allocation

Extent
 Definition: Variable-sized contiguous collection of disk blocks
 Use: One extent per file; stores all of a file’s data

LBN to PBN map
 Simple and small; just store the following two things:

Block number of extent’s first block

Total number of blocks in the extent

Or just the size of the file since we can divide to get the number of blocks

14
Evaluating Single-Extent-Based Allocation

Pros
 Inode is small for all file sizes
 Sequential access is optimized, matching hardware characteristics
 Random access needs ≤ 2 reads to get disk data (inode + data block)

Cons
 Handles sparse files poorly
 Does not match file POSIX API

When you create a file in Unix you don’t tell the OS how big it will be
 Doesn’t handle file extension or truncation well
 CAUSES EXTERNAL FRAGMENTATION
16
External Fragmentation (with Single extents)

Extents are variable-sized, created and deleted randomly/arbitrarily
 Over time, large, contiguous free spaces become scarce
 They get fragmented into many smaller space
Allocated blocks Free space
We need an extent this big


Even though there’s plenty of room for our new file overall,
no contiguous free (grey) space is big enough!
NOTES: (1) Recall from 213 that this is the same problem malloc implementations face.
(2) Single extents are used for read-only file systems (DVDs, BlueRays).
17
Strategy 2 : Block-Based Allocation

Blocks
 Fixed size units of allocation, thus eliminating external fragmentation
 A file might require many blocks

LBN to PBN map
 Must store block number of every block of the file
 So, inodes for large files need to be large

We’ll talk about how to handle this issue next class

18
Evaluating Block-Based Allocation

Pros
 No external fragmentation (if I can fit one more block, I can grow a file
by one more block).
 Matches Unix API; files start out empty.
 Easy to extend and truncate.
 Handles sparse files well.

Cons
 Not optimized for sequential access

Each block may be in a different location on disk
 Might not be optimized for random access for large files either.
20
Picking a Block Size

Pros of big blocks
 Better performance with sequential accesses (or high spatial locality,
generally)

Must be at least big enough to amortize fixed access latency (e.g., seek
time)
 Smaller inode block maps
last 100 bytes of file wasted

Cons of big blocks space
64 KB block
 More INTERNAL FRAGMENTATION

Last block of file might not be full
 Particularly problematic for small files …

… and lots of files are small 22
File sizes – Norm’s laptop in Fall 2022
• 10,972,857 files

Size Count Percent

0 bytes 269,251 2.5
> 0 bytes && <= 4K bytes 6,412,059 58.4
> 4K bytes && <= 8K bytes 1,324,083 12.1
> 8K bytes && <= 16K bytes 1,034,937 9.4
> 16K bytes && <= 1M bytes 1,600,065 14.6
> 1M bytes && <= 1G bytes 332,166 3.0
> 1G bytes 296 0.0

23
File sizes – Norm’s laptop in Fall 2023
• 12,921,244 files
Size Count Percent Blocks Blocks
Percent
0 bytes 275,553 2.1 0 0.0
> 0 bytes && <= 4K bytes 8,601,494 66.6 8,601,494 0.6
> 4K bytes && <= 8K bytes 1,043,689 8.1 2,087,378 0.2
> 8K bytes && <= 16K bytes 895,783 6.9 3,009,689 0.2
> 16K bytes && <= 1M bytes 1,698,584 13.1 53,202,543 4.0
> 1M bytes && <= 1G bytes 405,437 3.1 807,101,780 60.6
> 1G bytes 704 0.0 457,898,242 34.4

24
In-class Exercise

Practice using knowledge of disks to compute performance of
different on-disk allocation strategies.

While some of the calculations are finicky, the important
takeaway is to observe what a dramatic effect layout has on
performance.

Similarly, observing this effect should give you intuition for why
solid-state, SSDs (aka flash) drives are so much faster than
spinning disks.

CPSC 313 25
Wrapping Up
There are advantages and disadvantages to different layouts and our
goal as file system developers* is to pick representations that offer a
good set of tradeoffs.

*
Note that we used to refer to you all as computer architects; in this unit
you are becoming file system designers.

CPSC 313 26
File Representation: Single Extent
File
• Pros:
inode data  really simple
 good for both sequential and random
access
 very efficient (in terms of memory
allocation)
 relatively little internal fragmentation

CPSC 313 27
File Representation: Single Extent
File
• Cons:
inode data  inflexible – what happens if a file changes
size?
 have to pre-allocate space at create time
 lots of external fragmentation
 wastes space for sparse files
 Strict single-extent allocation is
unrealistic!

CPSC 313 28
File Representation: Fixed size blocks
• Pros:
inode  eliminates external fragmentation
 internal fragmentation can be reduced
by choosing smaller block sizes in the
design
 easy to grow (and shrink) files
 handles sparse files well
…

CPSC 313 29
File Representation: Fixed size blocks
• Cons:
inode  requires a lot of metadata for big files

at least one disk-address per block
 sequential access could be slow

if blocks are scattered over disk

each block access could require
expensive seek
…

CPSC 313 30

Storage Design and Implementation in Vsphere 6 A Technology Deep Dive (Mostafa Khalil) (Z-Library)
No ratings yet
Storage Design and Implementation in Vsphere 6 A Technology Deep Dive (Mostafa Khalil) (Z-Library)
1,757 pages
HCIA-Storage V4.5 Learning Guide
100% (1)
HCIA-Storage V4.5 Learning Guide
114 pages
Bosch Camera Price
No ratings yet
Bosch Camera Price
30 pages
Optical Storage
No ratings yet
Optical Storage
4 pages
Isilon Hardware
No ratings yet
Isilon Hardware
32 pages
SOP-CIT-005.01 Backup and Restore Management - (Effective)
No ratings yet
SOP-CIT-005.01 Backup and Restore Management - (Effective)
14 pages
1-22 Scott - A Taste of Agile 0
No ratings yet
1-22 Scott - A Taste of Agile 0
135 pages
Main Log
No ratings yet
Main Log
183 pages
Filesys
No ratings yet
Filesys
57 pages
DD+DIS110.09E - Installation Instructions Digitizer - Software - C25 - 3102
No ratings yet
DD+DIS110.09E - Installation Instructions Digitizer - Software - C25 - 3102
23 pages
Slide 07
No ratings yet
Slide 07
100 pages
File System Implementation
No ratings yet
File System Implementation
45 pages
Chapter 10: File System Implementation
No ratings yet
Chapter 10: File System Implementation
33 pages
08 Filemanagement
No ratings yet
08 Filemanagement
66 pages
4762 - Conversion From One Unit To The Other Units of Storage
No ratings yet
4762 - Conversion From One Unit To The Other Units of Storage
8 pages
003 Filtros-Avanzados
No ratings yet
003 Filtros-Avanzados
24 pages
File Systems: Implementation: Bilkent University Department of Computer Engineering CS342 Operating Systems
No ratings yet
File Systems: Implementation: Bilkent University Department of Computer Engineering CS342 Operating Systems
107 pages
Evolution of Music Players - Group 2 Music
No ratings yet
Evolution of Music Players - Group 2 Music
17 pages
SSD Buying Guide - List July 21, 2020
No ratings yet
SSD Buying Guide - List July 21, 2020
19 pages
1 - Disk Storage - Ch13
No ratings yet
1 - Disk Storage - Ch13
31 pages
OS CH 4
No ratings yet
OS CH 4
39 pages
Types of Storage Devices
No ratings yet
Types of Storage Devices
3 pages
CH 5.0ok
No ratings yet
CH 5.0ok
27 pages
Unit6 - File System Interface
No ratings yet
Unit6 - File System Interface
62 pages
File System
No ratings yet
File System
18 pages
Memory Management Notes - Operating System
No ratings yet
Memory Management Notes - Operating System
4 pages
Storage 4
No ratings yet
Storage 4
15 pages
מערכות הפעלה- הרצאה 10 יחידה ב - Disk
No ratings yet
מערכות הפעלה- הרצאה 10 יחידה ב - Disk
37 pages
Audio Recording System
No ratings yet
Audio Recording System
2 pages
05 - Stallings CH6 External Memory
No ratings yet
05 - Stallings CH6 External Memory
37 pages
Cópia de Softwares Atualizados Com NEs05!10!22
No ratings yet
Cópia de Softwares Atualizados Com NEs05!10!22
25 pages
File System Implementation
No ratings yet
File System Implementation
35 pages
Lec18 Filesystems
No ratings yet
Lec18 Filesystems
29 pages
File System Implementation OS
No ratings yet
File System Implementation OS
54 pages
CH 112
No ratings yet
CH 112
29 pages
OSG202 - Chap 4 - File System
No ratings yet
OSG202 - Chap 4 - File System
56 pages
L18 VSFS and FSFormat
No ratings yet
L18 VSFS and FSFormat
38 pages
Lecture 2 Advanced File Systems
No ratings yet
Lecture 2 Advanced File Systems
66 pages
Album
No ratings yet
Album
11 pages
04 Storage Virtualization - Print
No ratings yet
04 Storage Virtualization - Print
7 pages
Filesystem Implementation
No ratings yet
Filesystem Implementation
27 pages
File System Implementation: Sunu Wibirama
No ratings yet
File System Implementation: Sunu Wibirama
40 pages
Digital Registers and Memory
No ratings yet
Digital Registers and Memory
9 pages
Datasheet XPG Gammix s11 Pro Pcie Gen3 x4 m2 SSD
No ratings yet
Datasheet XPG Gammix s11 Pro Pcie Gen3 x4 m2 SSD
2 pages
OS Part 04
No ratings yet
OS Part 04
60 pages
CS124 Lec 24
No ratings yet
CS124 Lec 24
28 pages
Samsung HT-C450-02259B-0224
No ratings yet
Samsung HT-C450-02259B-0224
41 pages
This Lecture: Physical Reality (Disks) File System Abstraction
No ratings yet
This Lecture: Physical Reality (Disks) File System Abstraction
8 pages
Ch-14 - File System Implementation
No ratings yet
Ch-14 - File System Implementation
34 pages
18.FileSystems Fundamentals Handout
No ratings yet
18.FileSystems Fundamentals Handout
5 pages
File Allocation
No ratings yet
File Allocation
37 pages
Unit 3-Chapter 1-File Management-II Part
No ratings yet
Unit 3-Chapter 1-File Management-II Part
19 pages
OS Unit 4 BCA-402
No ratings yet
OS Unit 4 BCA-402
31 pages
Liteon Table v1.3
No ratings yet
Liteon Table v1.3
2 pages
OS - Chapter - 5 - File System
No ratings yet
OS - Chapter - 5 - File System
30 pages
He-Dieu-Hanh - Kai-Li - Filelayout - (Cuuduongthancong - Com)
No ratings yet
He-Dieu-Hanh - Kai-Li - Filelayout - (Cuuduongthancong - Com)
7 pages
4 4-fsFileIndex
No ratings yet
4 4-fsFileIndex
33 pages
Reading: Washington. Thank You, Hank!
No ratings yet
Reading: Washington. Thank You, Hank!
4 pages
Lecture 2
No ratings yet
Lecture 2
43 pages
Backup and Disaster Recovery Presentation
No ratings yet
Backup and Disaster Recovery Presentation
15 pages
Os - Unit 5
No ratings yet
Os - Unit 5
60 pages
File System Implementation
No ratings yet
File System Implementation
38 pages
LTO-7 Tape-Drive Datasheet
No ratings yet
LTO-7 Tape-Drive Datasheet
4 pages
File System
No ratings yet
File System
9 pages
OS Presentatio Topic
No ratings yet
OS Presentatio Topic
10 pages
Fast File System: Don Porter
No ratings yet
Fast File System: Don Porter
15 pages
Ch5 (Dev, File and System Level Io)
No ratings yet
Ch5 (Dev, File and System Level Io)
40 pages
Module 4 File System
No ratings yet
Module 4 File System
58 pages
Operating Systems Unit 3 - Files Management
No ratings yet
Operating Systems Unit 3 - Files Management
61 pages
OS Unit 5
No ratings yet
OS Unit 5
21 pages
Data Recovery Professional-Brochure - Total-Data-Recovery-Professional
No ratings yet
Data Recovery Professional-Brochure - Total-Data-Recovery-Professional
4 pages
DoQuangHuy HE191197
No ratings yet
DoQuangHuy HE191197
8 pages
CSS Finals Reviewer
No ratings yet
CSS Finals Reviewer
19 pages
4 4 Persistence-Fs-Impl
No ratings yet
4 4 Persistence-Fs-Impl
32 pages
File Management15
No ratings yet
File Management15
52 pages
12 - File System Implementation
No ratings yet
12 - File System Implementation
42 pages
Lecture-Files and Directories
No ratings yet
Lecture-Files and Directories
20 pages
OS Unit-4
No ratings yet
OS Unit-4
29 pages
OS CO4 S4 FileDirectories FileSystemImplementation
No ratings yet
OS CO4 S4 FileDirectories FileSystemImplementation
79 pages
FILE CONCEPT For Second Internels
No ratings yet
FILE CONCEPT For Second Internels
20 pages
Lecture 09 10
No ratings yet
Lecture 09 10
54 pages
Chapter 2 - File System Management
No ratings yet
Chapter 2 - File System Management
8 pages
Operating Systems: Unit-6 I/O Management
No ratings yet
Operating Systems: Unit-6 I/O Management
40 pages
OS CH 5
No ratings yet
OS CH 5
32 pages
File System and Secondary Storage
No ratings yet
File System and Secondary Storage
40 pages
Implementation of Files
No ratings yet
Implementation of Files
7 pages
12 File Systems
No ratings yet
12 File Systems
42 pages
Chapter 5 Part 2 Secondary Storage MGT File MGT in Popular Oss Eighth Edition
No ratings yet
Chapter 5 Part 2 Secondary Storage MGT File MGT in Popular Oss Eighth Edition
38 pages

4 3-fsFileRepresentation

Uploaded by

4 3-fsFileRepresentation

Uploaded by

CPSC313: Computer Hardware

and Operating Systems

Map: Name to file Allocate and manage

Find location of in-memory file

Map: file offsets to

Size Count Percent

You might also like