0% found this document useful (0 votes)

5 views16 pages

11 Distributed File Systems

The document discusses distributed file systems, focusing on their architecture, operations, and challenges such as consistency, replication, and fault tolerance. It highlights the NFS architecture and compares versions, as well as introduces concepts like client-side caching and the Google File System's chunk-based approach. Additionally, it addresses high availability in peer-to-peer systems and the trade-offs between replication and erasure coding for fault tolerance.

Uploaded by

Yatru Harsha Hiski

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views16 pages

11 Distributed File Systems

Uploaded by

Yatru Harsha Hiski

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

Distributed Systems

Principles and Paradigms

Chapter 11
(version October 15, 2007)

Maarten van Steen

Vrije Universiteit Amsterdam, Faculty of Science
Dept. Mathematics and Computer Science
Room R4.20. Tel: (020) 598 7784
E-mail:[email protected], URL: www.cs.vu.nl/∼steen/
01 Introduction
02 Architectures
03 Processes
04 Communication
05 Naming
06 Synchronization
07 Consistency and Replication
08 Fault Tolerance
09 Security
10 Distributed Object-Based Systems
11 Distributed File Systems
12 Distributed Web-Based Systems
13 Distributed Coordination-Based Systems
00 – 1 /
Distributed File Systems

General goal: Try to make a file system transparently

available to remote clients.

Client Server

Requests from
client to access File stays
remote file on server

Remote access model

1. File moved to client

Client Server

Old file

New file

2. Accesses are
3. When client is done,
done on client
file is returned to
server

Upload/download model

11 – 1 Distributed File Systems/11.1 Architecture

Example: NFS Architecture
NFS is implemented using the Virtual File System
abstraction, which is now used for lots of different op-
erating systems:
Client Server

System call layer System call layer

Virtual file system Virtual file system

(VFS) layer (VFS) layer

Local file Local file

system interface NFS client NFS server system interface

RPC client RPC server

stub stub

Network

Essence: VFS provides standard file system inter-

face, and allows to hide difference between accessing
local or remote file system.

Question: Is NFS actually a file system?

11 – 2 Distributed File Systems/11.1 Architecture
NFS File Operations
Oper. v3 v4 Description
Create Yes No Create a regular file
Create No Yes Create a nonregular file
Link Yes Yes Create a hard link to a file
Symlink Yes No Create a symbolic link to a file
Mkdir Yes No Create a subdirectory
Mknod Yes No Create a special file
Rename Yes Yes Change the name of a file
Remove Yes Yes Remove a file from a file system
Rmdir Yes No Remove an empty subdirectory
Open No Yes Open a file
Close No Yes Close a file
Lookup Yes Yes Look up a file by means of a name
Readdir Yes Yes Read the entries in a directory
Readlink Yes Yes Read the path name in a symbolic link
Getattr Yes Yes Get the attribute values for a file
Setattr Yes Yes Set one or more file-attribute values
Read Yes Yes Read the data contained in a file
Write Yes Yes Write data to a file

Question: Anything unusual between v3 and v4?

11 – 3 Distributed File Systems/11.1 Architecture

Cluster-Based File Systems

Observation: When dealing with very large data col-

lections, following a simple client-server approach is
not going to work.

Solution 1: For speeding up file accesses, apply

striping techniques by which files can be fetched in
parallel:
File block of file a File block of file e

a b c d e
a b c d e
a b c d e

Whole-file distribution

a b a b a b
c e c d c d
d e e

File-striped system

11 – 4 Distributed File Systems/11.1 Architecture

Example: Google File System

Solution 2: Divide files in large 64 MB chunks, and

distribute/replicate chunks across many servers.

file name, chunk index

GFS client Master
contact address

Instructions Chunk-server state

Chunk ID, range

Chunk server Chunk server Chunk server
Chunk data
Linux file Linux file Linux file
system system system

A couple of important details:

• The master maintains only a (file name, chunk

server) table in main memory ⇒ minimal I/O
• Files are replicated using a primary-backup scheme;
the master is kept out of the loop

11 – 5 Distributed File Systems/11.1 Architecture

RPCs in File Systems

Observation: Many (traditional) distributed file sys-

tems deploy remote procedure calls to access files.
When wide-area networks need to be crossed, alter-
natives need to be exploited:

Client Server Client Server

LOOKUP
OPEN
LOOKUP READ

Lookup name Lookup name

Open file
READ
Read file data
Read file data
Time Time

(a) (b)

11 – 6 Distributed File Systems/11.3 Communication

Example: RPCs in Coda

Observation: When dealing with replicated files, se-

quentially sending information is not the way to go:

Client Client

Invalidate Reply Invalidate Reply

Server Server

Invalidate Reply Invalidate Reply

Client Client
Time Time
(a) (b)

Note: In Coda, clients can cache files, but will be in-

formed when an update has been performed.

11 – 7 Distributed File Systems/11.3 Communication

File Sharing Semantics (1/2)
Problem: When dealing with distributed file systems,
we need to take into account the ordering of concur-
rent read/write operations, and expected semantics (=
consistency).

Client machine #1

a b
Process
A
a b c

2. Write "c" 1. Read "ab"

File server
Original file
Single machine a b

a b
Process
A 3. Read gets "ab"
a b c
Client machine #2

Process
a b
B
Process
B
1. Write "c" 2. Read gets "abc"

(a) (b)

11 – 8 Distributed File Systems/11.5 Synchronization

File Sharing Semantics (2/2)

UNIX semantics: a read operation returns the effect

of the last write operation ⇒ can only be imple-
mented for remote access models in which there
is only a single copy of the file

Transaction semantics: the file system supports trans-

actions on a single file ⇒ issue is how to allow
concurrent access to a physically distributed file

Session semantics: the effects of read and write

operations are seen only by the client that has
opened (a local copy) of the file ⇒ what happens
when a file is closed (only one client may actually
win)

11 – 9 Distributed File Systems/11.5 Synchronization

Example: File Sharing in Coda

Essence: Coda assumes transactional semantics, but

without the full-fledged capabilities of real transactions.

Session S A
Client

Open(RD) File f Invalidate

Close
Server

Close
Open(WR) File f

Client

Time
Session S B

Note: Transactional issues reappear in the form of

“this ordering could have taken place.”

11 – 10 Distributed File Systems/11.5 Synchronization

Consistency and Replication

Observation: In modern distributed file systems, client-

side caching is the preferred technique for attaining
performance; server-side replication is done for fault
tolerance.

Observation: Clients are allowed to keep (large parts

of) a file, and will be notified when control is with-
drawn ⇒ servers are now generally stateful

1. Client asks for file

Client Server
2. Server delegates file
Old file

Local copy 3. Server recalls delegation

Updated file
4. Client sends returns file

11 – 11 Distributed File Systems/11.6 Consistency and Replication

Example:
Client-side Caching in Coda

Session S A Session SA
Client A
Open(RD) Close Close
Open(RD)
Invalidate
Server File f (callback break) File f

File f OK (no file transfer)

Open(WR)
Open(WR) Close Close
Client B
Time
Session S B Session S B

Note: By making use of transactional semantics, it

becomes possible to further improve performance.

11 – 12 Distributed File Systems/11.6 Consistency and Replication

Fault Tolerance

Observation: FT is handled by simply replicating file

servers, generally using a standard primary-backup
protocol:

Client Client
Primary server
for item x Backup server
W1 W5 R1 R2

W4 W4

W3 W3 Data store

W2 W3
W4

W1. Write request R1. Read request

W2. Forward request to primary R2. Response to read
W3. Tell backups to update
W4. Acknowledge update
W5. Acknowledge write completed

11 – 13 Distributed File Systems/11.7 Fault Tolerance

High Availability in P2P Systems

Problem: There are many fully decentralized file-sharing

systems, but because churn is high (i.e., nodes come
and go all the time), we may face an availability prob-
lem.

Solution: Replicate files all over the place (replica-

tion factor: rrep).

Alternative: Apply erasure coding:

• Partition a file F into m fragments, and recode into

a collection F ∗ of n > m fragments

• Property: any m fragments from F∗ are sufficient

to reconstruct F.

• Replication factor: rec = n/m

11 – 14 Distributed File Systems/11.7 Fault Tolerance

Replication vs. Erasure Coding

With an average node availability a, and required file

unavailability ǫ, we have for erasure coding:
rec ·m
rec · m i
1−ǫ= ∑ a (1 − a)rec ·m−i
i=m
i

and for file replication:

1 − ǫ = 1 − (1 − a)rrep

2.2
rreq
rec 2.0

1.8

1.6

1.4

0.2 0.4 0.6 0.8 1

Node availability

11 – 15 Distributed File Systems/11.7 Fault Tolerance

Lecture 7 updated_31dba56704691e95c69b11e04f5095dc
No ratings yet
Lecture 7 updated_31dba56704691e95c69b11e04f5095dc
28 pages
10 Distributed File Systems
No ratings yet
10 Distributed File Systems
27 pages
Distributed File System
No ratings yet
Distributed File System
68 pages
Distributed File Systems
No ratings yet
Distributed File Systems
43 pages
Dist_Sys_Unit_4_Notes
No ratings yet
Dist_Sys_Unit_4_Notes
45 pages
10 Distributedfs
No ratings yet
10 Distributedfs
35 pages
chap6
No ratings yet
chap6
54 pages
Network File System (NFS)
No ratings yet
Network File System (NFS)
31 pages
Distributed File System
100% (1)
Distributed File System
17 pages
DFS, PPT
No ratings yet
DFS, PPT
18 pages
Networked File System: CS 537 - Introduction To Operating Systems
No ratings yet
Networked File System: CS 537 - Introduction To Operating Systems
23 pages
06 dfs2
No ratings yet
06 dfs2
50 pages
Gytha John Harikrishnan Hridya S7Cse: Presented by
No ratings yet
Gytha John Harikrishnan Hridya S7Cse: Presented by
17 pages
Distributed System Based File System
No ratings yet
Distributed System Based File System
15 pages
He-Phan-Bo - Thoai-Nam - Distributedsystem - 16 - Fileservice - (Cuuduongthancong - Com)
No ratings yet
He-Phan-Bo - Thoai-Nam - Distributedsystem - 16 - Fileservice - (Cuuduongthancong - Com)
28 pages
Computer Networks-Lab: Hareem Aslam Hareem - Aslam@pucit - Edu.pk
No ratings yet
Computer Networks-Lab: Hareem Aslam Hareem - Aslam@pucit - Edu.pk
22 pages
04 en Network File Systems
No ratings yet
04 en Network File Systems
57 pages
Distributed File Systems
No ratings yet
Distributed File Systems
35 pages
Distributed Systems U4
No ratings yet
Distributed Systems U4
8 pages
Distributed File Systems
No ratings yet
Distributed File Systems
35 pages
Lecture24 DFS PartI 25nov 2014
No ratings yet
Lecture24 DFS PartI 25nov 2014
46 pages
Distributed File Systems
No ratings yet
Distributed File Systems
23 pages
Distributed File Systems: Arvind Krishnamurthy Spring 2001
No ratings yet
Distributed File Systems: Arvind Krishnamurthy Spring 2001
3 pages
Lecture 5 - DFS & NFS
No ratings yet
Lecture 5 - DFS & NFS
45 pages
3Distributed File System
No ratings yet
3Distributed File System
42 pages
Design Issues: Naming and Name Resolution
No ratings yet
Design Issues: Naming and Name Resolution
4 pages
L6 DFS
No ratings yet
L6 DFS
27 pages
Distributed File Systems
No ratings yet
Distributed File Systems
28 pages
Lecture 25: Distributed File Systems: Indranil Gupta (Indy)
No ratings yet
Lecture 25: Distributed File Systems: Indranil Gupta (Indy)
27 pages
5 Distributed File System
100% (1)
5 Distributed File System
59 pages
Distributed File Systems
No ratings yet
Distributed File Systems
6 pages
Distributed File System
No ratings yet
Distributed File System
7 pages
Week5 Dfs
No ratings yet
Week5 Dfs
13 pages
18-Distributed File Systems Study On Operating Systems
No ratings yet
18-Distributed File Systems Study On Operating Systems
24 pages
Distributed File Systems
No ratings yet
Distributed File Systems
38 pages
DFSNov 1
No ratings yet
DFSNov 1
36 pages
2distributed File System Dfs
No ratings yet
2distributed File System Dfs
21 pages
Andrew - Cmu.edu: Let's Start With A Familiar Example: Andrew 10,000s of People Terabytes of Disk
No ratings yet
Andrew - Cmu.edu: Let's Start With A Familiar Example: Andrew 10,000s of People Terabytes of Disk
7 pages
Reliable Distributed Systems
No ratings yet
Reliable Distributed Systems
44 pages
Lec 11 - Distributed Files - Distributed File System
No ratings yet
Lec 11 - Distributed Files - Distributed File System
33 pages
5.distributed File System
No ratings yet
5.distributed File System
86 pages
Presentation ON Distributed File System: Institute of Engineering and Technology Bundelkhand University
No ratings yet
Presentation ON Distributed File System: Institute of Engineering and Technology Bundelkhand University
51 pages
Distributed File Systems
No ratings yet
Distributed File Systems
42 pages
CSCI319 Distributed Systems
No ratings yet
CSCI319 Distributed Systems
26 pages
Issues in Distributed File Systems
No ratings yet
Issues in Distributed File Systems
10 pages
Distributed File Systems (DFS) : A Resource Management Component of A Distributed Operating System
No ratings yet
Distributed File Systems (DFS) : A Resource Management Component of A Distributed Operating System
16 pages
L8 DFS
No ratings yet
L8 DFS
35 pages
Caching: File Systems: Outline
No ratings yet
Caching: File Systems: Outline
25 pages
Distributed File Systems
No ratings yet
Distributed File Systems
31 pages
Distributed File System
No ratings yet
Distributed File System
43 pages
Distributed File Systems
No ratings yet
Distributed File Systems
107 pages
SIMPLEX TRUEALERT ES APPLIANCE FIRMWARE UPDATE INSTRUCTIONS
No ratings yet
SIMPLEX TRUEALERT ES APPLIANCE FIRMWARE UPDATE INSTRUCTIONS
13 pages
Distributed File Systems Concepts and e 61384
No ratings yet
Distributed File Systems Concepts and e 61384
54 pages
Distributed File Systems & Name Services: UNIT-4
No ratings yet
Distributed File Systems & Name Services: UNIT-4
70 pages
Linux Command Line for New Users: A Practical Guide with Examples
From Everand
Linux Command Line for New Users: A Practical Guide with Examples
William E. Clark
No ratings yet
Requirements For Distributed File Systems
No ratings yet
Requirements For Distributed File Systems
4 pages
Chapter 11: Distributed File Systems
No ratings yet
Chapter 11: Distributed File Systems
6 pages
Unit-3 Part1
No ratings yet
Unit-3 Part1
57 pages
Distributed File System - File Service Architecture
No ratings yet
Distributed File System - File Service Architecture
51 pages
KFlash 3.1 Customer Release Update INSTRUCTIONS 365-095-22108_x3
No ratings yet
KFlash 3.1 Customer Release Update INSTRUCTIONS 365-095-22108_x3
25 pages
Linux Proficiency Handbook: A Comprehensive Guide to Mastering System Administration
From Everand
Linux Proficiency Handbook: A Comprehensive Guide to Mastering System Administration
Adam Jones
No ratings yet
SE-ch3
No ratings yet
SE-ch3
43 pages
Distributed File Systems
No ratings yet
Distributed File Systems
50 pages
Ch12_1 Selected Pentium Instructions
No ratings yet
Ch12_1 Selected Pentium Instructions
84 pages
Ch15_1 MIPS Assembly Language
No ratings yet
Ch15_1 MIPS Assembly Language
33 pages
MPDR - ISUZU Edition Installer Operation Manual (E)
No ratings yet
MPDR - ISUZU Edition Installer Operation Manual (E)
13 pages
8085 Interfacing
No ratings yet
8085 Interfacing
9 pages
12 Distributed Web-Based Systems
No ratings yet
12 Distributed Web-Based Systems
22 pages
Mastering Linux: From Basics to Expert Proficiency
From Everand
Mastering Linux: From Basics to Expert Proficiency
William Smith
No ratings yet
SE - ch7
No ratings yet
SE - ch7
28 pages
Iiot Question Bank Main
No ratings yet
Iiot Question Bank Main
3 pages
Ch13_1 High-Level Language Interface
No ratings yet
Ch13_1 High-Level Language Interface
15 pages
SE-ch4
No ratings yet
SE-ch4
17 pages
SE - ch5
No ratings yet
SE - ch5
16 pages
SE - ch6
No ratings yet
SE - ch6
13 pages
vmce_v12_1
No ratings yet
vmce_v12_1
10 pages
Lec 23 CAOCache Memory
No ratings yet
Lec 23 CAOCache Memory
11 pages
Memory Banking in 8086 Microprocessor
No ratings yet
Memory Banking in 8086 Microprocessor
2 pages
2ECEg 4191 Chapter 2 - Application Layer
No ratings yet
2ECEg 4191 Chapter 2 - Application Layer
63 pages
Desktop C4jjelu
No ratings yet
Desktop C4jjelu
28 pages
Class 5 Computer Studies Types of Software
No ratings yet
Class 5 Computer Studies Types of Software
3 pages
VectorScribe Quick Guide
100% (1)
VectorScribe Quick Guide
10 pages
AVR Programming: Interrupts and Timers: Sven Gestegård Robertz Department of Computer Science Lund University Sweden
No ratings yet
AVR Programming: Interrupts and Timers: Sven Gestegård Robertz Department of Computer Science Lund University Sweden
18 pages
Examen 700 Teams
50% (2)
Examen 700 Teams
55 pages
Graph Algorithms
No ratings yet
Graph Algorithms
51 pages
Moore Machine and Mealy Machine
100% (2)
Moore Machine and Mealy Machine
25 pages
Simatic Price List: MLFB Description Catalog L-Price (Euro) L-Price (RMB) (Incl VAT) Discount Group
No ratings yet
Simatic Price List: MLFB Description Catalog L-Price (Euro) L-Price (RMB) (Incl VAT) Discount Group
128 pages
SGW1-IA3-MMP - Modbus Multiplexer Exemys
No ratings yet
SGW1-IA3-MMP - Modbus Multiplexer Exemys
23 pages
Case Study On Windows
No ratings yet
Case Study On Windows
8 pages
BCM7312 Micro Directv L11
100% (1)
BCM7312 Micro Directv L11
3 pages
MB Manual Ga-f2a88xm-Ds2 e
No ratings yet
MB Manual Ga-f2a88xm-Ds2 e
36 pages
4 02 0250 20010 Vci en A4
No ratings yet
4 02 0250 20010 Vci en A4
20 pages
Securing Domain Controllers To Improve Active Directory Security - Active Directory Security
No ratings yet
Securing Domain Controllers To Improve Active Directory Security - Active Directory Security
42 pages
AHV Admin Guide v6 0
No ratings yet
AHV Admin Guide v6 0
148 pages
Sitescope Installation
No ratings yet
Sitescope Installation
23 pages
Lesson 2
100% (1)
Lesson 2
41 pages
Amilo Pi 2550
No ratings yet
Amilo Pi 2550
3 pages
MYOB Installation Guide
No ratings yet
MYOB Installation Guide
2 pages
Chapter 13: Wired Lans - Ethernet: Week-04
No ratings yet
Chapter 13: Wired Lans - Ethernet: Week-04
3 pages
Augmented Reality
No ratings yet
Augmented Reality
8 pages
KEDIT Reference Manual
No ratings yet
KEDIT Reference Manual
446 pages
System Software Question Bank 2012 With Part-B Answers
75% (16)
System Software Question Bank 2012 With Part-B Answers
49 pages
JN0 643 Q&A Troytec
No ratings yet
JN0 643 Q&A Troytec
157 pages

11 Distributed File Systems

Uploaded by

11 Distributed File Systems

Uploaded by

Distributed Systems

Principles and Paradigms

Maarten van Steen

General goal: Try to make a file system transparently

Remote access model

1. File moved to client

11 – 1 Distributed File Systems/11.1 Architecture

System call layer System call layer

Virtual file system Virtual file system

Local file Local file

RPC client RPC server

Essence: VFS provides standard file system inter-

Question: Is NFS actually a file system?

Question: Anything unusual between v3 and v4?

11 – 3 Distributed File Systems/11.1 Architecture

Observation: When dealing with very large data col-

Solution 1: For speeding up file accesses, apply

11 – 4 Distributed File Systems/11.1 Architecture

Solution 2: Divide files in large 64 MB chunks, and

file name, chunk index

Instructions Chunk-server state

Chunk ID, range

A couple of important details:

• The master maintains only a (file name, chunk

11 – 5 Distributed File Systems/11.1 Architecture

Observation: Many (traditional) distributed file sys-

Client Server Client Server

Lookup name Lookup name

11 – 6 Distributed File Systems/11.3 Communication

Observation: When dealing with replicated files, se-

Invalidate Reply Invalidate Reply

Invalidate Reply Invalidate Reply

Note: In Coda, clients can cache files, but will be in-

11 – 7 Distributed File Systems/11.3 Communication

2. Write "c" 1. Read "ab"

11 – 8 Distributed File Systems/11.5 Synchronization

UNIX semantics: a read operation returns the effect

Transaction semantics: the file system supports trans-

Session semantics: the effects of read and write

11 – 9 Distributed File Systems/11.5 Synchronization

Essence: Coda assumes transactional semantics, but

Open(RD) File f Invalidate

Note: Transactional issues reappear in the form of

11 – 10 Distributed File Systems/11.5 Synchronization

Observation: In modern distributed file systems, client-

Observation: Clients are allowed to keep (large parts

1. Client asks for file

Local copy 3. Server recalls delegation

11 – 11 Distributed File Systems/11.6 Consistency and Replication

File f OK (no file transfer)

Note: By making use of transactional semantics, it

11 – 12 Distributed File Systems/11.6 Consistency and Replication

Observation: FT is handled by simply replicating file

W1. Write request R1. Read request

11 – 13 Distributed File Systems/11.7 Fault Tolerance

Problem: There are many fully decentralized file-sharing

Solution: Replicate files all over the place (replica-

Alternative: Apply erasure coding:

• Partition a file F into m fragments, and recode into

• Property: any m fragments from F∗ are sufficient

• Replication factor: rec = n/m

11 – 14 Distributed File Systems/11.7 Fault Tolerance

With an average node availability a, and required file

and for file replication:

0.2 0.4 0.6 0.8 1

11 – 15 Distributed File Systems/11.7 Fault Tolerance

You might also like