0% found this document useful (0 votes)

269 views6 pages

Chapter 11: Distributed File Systems

This chapter discusses distributed file systems. It describes three common architectures: 1) the remote access model where files stay on the server and clients remotely access them, 2) the upload/download model where files are temporarily moved to clients during access, and 3) cluster-based systems that divide files into chunks and distribute chunks across multiple servers for parallel access. It provides examples of NFS, Google File System, and peer-to-peer file systems.

Uploaded by

gopitheprince

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

269 views6 pages

Chapter 11: Distributed File Systems

Uploaded by

gopitheprince

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Distributed Systems

Principles and Paradigms

Maarten van Steen

VU Amsterdam, Dept. Computer Science

Room R4.20, [email protected]

Chapter 11: Distributed File Systems

Version: December 4, 2011

1 / 17

Contents

Chapter
01: Introduction
02: Architectures
03: Processes
04: Communication
05: Naming
06: Synchronization
07: Consistency & Replication
08: Fault Tolerance
09: Security
10: Distributed Object-Based Systems
11: Distributed File Systems
12: Distributed Web-Based Systems
13: Distributed Coordination-Based Systems

2 / 17 2 / 17

Distributed File Systems 11.1 Architecture Distributed File Systems 11.1 Architecture

Distributed File Systems

General goal
Try to make a file system transparently available to remote clients.

1. File moved to client

Client Server Client Server

Old file

New file

Requests from
client to access File stays 2. Accesses are
3. When client is done,
remote file on server done on client
file is returned to

Remote access model Upload/download model

3 / 17 3 / 17
Distributed File Systems 11.1 Architecture Distributed File Systems 11.1 Architecture

Example: NFS Architecture

NFS
NFS is implemented using the Virtual File System abstraction, which is
now used for lots of different operating systems.

Client Server

System call layer System call layer

Virtual file system Virtual file system

(VFS) layer (VFS) layer

Local file Local file

system interface NFS client NFS server system interface

RPC client RPC server

stub stub

Network

4 / 17 4 / 17

Distributed File Systems 11.1 Architecture Distributed File Systems 11.1 Architecture

Example: NFS Architecture

Essence
VFS provides standard file system interface, and allows to hide
difference between accessing local or remote file system.

Question
Is NFS actually a file system?

5 / 17 5 / 17

Distributed File Systems 11.1 Architecture Distributed File Systems 11.1 Architecture

NFS File Operations

Oper. v3 v4 Description
Create Yes No Create a regular file
Create No Yes Create a nonregular file
Link Yes Yes Create a hard link to a file
Symlink Yes No Create a symbolic link to a file
Mkdir Yes No Create a subdirectory
Mknod Yes No Create a special file
Rename Yes Yes Change the name of a file
Remove Yes Yes Remove a file from a file system
Rmdir Yes No Remove an empty subdirectory
Open No Yes Open a file
Close No Yes Close a file
Lookup Yes Yes Look up a file by means of a name
Readdir Yes Yes Read the entries in a directory
Readlink Yes Yes Read the path name in a symbolic link
Getattr Yes Yes Get the attribute values for a file
Setattr Yes Yes Set one or more file-attribute values
Read Yes Yes Read the data contained in a file
Write Yes Yes Write data to a file
6 / 17 6 / 17
Distributed File Systems 11.1 Architecture Distributed File Systems 11.1 Architecture

Cluster-Based File Systems

Observation
With very large data collections, following a simple client-server
approach is not going to work for speeding up file accesses, apply
striping techniques by which files can be fetched in parallel.

File block of file a File block of file e

a b c d e
a b c d e
a b c d e

Whole-file distribution

a b a b a b
c e c d c d
d e e

File-striped system
7 / 17 7 / 17

Distributed File Systems 11.1 Architecture Distributed File Systems 11.1 Architecture

Example: Google File System

file name, chunk index
GFS client Master
contact address

Instructions Chunk-server state

Chunk ID, range

Chunk server Chunk server Chunk server
Chunk data
Linux file Linux file Linux file
system system system

The Google solution

Divide files in large 64 MB chunks, and distribute/replicate chunks across
many servers:
The master maintains only a (file name, chunk server) table in main
memory minimal I/O
Files are replicated using a primary-backup scheme; the master is kept
out of the loop
8 / 17 8 / 17

Distributed File Systems 11.1 Architecture Distributed File Systems 11.1 Architecture

P2P-based File Systems

Node where a file system is rooted

File system layer Ivy Ivy Ivy

Block-oriented storage DHash DHash DHash

DHT layer Chord Chord Chord

Network

Basic idea
Store data blocks in the underlying P2P system:
Every data block with content D is stored on a node with hash h(D).
Allows for integrity check.
Public-key blocks are signed with associated private key and looked up
with public key.
A local log of file operations to keep track of hblockID, h(D)i pairs.

9 / 17 9 / 17
Distributed File Systems 11.5 Synchronization Distributed File Systems 11.5 Synchronization

File sharing semantics

Client machine #1
Problem
When dealing with distributed file a b
systems, we need to take into account Process
A
the ordering of concurrent read/write a b c

operations and expected semantics 2. Write "c" 1. Read "ab"

(i.e., consistency).
File server
Original file
Single machine a b

a b
Process
A 3. Read gets "ab"
a b c
Client machine #2

Process
a b
B
Process
B
1. Write "c" 2. Read gets "abc"

(a) (b)
10 / 17 10 / 17

Distributed File Systems 11.5 Synchronization Distributed File Systems 11.5 Synchronization

File sharing semantics

Semantics
UNIX semantics: a read operation returns the effect of the last
write operation can only be implemented for remote access
models in which there is only a single copy of the file
Transaction semantics: the file system supports transactions on a
single file issue is how to allow concurrent access to a
physically distributed file
Session semantics: the effects of read and write operations are
seen only by the client that has opened (a local copy) of the file
what happens when a file is closed (only one client may actually
win)

11 / 17 11 / 17

Distributed File Systems 11.5 Synchronization Distributed File Systems 11.5 Synchronization

Example: File sharing in Coda

Essence
Coda assumes transactional semantics, but without the full-fledged
capabilities of real transactions. Note: Transactional issues reappear in
the form of this ordering could have taken place.

Session S A
Client

Open(RD) File f Invalidate

Close
Server

Close
Open(WR) File f

Client

Time
Session S B

12 / 17 12 / 17
Distributed File Systems 11.6 Consistency and Replication Distributed File Systems 11.6 Consistency and Replication

Consistency and replication

Observation
In modern distributed file systems, client-side caching is the preferred
technique for attaining performance; server-side replication is done for fault
tolerance.

Observation
Clients are allowed to keep (large parts of) a file, and will be notified when
control is withdrawn servers are now generally stateful
1. Client asks for file
Client Server
2. Server delegates file
Old file

Local copy 3. Server recalls delegation

Updated file
4. Client sends returns file

13 / 17 13 / 17

Distributed File Systems 11.6 Consistency and Replication Distributed File Systems 11.6 Consistency and Replication

Example: Client-side caching in Coda

Session S A Session SA
Client A
Open(RD) Close Close
Open(RD)
Invalidate
Server File f (callback break) File f

File f OK (no file transfer)

Open(WR)
Open(WR) Close Close
Client B
Time
Session S B Session S B

Note
By making use of transactional semantics, it becomes possible to
further improve performance.

14 / 17 14 / 17

Distributed File Systems 11.6 Consistency and Replication Distributed File Systems 11.6 Consistency and Replication

Example: Server-side replication in Coda

Server Server
S1 S3

Client Broken Client

Server
A network B
S2

Main issue
Ensure that concurrent updates are detected:
Each client has an Accessible Volume Storage Group (AVSG): is a
subset of the actual VSG.
Version vector CVVi (f )[j] = k Si knows that Sj has seen version k of f .
Example: A updates f S1 = S2 = [+1, +1, +0]; B updates
f S3 = [+0, +0, +1].

15 / 17 15 / 17
Distributed File Systems 11.7 Fault Tolerance Distributed File Systems 11.7 Fault Tolerance

High availability in P2P systems

Problem
There are many fully decentralized file-sharing systems, but because
churn is high (i.e., nodes come and go all the time), we may face an
availability problem replicate files all over the place (replication
factor: rrep ).

Alternative
Apply erasure coding:
Partition a file F into m fragments, and recode into a collection F
of n > m fragments
Property: any m fragments from F are sufficient to reconstruct F .
Replication factor: rec = n/m

16 / 17 16 / 17

Distributed File Systems 11.7 Fault Tolerance Distributed File Systems 11.7 Fault Tolerance

Replication vs. erasure coding

Comparison
With an average node availability a, 2.2
rrep
and required file unavailability , we rec 2.0
have for erasure coding:
1.8
rec m
rec m i
1 = i
a (1 a)rec mi 1.6
i =m
1.4
and for file replication: 0.2 0.4 0.6 0.8 1
Node availability
1 = 1 (1 a)rrep

17 / 17 17 / 17

Chapter 8 - Solutions Chapter 8 - Solutions
No ratings yet
Chapter 8 - Solutions Chapter 8 - Solutions
40 pages
11 Distributed File Systems
No ratings yet
11 Distributed File Systems
16 pages
Distributed File Systems
No ratings yet
Distributed File Systems
35 pages
L8 DFS
No ratings yet
L8 DFS
35 pages
Requirements For Distributed File Systems
No ratings yet
Requirements For Distributed File Systems
4 pages
Chap 6
No ratings yet
Chap 6
54 pages
Distributed File Systems
No ratings yet
Distributed File Systems
107 pages
Lec 11 - Distributed Files - Distributed File System
No ratings yet
Lec 11 - Distributed Files - Distributed File System
33 pages
Distributed File Systems
No ratings yet
Distributed File Systems
23 pages
Presentation ON Distributed File System: Institute of Engineering and Technology Bundelkhand University
No ratings yet
Presentation ON Distributed File System: Institute of Engineering and Technology Bundelkhand University
51 pages
Distributed File Systems
No ratings yet
Distributed File Systems
35 pages
5.distributed File System
No ratings yet
5.distributed File System
86 pages
Lecture 25: Distributed File Systems: Indranil Gupta (Indy)
No ratings yet
Lecture 25: Distributed File Systems: Indranil Gupta (Indy)
27 pages
Distributed File Systems
No ratings yet
Distributed File Systems
31 pages
Distributed File Systems
No ratings yet
Distributed File Systems
28 pages
5 Distributed File System
100% (1)
5 Distributed File System
59 pages
18-Distributed File Systems Study On Operating Systems
No ratings yet
18-Distributed File Systems Study On Operating Systems
24 pages
04 en Network File Systems
No ratings yet
04 en Network File Systems
57 pages
Lecture24 DFS PartI 25nov 2014
No ratings yet
Lecture24 DFS PartI 25nov 2014
46 pages
Distributed File Systems
No ratings yet
Distributed File Systems
6 pages
Distributed File Systems Concepts and e 61384
No ratings yet
Distributed File Systems Concepts and e 61384
54 pages
He-Phan-Bo - Thoai-Nam - Distributedsystem - 16 - Fileservice - (Cuuduongthancong - Com)
No ratings yet
He-Phan-Bo - Thoai-Nam - Distributedsystem - 16 - Fileservice - (Cuuduongthancong - Com)
28 pages
Distributed Systems U4
No ratings yet
Distributed Systems U4
8 pages
Network File System (NFS)
No ratings yet
Network File System (NFS)
31 pages
Distributed File Systems
No ratings yet
Distributed File Systems
42 pages
Distributed File System
No ratings yet
Distributed File System
43 pages
DFSNov 1
No ratings yet
DFSNov 1
36 pages
Distributed Computing Module 5 Important Topics PYQs
No ratings yet
Distributed Computing Module 5 Important Topics PYQs
23 pages
10 Distributed File Systems
No ratings yet
10 Distributed File Systems
27 pages
Distributed File Systems: Arvind Krishnamurthy Spring 2001
No ratings yet
Distributed File Systems: Arvind Krishnamurthy Spring 2001
3 pages
Dist Sys Unit 4 Notes
No ratings yet
Dist Sys Unit 4 Notes
45 pages
Distributed File System - File Service Architecture
No ratings yet
Distributed File System - File Service Architecture
51 pages
Distributed File Systems
No ratings yet
Distributed File Systems
50 pages
Distributed File Systems
No ratings yet
Distributed File Systems
43 pages
L6 DFS
No ratings yet
L6 DFS
27 pages
Distributed File Systems
No ratings yet
Distributed File Systems
38 pages
Week5 Dfs
No ratings yet
Week5 Dfs
13 pages
Lecture 5 - DFS & NFS
No ratings yet
Lecture 5 - DFS & NFS
45 pages
Distributed File System
100% (1)
Distributed File System
17 pages
10 Distributedfs
No ratings yet
10 Distributedfs
35 pages
Reliable Distributed Systems
No ratings yet
Reliable Distributed Systems
44 pages
Afs Andrew File System
No ratings yet
Afs Andrew File System
28 pages
Gytha John Harikrishnan Hridya S7Cse: Presented by
No ratings yet
Gytha John Harikrishnan Hridya S7Cse: Presented by
17 pages
Unit-3 Part1
No ratings yet
Unit-3 Part1
57 pages
DFS, PPT
No ratings yet
DFS, PPT
18 pages
3distributed File System
No ratings yet
3distributed File System
42 pages
A Distributed File System: By, Prof Ankita Mandore
No ratings yet
A Distributed File System: By, Prof Ankita Mandore
37 pages
Lecture13 15319 MHH 27feb 2012
No ratings yet
Lecture13 15319 MHH 27feb 2012
30 pages
Distributed File System
No ratings yet
Distributed File System
7 pages
Distributed Systems: (3rd Edition)
No ratings yet
Distributed Systems: (3rd Edition)
36 pages
BDA Unit I
No ratings yet
BDA Unit I
18 pages
Distributed File Systems
No ratings yet
Distributed File Systems
18 pages
Distributed Computing
No ratings yet
Distributed Computing
37 pages
Other File Systems: LFS, NFS, and Afs
No ratings yet
Other File Systems: LFS, NFS, and Afs
37 pages
Distributed File Systems & Name Services: UNIT-4
No ratings yet
Distributed File Systems & Name Services: UNIT-4
70 pages
Lecture 4.1 - Hadoop - MapReduce - Hbase
No ratings yet
Lecture 4.1 - Hadoop - MapReduce - Hbase
94 pages
CSCI319 Distributed Systems
No ratings yet
CSCI319 Distributed Systems
26 pages
2distributed File System Dfs
No ratings yet
2distributed File System Dfs
21 pages
Linux Proficiency Handbook: A Comprehensive Guide to Mastering System Administration
From Everand
Linux Proficiency Handbook: A Comprehensive Guide to Mastering System Administration
Adam Jones
No ratings yet
The Debian Linux Handbook: A Practical Guide for Users and Administrators
From Everand
The Debian Linux Handbook: A Practical Guide for Users and Administrators
Robert Johnson
No ratings yet
Mastering Linux: From Basics to Expert Proficiency
From Everand
Mastering Linux: From Basics to Expert Proficiency
William Smith
No ratings yet
Apriori Algorithm Using Parallel Computing Concepts and Forest Fire Prediction
No ratings yet
Apriori Algorithm Using Parallel Computing Concepts and Forest Fire Prediction
7 pages
CSCE569 Parallel Computing: TTH 03:30AM-04:45PM Dr. Jianjun Hu
No ratings yet
CSCE569 Parallel Computing: TTH 03:30AM-04:45PM Dr. Jianjun Hu
37 pages
Programming The Web Server: Robert M. Dondero, Ph.D. Princeton University
No ratings yet
Programming The Web Server: Robert M. Dondero, Ph.D. Princeton University
53 pages
Graph and Graph Traaversals
No ratings yet
Graph and Graph Traaversals
19 pages
Grid
No ratings yet
Grid
42 pages
PP PDF
No ratings yet
PP PDF
150 pages
PA Analysis
No ratings yet
PA Analysis
28 pages
Web Mining
No ratings yet
Web Mining
10 pages
Situational Calculus:: Name (Robot, Robbie)
No ratings yet
Situational Calculus:: Name (Robot, Robbie)
2 pages
Amdals Law Notes
No ratings yet
Amdals Law Notes
8 pages
The Internet and World Wide Web
No ratings yet
The Internet and World Wide Web
19 pages
Internet Fundamentals: CS 299 - Web Programming and Design
No ratings yet
Internet Fundamentals: CS 299 - Web Programming and Design
17 pages
Message Passing Interface (MPI) Programming
No ratings yet
Message Passing Interface (MPI) Programming
11 pages
Web Technologies Lecture Notes On Unit 2
100% (2)
Web Technologies Lecture Notes On Unit 2
35 pages
CS6801-Multi Core Architectures and Programming
No ratings yet
CS6801-Multi Core Architectures and Programming
9 pages
Web Technologies Lecture Notes On Unit 2
100% (2)
Web Technologies Lecture Notes On Unit 2
35 pages
Rules and Regulations For Posters/ Painting Competition
No ratings yet
Rules and Regulations For Posters/ Painting Competition
2 pages
List of Programs: Sno: Name of The Program
No ratings yet
List of Programs: Sno: Name of The Program
124 pages
Server-Side Scripting With PHP4
No ratings yet
Server-Side Scripting With PHP4
38 pages
Artificial Intelligence: Agents, Architecture, and Techniques
No ratings yet
Artificial Intelligence: Agents, Architecture, and Techniques
79 pages
AI 2marks Questions
100% (1)
AI 2marks Questions
121 pages
Malicious Behaviour Detection in Manets Using A Superior Adaptive Ack Technique
No ratings yet
Malicious Behaviour Detection in Manets Using A Superior Adaptive Ack Technique
2 pages
Cheng Et Al. (2007)
No ratings yet
Cheng Et Al. (2007)
4 pages
Computer Science - (A) - 1 / 1
No ratings yet
Computer Science - (A) - 1 / 1
10 pages
Network Detection and Response in The SOC Securonix
No ratings yet
Network Detection and Response in The SOC Securonix
7 pages
70 762 PDF
No ratings yet
70 762 PDF
20 pages
01 - Introduction To Data Warehouse
No ratings yet
01 - Introduction To Data Warehouse
39 pages
Dbms Lab
No ratings yet
Dbms Lab
53 pages
Navya - Week 5 Assignment
No ratings yet
Navya - Week 5 Assignment
11 pages
L2 Information System
No ratings yet
L2 Information System
14 pages
Universe Designer & WEB Intelligence
No ratings yet
Universe Designer & WEB Intelligence
27 pages
TCS Solutions
100% (1)
TCS Solutions
4 pages
Formato Plan de Copias de La Información Equipos Servidores Centro de Cómputo
No ratings yet
Formato Plan de Copias de La Información Equipos Servidores Centro de Cómputo
2 pages
BT0065
No ratings yet
BT0065
1 page
Block Chain Based Secure Storage System For Range Activity Data
No ratings yet
Block Chain Based Secure Storage System For Range Activity Data
5 pages
A Guide To Your Rights: Data Protection Acts
No ratings yet
A Guide To Your Rights: Data Protection Acts
14 pages
Understanding and Creating Art With AI: Review and Outlook: Eva Cetinic, James She
No ratings yet
Understanding and Creating Art With AI: Review and Outlook: Eva Cetinic, James She
22 pages
DMWQ1D4S3T1 - Building Analytics at Scale With Amazon Athena
No ratings yet
DMWQ1D4S3T1 - Building Analytics at Scale With Amazon Athena
48 pages
A Short Introduction To Vertica
No ratings yet
A Short Introduction To Vertica
21 pages
Triggers - SQL Server: What Is A Trigger
No ratings yet
Triggers - SQL Server: What Is A Trigger
13 pages
IS200-Database Management Systems
No ratings yet
IS200-Database Management Systems
5 pages
Data Structures Mcqs
No ratings yet
Data Structures Mcqs
27 pages
F. J. Roethlisberger - Management and Morale-Harvard University Press (1941)
No ratings yet
F. J. Roethlisberger - Management and Morale-Harvard University Press (1941)
220 pages
4th Quarter PR1 Exam
No ratings yet
4th Quarter PR1 Exam
8 pages
SQL Commands
No ratings yet
SQL Commands
4 pages
3150 2010 November 01 Agung Santoso-with-cover-page-V2
No ratings yet
3150 2010 November 01 Agung Santoso-with-cover-page-V2
18 pages
What Are Schemas
No ratings yet
What Are Schemas
25 pages
Introduction To Relational Databases
No ratings yet
Introduction To Relational Databases
17 pages
Research Methodology: Lecture 6: Methods of Data Collection
100% (3)
Research Methodology: Lecture 6: Methods of Data Collection
49 pages
Power Apps Developer Ganesh
No ratings yet
Power Apps Developer Ganesh
5 pages
Lab 1
No ratings yet
Lab 1
18 pages

Chapter 11: Distributed File Systems

Uploaded by

Chapter 11: Distributed File Systems

Uploaded by

Distributed Systems

Principles and Paradigms

Maarten van Steen

VU Amsterdam, Dept. Computer Science

Chapter 11: Distributed File Systems

Distributed File Systems

1. File moved to client

Remote access model Upload/download model

Example: NFS Architecture

System call layer System call layer

Virtual file system Virtual file system

Local file Local file

RPC client RPC server

Example: NFS Architecture

NFS File Operations

Cluster-Based File Systems

File block of file a File block of file e

Example: Google File System

Instructions Chunk-server state

Chunk ID, range

The Google solution

P2P-based File Systems

File system layer Ivy Ivy Ivy

Block-oriented storage DHash DHash DHash

DHT layer Chord Chord Chord

File sharing semantics

operations and expected semantics 2. Write "c" 1. Read "ab"

File sharing semantics

Example: File sharing in Coda

Open(RD) File f Invalidate

Consistency and replication

Local copy 3. Server recalls delegation

Example: Client-side caching in Coda

File f OK (no file transfer)

Example: Server-side replication in Coda

Client Broken Client

High availability in P2P systems

Replication vs. erasure coding

You might also like