0% found this document useful (0 votes)

70 views18 pages

Distributed File Systems

This document discusses distributed file systems and some of the key design considerations. It describes common file access models like upload/download and remote access. It also covers issues like file server design, file sharing semantics, file usage patterns, and caching approaches. Overall, the document analyzes different techniques for providing transparent access to remote files while balancing consistency, performance, and network efficiency.

Uploaded by

sycopath

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

70 views18 pages

Distributed File Systems

Uploaded by

sycopath

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

Distributed Systems

Distributed File Systems

Paul Krzyzanowski
[email protected]

Except as otherwise noted, the content of this presentation is licensed under the Creative Commons
Attribution 2.5 License.
Page 1
Accessing files
FTP, telnet:
–  Explicit access
–  User-directed connection to access remote
resources

We want more transparency

–  Allow user to access remote resources just as local
ones

Focus on file system for now

NAS: Network Attached Storage

Page 2
File service types
Upload/Download model
–  Read file: copy file from server to client
–  Write file: copy file from client to server

Advantage
–  Simple

Problems
–  Wasteful: what if client needs small piece?
–  Problematic: what if client doesn’t have enough space?
–  Consistency: what if others need to modify the same file?

Page 3
File service types
Remote access model
File service provides functional interface:
–  create, delete, read bytes, write bytes, etc…

Advantages:
–  Client gets only what’s needed
–  Server can manage coherent view of file system

Problem:
–  Possible server and network congestion
•  Servers are accessed for duration of file access
•  Same data may be requested repeatedly

Page 4
File server
File Directory Service
–  Maps textual names for file to internal locations
that can be used by file service

File service
–  Provides file access interface to clients

Client module (driver)

–  Client side interface for file and directory service
–  if done right, helps provide access transparency
e.g. under vnode layer

Page 5
Semantics of
file sharing

Page 6
Sequential semantics
Read returns result of last write
Easily achieved if
–  Only one server
–  Clients do not cache data
BUT
–  Performance problems if no cache
•  Obsolete data
–  We can write-through
•  Must notify clients holding copies
•  Requires extra state, generates extra traffic

Page 7
Session semantics
Relax the rules
•  Changes to an open file are initially visible
only to the process (or machine) that
modified it.
•  Last process to modify the file wins.

Page 8
Other solutions
Make files immutable
–  Aids in replication
–  Does not help with detecting modification

Or...
Use atomic transactions
–  Each file access is an atomic transaction
–  If multiple transactions start concurrently
•  Resulting modification is serial

Page 9
File usage patterns
•  We can’t have the best of all worlds
•  Where to compromise?
–  Semantics vs. efficiency
–  Efficiency = client performance, network traffic,
server load
•  Understand how files are used
•  1981 study by Satyanarayanan

Page 10
File usage
Most files are <10 Kbytes
–  2005: average size of 385,341 files on my Mac =197 KB
–  2007: average size of 440,519 files on my Mac =451 KB
–  (files accessed within 30 days: 15, 792 files
80% of files are <47KB)
–  Feasible to transfer entire files (simpler)
–  Still have to support long files
Most files have short lifetimes
–  Perhaps keep them local
Few files are shared
–  Overstated problem
–  Session semantics will cause no problem most of
the time

Page 11
System design issues

Page 12
How do you access them?
•  Access remote files as local files
•  Remote FS name space should be
syntactically consistent with local name
space
1.  redefine the way all files are named and provide a
syntax for specifying remote files
•  e.g. //server/dir/file
•  Can cause legacy applications to fail
2.  use a file system mounting mechanism
•  Overlay portions of another FS name space over local
name space
•  This makes the remote name space look like it’s part of
the local name space

Page 13
Stateful or stateless design?
Stateful
–  Server maintains client-specific state
•  Shorter requests
•  Better performance in processing requests
•  Cache coherence is possible
–  Server can know who’s accessing what
•  File locking is possible

Page 14
Stateful or stateless design?
Stateless
–  Server maintains no information on client accesses
•  Each request must identify file and offsets
•  Server can crash and recover
–  No state to lose
•  Client can crash and recover
•  No open/close needed
–  They only establish state
•  No server space used for state
–  Don’t worry about supporting many clients
•  Problems if file is deleted on server
•  File locking not possible

Page 15
Caching
Hide latency to improve performance for
repeated accesses

Four places
–  Server’s disk
–  Server’s buffer cache
WARNING:
–  Client’s buffer cache cache consistency
–  Client’s disk problems

Page 16
Approaches to caching
•  Write-through
–  What if another client reads its own (out-of-date) cached
copy?
–  All accesses will require checking with server
–  Or … server maintains state and sends invalidations

•  Delayed writes (write-behind)

–  Data can be buffered locally (watch out for consistency –
others won’t see updates!)
–  Remote files updated periodically
–  One bulk wire is more efficient than lots of little writes
–  Problem: semantics become ambiguous

Page 17
Approaches to caching
•  Read-ahead (prefetch)
–  Request chunks of data before it is needed.
–  Minimize wait when it actually is needed.

•  Write on close
–  Admit that we have session semantics.

•  Centralized control
–  Keep track of who has what open and cached on
each node.
–  Stateful file system with signaling traffic.

Page 18

3distributed File System
No ratings yet
3distributed File System
42 pages
04 en Network File Systems
No ratings yet
04 en Network File Systems
57 pages
Andrew - Cmu.edu: Let's Start With A Familiar Example: Andrew 10,000s of People Terabytes of Disk
No ratings yet
Andrew - Cmu.edu: Let's Start With A Familiar Example: Andrew 10,000s of People Terabytes of Disk
7 pages
Distributed File System Implementation
100% (1)
Distributed File System Implementation
30 pages
Requirements For Distributed File Systems
No ratings yet
Requirements For Distributed File Systems
4 pages
Distributed-File Systems Background
No ratings yet
Distributed-File Systems Background
9 pages
WINSEM2012-13 CP0029 06-Mar-2013 RM01 DFT 2
No ratings yet
WINSEM2012-13 CP0029 06-Mar-2013 RM01 DFT 2
46 pages
Other File Systems: LFS, NFS, and Afs
No ratings yet
Other File Systems: LFS, NFS, and Afs
37 pages
DFS Design and Implementation
No ratings yet
DFS Design and Implementation
40 pages
Unit-3 Part1
No ratings yet
Unit-3 Part1
57 pages
DFS Design and Implementation: Brent R. Hafner
No ratings yet
DFS Design and Implementation: Brent R. Hafner
40 pages
Reliable Distributed Systems
No ratings yet
Reliable Distributed Systems
44 pages
Presentation ON Distributed File System: Institute of Engineering and Technology Bundelkhand University
No ratings yet
Presentation ON Distributed File System: Institute of Engineering and Technology Bundelkhand University
51 pages
Distributed File Systems
No ratings yet
Distributed File Systems
6 pages
Distributed File Systems
No ratings yet
Distributed File Systems
28 pages
Ds 2016 17 Lec17
No ratings yet
Ds 2016 17 Lec17
32 pages
5.distributed File Systems
No ratings yet
5.distributed File Systems
47 pages
Distributed File Systems
No ratings yet
Distributed File Systems
31 pages
Distributed File System
No ratings yet
Distributed File System
43 pages
03-1 File Systems
No ratings yet
03-1 File Systems
9 pages
Distributed File Systems
No ratings yet
Distributed File Systems
42 pages
Distributed File Systems
No ratings yet
Distributed File Systems
107 pages
CS2510 00 Distributed Storage Overview
No ratings yet
CS2510 00 Distributed Storage Overview
53 pages
AFS Presentation
No ratings yet
AFS Presentation
36 pages
Lecture 25: Distributed File Systems: Indranil Gupta (Indy)
No ratings yet
Lecture 25: Distributed File Systems: Indranil Gupta (Indy)
27 pages
L8 DFS
No ratings yet
L8 DFS
35 pages
Distributed File Systems (DFS) : A Resource Management Component of A Distributed Operating System
No ratings yet
Distributed File Systems (DFS) : A Resource Management Component of A Distributed Operating System
16 pages
Distributed File Systems & Name Services: UNIT-4
No ratings yet
Distributed File Systems & Name Services: UNIT-4
70 pages
Distributed File Systems
No ratings yet
Distributed File Systems
38 pages
He-Phan-Bo - Thoai-Nam - Distributedsystem - 16 - Fileservice - (Cuuduongthancong - Com)
No ratings yet
He-Phan-Bo - Thoai-Nam - Distributedsystem - 16 - Fileservice - (Cuuduongthancong - Com)
28 pages
Distributed File Systems
No ratings yet
Distributed File Systems
50 pages
DISTRIBUTEDFILESYS
No ratings yet
DISTRIBUTEDFILESYS
16 pages
File Models and File Accessing Models: Prepared By: Mehta Ishani 1300407010030
No ratings yet
File Models and File Accessing Models: Prepared By: Mehta Ishani 1300407010030
18 pages
Module 2
No ratings yet
Module 2
27 pages
06 dfs2
No ratings yet
06 dfs2
50 pages
L6 DFS
No ratings yet
L6 DFS
27 pages
DFSNov 1
No ratings yet
DFSNov 1
36 pages
CSCI319 Distributed Systems
No ratings yet
CSCI319 Distributed Systems
26 pages
Gytha John Harikrishnan Hridya S7Cse: Presented by
No ratings yet
Gytha John Harikrishnan Hridya S7Cse: Presented by
17 pages
Distributed File Systems
No ratings yet
Distributed File Systems
6 pages
Atria Institute of Technology: File System Mounting and File Sharing
No ratings yet
Atria Institute of Technology: File System Mounting and File Sharing
24 pages
5.distributed File System
No ratings yet
5.distributed File System
86 pages
P2P File Sharing
No ratings yet
P2P File Sharing
43 pages
Discrete Computing
No ratings yet
Discrete Computing
25 pages
Distributed File System
No ratings yet
Distributed File System
49 pages
Lecture 5 - DFS & NFS
No ratings yet
Lecture 5 - DFS & NFS
45 pages
18-Distributed File Systems Study On Operating Systems
No ratings yet
18-Distributed File Systems Study On Operating Systems
24 pages
Distributed Systems U4
No ratings yet
Distributed Systems U4
8 pages
Lec25 Distfiles
No ratings yet
Lec25 Distfiles
25 pages
Final Suggestions Dos - 605B
No ratings yet
Final Suggestions Dos - 605B
17 pages
Distributed File Systems
No ratings yet
Distributed File Systems
35 pages
Caching in Distributed File System: Ke Wang CS614 - Advanced System Apr 24, 2001
No ratings yet
Caching in Distributed File System: Ke Wang CS614 - Advanced System Apr 24, 2001
56 pages
Chapter 8
No ratings yet
Chapter 8
30 pages
DFS
No ratings yet
DFS
37 pages
Lecture 08
No ratings yet
Lecture 08
25 pages
CC U3
No ratings yet
CC U3
40 pages
Design and Implementation of The Sun Network Filesystem: R. Sandberg, D. Goldberg S. Kleinman, D. Walsh, R. Lyon
No ratings yet
Design and Implementation of The Sun Network Filesystem: R. Sandberg, D. Goldberg S. Kleinman, D. Walsh, R. Lyon
34 pages
PHPR OZ12 B
No ratings yet
PHPR OZ12 B
31 pages
What Is RAC
No ratings yet
What Is RAC
9 pages
Attix 5 Backup
No ratings yet
Attix 5 Backup
30 pages
430TX
No ratings yet
430TX
83 pages
Proxy Kampret
No ratings yet
Proxy Kampret
4 pages
ASP Questions
No ratings yet
ASP Questions
16 pages
IFI MAIGO PARISH RMS 2 Updated
No ratings yet
IFI MAIGO PARISH RMS 2 Updated
32 pages
Testbank ch01
No ratings yet
Testbank ch01
14 pages
Ibm Tivoli Maximo DG PDF
No ratings yet
Ibm Tivoli Maximo DG PDF
16 pages
Glossary
No ratings yet
Glossary
50 pages
HUAWEI 4G Router 3 Prime Démarrage Rapide - (B818-263,01, FR)
No ratings yet
HUAWEI 4G Router 3 Prime Démarrage Rapide - (B818-263,01, FR)
52 pages
PDF Chat Report
No ratings yet
PDF Chat Report
148 pages
Module 3:the Memory System: Courtesy: Text Book: Carl Hamacher 5 Edition
No ratings yet
Module 3:the Memory System: Courtesy: Text Book: Carl Hamacher 5 Edition
73 pages
Cs3351 DC Ques Bank
No ratings yet
Cs3351 DC Ques Bank
10 pages
Ebook Scaling RAG Systems From POC To Production - 2025
No ratings yet
Ebook Scaling RAG Systems From POC To Production - 2025
28 pages
Advanced Java Unit 4 Digital Notes
No ratings yet
Advanced Java Unit 4 Digital Notes
47 pages
System Design
No ratings yet
System Design
4 pages
Abraham Silberschatz-Operating System Concepts (9th, 2012 - 12) - 460-463, 9.8
No ratings yet
Abraham Silberschatz-Operating System Concepts (9th, 2012 - 12) - 460-463, 9.8
4 pages
Modern Web Application Architecture Overview
No ratings yet
Modern Web Application Architecture Overview
9 pages
Module 9 - Caching Information For Scalability
No ratings yet
Module 9 - Caching Information For Scalability
75 pages
486 Unp 301
No ratings yet
486 Unp 301
50 pages
System Global Area (SGA) Part 1: What We Will Learn in This Lecture?
No ratings yet
System Global Area (SGA) Part 1: What We Will Learn in This Lecture?
6 pages
Oracle Advanced Security Internals Demonstrating Network Encryption
100% (1)
Oracle Advanced Security Internals Demonstrating Network Encryption
10 pages
Cheatsheet Midterm1
No ratings yet
Cheatsheet Midterm1
2 pages
Redis Guide How To Use
No ratings yet
Redis Guide How To Use
8 pages
Concrete Architecture of The Linux Kernel
No ratings yet
Concrete Architecture of The Linux Kernel
34 pages
Unit-3 - Software Architecture Analysis
No ratings yet
Unit-3 - Software Architecture Analysis
137 pages
Project 1 (100 PTS) Due 10/01/2022: CSCI 6461 Computer Architecture II Fall 2022
No ratings yet
Project 1 (100 PTS) Due 10/01/2022: CSCI 6461 Computer Architecture II Fall 2022
3 pages
Designing Pastebin - Grokking The System Design Interview
No ratings yet
Designing Pastebin - Grokking The System Design Interview
9 pages
Anatella Quick Guide
No ratings yet
Anatella Quick Guide
159 pages
4 Internal Representation of Files
No ratings yet
4 Internal Representation of Files
12 pages

Distributed File Systems

Uploaded by

Distributed File Systems

Uploaded by

Distributed Systems

Distributed File Systems

We want more transparency

Focus on file system for now

Client module (driver)

• Delayed writes (write-behind)

You might also like

•  Delayed writes (write-behind)