0% found this document useful (0 votes)

49 views50 pages

06 dfs2

This document provides an overview of the lecture on distributed file systems. It discusses key topics like file access consistency in NFS and AFS, name space construction using mounting in NFS vs a global namespace in AFS, and user authentication and access control using Kerberos. It also provides background on Coda, a distributed file system designed for disconnected operation, and its use of optimistic replication and file hoarding to handle communication failures and voluntary disconnections.

Uploaded by

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

49 views50 pages

06 dfs2

Uploaded by

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 50

15-440 Distributed Systems

Distributed File Systems 2

Lecture 06, Sept 20th 2016

1
Logistical Updates

• P0
• Due date: Midnight EST 9/22 (Thursday)
• NOTE: We will not accept P0 after Midnight EST 9/24
• Each late day constitutes a 10% penalty (max -20%)
• Attend office hours in case you are having trouble
• Solutions for P0 discussed next Monday
• Recitation sections on Monday 9/26
• Learn about good solutions to P0
• May help to learn how to structure GO code (for P1)
• P1 Released!
• For deadlines class page is most up to date

2
Review of Last Lecture

• Distributed file systems functionality

• Implementation mechanisms example
• Client side: VFS interception in kernel
• Communications: RPC
• Server side: service daemons
• Design choices
• Topic 1: client-side caching
• NFS and AFS

3
Today's Lecture

• DFS design comparisons continued

• Topic 2: file access consistency
• NFS, AFS
• Topic 3: name space construction
• Mount (NFS) vs. global name space (AFS)
• Topic 4: Security in distributed file systems
• Kerberos

• Other types of DFS

• Coda – disconnected operation
• LBFS – weakly connected operation

4
Topic 2: File Access Consistency

• In UNIX local file system, concurrent file reads and

writes have “sequential” consistency semantics
• Each file read/write from user-level app is an atomic
operation
• The kernel locks the file vnode
• Each file write is immediately visible to all file readers
• Neither NFS nor AFS provides such concurrency
control
• NFS: “sometime within 30 seconds”
• AFS: session semantics for consistency

5
Session Semantics in AFS v2

• What it means:
• A file write is visible to processes on the same box
immediately, but not visible to processes on other
machines until the file is closed
• When a file is closed, changes are visible to new
opens, but are not visible to “old” opens
• All other file operations are visible everywhere
immediately
• Implementation
• Dirty data are buffered at the client machine until file
close, then flushed back to server, which leads the
server to send “break callback” to other clients

6
AFS Write Policy

• Writeback cache
• Opposite of NFS “every write is sacred”
• Store chunk back to server
• When cache overflows
• On last user close()
• ...or don't (if client machine crashes)
• Is writeback crazy?
• Write conflicts “assumed rare”
• Who wants to see a half-written file?

7
Results for AFS

• Lower server load than NFS

• More files cached on clients
• Callbacks: server not busy if files are read-only (common
case)
• But maybe slower: Access from local disk is much
slower than from another machine’s memory over
LAN
• For both:
• Central server is bottleneck: all reads and writes hit it at
least once;
• is a single point of failure.
• is costly to make them fast, beefy, and reliable servers.
Topic 3: Name-Space
Construction and Organization
• NFS: per-client linkage
• Server: export /root/fs1/
• Client: mount server:/root/fs1 /fs1
• AFS: global name space
• Name space is organized into Volumes
• Global directory /afs;
• /afs/cs.wisc.edu/vol1/…; /afs/cs.stanford.edu/vol1/…
• Each file is identified as fid = <vol_id, vnode #, unique
identifier>
• All AFS servers keep a copy of “volume location database”,
which is a table of vol_id server_ip mappings

9
Implications on Location
Transparency
• NFS: no transparency
• If a directory is moved from one server to another, client
must remount

• AFS: transparency
• If a volume is moved from one server to another, only
the volume location database on the servers needs to
be updated

10
Naming in NFS (1)

• Figure 11-11. Mounting (part of) a remote file

system in NFS.

No naming transparency since both clients have the files

(eg.mbox) stored in different hierarchcal namespaces.
11
Naming in NFS (2)

Problem: Requires iterative name lookups and mounting filesystems

Solution? NFSv4 deals with this using automounting and other primitives
12
Implications on Location
Transparency
• NFS: no transparency
• If a directory is moved from one server to another, client
must remount

• AFS: transparency
• If a volume is moved from one server to another, only
the volume location database on the servers needs to
be updated

13
Topic 4: User Authentication and
Access Control
• User X logs onto workstation A, wants to access files
on server B
• How does A tell B who X is?
• Should B believe A?
• Choices made in NFS V2
• All servers and all client workstations share the same <uid,
gid> name space  B send X’s <uid,gid> to A
• Problem: root access on any client workstation can lead
to creation of users of arbitrary <uid, gid>
• Server believes client workstation unconditionally
• Problem: if any client workstation is broken into, the
protection of data on the server is lost;
• <uid, gid> sent in clear-text over wire  request packets
can be faked easily

14
User Authentication (cont’d)

• How do we fix the problems in NFS v2

• Hack 1: root remapping  strange behavior
• Hack 2: UID remapping  no user mobility
• Real Solution: use a centralized
Authentication/Authorization/Access-control (AAA)
system

15
A Better AAA System: Kerberos

• Basic idea: shared secrets

• User proves to KDC who he is; KDC generates shared
secret between client and file server

KDC
fs” ticket server
ss
c ce generates S
a
d to
e
“ Ne [S] Kf [ file server
nt s S]
K c li e encrypt S with
client’s key
client

S: specific to {client,fs} pair;

“short-term session-key”; expiration time (e.g. 8 hours)
16
Today's Lecture

• DFS design comparisons continued

• Topic 2: file access consistency
• NFS, AFS
• Topic 3: name space construction
• Mount (NFS) vs. global name space (AFS)
• Topic 4: AAA in distributed file systems
• Kerberos

• Other types of DFS

• Coda – disconnected operation
• LBFS – weakly connected operation

17
Background

• We are back to 1990s.

• Network is slow and not stable
• Terminal  “powerful” client
• 33MHz CPU, 16MB RAM, 100MB hard drive
• Mobile Users appeared
• 1st IBM Thinkpad in 1992
• We can do work at client without network

18
CODA

• Successor of the very successful Andrew File

System (AFS)
• AFS
• First DFS aimed at a campus-sized user community
• Key ideas include
• open-to-close consistency
• callbacks

19
Hardware Model

• CODA and AFS assume that client workstations

are personal computers controlled by their
user/owner
• Fully autonomous
• Cannot be trusted
• CODA allows owners of laptops to operate them
in disconnected mode
• Opposite of ubiquitous connectivity

20
Accessibility

• Must handle two types of failures

• Server failures:
• Data servers are replicated
• Communication failures and voluntary
disconnections
• Coda uses optimistic replication and file
hoarding

21
Design Rationale

• Scalability
• Callback cache coherence (inherit from AFS)
• Whole file caching
• Fat clients. (security, integrity)
• Avoid system-wide rapid change
• Portable workstations
• User’s assistance in cache management

22
Design Rationale –Replica
Control
• Pessimistic
• Disable all partitioned writes
- Require a client to acquire control (lock) of a cached
object prior to disconnection
• Optimistic
• Assuming no others touching the file
- conflict detection
+ fact: low write-sharing in Unix
+ high availability: access anything in range

23
What about Consistency?

• Pessimistic replication control protocols

guarantee the consistency of replicated in the
presence of any non-Byzantine failures
• Typically require a quorum of replicas to allow access
to the replicated data
• Would not support disconnected mode
• We shall cover Byzantine Faults and Failures later.

24
Pessimistic Replica Control

• Would require client to acquire exclusive (RW)

or shared (R) control of cached objects before
accessing them in disconnected mode:
• Acceptable solution for voluntary disconnections
• Does not work for involuntary disconnections

• What if the laptop remains disconnected for a long

time?

25
Leases

• We could grant exclusive/shared control of the

cached objects for a limited amount of time
• Works very well in connected mode
• Reduces server workload
• Server can keep leases in volatile storage as long as
their duration is shorter than boot time
• Would only work for very short disconnection
periods

26
Optimistic Replica Control (I)

• Optimistic replica control allows access in

every disconnected mode
• Tolerates temporary inconsistencies
• Promises to detect them later
• Provides much higher data availability

27
Optimistic Replica Control (II)

• Defines an accessible universe: set of files that

the user can access
• Accessible universe varies over time
• At any time, user
• Will read from the latest file(s) in his accessible
universe
• Will update all files in his accessible universe

28
Coda States

Hoarding

Emulating Recovering

1. Hoarding:
Normal operation mode
2. Emulating:
Disconnected operation mode
3. Reintegrating:
Propagates changes and detects inconsistencies
29
Hoarding

• Hoard useful data for disconnection

• Balance the needs of connected and
disconnected operation.
• Cache size is restricted
• Unpredictable disconnections
• Uses user specified preferences + usage patterns
to decide on files to keep in hoard

30
Prioritized algorithm

• User defined hoard priority p: how important is a file to

you?
• Recent Usage q
• Object priority = f(p,q)
• Kick out the one with lowest priority
+ Fully tunable
Everything can be customized
- Not tunable (?)
- No idea how to customize
- Hoard walking algorithm function of Cache Size
- As disk grows, cache grows

31
Emulation

• In emulation mode:
• Attempts to access files that are not in the client caches
appear as failures to application
• All changes are written in a persistent log,
the client modification log (CML)
• Coda removes from log all obsolete entries like those
pertaining to files that have been deleted

33
Reintegration
• When workstation gets reconnected, Coda initiates a
reintegration process
• Performed one volume at a time
• Venus ships replay log to all volumes
• Each volume performs a log replay algorithm

• Only care about write/write confliction

• Conflict resolution succeeds?
• Yes. Free logs, keep going…
• No. Save logs to a tar. Ask for help

• In practice:
• No Conflict at all! Why?
• Over 99% modification by the same person
• Two users modify the same obj within a day: <0.75%

35
Remember this slide?

• We are back to 1990s.

• Network is slow and not stable
• Terminal  “powerful” client
• 33MHz CPU, 16MB RAM, 100MB hard drive
• Mobile Users appear
• 1st IBM Thinkpad in 1992

36
What’s now?

• We are in 2000s now.

• Network is fast and reliable in LAN
• “powerful” client  very powerful client
• 2.4GHz CPU, 4GB RAM, 500GB hard drive
• Mobile users everywhere
• Do we still need support for disconnection?
• WAN and wireless is not very reliable, and is slow

37
Today's Lecture

• DFS design comparisons continued

• Topic 2: file access consistency
• NFS, AFS
• Topic 3: name space construction
• Mount (NFS) vs. global name space (AFS)
• Topic 4: AAA in distributed file systems
• Kerberos

• Other types of DFS

• Coda – disconnected operation
• LBFS – weakly connected operation

38
Low Bandwidth File System
Key Ideas
• A network file systems for slow or wide-area
networks
• Exploits similarities between files or versions of
the same file
• Avoids sending data that can be found in the server’s
file system or the client’s cache
• Also uses conventional compression and caching
• Requires 90% less bandwidth than traditional
network file systems

39
Working on slow networks

• Make local copies

• Must worry about update conflicts
• Use remote login
• Only for text-based applications
• Use instead a LBFS
• Better than remote login
• Must deal with issues like auto-saves blocking the
editor for the duration of transfer

40
LBFS design

• LBFS server divides file it stores into chunks and

indexes the chunks by hash value
• Client similarly indexes its file cache
• Exploits similarities between files
• LBFS never transfers chunks that the recipient already
has

41
Indexing

• Uses the SHA-1 algorithm for hashing

• It is collision resistant
• Central challenge in indexing file chunks is
keeping the index at a reasonable size while
dealing with shifting offsets
• Indexing the hashes of fixed size data blocks
• Indexing the hashes of all overlapping blocks at all
offsets

42
LBFS chunking solution

• Considers only non-overlapping chunks

• Sets chunk boundaries based on file contents
rather than on position within a file
• Examines every overlapping 48-byte region of file
to select the boundary regions called breakpoints
using Rabin fingerprints
• When low-order 13 bits of region’s fingerprint equals a
chosen value, the region constitutes a breakpoint

43
Effects of edits on file chunks

• Chunks of file before/after edits

• Grey shading show edits
• Stripes show regions with magic values that create chunk boundaries

44
More Indexing Issues

• Pathological cases
• Very small chunks
• Sending hashes of chunks would consume as much
bandwidth as just sending the file
• Very large chunks
• Cannot be sent in a single RPC
• LBFS imposes minimum (2K) and maximum
chunk (64K) sizes

45
The Chunk Database

• Indexes each chunk by the first 64 bits of its SHA-

1 hash
• To avoid synchronization problems, LBFS always
recomputes the SHA-1 hash of any data chunk
before using it
• Simplifies crash recovery
• Recomputed SHA-1 values are also used to
detect hash collisions in the database

46
DFS in real life

• Dropbox, Google Drive, OneDrive, BOX

• 100s of Millions of users, syncing petabytes (?)
• Basic function: Storing, sharing, synchronizing data
between multiple devices, anytime, over any network
• General architecture (esp Dropbox)

Picture Credit: Yong Cui, QuickSync: Improving Synchronization Efficiency for Mobile Cloud Storage Services 47
Features and Comparisons

• Chunking: splitting a large file into multiple data units

• Bundling: multiple small chunks as a single chunk
• Deduplication: avoiding sending existing content in the cloud
• Delta-encoding: transmit only the modified portion of a file

• Question: Dropbox’s consistency model for conflicts?

• Question: Why don’t we do data deduplication always?
12/05/20 48
Key Lessons

• Distributed filesystems almost always involve a

tradeoff: consistency, performance, scalability.
• We’ll see a related tradeoff, also involving
consistency, in a while: the CAP tradeoff.
Consistency, Availability, Partition-resilience.
More Key Lessons

• Client-side caching is a fundamental technique to

improve scalability and performance
• But raises important questions of cache consistency
• Timeouts and callbacks are common methods for
providing (some forms of) consistency.
• AFS picked close-to-open consistency as a good
balance of usability (the model seems intuitive to
users), performance, etc.
• AFS authors argued that apps with highly concurrent,
shared access, like databases, needed a different model
Key lessons for Coda

• Puts scalability and availability before

data consistency
• Unlike NFS
• Assumes that inconsistent updates are very infrequent
=> detect conflicts when reintegrating
• Introduced disconnected operation mode by allowing
cached data (weakly consistent), backed by a file
hoarding database
• Limitations?
• Detects only W/W conflicts, no R/W (lazy consistency)
• No client-client sharing possible

51
Key Lessons for LBFS

• Under normal circumstances, LBFS consumes

90% less bandwidth than traditional file systems.
• Makes transparent remote file access a viable and
less frustrating alternative to running interactive
programs on remote machines.
• Key Ideas: Content based chunks definition, rabin
fingerprints to deal with insertions/deletions,
hashes to determine content changes, ...

Electrostatic Handbook 2003
100% (9)
Electrostatic Handbook 2003
228 pages
Floor Truss Span Tables
No ratings yet
Floor Truss Span Tables
2 pages
04 en Network File Systems
No ratings yet
04 en Network File Systems
57 pages
PHPR OZ12 B
No ratings yet
PHPR OZ12 B
31 pages
Distributed File Systems
No ratings yet
Distributed File Systems
28 pages
Distributed File Systems: Arvind Krishnamurthy Spring 2001
No ratings yet
Distributed File Systems: Arvind Krishnamurthy Spring 2001
3 pages
Other File Systems: LFS, NFS, and Afs
No ratings yet
Other File Systems: LFS, NFS, and Afs
37 pages
AptDC 9-10
No ratings yet
AptDC 9-10
9 pages
Andrew - Cmu.edu: Let's Start With A Familiar Example: Andrew 10,000s of People Terabytes of Disk
No ratings yet
Andrew - Cmu.edu: Let's Start With A Familiar Example: Andrew 10,000s of People Terabytes of Disk
7 pages
Lecture 25: Distributed File Systems: Indranil Gupta (Indy)
No ratings yet
Lecture 25: Distributed File Systems: Indranil Gupta (Indy)
27 pages
Unit-3 Part1
No ratings yet
Unit-3 Part1
57 pages
Distributed-File Systems Background
No ratings yet
Distributed-File Systems Background
9 pages
Presentation ON Distributed File System: Institute of Engineering and Technology Bundelkhand University
No ratings yet
Presentation ON Distributed File System: Institute of Engineering and Technology Bundelkhand University
51 pages
Distributed File System Implementation
100% (1)
Distributed File System Implementation
30 pages
Caching in Distributed File System: Ke Wang CS614 - Advanced System Apr 24, 2001
No ratings yet
Caching in Distributed File System: Ke Wang CS614 - Advanced System Apr 24, 2001
56 pages
Coda
No ratings yet
Coda
25 pages
Week5 Dfs
No ratings yet
Week5 Dfs
13 pages
10 Distributed File Systems
No ratings yet
10 Distributed File Systems
27 pages
03 Nfs PDF
No ratings yet
03 Nfs PDF
48 pages
Design and Implementation of The Sun Network Filesystem: R. Sandberg, D. Goldberg S. Kleinman, D. Walsh, R. Lyon
No ratings yet
Design and Implementation of The Sun Network Filesystem: R. Sandberg, D. Goldberg S. Kleinman, D. Walsh, R. Lyon
34 pages
Distributed File System
No ratings yet
Distributed File System
43 pages
10 Distributedfs
No ratings yet
10 Distributedfs
35 pages
Lecture 08
No ratings yet
Lecture 08
25 pages
Distributed File Systems
No ratings yet
Distributed File Systems
50 pages
Distributed File Systems
No ratings yet
Distributed File Systems
38 pages
Distributed File Systems
No ratings yet
Distributed File Systems
18 pages
Afs Andrew File System
No ratings yet
Afs Andrew File System
28 pages
Distributed System Based File System
No ratings yet
Distributed System Based File System
15 pages
DFS Design and Implementation: Brent R. Hafner
No ratings yet
DFS Design and Implementation: Brent R. Hafner
40 pages
3distributed File System
No ratings yet
3distributed File System
42 pages
L6 DFS
No ratings yet
L6 DFS
27 pages
18-Distributed File Systems Study On Operating Systems
No ratings yet
18-Distributed File Systems Study On Operating Systems
24 pages
Distributed File Systems
No ratings yet
Distributed File Systems
31 pages
Distributed File Systems & Name Services: UNIT-4
No ratings yet
Distributed File Systems & Name Services: UNIT-4
70 pages
CS2510 00 Distributed Storage Overview
No ratings yet
CS2510 00 Distributed Storage Overview
53 pages
DFS Design and Implementation
No ratings yet
DFS Design and Implementation
40 pages
DFSNov 1
No ratings yet
DFSNov 1
36 pages
Distributed File Systems
No ratings yet
Distributed File Systems
56 pages
Caching: File Systems: Outline
No ratings yet
Caching: File Systems: Outline
25 pages
11 Distributed File Systems
No ratings yet
11 Distributed File Systems
16 pages
Distributed File Systems (DFS) : A Resource Management Component of A Distributed Operating System
No ratings yet
Distributed File Systems (DFS) : A Resource Management Component of A Distributed Operating System
16 pages
Issues in Distributed File Systems
No ratings yet
Issues in Distributed File Systems
10 pages
AFS Presentation
No ratings yet
AFS Presentation
36 pages
NFS
No ratings yet
NFS
27 pages
Gytha John Harikrishnan Hridya S7Cse: Presented by
No ratings yet
Gytha John Harikrishnan Hridya S7Cse: Presented by
17 pages
Repli
No ratings yet
Repli
38 pages
Distributed File Systems
No ratings yet
Distributed File Systems
35 pages
L8 DFS
No ratings yet
L8 DFS
35 pages
Distributed File Systems
No ratings yet
Distributed File Systems
107 pages
DC - Unit 3 Uhh Ybhg The G Hai H G BT
No ratings yet
DC - Unit 3 Uhh Ybhg The G Hai H G BT
32 pages
Ds 2016 17 Lec17
No ratings yet
Ds 2016 17 Lec17
32 pages
Sun NFS Overview: Network File System (NFS) Is A Protocol Originally Developed by
No ratings yet
Sun NFS Overview: Network File System (NFS) Is A Protocol Originally Developed by
4 pages
Requirements For Distributed File Systems
No ratings yet
Requirements For Distributed File Systems
4 pages
Distributed Systems U4
No ratings yet
Distributed Systems U4
8 pages
He-Phan-Bo - Thoai-Nam - Distributedsystem - 16 - Fileservice - (Cuuduongthancong - Com)
No ratings yet
He-Phan-Bo - Thoai-Nam - Distributedsystem - 16 - Fileservice - (Cuuduongthancong - Com)
28 pages
Distributed File System
No ratings yet
Distributed File System
68 pages
Reliable Distributed Systems
No ratings yet
Reliable Distributed Systems
44 pages
WINSEM2012-13 CP0029 06-Mar-2013 RM01 DFT 2
No ratings yet
WINSEM2012-13 CP0029 06-Mar-2013 RM01 DFT 2
46 pages
Distributed File Systems
No ratings yet
Distributed File Systems
43 pages
Linux Command Line for New Users: A Practical Guide with Examples
From Everand
Linux Command Line for New Users: A Practical Guide with Examples
William E. Clark
No ratings yet
Network File System in Practice: Definitive Reference for Developers and Engineers
From Everand
Network File System in Practice: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Professional Guide to Linux System Programming: Understanding and Implementing Advanced Techniques
From Everand
Professional Guide to Linux System Programming: Understanding and Implementing Advanced Techniques
Adam Jones
No ratings yet
15-440 Distributed Systems: Fault Tolerance, Logging and Recovery Thursday Oct 8, 2015
No ratings yet
15-440 Distributed Systems: Fault Tolerance, Logging and Recovery Thursday Oct 8, 2015
30 pages
15-440 Distributed Systems Fall 2016: L-23 Security
No ratings yet
15-440 Distributed Systems Fall 2016: L-23 Security
38 pages
15-440 Distributed Systems: Lecture 19 - Naming and Hashing
No ratings yet
15-440 Distributed Systems: Lecture 19 - Naming and Hashing
46 pages
Thriving in ACrowded and Changing World
No ratings yet
Thriving in ACrowded and Changing World
168 pages
Methodically Defeating Nintendo Switch Security
No ratings yet
Methodically Defeating Nintendo Switch Security
12 pages
Catalogo Bomba de Lodos Gardner Denver Pah-08 Ultimo
100% (3)
Catalogo Bomba de Lodos Gardner Denver Pah-08 Ultimo
35 pages
Chapter Five
No ratings yet
Chapter Five
10 pages
L2 GRF, GRH, SI, GAP
No ratings yet
L2 GRF, GRH, SI, GAP
30 pages
Test Automation Using Selinim Internship Report Title Pages
No ratings yet
Test Automation Using Selinim Internship Report Title Pages
4 pages
Customizing The Windchill 9 User Interface
No ratings yet
Customizing The Windchill 9 User Interface
3 pages
Sped MApeh6
No ratings yet
Sped MApeh6
5 pages
Configuracion de Scannert
No ratings yet
Configuracion de Scannert
2 pages
New Doc 2018-07-21
No ratings yet
New Doc 2018-07-21
3 pages
TML Lib CJ1 Motion Control Library For o
No ratings yet
TML Lib CJ1 Motion Control Library For o
2 pages
2.2. BASIC Work in Team Environment
No ratings yet
2.2. BASIC Work in Team Environment
3 pages
Presenatation On SIP by Saral Jain
No ratings yet
Presenatation On SIP by Saral Jain
12 pages
Royal Caribbean
No ratings yet
Royal Caribbean
13 pages
Nop 180
No ratings yet
Nop 180
2 pages
Air Act 1981 Project Arjun Dubey 4046
No ratings yet
Air Act 1981 Project Arjun Dubey 4046
3 pages
Designoftwowayslab
No ratings yet
Designoftwowayslab
23 pages
Towards A Critical Health Psychology Practice
100% (1)
Towards A Critical Health Psychology Practice
15 pages
02 Chittick Vs CA
No ratings yet
02 Chittick Vs CA
5 pages
Understanding The Adobe Illustrator Tools.
No ratings yet
Understanding The Adobe Illustrator Tools.
7 pages
Questions 1. Research Design: Balangay: A Proposed Flood Resilient House Methodology
No ratings yet
Questions 1. Research Design: Balangay: A Proposed Flood Resilient House Methodology
3 pages
4th Sem CSIT Mini - Project - Abstract Format
No ratings yet
4th Sem CSIT Mini - Project - Abstract Format
2 pages
How To Add or Remove An Employee
No ratings yet
How To Add or Remove An Employee
4 pages
SQC L9
No ratings yet
SQC L9
33 pages
Risk Assessment Template Teen Fashion
No ratings yet
Risk Assessment Template Teen Fashion
2 pages
APQP Project
No ratings yet
APQP Project
326 pages
Latihan Lab 3 - General Ledger and Adjusting Entries
No ratings yet
Latihan Lab 3 - General Ledger and Adjusting Entries
3 pages
Almera n16 Europa Em-K9k
No ratings yet
Almera n16 Europa Em-K9k
98 pages
A4 Australian Department of Parliamentary Services CS
No ratings yet
A4 Australian Department of Parliamentary Services CS
2 pages
Smartax Mt800 Adsl Router: User Manual
No ratings yet
Smartax Mt800 Adsl Router: User Manual
109 pages

06 dfs2

Uploaded by

06 dfs2

Uploaded by

15-440 Distributed Systems

Distributed File Systems 2

Lecture 06, Sept 20th 2016

• Distributed file systems functionality

• DFS design comparisons continued

• Other types of DFS

• In UNIX local file system, concurrent file reads and

• Lower server load than NFS

• Figure 11-11. Mounting (part of) a remote file

No naming transparency since both clients have the files

Problem: Requires iterative name lookups and mounting filesystems

• How do we fix the problems in NFS v2

• Basic idea: shared secrets

S: specific to {client,fs} pair;

• DFS design comparisons continued

• Other types of DFS

• We are back to 1990s.

• Successor of the very successful Andrew File

• CODA and AFS assume that client workstations

• Must handle two types of failures

• Pessimistic replication control protocols

• Would require client to acquire exclusive (RW)

• What if the laptop remains disconnected for a long

• We could grant exclusive/shared control of the

• Optimistic replica control allows access in

• Defines an accessible universe: set of files that

• Hoard useful data for disconnection

• User defined hoard priority p: how important is a file to

• Only care about write/write confliction

• We are back to 1990s.

• We are in 2000s now.

• DFS design comparisons continued

• Other types of DFS

• Make local copies

• LBFS server divides file it stores into chunks and

• Uses the SHA-1 algorithm for hashing

• Considers only non-overlapping chunks

• Chunks of file before/after edits

• Indexes each chunk by the first 64 bits of its SHA-

• Dropbox, Google Drive, OneDrive, BOX

• Chunking: splitting a large file into multiple data units

• Question: Dropbox’s consistency model for conflicts?

• Distributed filesystems almost always involve a

• Client-side caching is a fundamental technique to

• Puts scalability and availability before

• Under normal circumstances, LBFS consumes

You might also like