0% found this document useful (0 votes)

82 views56 pages

Caching in Distributed File System: Ke Wang CS614 - Advanced System Apr 24, 2001

This document discusses caching in distributed file systems. It covers: - Key requirements for distributed systems including scalability, access to distributed files, information protection, ease of administration, and vendor support. - Background on distributed file systems (DFS) which allow shared files and storage across locations. DFS combine smaller remote storage spaces. - Approaches to caching including location (disk vs memory), placement (client vs server), structure (block vs file), policies, and consistency issues. - Examples of caching in specific distributed file systems including NFS, AFS, and Sprite. AFS uses whole-file caching at clients and write-on-close to improve scalability.

Uploaded by

Archana Panwar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

82 views56 pages

Caching in Distributed File System: Ke Wang CS614 - Advanced System Apr 24, 2001

Uploaded by

Archana Panwar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 56

Caching

in
Distributed File System

Ke Wang
CS614 – Advanced System
Apr 24, 2001
Key requirements of distributed system
 Scalability from small to large networks
 Fast and transparent access to geographically
Distributed File System(DFS)
 Information protection
 Ease of administration
 Wide support from variety of vendors
Background
 DFS -- a distributed implementation of a file
system, where multiple users share files and
storage resources.
 Overall storage space managed by a DFS is
composed of different, remotely located,
smaller storage spaces
 There is usually a correspondence between
constituent storage spaces and sets of files
DFS Structure
 Service - a software entity providing a
particular type of function to client
 Server - service software running on a single
machine
 Client - process that can invoke a service
using a set of operations that forms its client
interface
Why caching?
 Retaining most recently accessed disk blocks.
 Repeated accesses to a block in cache can be
handled without involving the disk.
 Advantages
- Reduce delays
- Reduce contention for disk arm
Caching in DFS
 Advantages
 Reduce network traffic
 Reduce server contention

 Problems
 Cache-consistency
Stuff to consider
 Cache location (disk vs. memory)
 Cache Placement (client vs. server)
 Cache structure (block vs. file)
 Stateful vs. Stateless server
 Cache update policies
 Consistency
 Client-driven vs. Server-driven protocols
Practical Distributed System
 NFS: Sun’s Network File System
 AFS: Andrew File System (CMU)
 Sprite FS: File System for the Sprite OS
( UC Berkeley)
Sun’s Network File System(NFS)
Sun’s Network File System(NFS)
 Originally released in 1985
 Build on top of an unreliable datagram
protocol UDP (change to TCP now)
 Client-server model
Andrew File System(AFS)
 Developed at CMU since 1983
 Client-server model
 Key software: Vice and Venus
 Goal: high scalability (5,000-10,000
nodes)
Andrew File System(AFS)
Andrew File System(AFS)
 VICE is a multi-threaded server process with
each thread handling a single client request

 VENUS is the client process that runs on each

workstation which forms the interface with
VICE
 User-level processes
Prototype of AFS
 One process for one client
 Client cache file
 Verify timestamp every open
 -> a lot of interaction with server
 -> heavy network traffic
Improve AFS
 To improve prototype
 Reduce cache validity check
 Reduce server processes
 Reduce network traffic

  Higher scalability!
Sprite File System
 Designed for networked workstation
with large physical memories
(can be diskless)
 Expect memory of 100-500Mbytes
 Goal: high performance
Caches in Sprite FS
Caches in Sprite FS(cont)
 When a process makes a file access, it is
presented first to the cache(file traffic). If not
satisfied, request is passed either to a local
disk, if the file is stored locally(disk traffic), or
to the server where the file is stored(server
traffic). Servers also maintain caches to
reduce disk traffic.
Caching in Sprite FS
 Two unusual aspects
 Guarantee complete consistent view

 Concurrent write sharing

 Sequential write sharing

 Cache size varies dynamically

Cache Location
Disk vs. Main Memory
 Advantages of disk caches

 More Reliable
 Cached data are still there during recovery
and don’t need to be fetched again
Cache Location
Disk vs. Main Memory(cont)
 Advantages of main-memory caches:
 Permit workstations to be diskless
 More quick access
 Server caches(used to speed up disk I/O)
are always in main memory; using main-
memory caches on the clients permits a
single caching mechanism for servers and
users
Cache Placement
Client vs. Server
 Client cache reduce network traffic
 Read-only operations on unchanged files
do not need go over the network
 Server cache reduce server load
 Cache is amortized across all clients ( but
needs to be bigger to be effective)
 In practice, need BOTH!
Cache structure
 Block basis
 Simple
 Sprite FS, NFS
 File basis
 Reduce interaction with servers
 AFS
 Cannot access files larger than cache
Compare
 NFS: client memory(disk), block basis
 AFS: client disk, file basis
 Sprint FS: client memory, server
memory, block basis
Stateful vs. Stateless Server
 Stateful – Servers hold information
about the client

 Stateless – Servers maintain no state

information about clients
Stateful Servers
 Mechanism
 Client opens a file
 Server fetches information about the file
from its disk, store in memory, gives client
a unique connection id and open file
 id is used for subsequent accesses until the
session ends
Stateful Servers(cont)
 Advantages:
 Fewer disk access
 Read-ahead possible
 RPCs are small, contains only an id
 File may be cached entirely on client,
invalidated by the server if there is a
conflicting write
Stateful Servers(cont)
 Disadvantage:
 Server loses all its volatile state in crash
 Restore state by dialog with clients, or

abort operations that underway when

crash occurred
 Server needs to be aware of client

failures
Stateless Server
 Each request must be self-contained
 Each request identifies the file and
position in the file
 No need to establish and terminate a
connection by open and close
operations
Stateless Server(cont)
 Advantage
 A file server crash does not affect clients
 Simple
 Disadvantage
 Impossible to enforce consistency
 RPC needs to contain all state, longer
Stateful vs. Stateless
 AFS and Sprite FS are stateful
 Sprite FS servers keep track of which
clients have which files open
 AFS servers keep track of the contents of
client’s caches
 NFS is stateless
Cache Update Policy
 Write-through

 Delayed-write

 Write-on-close (variation of delayed-

write)
Cache Update Policy(cont)
 Write-through – all writes be
propagated to stable storage
immediately

 Reliable, but poor performance

Cache Update Policy(cont)
 Delayed-write – modification written to
cache and then written through to
server later
 Write-on-close – modification written
back to server when file close
 Reduces intermediate read and write traffic
while file is open
Cache Update Policy(cont)
 Pros for delayed-write/write-on-close
 Lots of files have lifetimes of less than 30s
 Redundant writes are absorbed
 Lots of small writes can be batched into
larger writes
 Disadvantage:
 Poor reliability; unwritten data may be lost
when client crash
Caching in AFS
 Key to Andrew’s scalability
 Client cache entire file in disk
 Write-on-close
 Server load and network traffic reduced
 Contacts server only on open and close
 Retain across reboots
 Require local disk, large enough
Cache update policy
 NFS and Sprite delayed-write
 Delay 30 seconds
 AFS write-on-close
 Reduce traffic to server dramatically
  Good scalability of AFS
Consistency
 Is locally cached copy of data consistent
with the master copy?

 Is there danger of “stale” data?

 Permit concurrent write sharing?

Sprite:Complete Consistency
 Concurrent Write Share
 A file open on multiple clients
 At least one client write
 Server detects
 Require write back to server
 Invalidate open cache
Sprite:Complete Consistency
 Sequential Write Sharing
 A file modified, closed, opened by others
 Out-of-date blocks
 Compare version number with server
 Current data in other’s cache
 Keep track of last writer
AFS: session semantics
 Session semantics in AFS
 Writes to an open file invisible to others
 Once file closed, changes visible to new
opens anywhere
 Other file operations visible immediately
 Only guarantee sequential consistency
Consistency
 Sprite guarantees complete consistency
 AFS uses session semantics
 NFS not guarantee consistency
 NFS is stateless. All operations involve
contacting the server; if server is
unreachable, read & write cannot work
Client-driven vs. Server-driven
 Client-driven approach
 Client initiates validity check
 Server check whether the local data are
consistent with master copy
 Server-driven approach
 Server records files client caches
 When server detect inconsistency, it must
react
AFS: server-driven
 Callback (key to scalability)
 Cache valid if have callback on

 Server notify before modification

 When reboot, all suspect

 reduces cache validation requests to

server
Client-driven vs. Server-driven
 AFS is server-driven (callback)
 Contributes to AFS’s scalability
 Whole file caching and session semantics
also help
 NFS and Sprite are client-driven
 Increased load on network and server
AFS:Effect on scalability
Sprite:Dynamic cache size
 Make client cache as large as possible
 Virtual memory and file system
negotiate
 Compare age of oldest page
 Two problems
 Double caching
 Multiblock pages
Why not callback in Sprite?
Why not callback in Sprite?
 Estimated improvement is small
 Reason
 Andrew is user-level process
 Sprite is kernel-level implementation
Comparison
Performance – running time
Performance – running time
 Use Andrew benchmark
 Sprite system is fastest
 Kernel-to-kernel PRC
 Delayed write
 Kernel implementation (AFS is user-level)
Performance – CPU utilization
Performance – CPU utilization
 Use Andrew benchmark
 Andrew system showed greatest
scalability
 File-based cache
 Server-driven
 Use of callback
Nomadic Caching
 New issues
 If client become disconnected?
 Weakly connected(by modem)?

 Violate key property: transparency!

Nomadic Caching
 Cache misses may impede progress
 Local update invisible remotely
 Update conflict
 Update vulnerable to loss, damage

  Coda file system

Employee Relationship Management (MCQ)
67% (6)
Employee Relationship Management (MCQ)
46 pages
FC1500 User Manual
No ratings yet
FC1500 User Manual
15 pages
RPSC ASO Notes in Hindi PDF
100% (5)
RPSC ASO Notes in Hindi PDF
87 pages
2019 JBKnowledge Construction Technology Report
No ratings yet
2019 JBKnowledge Construction Technology Report
60 pages
Pert CPM
No ratings yet
Pert CPM
41 pages
Synopsis "Zee Bank Atm System" Submitted by Archana Panwar: For The Award of The Degree of
No ratings yet
Synopsis "Zee Bank Atm System" Submitted by Archana Panwar: For The Award of The Degree of
30 pages
1 Process Synchronization
No ratings yet
1 Process Synchronization
35 pages
Applets: Kuldeep Yogi Banasthali Unuversity
No ratings yet
Applets: Kuldeep Yogi Banasthali Unuversity
29 pages
Pumping Lemma For Regular Language
No ratings yet
Pumping Lemma For Regular Language
38 pages
Cryptography Hash Functions
No ratings yet
Cryptography Hash Functions
5 pages
SQA Metrics
No ratings yet
SQA Metrics
46 pages
Introducing ODIN: Adfom's Powerful AI
No ratings yet
Introducing ODIN: Adfom's Powerful AI
10 pages
HCI Ch-4 Interaction Lasts
No ratings yet
HCI Ch-4 Interaction Lasts
65 pages
Frame Relay and ATM
No ratings yet
Frame Relay and ATM
36 pages
Java Applets: Kuldeep Yogi
No ratings yet
Java Applets: Kuldeep Yogi
14 pages
Youth Sexual and Reproductive Health
No ratings yet
Youth Sexual and Reproductive Health
33 pages
Best First Search: A Algorithm
No ratings yet
Best First Search: A Algorithm
43 pages
19 - Frames
No ratings yet
19 - Frames
21 pages
8 - Problem Reduction Search
No ratings yet
8 - Problem Reduction Search
31 pages
Predicate Calculus: Conversion To Normal Form
No ratings yet
Predicate Calculus: Conversion To Normal Form
25 pages
2 Deadlocks
No ratings yet
2 Deadlocks
45 pages
Usc Csci555 f12 Part2
No ratings yet
Usc Csci555 f12 Part2
222 pages
Review of Experimental Analysis of Parallel and Counter Flow Heat Exchanger IJERTV5IS020385
No ratings yet
Review of Experimental Analysis of Parallel and Counter Flow Heat Exchanger IJERTV5IS020385
3 pages
Predicate Calculus: Resolution in FOL
No ratings yet
Predicate Calculus: Resolution in FOL
25 pages
Candle Strategy
No ratings yet
Candle Strategy
6 pages
Tender Documents
No ratings yet
Tender Documents
92 pages
Real Estate Brochure - Ekta World
No ratings yet
Real Estate Brochure - Ekta World
31 pages
Embedded
No ratings yet
Embedded
14 pages
Project Read Me
No ratings yet
Project Read Me
7 pages
PHPR OZ12 B
No ratings yet
PHPR OZ12 B
31 pages
Andrew File System (AFS)
No ratings yet
Andrew File System (AFS)
16 pages
06 dfs2
No ratings yet
06 dfs2
50 pages
L8 DFS
No ratings yet
L8 DFS
35 pages
Distributed File Systems
No ratings yet
Distributed File Systems
107 pages
A Detailed Comparison Between AtRisk ModelRisk and Crystal Ball PDF
No ratings yet
A Detailed Comparison Between AtRisk ModelRisk and Crystal Ball PDF
11 pages
Concept of Distributed File System
No ratings yet
Concept of Distributed File System
10 pages
Lec 11 - Distributed Files - Distributed File System
No ratings yet
Lec 11 - Distributed Files - Distributed File System
33 pages
Astm B786
No ratings yet
Astm B786
6 pages
Presentation ON Distributed File System: Institute of Engineering and Technology Bundelkhand University
No ratings yet
Presentation ON Distributed File System: Institute of Engineering and Technology Bundelkhand University
51 pages
Distributed File Systems & Name Services: UNIT-4
No ratings yet
Distributed File Systems & Name Services: UNIT-4
70 pages
Dijkstra's Algorithm
No ratings yet
Dijkstra's Algorithm
16 pages
DFS Design and Implementation: Brent R. Hafner
No ratings yet
DFS Design and Implementation: Brent R. Hafner
40 pages
DFS
No ratings yet
DFS
37 pages
Distributed File Systems
No ratings yet
Distributed File Systems
42 pages
AFS Presentation
No ratings yet
AFS Presentation
36 pages
Newton Leys Residents Association 04 07
No ratings yet
Newton Leys Residents Association 04 07
5 pages
WINSEM2012-13 CP0029 06-Mar-2013 RM01 DFT 2
No ratings yet
WINSEM2012-13 CP0029 06-Mar-2013 RM01 DFT 2
46 pages
Ethanol Preparation at Jai Hind College
No ratings yet
Ethanol Preparation at Jai Hind College
4 pages
California Legislators Call For Audit of Highlands
100% (1)
California Legislators Call For Audit of Highlands
9 pages
Mendoza Vs COMELEC
No ratings yet
Mendoza Vs COMELEC
2 pages
Dataplatform
No ratings yet
Dataplatform
2 pages
Business and Company Law
No ratings yet
Business and Company Law
6 pages
Distributed File Systems
No ratings yet
Distributed File Systems
56 pages
Sun's Network File System (NFS) : Client0 - Client1 - / - / - Network - Server+disks / - Client2 - / - Client3
No ratings yet
Sun's Network File System (NFS) : Client0 - Client1 - / - / - Network - Server+disks / - Client2 - / - Client3
22 pages
64 Prerna Jain Dspractassg11
No ratings yet
64 Prerna Jain Dspractassg11
8 pages
DFS Design and Implementation
No ratings yet
DFS Design and Implementation
40 pages
04 en Network File Systems
No ratings yet
04 en Network File Systems
57 pages
Andrew - Cmu.edu: Let's Start With A Familiar Example: Andrew 10,000s of People Terabytes of Disk
No ratings yet
Andrew - Cmu.edu: Let's Start With A Familiar Example: Andrew 10,000s of People Terabytes of Disk
7 pages
Reliable Distributed Systems
No ratings yet
Reliable Distributed Systems
44 pages
Distributed-File Systems Background
No ratings yet
Distributed-File Systems Background
9 pages
Distributed File System
No ratings yet
Distributed File System
49 pages
Distributed File Systems
No ratings yet
Distributed File Systems
50 pages
L6 DFS
No ratings yet
L6 DFS
27 pages
Unit-3 Part1
No ratings yet
Unit-3 Part1
57 pages
Partnership Liquidation Part 1
No ratings yet
Partnership Liquidation Part 1
2 pages
File Systems 2
No ratings yet
File Systems 2
43 pages
Distributed File System
No ratings yet
Distributed File System
43 pages
Distributed File System Implementation
100% (1)
Distributed File System Implementation
30 pages
Distributed File Systems
No ratings yet
Distributed File Systems
18 pages
DFS OS Final
No ratings yet
DFS OS Final
28 pages
18-Distributed File Systems Study On Operating Systems
No ratings yet
18-Distributed File Systems Study On Operating Systems
24 pages
DFSNov 1
No ratings yet
DFSNov 1
36 pages
Caching: File Systems: Outline
No ratings yet
Caching: File Systems: Outline
25 pages
Requirements For Distributed File Systems
No ratings yet
Requirements For Distributed File Systems
4 pages
Distributed File Systems (DFS) : A Resource Management Component of A Distributed Operating System
No ratings yet
Distributed File Systems (DFS) : A Resource Management Component of A Distributed Operating System
16 pages
Sunstar Company Profile (2023)
No ratings yet
Sunstar Company Profile (2023)
27 pages
Other File Systems: LFS, NFS, and Afs
No ratings yet
Other File Systems: LFS, NFS, and Afs
37 pages
Lecture 25: Distributed File Systems: Indranil Gupta (Indy)
No ratings yet
Lecture 25: Distributed File Systems: Indranil Gupta (Indy)
27 pages
Afs Andrew File System
No ratings yet
Afs Andrew File System
28 pages
Principles of Sustainability
No ratings yet
Principles of Sustainability
7 pages
PPL (P1) Checkride POA
No ratings yet
PPL (P1) Checkride POA
33 pages
Disk Scheduling
No ratings yet
Disk Scheduling
21 pages
Sun NFS Overview: Network File System (NFS) Is A Protocol Originally Developed by
No ratings yet
Sun NFS Overview: Network File System (NFS) Is A Protocol Originally Developed by
4 pages
Distributed File Systems: Arvind Krishnamurthy Spring 2001
No ratings yet
Distributed File Systems: Arvind Krishnamurthy Spring 2001
3 pages
Applications of Integration in Calculus
No ratings yet
Applications of Integration in Calculus
13 pages
CS2510 00 Distributed Storage Overview
No ratings yet
CS2510 00 Distributed Storage Overview
53 pages
Distributed Systems U4
No ratings yet
Distributed Systems U4
8 pages
Untitled Document
No ratings yet
Untitled Document
2 pages
Distributed File Systems
No ratings yet
Distributed File Systems
35 pages
NB57 - Watertight Sliding Doors Technical Specification-Contract Issue - v0 - 21092017
No ratings yet
NB57 - Watertight Sliding Doors Technical Specification-Contract Issue - v0 - 21092017
14 pages
Ap15 Compsci A q2
No ratings yet
Ap15 Compsci A q2
9 pages
3distributed File System
No ratings yet
3distributed File System
42 pages
Distributed File Systems
No ratings yet
Distributed File Systems
28 pages
Network File System (NFS)
No ratings yet
Network File System (NFS)
31 pages
Chap 6
No ratings yet
Chap 6
54 pages
AptDC 9-10
No ratings yet
AptDC 9-10
9 pages
Distributed File System
No ratings yet
Distributed File System
68 pages
Distributed Computing Module 5 Important Topics PYQs
No ratings yet
Distributed Computing Module 5 Important Topics PYQs
23 pages
1 - 13 - 76 - e Proportional Valve Wandfluh
No ratings yet
1 - 13 - 76 - e Proportional Valve Wandfluh
10 pages
Distributed File Systems
No ratings yet
Distributed File Systems
35 pages
Android Preparation Notes
No ratings yet
Android Preparation Notes
1 page
PL62X Extended Guide REV02
No ratings yet
PL62X Extended Guide REV02
32 pages
Lecture 08
No ratings yet
Lecture 08
25 pages
@klwks - Bot Os Co-4 Ha-4
No ratings yet
@klwks - Bot Os Co-4 Ha-4
17 pages
DC - Unit 3 Uhh Ybhg The G Hai H G BT
No ratings yet
DC - Unit 3 Uhh Ybhg The G Hai H G BT
32 pages
The Samba Handbook: File and Print Sharing for Linux and Windows
From Everand
The Samba Handbook: File and Print Sharing for Linux and Windows
Robert Johnson
No ratings yet
VMware Horizon 6 Desktop Virtualization Solutions
From Everand
VMware Horizon 6 Desktop Virtualization Solutions
Ryan Cartwright
No ratings yet
Oracle Coherence 3.5
From Everand
Oracle Coherence 3.5
Aleksandar Seovic
4/5 (1)

Caching in Distributed File System: Ke Wang CS614 - Advanced System Apr 24, 2001

Uploaded by

Caching in Distributed File System: Ke Wang CS614 - Advanced System Apr 24, 2001

Uploaded by

Caching

 VENUS is the client process that runs on each

 Concurrent write sharing

 Cache size varies dynamically

 Stateless – Servers maintain no state

abort operations that underway when

 Write-on-close (variation of delayed-

 Reliable, but poor performance

 Is there danger of “stale” data?

 Permit concurrent write sharing?

 Server notify before modification

 When reboot, all suspect

 reduces cache validation requests to

 Violate key property: transparency!

  Coda file system

You might also like