0% found this document useful (0 votes)

24 views6 pages

16 Distributedfilesystems

Distributed file systems allow network-wide sharing of files across multiple machines. They provide a centralized view of shared files even though the implementation is distributed. Key aspects include caching files locally for performance, managing coherency when files are modified, and handling issues like naming, sharing, and replication across multiple copies of files. Example systems discussed are NFS, AFS, Sprite, and GFS, each with their own approaches to distributed sharing of files.

Uploaded by

Chetan Gaurkar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views6 pages

16 Distributedfilesystems

Uploaded by

Chetan Gaurkar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Distributed File Systems

CSE 451: Operating Systems

• The most common distributed services:
– printing
– email
– Files
– Computation
• Basic idea of distributed file systems
Distributed File Systems – support network-wide sharing of files and devices (disks)
• Generally provide a “traditional” view
– a centralized shared local file system
• But with a distributed implementation
– read blocks from remote hosts, instead of from local disks

11/24/2008 2

Issues
• What is the basic abstraction • Caching
– remote file system? – caching exists for performance reasons
• open, close, read, write, … – where are file blocks cached?
– remote disk? • on the file server?
• read block, write block • on the client machine?
• Naming • both?

– how are files named? • Sharing and coherency

– are those names location transparent? – what are the semantics of sharing?
• is the file location visible to the user? – what happens when a cached block/file is modified
– are those names location independent? – how does a node know when its cached blocks are out of
• do the names change if the file moves? date?
• do the names change if the user moves?

11/24/2008 3 11/24/2008 4

1
Example: SUN Network File System (NFS)
• Replication • The Sun Network File System (NFS) has become a
– replication can exist for performance and/or availability common standard for distributed UNIX file access
– can there be multiple copies of a file in the network? • NFS runs over LANs (even over WANs – slowly)
– if multiple copies, how are updates handled?
• Basic idea
– what if there’s a network partition and clients work on
– allow a remote directory to be “mounted” (spliced) onto a
separate copies?
local directory
• Performance – Gives access to that remote directory and all its descendants
– what is the cost of remote operation? as if they were part of the local hierarchy
– what is the cost of file sharing? • Pretty much exactly like a “local mount” or “link” on
– how does the system scale as the number of clients grows? UNIX
– what are the performance limitations: network, CPU, disks, – except for implementation and performance …
protocols, data copying?
– no, we didn’t really learn about these, but they’re obvious ☺

11/24/2008 5 11/24/2008 6

NFS implementation
• For instance: • NFS defines a set of RPC operations for remote file
– I mount /u4/levy on Node1 onto /students/foo on Node2 access:
– users on Node2 can then access this directory as – searching a directory
/students/foo
– reading directory entries
– if I had a file /u4/levy/myfile, users on Node2 see it as
/students/foo/myfile – manipulating links and directories
• Just as, on a local system, I might link – reading/writing files
/cse/www/education/courses/451/08au/ • Every node may be both a client and server
as
/u4/levy/451
to allow easy access to my web data from my home
directory

11/24/2008 7 11/24/2008 8

2
NFS caching / sharing
• NFS defines new layers in the Unix file system • On an open, the client asks the server whether its
cached blocks are up to date.
The virtual file system (VFS) provides • Once a file is open, different clients can write it and
a standard interface, using v-nodes as get inconsistent data.
file handles. A v-node describes either
System Call Interface a local or remote file. • Modified data is flushed back to the server every 30
Virtual File System seconds.
UFS NFS RPCs to other (server) nodes
(local files) (remote files)
RPC requests from remote clients,
and server responses
buffer cache / i-node table

11/24/2008 9 11/24/2008 10

Example: CMU’s Andrew File System (AFS) AFS caching/sharing

• Developed at CMU to support all of its student • Need for scaling required reduction of client-server
computing message traffic
• Consists of workstation clients and dedicated file • Once a file is cached, all operations are performed
server machines (differs from NFS) locally
• Workstations have local disks, used to cache files
• On close, if the file has been modified, it is replaced
being used locally (originally whole files,
subsequently 64K file chunks) (differs from NFS) on the server
• Andrew has a single name space – your files have • The client assumes that its cache is up to date,
the same names everywhere in the world (differs unless it receives a callback message from the server
from NFS) saying otherwise
• Andrew is good for distant operation because of its – on file open, if the client has received a callback on the file, it
local disk caching: after a slow startup, most must fetch a new copy; otherwise it uses its locally-cached
accesses are to local disk copy (differs from NFS)

11/24/2008 11 11/24/2008 12

3
Example: Berkeley Sprite File System Example: Google File System (GFS)
• Unix file system developed for diskless workstations
with large memories at UCB (differs from NFS, AFS)
• Considers memory as a huge cache of disk blocks
– memory is shared between file system and VM NFS, etc.
• Files are permanently stored on servers
– servers have a large memory that acts as a cache as well
• Several workstations can cache blocks for read-only
files GFS

• If a file is being written by more than 1 machine,

client caching is turned off – all requests go to the Independence Cooperation
Small Scale
server (differs from NFS, AFS) Large Scale
Many users Few users
Many programs Few programs (well, many applications)

11/24/2008 13 11/24/2008 14

GFS: Google File System GFS Idealogy

• Why did Google build its own FS? • Huge amount of data
• Google has unique FS requirements
– Huge read/write bandwidth
• Ability to efficiently access data
– Reliability over thousands of nodes with frequent failures • Large quantity of Cheap machines
– Mostly operating on large data blocks
• BW more important than latency
– Need efficient distributed operations
• Unfair advantage • Component failures are the norm rather than the
– Google has control over applications, libraries and operating exception
system
• Atomic append operation so that multiple clients can
append concurrently

11/24/2008 15 11/24/2008 16

4
Files in GFS GFS Setup
• Files are huge by traditional standards Misc. servers
GFS Master

Replicas
• Most files are mutated by appending new data rather Client
Masters
than overwriting existing data GFS Master
Client
Client
• Once written, the files are only read, and often only
sequentially.
• Appending becomes the focus of performance C0 C1 C1 C0 C5
optimization and atomicity guarantees C5 C2 C5 C3 … C2

Chunkserver 1 Chunkserver 2 Chunkserver N

• Master manages metadata
• Data transfers happen directly between clients/chunkservers
• Files broken into chunks (typically 64 MB)

11/24/2008 17 11/24/2008 18

Architecture Architecture
• GFS cluster consists of a single master and multiple chunk servers and
is accessed by multiple clients.
• Each of these is typically a commodity Linux machine running a user-
level server process.
• Files are divided into fixed-size chunks identified by an immutable and
globally unique 64 bit chunk handle
• For reliability, each chunk is replicated on multiple chunk servers
• master maintains all file system metadata.
• The master periodically communicates with each chunk server in
HeartBeat messages to give it instructions and collect its state
• Neither the client nor the chunk server caches file data eliminating
cache coherence issues.
• Clients do cache metadata, however.

11/24/2008 19 11/24/2008 20

5
Read Process Specifications
• Single master vastly simplifies design • Chunk Size = 64 MB
• Clients never read and write file data through the master. Instead, a • Chunks stored as plain Unix files on chunk server.
client asks the master which chunk servers it should contact.
• Using the fixed chunk size, the client translates the file name and byte • A persistent TCP connection to the chunk server over an
offset specified by the application into a chunk index within the file extended period of time (reduce network overhead)
• It sends the master a request containing the file name and chunk index. • cache all the chunk location information to facilitate small
The master replies with the corresponding chunk handle and locations random reads.
of the replicas. The client caches this information using the file name
and chunk index as the key. • Master keeps the metadata in memory
• The client then sends a request to one of the replicas, most likely the • Disadvantages – Small files become Hotspots.
closest one. The request specifies the chunk handle and a byte range • Solution – Higher replication for such files.
within that chunk

11/24/2008 21 11/24/2008 22

Summary of Distributed File Systems

• There are a number of issues to deal with: • Performance is always an issue
– what is the basic abstraction – always a tradeoff between performance and the semantics
– naming of file operations (e.g., for shared files).
– caching • Caching of file blocks is crucial in any file system
– sharing and coherency – maintaining coherency is a crucial design issue.
– replication • Newer systems are dealing with issues such as
– performance disconnected operation for mobile computers
• No right answer! Different systems make different
tradeoffs!

11/24/2008 23 11/24/2008 24

Cc-Unit-2
No ratings yet
Cc-Unit-2
99 pages
Untitled
No ratings yet
Untitled
1,532 pages
Windows 10 Live Analysis Using Sysinternals Lasw26-Precourse-free
No ratings yet
Windows 10 Live Analysis Using Sysinternals Lasw26-Precourse-free
9 pages
Unit-3 Multithreading
No ratings yet
Unit-3 Multithreading
25 pages
DBMS MakeUP-ST2 Solution 2024-25
No ratings yet
DBMS MakeUP-ST2 Solution 2024-25
7 pages
Conda Cheatsheet
No ratings yet
Conda Cheatsheet
2 pages
SEC1601
No ratings yet
SEC1601
231 pages
Docker The Ultimate Beginner's Guide
No ratings yet
Docker The Ultimate Beginner's Guide
18 pages
Advance Operating System
No ratings yet
Advance Operating System
18 pages
Ask Mr. Catalog Answers To Common ICF Catalog Questions
No ratings yet
Ask Mr. Catalog Answers To Common ICF Catalog Questions
2 pages
Upgrade Romomn CPLD
No ratings yet
Upgrade Romomn CPLD
8 pages
Lsf9.1 Users Guide
No ratings yet
Lsf9.1 Users Guide
54 pages
1651 CICC SH Sys Crash Log
No ratings yet
1651 CICC SH Sys Crash Log
31 pages
USS V2R2 Latest Status and New Features
No ratings yet
USS V2R2 Latest Status and New Features
32 pages
EMC Config Checker
No ratings yet
EMC Config Checker
18 pages
4 - Linux Basic & Admin
No ratings yet
4 - Linux Basic & Admin
14 pages
Implementation of Fault Tolerance in Earliest Deadline First (Edf) Scheduling Algorithm Domain: Embedded System
No ratings yet
Implementation of Fault Tolerance in Earliest Deadline First (Edf) Scheduling Algorithm Domain: Embedded System
26 pages
Result Sheet Trade DRM 2021
No ratings yet
Result Sheet Trade DRM 2021
2 pages
TLE-10 Summative Test First QRTR 2020 - 2ND CYCLE
No ratings yet
TLE-10 Summative Test First QRTR 2020 - 2ND CYCLE
1 page
Problemas Repositorios o Claves GPG Solve Keyring Related Issues in Manjaro
No ratings yet
Problemas Repositorios o Claves GPG Solve Keyring Related Issues in Manjaro
4 pages
BA1
No ratings yet
BA1
3 pages
GMLC In003 - en P
No ratings yet
GMLC In003 - en P
10 pages
Bugreport WDT 2024 12 01 17 09 43 - Log
No ratings yet
Bugreport WDT 2024 12 01 17 09 43 - Log
2 pages
Management
No ratings yet
Management
7 pages
ESPRESSIF IDF Extension For Visual Studio Code Table of Contents (TOC)
No ratings yet
ESPRESSIF IDF Extension For Visual Studio Code Table of Contents (TOC)
6 pages
02 - Task - Performance - 1 (7) 0s
No ratings yet
02 - Task - Performance - 1 (7) 0s
2 pages
Allen Bradley File List
No ratings yet
Allen Bradley File List
2 pages
SAP System Directories On UNIX - SAP Documentation
No ratings yet
SAP System Directories On UNIX - SAP Documentation
5 pages
Result Sheet Trade ICTSM 2020
No ratings yet
Result Sheet Trade ICTSM 2020
2 pages
Result Sheet Trade E.T 2021
No ratings yet
Result Sheet Trade E.T 2021
2 pages
MariFlow Manual
No ratings yet
MariFlow Manual
6 pages
Crash Report
No ratings yet
Crash Report
5 pages
Os Questions
No ratings yet
Os Questions
2 pages
C - How Does Linux Kernel Discover PCI Devices - Stack Overflow
No ratings yet
C - How Does Linux Kernel Discover PCI Devices - Stack Overflow
1 page
File Access Permissions
No ratings yet
File Access Permissions
2 pages
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
4/5 (643)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
4.5/5 (1856)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
4/5 (1175)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
3.5/5 (2289)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
4.5/5 (298)
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
4/5 (100)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
4.5/5 (629)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
4.5/5 (4103)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
4.5/5 (1139)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
4.5/5 (943)
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
4/5 (45)
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
3.5/5 (144)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
4.5/5 (815)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
4/5 (903)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
3.5/5 (233)
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
3.5/5 (836)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
4/5 (1267)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
4.5/5 (244)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
3.5/5 (919)
John Adams
From Everand
John Adams
David McCullough
4.5/5 (2546)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
4/5 (2885)
Little Women
From Everand
Little Women
Louisa May Alcott
4.5/5 (2369)

16 Distributedfilesystems

Uploaded by

16 Distributedfilesystems

Uploaded by

Distributed File Systems

CSE 451: Operating Systems

– how are files named? • Sharing and coherency

Example: CMU’s Andrew File System (AFS) AFS caching/sharing

• If a file is being written by more than 1 machine,

GFS: Google File System GFS Idealogy

Chunkserver 1 Chunkserver 2 Chunkserver N

Summary of Distributed File Systems

You might also like