Distributed File Systems in Unix

This document discusses several types of distributed file systems including FTP, NFS, and AFS. It describes the purpose and key characteristics of distributed file systems, including that they allow sharing of data and storage resources across physically distributed computers using a common file system. The document then summarizes the design and features of FTP, NFS, and AFS, focusing on how each handles issues like naming, migration, directories, sharing semantics, and caching. It also discusses design principles learned from building these early distributed file systems.

Uploaded by

Jeya Pradha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

207 views6 pages

Distributed File Systems in Unix

Uploaded by

Jeya Pradha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 6

Distributed File Systems in unix

Purpose of distributed file systems:

The purpose of a distributed file system (DFS) is to allow users of physically distributed
computers to share data and storage resources by using a common file system. A typical
configuration for a DFS is a collection of workstations and mainframes connected by a
local area network (LAN). A DFS is implemented as part of the operating system of each
of the connected computers. This paper establishes a viewpoint that emphasizes the
dispersed structure and decentralization of both data and control in the design of such
systems. It defines the concepts of transparency, fault tolerance, and scalability and
discusses them in the context of DFSs. The paper claims that the principle of distributed
operation is fundamental for a fault tolerant and scalable DFS design. It also presents
alternatives for the semantics of sharing and methods for providing access to remote files.
A survey of contemporary UNIX-based systems, namely, UNIX United, Locus, Sprite,
Sun's Network File System, and ITC's Andrew, illustrates the concepts and demonstrates
various implementations and design alternatives. Based on the assessment of these
systems, the paper makes the point that a departure from the extending centralized file
systems over a communication network is necessary to accomplish sound distributed file
system design.
File Characteristics
• most files are small—transfer files rather than disk blocks?
• reading more common than writing
• most access is sequential
• most files have a short lifetime—lots of applications generate
temporary files (such as
a compiler).
• file sharing (involving writes) is unusual—argues for client caching
• processes use few files
• files can be divided into classes—handle “system” files and “user”
files differently.
Distributed File Systems types
Primarily look at three distributed file systems as we look at issues.
1. File Transfer Protocol (FTP). Motivation is to provide file sharing (not
a distributed
file system). 1970s.
Connect to a remote machine and interactively send or fetch an
arbitrary file. FTP
deals with authentication, listing a directory contents, ascii or binary
files, etc.
Typically, a user connecting to an FTP server must specify an account
and password.
Often, it is convenient to set up a special account in which no password
is needed.
Such systems provide a service called anonymous FTP where userid is
“anonymous”
and password is typically user email address.
Has largely been superceded by use of HTTP for file transfer.
2. Sun’s Network File System (NFS). Motivated by wanting to extend a
Unix file
system to a distributed environment. Easy file sharing and
compatability with
existing systems. Mid-1980’s.
Stateless in that servers do not maintain state about clients. RPC calls
supported:
• searching for a file within a directory
• reading a set of directory entries
• manipulating links and directories
• accessing file attributes
• reading/writing file data
Latest version of NFS (version 4) introduces some amount of state.
3. Andrew File System (AFS). Research project at CMU in 1980s.
Company called
Transarc, acquired by IBM. Primary motivation was to build a scalable
distributed
file system. Look at pictures.
Other older file systems:
1. CODA: AFS spin-off at CMU. Disconnection and fault recovery.
2. Sprite: research project at UCB in 1980’s. To build a distributed Unix
system.
3. Echo. Digital SRC.
4. Amoeba Bullet File Server: Tanenbaum research project.
5. xFs: serverless file system—file system distributed across multiple
machines.
Research project at UCB.
CS 4513 2 week5-dfs.tex
Distributed File System Issues
Naming
How are files named? Access independent? Is the name location
independent?
• FTP. location and access dependent.
• NFS. location dependent through client mount points. Largely
transparent for
ordinary users, but the same remote file system could be mounted
differently on
different machines. Access independent. See Fig 9-3. Has automount
feature for file
systems to be mounted on demand. All clients could be configured to
have same
naming structure.
• AFS. location independent. Each client has the same look within a cell
. Have a cell
at each site. See Fig 13-15.
Migration
Can files be migrated between file server machines? What must clients
be aware of?
• FTP. Sure, but end-user must be aware.
• NFS. Must change mount points on the client machines.
• AFS. On a per-volume (collection of files managed as a single unit)
basis.
Directories
Are directories and files handled with the same or a different
mechanism?
• FTP. Directory listing handled as remote command.
• NFS. Unix-like.
• AFS. Unix-like.
Amoeba has separate mechanism for directories and files.
CS 4513 3 week5-dfs.tex
Sharing Semantics
What type of file sharing semantics are supported if two processes
accessing the same file?
Possibilities:
• Unix semantics – every operation on a file is instantly visible to all
processes.
• session semantics – no changes are visible to other processes until
the file is closed.
• immutable files – files cannot be changed (new versions must be
created)
• FTP. User-level copies. No support.
• NFS. Mostly Unix semantics.
• AFS. Session semantics.
Immutable files in Amoeba.
CS 4513 4 week5-dfs.tex
Caching
Possibilities:
• write-through – all changes made on client are immediately written
through to server
• write-back – changes made on client are cached for some amount of
time before being
written back to server.
• write-on-close – one type of write-back where changes are written on
close (matches
session semantics).
• FTP. None. User maintains own copy (whole file)
• NFS. File attributes (inodes) and file data blocks are cached
separately. Cached
attributes are validated with the server on file open.
Version 3: Uses read-ahead and delayed writes from client cache.
Time-based at
block level. New/changed files may not visible for 30 seconds. Neither
Unix nor
session semantics. Non-deterministic semantics as multiple processes
can have the
same file open for writing.
Version 4: Client must flush modified file contents back to the server
on close of file
at client. Server can also delegate a file to a client so that the client
can handle all
requests for the file without checking with the server. However, server
must now
maintain state about open delegations and recall (with a callback) a
delegation if the
file is needed on another machine.
• AFS. File-level caching with callbacks (explain). Session semantics.
Concurrent sharing is not possible.

What is AFS?
AFS is a distributed filesystem product, pioneered at Carnegie Mellon University and
supported and developed as a product by Transarc Corporation (now IBM Pittsburgh
Labs). It offers a client-server architecture for federated file sharing and replicated read-
only content distribution, providing location independence, scalability, security, and
transparent migration capabilities. AFS is available for a broad range of heterogeneous
systems including UNIX, Linux,
AFS Design Principles
What was learned.
Think about for file systems and other large distributed systems.
• Workstations have cycles to burn. Make clients do work whenever
possible.
• Cache whenever possible.
• Exploit file usage properties. Understand them. One-third of Unix files
are
temporary.
• Minimize system-wide knowledge and change. Do not hardwire
locations.
• Trust the fewest possible entities. Do not trust workstations.
• Batch if possible to group operations.
CS 4513 9 week5-dfs.tex
Elephant: The File System that Never Forgets
Motivation that disks and storage are cheap and information is
valuable.
Straightforward idea to store all (significant) versions of a file without
need for user
intervention.
”All user operations are reversible.”
Simple, but powerful goal for the system.
A new version of a file is created each time it is written—similarities to
a log-structured file
system.
File versions are referenced by time and extend to directories.
Per-file and per-file-group policies for reclaiming file storage.
What Files to Keep?
Basic idea is to keep landmark or distinguished file versions and
discard the others.
• Keep One—current situation. Good for unimportant or easily
recreatable files.
• Keep All—complete history maintained.
• Keep Landmarks—how to determine
– user-defined landmarks (similar to check-in idea in RCS) are allowed
– heuristic to tag other versions as landmarks.
Not all files should be treated the same. For example object files and
source files have
different characteristics.
Architecture
Single master and multiple chunkservers as shown in Fig 1. Each is a
commodity Linux
server.
Files stored in fixed-size 64MB chunks as Linux files. Each has a 64-bit
chunk handle.
By default have three replicas for each chunk.
GFS maintains metafile information.
Clients do not cache data—typically not reused. Do cache metadata.
Large chunk sizes help to minimize client interaction with master
(potential bottleneck).
Client can maintain persistent TCP connection with chunkserver.
Reduces amount of
metadata at master.
CS 4513 11 week5-dfs.tex
replicated execution environments.
Lots of replication drawbacks: bandwidth needed, requires hard state
at each replication,
and replicated run-time environment not the same as devlp env.
Shark designed to support widely distributed applications. Can export a
file system
Scalability through a location-aware cooperative cache—a p2p file
system for read sharing.
At heart is centralized file system like NFS.
Design
Key ideas:
Once a client retrieves a file, it becomes a replica proxy for serving to
other clients.
Files are stored and retrieved as chunks. Client can retrieve chunks
from multiple locations.
A token is assigned for the whole file and for each chunk.
Use Rabin fingerprint algorithm to preserve data commonality in
chunks—idea is that
different versions of a file have many chunks in common.
File Consistency
Uses leases and whole-file caching—ala AFS.
Default lease of 5min with callbacks.
Must refetch entire file if modified—but may not have to retrieve all
chunks and can do so
from client proxies.

Advanced Higher English Dissertation Title Page
100% (2)
Advanced Higher English Dissertation Title Page
7 pages
NetSim Experiment Manual PDF
No ratings yet
NetSim Experiment Manual PDF
185 pages
Simple Flashcards
No ratings yet
Simple Flashcards
62 pages
EmpoTech Module 1
No ratings yet
EmpoTech Module 1
40 pages
Brocade MIB Reference
No ratings yet
Brocade MIB Reference
1,212 pages
Itcs104 FMC
No ratings yet
Itcs104 FMC
130 pages
How To Download Documents From The Hometax Website
No ratings yet
How To Download Documents From The Hometax Website
26 pages
Django PDF
No ratings yet
Django PDF
7 pages
Mastering Binary Math and Subnetting
100% (3)
Mastering Binary Math and Subnetting
110 pages
4 7362 Mini Olt Turn Up Procedure - Compress
No ratings yet
4 7362 Mini Olt Turn Up Procedure - Compress
38 pages
Light Side of The Internet
No ratings yet
Light Side of The Internet
18 pages
L 2 TP
No ratings yet
L 2 TP
14 pages
+971-55-843-7821 Discovery Garden Dubai Call Girls
No ratings yet
+971-55-843-7821 Discovery Garden Dubai Call Girls
1 page
CiTRANS 600 Series PTN Product Configuration Guide - Login - Quality of Service PDF
No ratings yet
CiTRANS 600 Series PTN Product Configuration Guide - Login - Quality of Service PDF
129 pages
Translate PDF Files To Different Languages For Free
100% (2)
Translate PDF Files To Different Languages For Free
12 pages
LTE RJIL Drive KPI Analysis
No ratings yet
LTE RJIL Drive KPI Analysis
12 pages
Ds Smcwbr14 3gn
No ratings yet
Ds Smcwbr14 3gn
2 pages
Internship Proposal On Investment Mechanism of Islami Bank Bangladesh Limited: A Study On Dhanmondi Branch
No ratings yet
Internship Proposal On Investment Mechanism of Islami Bank Bangladesh Limited: A Study On Dhanmondi Branch
6 pages
Contrail Sandbox Tutorial Script
No ratings yet
Contrail Sandbox Tutorial Script
74 pages
Lecture - 12 6.4 Secondary Storage: Magnetic Disk and Tape
No ratings yet
Lecture - 12 6.4 Secondary Storage: Magnetic Disk and Tape
8 pages
Google Search Tips
No ratings yet
Google Search Tips
6 pages
What Is CIDR?
No ratings yet
What Is CIDR?
2 pages
2020 Marketing Plan
No ratings yet
2020 Marketing Plan
19 pages
Glatt PPT Master - EN PDF
No ratings yet
Glatt PPT Master - EN PDF
30 pages
Basic HTML Tags
No ratings yet
Basic HTML Tags
4 pages
Live Outside The Box
No ratings yet
Live Outside The Box
2 pages
USC Human Trafficking Online Report
No ratings yet
USC Human Trafficking Online Report
56 pages
Telegram News: Don't Have Telegram Yet? Try It Now!
No ratings yet
Telegram News: Don't Have Telegram Yet? Try It Now!
1 page
University of Okara: Group Details
No ratings yet
University of Okara: Group Details
10 pages
Business English Vocabulary - Advertising
No ratings yet
Business English Vocabulary - Advertising
2 pages
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
4/5 (6441)
A Man Called Ove: A Novel
From Everand
A Man Called Ove: A Novel
Fredrik Backman
4.5/5 (5145)
Never Split the Difference: Negotiating As If Your Life Depended On It
From Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
4.5/5 (999)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
From Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
4.5/5 (141)
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
4/5 (642)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
From Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
4.5/5 (581)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
From Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
4.5/5 (361)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
4.5/5 (298)
Grit: The Power of Passion and Perseverance
From Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
4/5 (650)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
4/5 (1174)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
4.5/5 (628)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
4.5/5 (244)
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
4/5 (100)
Yes Please
From Everand
Yes Please
Amy Poehler
4/5 (2010)
The Little Book of Hygge: Danish Secrets to Happy Living
From Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
3.5/5 (463)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
3.5/5 (919)
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
4/5 (45)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
4/5 (903)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
4.5/5 (4102)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
4.5/5 (1138)
The Constant Gardener: A Novel
From Everand
The Constant Gardener: A Novel
John le Carré
4/5 (278)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
4.5/5 (1856)
On Fire: The (Burning) Case for a Green New Deal
From Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
4/5 (78)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
3.5/5 (233)
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
3.5/5 (144)
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
3.5/5 (836)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
From Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
4/5 (1018)
A Tree Grows in Brooklyn
From Everand
A Tree Grows in Brooklyn
Betty Smith
4.5/5 (2033)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
4/5 (1267)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
4.5/5 (943)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
3.5/5 (2289)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
From Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
4.5/5 (279)
Wolf Hall: A Novel
From Everand
Wolf Hall: A Novel
Hilary Mantel
4/5 (4088)
John Adams
From Everand
John Adams
David McCullough
4.5/5 (2546)
The Art of Racing in the Rain: A Novel
From Everand
The Art of Racing in the Rain: A Novel
Garth Stein
4/5 (4360)
Bad Feminist: Essays
From Everand
Bad Feminist: Essays
Roxane Gay
4/5 (1090)
Brooklyn: A Novel
From Everand
Brooklyn: A Novel
Colm Tóibín
3.5/5 (2133)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
4.5/5 (815)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
4/5 (2884)
Little Women
From Everand
Little Women
Louisa May Alcott
4.5/5 (2369)
The Woman in Cabin 10
From Everand
The Woman in Cabin 10
Ruth Ware
3.5/5 (2788)

Distributed File Systems in Unix

Uploaded by

Distributed File Systems in Unix

Uploaded by

Distributed File Systems in unix

Purpose of distributed file systems:

You might also like