0% found this document useful (0 votes)
6 views4 pages

Distributed Systems 15 Mark Q6 To Q10

The document discusses the workings and features of Distributed File Systems (DFS), including client-server architecture, caching, and replication methods. It also covers file models, access models, and the advantages and disadvantages of file caching in DFS. Additionally, it compares NFS and AFS, and explains HDFS and MapReduce, highlighting their applications in big data and machine learning.

Uploaded by

Ojaswi Gahoi
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
6 views4 pages

Distributed Systems 15 Mark Q6 To Q10

The document discusses the workings and features of Distributed File Systems (DFS), including client-server architecture, caching, and replication methods. It also covers file models, access models, and the advantages and disadvantages of file caching in DFS. Additionally, it compares NFS and AFS, and explains HDFS and MapReduce, highlighting their applications in big data and machine learning.

Uploaded by

Ojaswi Gahoi
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

Distributed Systems - 15 Mark Answers

6. Working of Distributed File System

A Distributed File System (DFS) allows users to access and manage files stored on multiple networked

computers as if they were on a local system. It enables data sharing, fault tolerance, and scalability.

Working:

- Client-server architecture

- File access transparency

- Caching

- Naming service

- Replication

Features:

- Transparency (access, location, replication)

- Fault tolerance

- Scalability

- Security

- Concurrency

Applications:

- Enterprise file sharing

- Cloud storage (e.g., Dropbox)

- Big data (e.g., Hadoop)

- OS-level integration

7. File Models, Access Models, and Replication

File Models:

- Unstructured: Sequence of bytes (Unix)

- Structured: Records (databases)

- Hierarchical: Directory tree

Access Models:
Distributed Systems - 15 Mark Answers

- Remote service

- Upload/download

- Caching

- Immutable

Replication:

- Primary-backup

- Multi-primary

- Read-only

- Eager vs Lazy replication

8. File Caching in DFS

Caching stores frequently accessed files at the client-side for faster access and less network usage.

Working:

- Client-side or server-side cache

- File/block-level

- Write-through or write-back

Consistency:

- Server-initiated

- Client validation

- Lease-based

Advantages:

- Faster access

- Lower server load

- Offline access

- Bandwidth savings

Disadvantages:
Distributed Systems - 15 Mark Answers

- Stale data

- Complex consistency

- Storage overhead

- Write conflicts

9. NFS vs AFS

NFS:

- Developed by Sun

- Stateless protocol

- Client-server model

- Limited consistency

AFS:

- Developed by CMU

- Whole file caching

- Callback consistency

- High scalability

Comparison:

- NFS: Block caching, less consistent

- AFS: Whole-file caching, more consistent

10. HDFS and MapReduce

HDFS:

- Master-slave (NameNode, DataNodes)

- Files split into blocks

- Replication for fault tolerance

MapReduce:

- Map: Input split into key-value pairs

- Reduce: Aggregates and outputs results


Distributed Systems - 15 Mark Answers

Comparison:

- HDFS is for storage

- MapReduce is for processing

Applications: Big data, ML preprocessing, web indexing

You might also like