Replication Control Distributed Computing With Diagrams
Replication Control Distributed Computing With Diagrams
1. Introduction
Distributed Computing involves multiple computers working together over a network to achieve a
common goal. Replication means creating copies of data, processes, or services to enhance
reliability, availability, and performance. Replication Control refers to managing these replicas to
ensure correctness and consistency.
3. Types of Replication
5. Consistency Models
- Strong Consistency: All replicas reflect the same data at all times.
- Eventual Consistency: Replicas converge to the same data over time.
- Causal Consistency: Preserves the order of causally related operations.
Architecture Overview:
- NameNode: Manages metadata and namespace.
- DataNodes: Store actual data blocks.
Replication Mechanism:
- Default replication factor is 3.
- Blocks are replicated across different DataNodes and racks.
Architecture Overview:
- Master Server: Manages metadata and chunk locations.
- ChunkServers: Store data chunks.
Replication Mechanism:
- Each chunk is replicated across multiple ChunkServers.
- One replica is designated as primary.
9. Conclusion
Replication control is vital in distributed computing to ensure data reliability, availability, and
consistency. By implementing effective replication strategies, systems can achieve high
performance and resilience.