Unit 4

distributed system

Uploaded by

2k21CO401 Sachin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views13 pages

Unit 4

distributed system

Uploaded by

2k21CO401 Sachin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 13

Distributed File Systems (DFS)

Distributed File Systems allow files to be accessed and managed across multiple locations,
balancing load and providing fault tolerance. Here’s an in-depth look at the key concepts.
File Models in DFS
1.Client-Server Model:
In this model, a central server or group of servers stores files, and clients connect to these
servers to access files.
Example: Network File System (NFS), a widely used DFS where clients mount file systems
remotely and access them as if they were local. NFS servers maintain file storage, while
clients access these resources using protocols such as RPC (Remote Procedure Call).
2. Cluster-Based Systems:
These systems distribute file storage and processing across clusters of servers or nodes,
allowing data to be spread across multiple machines to balance load and provide redundancy.
Example: Google File System (GFS) splits files into chunks, which are stored across
different machines, improving fault tolerance and performance by allowing concurrent
processing on different parts of a file.
3. Symmetric Model:
• In symmetric DFS, each node can act as both a client and a server, decentralizing
file access and management.
• Example: Hadoop Distributed File System (HDFS) distributes data and tasks
across multiple nodes in a cluster, providing resilience and high throughput for big
data workloads.

4. NFS (Network File System):

• NFS enables users to access files on remote systems as if they were local, using a
client-server model.
• NFS relies on remote procedure calls (RPC) to request services on remote file
systems, allowing for centralized file management across distributed systems.
Naming and Automounting
• Naming: Naming in DFS involves unique identification of files across nodes. It
can be hierarchical (similar to a file path or URL structure) or flat. Hierarchical
naming is usually preferred, allowing clear organization of files across locations.
• Automounting: DFS often use automounting to dynamically map remote
directories to a local system when files are accessed, reducing the overhead of
manually mounting directories.
File Sharing and Replication
• File Sharing: Allows multiple users to access files concurrently. Synchronization
mechanisms, such as file locking, versioning, and conflict resolution strategies,
are used to manage concurrent access.
• Replication: Files are copied across multiple nodes or data centers, improving
availability and fault tolerance. For example, in HDFS, data is replicated across
multiple nodes, allowing data access even if one node fails.
Peer-to-Peer (P2P) Systems
• P2P systems remove the central server, and each node functions as both a client
and a server, allowing direct sharing of files.
Example: BitTorrent uses P2P file-sharing, allowing peers to directly exchange file
chunks, reducing the load on any single server and enabling scalability.
Byzantine Failures
• Byzantine Failures refer to situations where nodes may fail arbitrarily, even
maliciously.
• Byzantine Fault Tolerance (BFT) techniques, such as PBFT (Practical Byzantine
Fault Tolerance), allow a system to continue to operate correctly even when some
nodes exhibit faulty or malicious behaviour.
Security and Authentication

• Security in DFS involves protecting data and ensuring only authorized access. This often
involves access control lists, encryption, and secure protocols (e.g., Kerberos
authentication and SSL encryption).
• Authentication, such as using tokens or certificates, ensures that only verified users or
nodes access the system.
Distributed Databases
• Distributed databases manage and store data across multiple nodes, ensuring
scalability, performance, and fault tolerance.
• Partitioning Types
1.Vertical Partitioning:
Splits tables by columns, allowing related attributes to be stored together.
Example: A customer database could store basic information (name, contact) on
one server, while sensitive information (credit card data) is stored separately,
enabling more secure access.
2. Horizontal Partitioning:
• Distributes tables by rows across nodes, usually based on keys (e.g., customer
region).
• Example: A user database could be split so that users from North America are
stored on one server and users from Europe on another, optimizing access based
on region.
3. Hybrid Partitioning:
Combines vertical and horizontal partitioning to optimize data access and query
performance.
Example: Customer data is horizontally partitioned by region, with each region also
vertically partitioned by data types (e.g., basic info, transaction history).
CRUD Operations

• CRUD (Create, Read, Update, Delete) operations are fundamental to database

management. In distributed databases, handling CRUD requires efficient
mechanisms to ensure synchronization, consistency, and performance across
nodes.
• Query Optimization
• Optimizing queries in distributed databases is crucial to reduce execution time and
resource usage. This can involve:
• Data Localization: Filtering data at local nodes before transferring it.
• Join Optimization: Optimizing how tables from different nodes are
combined.
• Caching: Storing frequently accessed query results locally to reduce
repetitive processing.
Master-Slave and Peer-to-Peer Architectures

1. Master-Slave Architecture:
A central master node handles write operations, while slaves replicate data for reads.
Example: MySQL Master-Slave replication uses the master node for writes,
propagating changes to slave nodes, which serve read requests to improve read
performance
2. Peer-to-Peer Architecture:
Each node acts as an equal, capable of handling reads and writes, with data shared
across peers.
Example: Cassandra uses a peer-to-peer approach where any node can accept reads
and writes, distributing the load evenly and enhancing availability.
CAP Theorem
• The CAP theorem states that a distributed system can only achieve two out of
three properties simultaneously:
1.Consistency: All nodes see the same data at the same time.
2.Availability: The system continues to operate despite failures.
3.Partition Tolerance: The system remains functional even if network partitions
cause a loss of connectivity between some nodes.
• Distributed databases generally make trade-offs between these properties. For
instance:
• AP Systems (e.g., DynamoDB) prioritize availability and partition tolerance but
may provide eventual consistency.
• CP Systems (e.g., HBase) emphasize consistency and partition tolerance but may
experience reduced availability during network partitions.
Distributed Web Systems

• Distributed web systems ensure seamless client-server interactions across

distributed architecture, optimizing performance, security, and scalability.
Web Clients and Servers
1. Web Clients: Web clients, typically browsers, make requests to servers using
HTTP to fetch or interact with resources (e.g., HTML pages, JSON data).
2. Web Servers: Servers handle these requests, providing or managing data,
application logic, and other resources.
HTTP Connections and Methods
1. HTTP Connections:
• HTTP/1.1 introduced persistent connections, allowing multiple requests on a
single connection to reduce latency.
2. HTTP Methods:
• GET: Retrieves data from the server.
• POST: Submits data to the server for processing.
• PUT: Updates existing data.
• DELETE: Removes data from the server.
Messaging and SOAP
• Messaging: Distributed systems use messaging protocols to manage
communication between services, with REST and SOAP as popular protocols.
• SOAP: SOAP is an XML-based messaging protocol for exchanging structured
information across networks, often used in distributed web services requiring
strict standards and security.
Naming and Proxy Caching

• Naming: Distributed systems use unique names, typically URLs or URIs, to

identify resources.
• Proxy Caching: Proxy servers cache frequently accessed resources, reducing
latency for users and load on origin servers. This caching occurs close to the user,
as in a Content Delivery Network (CDN).

Replication
• Replication in distributed web systems enhances availability and reliability by
mirroring resources across multiple servers.
Example: Content Delivery Networks (CDNs) replicate web content globally,
allowing users to access content from a server near their location.
Security in Distributed Web Systems

• Security mechanisms include HTTPS (for secure HTTP connections),

authentication (e.g., OAuth, JWT), and encryption protocols to protect data and
user interactions.
• Authentication: Methods like OAuth, JWT tokens, and Single Sign-On (SSO)
validate user identities, while encryption safeguards sensitive data.

DSCC End Sem
No ratings yet
DSCC End Sem
226 pages
Distributed Systems and Cloud Computing
No ratings yet
Distributed Systems and Cloud Computing
123 pages
Distributed Systems Unit 1
No ratings yet
Distributed Systems Unit 1
30 pages
ADSU1VFTVF25
No ratings yet
ADSU1VFTVF25
118 pages
System Design
No ratings yet
System Design
56 pages
Nosql Databases
No ratings yet
Nosql Databases
379 pages
Distributed DBMS
No ratings yet
Distributed DBMS
62 pages
Parallel and Distributed Computing Lec 4
No ratings yet
Parallel and Distributed Computing Lec 4
28 pages
Chapter 1 DS-Chapter 1 Introduction dt-2024-07-05 11-51-17
No ratings yet
Chapter 1 DS-Chapter 1 Introduction dt-2024-07-05 11-51-17
25 pages
PDC 1.1
No ratings yet
PDC 1.1
39 pages
Unit 1
No ratings yet
Unit 1
55 pages
Cloud Computing Notes 1
No ratings yet
Cloud Computing Notes 1
47 pages
Lec 10 Distributed Databases System
No ratings yet
Lec 10 Distributed Databases System
34 pages
5-Distributed Systems Engineering
No ratings yet
5-Distributed Systems Engineering
42 pages
Irs Unit-4
No ratings yet
Irs Unit-4
19 pages
BCA-502 (DE2) - SM03hsshs
No ratings yet
BCA-502 (DE2) - SM03hsshs
13 pages
UNIT-1 What Is: Q1: Distributed System? or Why Would You Design A System As A Distributed System
No ratings yet
UNIT-1 What Is: Q1: Distributed System? or Why Would You Design A System As A Distributed System
55 pages
Unit-1 Q&a
No ratings yet
Unit-1 Q&a
24 pages
Unit 5
No ratings yet
Unit 5
29 pages
System Design
No ratings yet
System Design
385 pages
Chapter 10
No ratings yet
Chapter 10
25 pages
Unit 5
No ratings yet
Unit 5
21 pages
Haoop Architecture
No ratings yet
Haoop Architecture
34 pages
SDA Presentation
No ratings yet
SDA Presentation
12 pages
Distributed Systems
No ratings yet
Distributed Systems
32 pages
Module 2
No ratings yet
Module 2
36 pages
Distributed Databases: Daniel Marcous
No ratings yet
Distributed Databases: Daniel Marcous
41 pages
DTUnit 1 & 2
No ratings yet
DTUnit 1 & 2
69 pages
Lecture 06
No ratings yet
Lecture 06
68 pages
System Design Scale System From Zero To Million Users #Systemdesign (English) (DownloadYoutubeSubtitles - Com)
No ratings yet
System Design Scale System From Zero To Million Users #Systemdesign (English) (DownloadYoutubeSubtitles - Com)
8 pages
Distributed Systems U1 U2
No ratings yet
Distributed Systems U1 U2
73 pages
DC IA 1 Syllabus Prep
No ratings yet
DC IA 1 Syllabus Prep
11 pages
Algomasterio System Design Interview Handbook
No ratings yet
Algomasterio System Design Interview Handbook
19 pages
Unit I
No ratings yet
Unit I
22 pages
DC - Co 1 All in 1 PDF
No ratings yet
DC - Co 1 All in 1 PDF
197 pages
DS Unit-1
No ratings yet
DS Unit-1
33 pages
CC - Unit 1
No ratings yet
CC - Unit 1
38 pages
Big Data Slides
No ratings yet
Big Data Slides
26 pages
Distributed Systems Unit-1 Notes
No ratings yet
Distributed Systems Unit-1 Notes
18 pages
Nosql Mod2
No ratings yet
Nosql Mod2
25 pages
System Design Terms
No ratings yet
System Design Terms
20 pages
Ds Questions
No ratings yet
Ds Questions
6 pages
System Design
No ratings yet
System Design
30 pages
What Is A Distributed Database System?: Example
No ratings yet
What Is A Distributed Database System?: Example
2 pages
ICS 408 Exam A
No ratings yet
ICS 408 Exam A
5 pages
Introduction To Hadoop: Dr. G Sudha Sadhasivam Professor, CSE PSG College of Technology Coimbatore
No ratings yet
Introduction To Hadoop: Dr. G Sudha Sadhasivam Professor, CSE PSG College of Technology Coimbatore
34 pages
Unit - Iv Data Analytics Frameworks: Centralized and Distributed Functional Architectures of Relational Systems
No ratings yet
Unit - Iv Data Analytics Frameworks: Centralized and Distributed Functional Architectures of Relational Systems
24 pages
Chapter One
No ratings yet
Chapter One
40 pages
Content Beyond The Syllabus Notes Se
No ratings yet
Content Beyond The Syllabus Notes Se
14 pages
Introduction To Distributed Systems
No ratings yet
Introduction To Distributed Systems
9 pages
Distributed System Assinmnet
No ratings yet
Distributed System Assinmnet
9 pages
A Thorough Introduction To Distributed Systems
No ratings yet
A Thorough Introduction To Distributed Systems
31 pages
Short Notes - 4 - 5 - 6
No ratings yet
Short Notes - 4 - 5 - 6
7 pages
Tybca Recent Trends in It Chpter 1
No ratings yet
Tybca Recent Trends in It Chpter 1
16 pages
Cloud Applications
No ratings yet
Cloud Applications
73 pages
Assignment 1
No ratings yet
Assignment 1
9 pages
System Design - ML Design 1 PDF
100% (1)
System Design - ML Design 1 PDF
24 pages
Cloud Unit3
No ratings yet
Cloud Unit3
26 pages
Unit III-ITA
100% (1)
Unit III-ITA
11 pages
Webmail and Email Client
No ratings yet
Webmail and Email Client
5 pages
v9.0.1c Releasenotes v6.0
No ratings yet
v9.0.1c Releasenotes v6.0
78 pages
LTE-EPC System Overview (SF-Santiago)
No ratings yet
LTE-EPC System Overview (SF-Santiago)
12 pages
Connecting Lans, Backbone Networks, and Virtual Lans
No ratings yet
Connecting Lans, Backbone Networks, and Virtual Lans
30 pages
SELECOM - DIGIDAS™ - Digital Distribution Antenna System 2G 3G 4G GSM UMTS LTE
No ratings yet
SELECOM - DIGIDAS™ - Digital Distribution Antenna System 2G 3G 4G GSM UMTS LTE
8 pages
SIPROTEC4 PROFIBUS DP ServiceInfo Optical en PDF
No ratings yet
SIPROTEC4 PROFIBUS DP ServiceInfo Optical en PDF
7 pages
MES3400-24F - Datasheet - 10.3.6.3 - En-1
No ratings yet
MES3400-24F - Datasheet - 10.3.6.3 - En-1
5 pages
DS ClearPass PolicyManager
No ratings yet
DS ClearPass PolicyManager
7 pages
Kurento
No ratings yet
Kurento
392 pages
Networking Devices - Introductory Summary
100% (1)
Networking Devices - Introductory Summary
22 pages
Ericsson UMTS
100% (1)
Ericsson UMTS
19 pages
Italkbb Mobile User Manual
No ratings yet
Italkbb Mobile User Manual
6 pages
v5.0.3 ReleaseNotes v1.0
No ratings yet
v5.0.3 ReleaseNotes v1.0
84 pages
Proposal For A Cloud Computing Solution and Application in A Pedagogical Virtual Organization
No ratings yet
Proposal For A Cloud Computing Solution and Application in A Pedagogical Virtual Organization
10 pages
Steve Mackay) Edwin Wright, John Park) : Titles in The Series
No ratings yet
Steve Mackay) Edwin Wright, John Park) : Titles in The Series
1 page
Sans Bound
No ratings yet
Sans Bound
9 pages
Module-1: Section-3 Appendix: Annexure
No ratings yet
Module-1: Section-3 Appendix: Annexure
2 pages
CCNET
No ratings yet
CCNET
2 pages
Implementing HP 3par Storeserv 7000 and Proliant Dl380 Gen8 With Microsoft Exchange 2010
No ratings yet
Implementing HP 3par Storeserv 7000 and Proliant Dl380 Gen8 With Microsoft Exchange 2010
37 pages
ATS Communication Overview
No ratings yet
ATS Communication Overview
23 pages
Riyaz Guide To Palo Alto Network Security
No ratings yet
Riyaz Guide To Palo Alto Network Security
2 pages
Website Vulnerability Scanner Report (Light)
No ratings yet
Website Vulnerability Scanner Report (Light)
4 pages
RC1201 4FE4E1T1 Datasheet
No ratings yet
RC1201 4FE4E1T1 Datasheet
2 pages
新建文本文档
No ratings yet
新建文本文档
2 pages
NguyenTanBaoLe Lab1b
No ratings yet
NguyenTanBaoLe Lab1b
3 pages
Deepak Dubey CV
No ratings yet
Deepak Dubey CV
6 pages
Intel® Ethernet Converged Network Adapter XL710: Product Brief
No ratings yet
Intel® Ethernet Converged Network Adapter XL710: Product Brief
4 pages
JAva - Exe Hard Coded Local Host
No ratings yet
JAva - Exe Hard Coded Local Host
4 pages