0% found this document useful (0 votes)

11 views

Lecture 05

The document discusses replication control in distributed and cloud computing, focusing on how to manage operations across multiple servers. It highlights the importance of replication for fault tolerance, load balancing, and availability, while also addressing challenges like replication transparency and consistency. Additionally, it covers transaction management in distributed systems, including the one-phase and two-phase commit protocols, and introduces the Paxos algorithm for achieving consensus in atomic commits.

Uploaded by

Raihan Kabir Rifat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views

Lecture 05

Uploaded by

Raihan Kabir Rifat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 29

CSE-813(Distributed and Cloud Computing)

Dr. Atiqur Rahman

ড. আতিকুর রহমান
Ph.D.(CQUPT, China), MS.Engg.(CU), B.Sc.(CU)
Associate Professor
Department of Computer Science and Engineering
University of Chittagong

Lecture 5: Replication Control

Server-side Focus

• Concurrency Control = how to coordinate multiple concurrent clients

executing operations (or transactions) with a server

Next:
• Replication Control = how to handle operations (or transactions) when there
are objects are stored at multiple servers, with or without replication
Replication: What and Why

• Replication = An object has identical copies, each maintained by a separate

server
– Copies are called “replicas”
• Why replication?
– Fault-tolerance: With k replicas of each object, can tolerate failure of any (k-1) servers in
the system
– Load balancing: Spread read/write operations out over the k replicas => load lowered by a
factor of k compared to a single replica
– Replication => Higher Availability
Availability

• If each server is down a fraction f of the time

– Server’s failure probability
• With no replication, availability of object =
= Probability that single copy is up
= (1 – f)
• With k replicas, availability of object =
Probability that at least one replicas is up
= 1 – Probability that all replicas are down
= (1 – f k)
Nines Availability

• With no replication, availability of object =

= (1 – f)
• With k replicas, availability of object =
= (1 – f k)
Availability Table
f=failure No replication k=3 replicas k=5 replicas
probability
0.1 90% 99.9% 99.999%
0.05 95% 99.9875% 7 Nines
0.01 99% 99.9999% 10 Nines
What’s the Catch?

• Challenge is to maintain two properties

1. Replication Transparency
– A client ought not to be aware of multiple copies of objects existing on the server side

2. Replication Consistency
– All clients see single consistent copy of data, in spite of replication
– For transactions, guarantee ACID (atomicity, consistency, isolation, and durability)
Replication Transparency
Replicas of an
Front ends Replica 1 object O
provide replication
transparency
Client Front End
Replica 2
Client
Front End
Client Replica 3

Requests
(replies flow opposite)
Replication Consistency

• Two ways to forward updates from front-ends (FEs) to replica group

– Passive Replication: uses a primary replica (master)
– Active Replication: treats all replicas identically

• Both approaches use the concept of “Replicated State Machines”

– Each replica’s code runs the same state machine
– Multiple copies of the same State Machine begun in the Start state, and receiving the
same Inputs in the same order will arrive at the same State having generated the same
Outputs. [Schneider 1990]
Passive Replication

• Master => total ordering of all updates

Replica 1 • On master failure, run election

Client Front End

Replica 2 Master (elected leader)

Client
Front End
Client Replica 3

Requests
(replies flow opposite)
Active Replication

Multicast
Front ends Replica 1 inside
provide replication Replica group
transparency
Client Front End
Replica 2
Client
Front End
Client Replica 3

Requests
(replies flow opposite)
Active Replication Using Concepts You’ve Learnt earlier

• Can use any flavor of multicast ordering, depending on application

– FIFO ordering
– Causal ordering
– Total ordering
– Hybrid ordering

• Total or Hybrid (*-Total) ordering + Replicated State machines approach

– => all replicas reflect the same sequence of updates to the object
Active Replication Using Concepts You’ve Learnt earlier (2)

• What about failures?

– Use virtual synchrony (i.e., view synchrony)
– Virtual synchrony is an interprocess message passing (sometimes called ordered, reliable multicast) technology.
Virtual synchrony systems allow programs running in a network to organize themselves into process groups, and to
send messages to groups (as opposed to sending them to specific processes).

• Virtual synchrony with total ordering for multicasts =>

– All replicas see all failures/joins/leaves and all multicasts in the same order
– Could also use causal (or even FIFO) ordering if application can tolerate it
Transactions and Replication
• One-copy serializability
– A concurrent execution of transactions in a replicated database is one-copy-serializable if it is equivalent to a serial execution
of these transactions over a single logical copy of the database.
– (Or) The effect of transactions performed by clients on replicated objects should be the same as if they had been performed
one at a time on a single set of objects (i.e., 1 replica per object).

• In a non-replicated system, transactions appear to be performed one at a time in some order.

– Correctness means serial equivalence of transactions
• When objects are replicated, transaction systems for correctness need
– Serial equivalence + One-copy serializability

•
Next

• Committing transactions with distributed servers

Transactions with Distributed Servers

Server 1
Transaction T Object A
write(A,1);
write(B,2); Object B
… .
write(Y, 25); .
write(Z, 26); .
commit
Server 13
Object Y

Object Z
Transactions with Distributed Servers

• Transaction T may touch objects that reside on different servers

• When T tries to commit
– Need to ensure all these servers commit their updates from T => T will commit
– Or none of these servers commit => T will abort
• What problem is this?
Transactions with Distributed Servers

• Transaction T may touch objects that reside on different servers

• When T tries to commit
– Need to ensure all these servers commit their updates from T => T will commit
– Or none of these servers commit => T will abort
• What problem is this?
– Consensus! (The goal of a distributed consensus algorithm is to allow a set of computers to all agree on a single value that one of the nodes in the
system proposed (as opposed to making up a random value). This often requires coordinating processes to reach consensus, or agree on some data value that
is needed during computation. The challenge in doing this in a distributed system is that messages can be lost or machines can fail.)

– (It’s also called the “Atomic Commit problem”)

– Atomic Commit: · A “prepare message” is sent to each participating worker by the coordinator. The coordinator must wait until a
response.
– The problem with atomic commits is that they require coordination between multiple systems. As computer networks are unreliable
services, this means no algorithm can coordinate with all systems as proven in the Two Generals Problem.
One-phase Commit

Coordinator Server 1
Transaction T Server Object A
write(A,1);
.
write(B,2); Object B
.
… . .
write(Y, 25); .
write(Z, 26); .
commit
• Special server called “Coordinator” Server 13
Object Y
initiates atomic commit
• Tells other servers to either
Object Z
commit or abort
One-phase Commit: Issues

• Server with object has no say in whether transaction commits or aborts

– If object corrupted, it just cannot commit (while other servers have committed)

• Server may crash before receiving commit message, with some updates still in
memory
Two-phase Commit
Coordinator
…
Server Server 1 Server 13
Prepare
Two-phase Commit
Coordinator
…
Server Server 1 Server 13
Prepare
• Save updates to disk
• Respond with “Yes” or “No”
Two-phase Commit
Coordinator
…
Server Server 1 Server 13
Prepare
• Save updates to disk
• Respond with “Yes” or “No”
If any
“No” vote Abort
or timeout
before all
(13) votes
Two-phase Commit
Coordinator
…
Server Server 1 Server 13
Prepare
• Save updates to disk
• Respond with “Yes” or “No”
All (13)
“Yes” Commit
votes
received
within
timeout?
Two-phase Commit
Coordinator
…
Server Server 1 Server 13
Prepare
• Save updates to disk
• Respond with “Yes” or “No”
All (13)
“Yes” Commit
votes • Wait! Can’t commit or abort
received before receiving next message!
within
timeout?
Two-phase Commit
Coordinator
…
Server Server 1 Server 13
Prepare
• Save updates to disk
• Respond with “Yes” or “No”
All (13)
“Yes” Commit
votes • Commit updates from disk
received to store
within
OK
timeout?
Failures in Two-phase Commit

• If server voted Yes, it cannot commit unilaterally before receiving Commit

message
• If server voted No, can abort right away (why?)
• To deal with server crashes
– Each server saves tentative updates into permanent storage, right before replying Yes/No in first
phase. Retrievable after crash recovery.
• To deal with coordinator crashes
– Coordinator logs all decisions and received/sent messages on disk
– After recovery or new election => new coordinator takes over
Failures in Two-phase Commit (2)

• To deal with Prepare message loss

– The server may decide to abort unilaterally after a timeout for first phase (server will vote No, and
so coordinator will also eventually abort)
• To deal with Yes/No message loss, coordinator aborts the transaction after a
timeout (pessimistic!). It must announce Abort message to all.
• To deal with Commit or Abort message loss
– Server can poll coordinator (repeatedly)
Using Paxos in Distributed Servers

Atomic Commit
• Can instead use Paxos to decide whether to commit a transaction or not
• But need to ensure that if any server votes No, everyone aborts

Ordering updates
• Paxos can also be used by replica group (for an object) to order all updates –
iteratively do:
– Server proposes message for next sequence number
– Group reaches consensus (or not)
Summary

• Multiple servers in cloud

– Replication for Fault-tolerance
– Load balancing across objects
• Replication Flavors using concepts we learnt earlier
– Active replication
– Passive replication
• Transactions and distributed servers
– Two phase commit

Bba Sem-3 - Itb - Practical Lab Record 2022
80% (5)
Bba Sem-3 - Itb - Practical Lab Record 2022
60 pages
Peoplesoft Development Overview Application Engine
100% (1)
Peoplesoft Development Overview Application Engine
56 pages
Lecture 11A - Replication Control
No ratings yet
Lecture 11A - Replication Control
15 pages
Distributed Computing Replication Control
No ratings yet
Distributed Computing Replication Control
71 pages
Aks Replication Control
No ratings yet
Aks Replication Control
71 pages
Replication and Consistency in Distributed Systems (Cont'd)
No ratings yet
Replication and Consistency in Distributed Systems (Cont'd)
17 pages
Consensus
No ratings yet
Consensus
77 pages
Lecture 10 - Replication
No ratings yet
Lecture 10 - Replication
37 pages
Consistency in Distributed Systems
No ratings yet
Consistency in Distributed Systems
21 pages
midterm-cheatsheet
No ratings yet
midterm-cheatsheet
2 pages
Logless One-Phase Commit Made Possible For Highly-Available Datastores
No ratings yet
Logless One-Phase Commit Made Possible For Highly-Available Datastores
26 pages
Replication Consistency
No ratings yet
Replication Consistency
65 pages
Introduction: Chapter 13: Distributed Transactions
No ratings yet
Introduction: Chapter 13: Distributed Transactions
62 pages
DS Chapter V7replication
No ratings yet
DS Chapter V7replication
33 pages
Lecture 13
No ratings yet
Lecture 13
37 pages
6 Replication Nhom3
No ratings yet
6 Replication Nhom3
44 pages
DS CH6 - Consistency and Replication
No ratings yet
DS CH6 - Consistency and Replication
18 pages
Lecture 9 Distributed Transactions
No ratings yet
Lecture 9 Distributed Transactions
7 pages
Week-04
No ratings yet
Week-04
49 pages
Chapter 1 Introduction
No ratings yet
Chapter 1 Introduction
48 pages
Unit # IV Replication and Fault Tolerance
No ratings yet
Unit # IV Replication and Fault Tolerance
82 pages
L6 Transactions II
No ratings yet
L6 Transactions II
20 pages
Distributed Chapter 4
No ratings yet
Distributed Chapter 4
24 pages
Lecture 9 - RPC and Concurrency Control
No ratings yet
Lecture 9 - RPC and Concurrency Control
29 pages
DS Chapter V8.0fault Tolerance
No ratings yet
DS Chapter V8.0fault Tolerance
23 pages
Chapter 8 Fault Tolerance
No ratings yet
Chapter 8 Fault Tolerance
20 pages
07 Replication
No ratings yet
07 Replication
14 pages
Fault
No ratings yet
Fault
101 pages
Synchronization in Distributed Systems
No ratings yet
Synchronization in Distributed Systems
51 pages
REPLICATION
No ratings yet
REPLICATION
20 pages
Ddbs Checkpointing ... Ddbs Checkpointing ... : Phase 1 at Css Phase 2 at CC
No ratings yet
Ddbs Checkpointing ... Ddbs Checkpointing ... : Phase 1 at Css Phase 2 at CC
9 pages
5 Chapter Five
No ratings yet
5 Chapter Five
29 pages
11 Distributed1
No ratings yet
11 Distributed1
42 pages
Unit-6 Transactions & Replications Syllabus: Introduction, System Model and Group Communication, Concurrency Control in Distributed
No ratings yet
Unit-6 Transactions & Replications Syllabus: Introduction, System Model and Group Communication, Concurrency Control in Distributed
20 pages
APznzaaXFN6D5AT9tCqCQpSnW0caVJznRTQS4RDG0C0vGSO_GyD8NEY-cfN6KjVUg7X72oHrm8g4ldc2D3S_tMAUojVq3KUsLq-Mpep8MzV9fyS0hKpHhb8YZ7Sd6uz0WvglrHDMyHbyLNQWz_COgfckEpjG66EsI0EwIc89JddmT7sbhT1avT0kdN2C9qGSmG5jWOH8LACQIaZkPD3LdVK
No ratings yet
APznzaaXFN6D5AT9tCqCQpSnW0caVJznRTQS4RDG0C0vGSO_GyD8NEY-cfN6KjVUg7X72oHrm8g4ldc2D3S_tMAUojVq3KUsLq-Mpep8MzV9fyS0hKpHhb8YZ7Sd6uz0WvglrHDMyHbyLNQWz_COgfckEpjG66EsI0EwIc89JddmT7sbhT1avT0kdN2C9qGSmG5jWOH8LACQIaZkPD3LdVK
43 pages
Distributed Transactions
No ratings yet
Distributed Transactions
27 pages
Unit 3-1
No ratings yet
Unit 3-1
26 pages
Dos 6
No ratings yet
Dos 6
22 pages
Unit IV - Distributed Transaction Processing
No ratings yet
Unit IV - Distributed Transaction Processing
38 pages
8 Transaction
No ratings yet
8 Transaction
30 pages
Lecture 7 PDC
No ratings yet
Lecture 7 PDC
8 pages
mod 5 (ds)
No ratings yet
mod 5 (ds)
25 pages
Distributed Systems - Fault Tolerance
No ratings yet
Distributed Systems - Fault Tolerance
21 pages
Fault Tolerance Notes
No ratings yet
Fault Tolerance Notes
101 pages
ds part b
No ratings yet
ds part b
30 pages
CC UNIT-4
No ratings yet
CC UNIT-4
28 pages
Distributed Systems Unit 4
No ratings yet
Distributed Systems Unit 4
26 pages
CBDT3103 Answer
No ratings yet
CBDT3103 Answer
9 pages
Ch8 Distributed
No ratings yet
Ch8 Distributed
12 pages
System Recovery
No ratings yet
System Recovery
38 pages
DistributedTransaction
No ratings yet
DistributedTransaction
46 pages
CS 194: Distributed Systems
No ratings yet
CS 194: Distributed Systems
15 pages
Unit 4 - DSRM
No ratings yet
Unit 4 - DSRM
5 pages
Chapter 7
No ratings yet
Chapter 7
26 pages
Dynamo
No ratings yet
Dynamo
19 pages
Unit5 compressed Fault tolerance- PACE
No ratings yet
Unit5 compressed Fault tolerance- PACE
11 pages
15-440 Distributed Systems: Fault Tolerance, Logging and Recovery Thursday Oct 8, 2015
No ratings yet
15-440 Distributed Systems: Fault Tolerance, Logging and Recovery Thursday Oct 8, 2015
30 pages
Fault System One
No ratings yet
Fault System One
19 pages
Reliability and Security in The Distributed Databases
No ratings yet
Reliability and Security in The Distributed Databases
29 pages
Concurrency Control in Dynamic Database Systems
No ratings yet
Concurrency Control in Dynamic Database Systems
22 pages
Distributed_Commit_Protocols
No ratings yet
Distributed_Commit_Protocols
9 pages
Kafka Developer Certified: The Essential Guide
From Everand
Kafka Developer Certified: The Essential Guide
SUJAN
No ratings yet
Lecture 02
No ratings yet
Lecture 02
32 pages
Lecture 03
No ratings yet
Lecture 03
26 pages
Multimedia-Topics
No ratings yet
Multimedia-Topics
2 pages
Lecture-04
No ratings yet
Lecture-04
49 pages
Graphics
No ratings yet
Graphics
24 pages
Lecture 06(Reading)
No ratings yet
Lecture 06(Reading)
28 pages
L12 System Testing
No ratings yet
L12 System Testing
32 pages
Stat Note
No ratings yet
Stat Note
8 pages
HW 2 F 04 Solns
No ratings yet
HW 2 F 04 Solns
7 pages
Class Assignment
No ratings yet
Class Assignment
4 pages
MIT6 042JS10 Lec39 Sol
No ratings yet
MIT6 042JS10 Lec39 Sol
4 pages
Unit 3 (CSS Notes)
No ratings yet
Unit 3 (CSS Notes)
10 pages
Skylux Duty Free: Category: IT/Telecommunication
No ratings yet
Skylux Duty Free: Category: IT/Telecommunication
3 pages
Lec PDF
No ratings yet
Lec PDF
2 pages
Assignment UI UX
No ratings yet
Assignment UI UX
2 pages
Introduction To L TEX: 1.1 Lists
No ratings yet
Introduction To L TEX: 1.1 Lists
9 pages
Bhavik Suthar - CV - AJ
No ratings yet
Bhavik Suthar - CV - AJ
3 pages
01 GIS - Complete Note
No ratings yet
01 GIS - Complete Note
233 pages
Information Technology Unit 1
No ratings yet
Information Technology Unit 1
6 pages
Online Bus Ticket Reservation System
No ratings yet
Online Bus Ticket Reservation System
19 pages
OS - Module1 - Apsima IV
No ratings yet
OS - Module1 - Apsima IV
23 pages
Xfmea Report Sample - Machinery FMEA: in Addition To This Summary, This Report Includes The Following Forms
No ratings yet
Xfmea Report Sample - Machinery FMEA: in Addition To This Summary, This Report Includes The Following Forms
6 pages
Hemanth Kumar Salibindla-AS400
No ratings yet
Hemanth Kumar Salibindla-AS400
6 pages
p4 - Google Search
No ratings yet
p4 - Google Search
3 pages
OOP - Chapter 4
No ratings yet
OOP - Chapter 4
22 pages
Desktop Keystrokes (JAWS Key Strokes) Jaws Jobs Access With Speech
No ratings yet
Desktop Keystrokes (JAWS Key Strokes) Jaws Jobs Access With Speech
5 pages
Class 6 Computer
No ratings yet
Class 6 Computer
2 pages
Pandora Henry James
No ratings yet
Pandora Henry James
40 pages
Resume Sample
No ratings yet
Resume Sample
2 pages
Senior Backend Engineer-- Lendica and SJ
No ratings yet
Senior Backend Engineer-- Lendica and SJ
1 page
3 & 8. AVEVA Predictive Asset Analytics
No ratings yet
3 & 8. AVEVA Predictive Asset Analytics
8 pages
Unchek Ff Prem 30k Special Ramadhan
No ratings yet
Unchek Ff Prem 30k Special Ramadhan
2 pages
Display Rpgle
No ratings yet
Display Rpgle
917 pages
SJ-20160328171815-028-ZXUR 9000 UMTS (V4.15.10.20) Configuration Tool Operation Guide
100% (1)
SJ-20160328171815-028-ZXUR 9000 UMTS (V4.15.10.20) Configuration Tool Operation Guide
61 pages
The Following Situation Refers To Tom Opim A First Year Mba
No ratings yet
The Following Situation Refers To Tom Opim A First Year Mba
1 page
Beige Dark Gray Minimalist Web Developer Resume
No ratings yet
Beige Dark Gray Minimalist Web Developer Resume
1 page
What Is Spring Boot
No ratings yet
What Is Spring Boot
5 pages
Scenario: Assignment 1 Weighting: 50% of Overall Module Grade
No ratings yet
Scenario: Assignment 1 Weighting: 50% of Overall Module Grade
2 pages
Sequential and Random Access
No ratings yet
Sequential and Random Access
5 pages

Lecture 05

Uploaded by

Lecture 05

Uploaded by

CSE-813(Distributed and Cloud Computing)

Dr. Atiqur Rahman

Lecture 5: Replication Control

• Concurrency Control = how to coordinate multiple concurrent clients

• Replication = An object has identical copies, each maintained by a separate

• If each server is down a fraction f of the time

• With no replication, availability of object =

• Challenge is to maintain two properties

• Two ways to forward updates from front-ends (FEs) to replica group

• Both approaches use the concept of “Replicated State Machines”

• Master => total ordering of all updates

Client Front End

• Can use any flavor of multicast ordering, depending on application

• Total or Hybrid (*-Total) ordering + Replicated State machines approach

• What about failures?

• Virtual synchrony with total ordering for multicasts =>

• In a non-replicated system, transactions appear to be performed one at a time in some order.

• Committing transactions with distributed servers

• Transaction T may touch objects that reside on different servers

• Transaction T may touch objects that reside on different servers

– (It’s also called the “Atomic Commit problem”)

• Server with object has no say in whether transaction commits or aborts

• If server voted Yes, it cannot commit unilaterally before receiving Commit

• To deal with Prepare message loss

• Multiple servers in cloud

You might also like