Replication

Uploaded by

raghavi.s

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views16 pages

Replication

Uploaded by

raghavi.s

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 16

Replication

Presented by Shivang Kumar

Introduction
Replication means keeping a copy of the same data on
multiple machines that are connected via a network. There
are several reasons why you might want to replicate data:
1.To keep data geographically close to your users (and
thus reduce latency)
2.To allow the system to continue working even if some of
its parts have failed (and thus increase availability)
3.To scale out the number of machines that can serve read
queries (and thus increase read throughput)
Leader-Based
Replication
• Replication Basics:
⚬ Database replicas store copies of the data.
⚬ Synchronization is crucial for consistency.
• Leader-Based Replication:
⚬ One replica is the leader; others are followers.
⚬ Clients write to the leader, which updates its storage.
⚬ Leader sends changes to followers for synchronisation.
• Operations:
⚬ Reads can be from any replica.
⚬ Writes are only accepted by the leader.
• Examples:
⚬ Common in relational databases
(PostgreSQL, MySQL, Oracle).
⚬ Used in nonrelational databases
(MongoDB, RethinkDB), message
brokers (Kafka), and more.

Leader-based (master–slave)
replication
Synchronous Vs.
Asynchronous
• Configurability: Replication can occur synchronously or
asynchronously.
• Communication Flow: Figure illustrates synchronous and
asynchronous communication between leader and followers.
• Synchronous Replication: The leader waits for follower
confirmation, ensuring consistency.
• Asynchronous Replication: The leader sends data but doesn't wait
for the follower’s response, introducing a potential delay.
• Replication Lag: Asynchronous followers might experience delays,
leading to inconsistencies.
In the example, the replication to follower
1 is synchronous: the leader waits until
follower 1 has confirmed that it received
the write before reporting success to the
user, and before making the write visible
to other clients. The replication to follower
2 is asynchronous: the leader sends the
message, but doesn’t wait for a response
from the follower.
Leader-based replication with
one synchronous and one
asynchronous follower.
Setting Up New
Followers
• Snapshot Creation: Consistent snapshot of leader's database taken
without locking the entire database.
• Copy and Connect: Snapshot copied to new follower; follower
connects to leader for data changes.
• Catch-Up Process: Follower processes backlog, catching up to
leader.
• Automated or Manual: Setting up followers varies, from automated
processes to manual workflows.
Handling Node
Outages
• Follower Recovery: Follower easily recovers from crashes or
network interruptions using its log.
• Leader Failure (Failover): This tricky process involves promoting a
follower, reconfiguring clients, and transitioning other followers.
• Failover Detection: Timeout-based detection; leader assumed dead
if no response for a specified period.
• Failover Challenges: Potential issues include conflicting writes, split
brain scenarios, and determining the right timeout.
• Manual vs. Automatic Failover: Operations teams may prefer
manual failover for better control.
Implementation of
Replication Logs
• Replication Methods: Overview of statement-based, write-ahead
log shipping, logical (row-based) log replication, and trigger-based
replication.
• Compatibility Challenges: Write-ahead log shipping closely tied to
storage engine, limiting software version flexibility.
• Logical Log Advantages: Decoupled from storage engine, allowing
backward compatibility and different software versions.
• Trigger-Based Replication: Application-layer approach for flexibility
but with higher overhead and potential limitations.
Problems with
Replication Lag
• Read-Scaling Architecture: Using asynchronous replication for
read-heavy workloads with many followers.
• Eventual Consistency: Replication lag can lead to temporary
inconsistencies between leader and followers.
• Challenges with Replication Lag: Three highlighted problems:
"Reading Your Own Writes," "Cross-Device Consistency," and
"Problems with Replication Lag."
• Solutions for Read-After-Write Consistency: Various techniques,
including leader reads, time-based decisions, and timestamp
tracking.
Reading Your Own
Writes
• User Data Submission: Users can submit data like comments or
records in applications.
• Asynchronous Replication Challenge: Viewing data shortly after
writing may lead to perceived data loss.
• Eventual Consistency: Term coined by Douglas Terry, popularized
by Werner Vogels; a common goal for NoSQL projects.
• Read-After-Write Consistency: Ensures users always see updates
they submitted upon page reload.
• Implementation Techniques: Various methods, e.g., reading from
leader when user might have modified data.
In this situation, we need read-after-write
consistency, also known as read-your-
writes consistency. This guarantees that
users will always see any updates they
submit if they reload the page. It makes
no promises about other users: their
updates may only be visible later.
However, it reassures the user that their
input has been saved correctly. A user makes a write, followed
by a read from a stale replica. To
prevent this anomaly, we need
read-after-write consistency
Monotonic Reads
• Avoiding Time Reversal: Users might observe time moving
backward when reading from different replicas.
• Monotonic Reads Guarantee: Ensures users do not read older data
after previously reading newer data.
• Replica Selection: Consistent replica selection for each user,
possibly based on a hash of the user ID.
For example, Figure shows user 2345
making the same query twice, first to a
follower with little lag, then to a follower
with greater lag.
The first query returns a comment that
was recently added by user 1234, but the
second query doesn’t return anything
because the lagging follower has not yet
A user first reads from a fresh picked up that write.

replica, then from a stale replica.

Time appears to go backward. To
prevent this anomaly, we need
monotonic reads.
Solutions for
Replication Lag
• Considering Lag Impact: Understanding the impact of replication
lag on application behaviour.
• Stronger Guarantees: Designing systems for stronger guarantees,
e.g., read-after-write, when necessary.
• Application Complexity: Challenges in dealing with replication
issues in application code.
• Transaction Importance: Transactions as a way for databases to
provide stronger guarantees.
• Single-Node Transactions: Abandonment of single-node
transactions in distributed databases, with a call for a nuanced
view.
Thank You

Ch02 - Big Data Storage Concepts
No ratings yet
Ch02 - Big Data Storage Concepts
23 pages
Presentation On Input and Output Devices-2
No ratings yet
Presentation On Input and Output Devices-2
19 pages
Information Assurance and Security
No ratings yet
Information Assurance and Security
4 pages
1 - BYD ECB Electric Training
100% (1)
1 - BYD ECB Electric Training
55 pages
CSC213 Object Oriented Programming-Lab Manual-Sol
No ratings yet
CSC213 Object Oriented Programming-Lab Manual-Sol
36 pages
Ps - 4618service Training - Self Study Programme 470 - The Touareg 2011 - Electrics Electronics - Design and Function
No ratings yet
Ps - 4618service Training - Self Study Programme 470 - The Touareg 2011 - Electrics Electronics - Design and Function
56 pages
Boomi Q&A Flashcards - Quizlet
No ratings yet
Boomi Q&A Flashcards - Quizlet
11 pages
Uploadsh 046 005807 00 Passport 8 12 Service Manual (FDA) 2 0
No ratings yet
Uploadsh 046 005807 00 Passport 8 12 Service Manual (FDA) 2 0
106 pages
AZ-104 - UD - 103 Removed - PDF
100% (1)
AZ-104 - UD - 103 Removed - PDF
67 pages
740C Azure Service Manual
No ratings yet
740C Azure Service Manual
38 pages
SMOE Layout Man PDF
No ratings yet
SMOE Layout Man PDF
4 pages
VLSI Assignment 2
No ratings yet
VLSI Assignment 2
34 pages
Unit-6 Transactions & Replications Syllabus: Introduction, System Model and Group Communication, Concurrency Control in Distributed
No ratings yet
Unit-6 Transactions & Replications Syllabus: Introduction, System Model and Group Communication, Concurrency Control in Distributed
20 pages
MM Brochure - IQ7000 - MERCK - Ina
No ratings yet
MM Brochure - IQ7000 - MERCK - Ina
12 pages
100-412-215 LPX Power Supply Manual Rev. 04
No ratings yet
100-412-215 LPX Power Supply Manual Rev. 04
84 pages
KS Uftp6a 305M BL
No ratings yet
KS Uftp6a 305M BL
2 pages
2021-Impact of BIM Based Quantity Take Off For Accuracy of Cost Estimation
No ratings yet
2021-Impact of BIM Based Quantity Take Off For Accuracy of Cost Estimation
15 pages
Omnichannel Retailing
No ratings yet
Omnichannel Retailing
9 pages
Galera Cluster
100% (1)
Galera Cluster
106 pages
Consistency and Replication1
No ratings yet
Consistency and Replication1
30 pages
Introduction To Distributed Computing
No ratings yet
Introduction To Distributed Computing
57 pages
Full Stack Java Developer
No ratings yet
Full Stack Java Developer
8 pages
Nosql Module 2
100% (1)
Nosql Module 2
87 pages
Advanced Distributed Systems Replication: What Is Replication? Reasons For Replication
No ratings yet
Advanced Distributed Systems Replication: What Is Replication? Reasons For Replication
20 pages
Network Topology
No ratings yet
Network Topology
5 pages
Nosql Systems: Sharding, Replication and Consistency: Riccardo Torlone Università Roma Tre
No ratings yet
Nosql Systems: Sharding, Replication and Consistency: Riccardo Torlone Università Roma Tre
28 pages
Consistency in Distributed Systems
No ratings yet
Consistency in Distributed Systems
21 pages
Pid Temperature Controller Manual
No ratings yet
Pid Temperature Controller Manual
12 pages
BCS 413 - Lecture5 - Replication - Consistency
No ratings yet
BCS 413 - Lecture5 - Replication - Consistency
25 pages
Relational Database Management System (RDBMS)
No ratings yet
Relational Database Management System (RDBMS)
4 pages
Consistency
No ratings yet
Consistency
42 pages
AMST GCT42 D20 01 GE CT Systems 42 Site Guide
No ratings yet
AMST GCT42 D20 01 GE CT Systems 42 Site Guide
19 pages
Big Data Management and Nosql Databases: Doc. Rndr. Irena Holubova, PH.D
No ratings yet
Big Data Management and Nosql Databases: Doc. Rndr. Irena Holubova, PH.D
27 pages
Distributed Systems: Chapter 07: Consistency & Replication
No ratings yet
Distributed Systems: Chapter 07: Consistency & Replication
48 pages
Data Replication Techniques
No ratings yet
Data Replication Techniques
3 pages
Gtoli
No ratings yet
Gtoli
2 pages
Distributed System Notes
No ratings yet
Distributed System Notes
24 pages
7 Consistency
No ratings yet
7 Consistency
41 pages
Voice Assistant: Formal Assessment
No ratings yet
Voice Assistant: Formal Assessment
2 pages
6 Replication Nhom3
No ratings yet
6 Replication Nhom3
44 pages
The Case For Determinism in Database Systems: Alexander Thomson Thomson@cs - Yale.edu Daniel J. Abadi Dna@cs - Yale.edu
No ratings yet
The Case For Determinism in Database Systems: Alexander Thomson Thomson@cs - Yale.edu Daniel J. Abadi Dna@cs - Yale.edu
11 pages
Kledbetter Module1resume 0115
No ratings yet
Kledbetter Module1resume 0115
5 pages
07 Replication
No ratings yet
07 Replication
14 pages
PT Final
No ratings yet
PT Final
12 pages
Chapter 7 - Consistency and Replication
No ratings yet
Chapter 7 - Consistency and Replication
28 pages
Chapter 1 Introduction
No ratings yet
Chapter 1 Introduction
48 pages
Consensus
No ratings yet
Consensus
77 pages
RAN Sharing - New Paradigm For LTE
100% (1)
RAN Sharing - New Paradigm For LTE
10 pages
Consistency and Replication55
No ratings yet
Consistency and Replication55
17 pages
2 X 150 MW Coastal Thermal Power Project Critical Activity Schedule
No ratings yet
2 X 150 MW Coastal Thermal Power Project Critical Activity Schedule
3 pages
Availability Digest: Asynchronous Replication Engines
No ratings yet
Availability Digest: Asynchronous Replication Engines
6 pages
Ramdump Modem 2024-05-17 16-44-54 Props
No ratings yet
Ramdump Modem 2024-05-17 16-44-54 Props
21 pages
ONGC
100% (1)
ONGC
5 pages
Chapter 7 Consistency and Replication
No ratings yet
Chapter 7 Consistency and Replication
43 pages
Consistency
No ratings yet
Consistency
23 pages
Replication and Consistency in Distributed Systems (Cont'd)
No ratings yet
Replication and Consistency in Distributed Systems (Cont'd)
17 pages
DS CH6 - Consistency and Replication
No ratings yet
DS CH6 - Consistency and Replication
18 pages
Energy Saving For Better Living
No ratings yet
Energy Saving For Better Living
16 pages
Chapter 7kec
No ratings yet
Chapter 7kec
8 pages
Deepak and Deepa - Consistency - and - Replication
No ratings yet
Deepak and Deepa - Consistency - and - Replication
38 pages
L08Exercise MovieShopUMWorksheet
No ratings yet
L08Exercise MovieShopUMWorksheet
4 pages
The Log - What Every Software Engineer Should Know About Real-Time Data's Unifying Abstraction - LinkedIn Engineering
No ratings yet
The Log - What Every Software Engineer Should Know About Real-Time Data's Unifying Abstraction - LinkedIn Engineering
38 pages
Chapter 7-Consistency and Replication
No ratings yet
Chapter 7-Consistency and Replication
73 pages
DRKP Module 2 1
No ratings yet
DRKP Module 2 1
77 pages
Understanding Multi-Master Replication 2
No ratings yet
Understanding Multi-Master Replication 2
31 pages
IAU ST Lecture5
No ratings yet
IAU ST Lecture5
50 pages
Slides
No ratings yet
Slides
31 pages
Consistency and Replication
No ratings yet
Consistency and Replication
8 pages
Ds Chapter 6
No ratings yet
Ds Chapter 6
23 pages
DBMS
No ratings yet
DBMS
16 pages
Big Data Analytics Lecture 2
No ratings yet
Big Data Analytics Lecture 2
42 pages
SGDB
No ratings yet
SGDB
14 pages
Determinism in Database Systems
No ratings yet
Determinism in Database Systems
11 pages
REPLICATION
No ratings yet
REPLICATION
20 pages
Chap 5
No ratings yet
Chap 5
75 pages
Nosql 1
No ratings yet
Nosql 1
40 pages
Module 2 Nosql
No ratings yet
Module 2 Nosql
10 pages
Intro To DS Chapter 5
No ratings yet
Intro To DS Chapter 5
76 pages
Lecture 27
No ratings yet
Lecture 27
19 pages
Lecture 8
No ratings yet
Lecture 8
14 pages
Distributed Shared Memory
No ratings yet
Distributed Shared Memory
18 pages
ch07 Consistency Replication
No ratings yet
ch07 Consistency Replication
30 pages
Chapter 6-Consistency and Replication-Updated
No ratings yet
Chapter 6-Consistency and Replication-Updated
30 pages
CH-07 Replication
No ratings yet
CH-07 Replication
35 pages
Lec 3 - Basic Concepts
No ratings yet
Lec 3 - Basic Concepts
32 pages
Consistency Replication
No ratings yet
Consistency Replication
49 pages
The Log - What Every Software Engineer Should Know About Real-Time Data's Unifying Abstraction - LinkedIn Engineering
No ratings yet
The Log - What Every Software Engineer Should Know About Real-Time Data's Unifying Abstraction - LinkedIn Engineering
31 pages
Module 2
No ratings yet
Module 2
40 pages
DS Unit5
No ratings yet
DS Unit5
13 pages

Replication

Uploaded by

Replication

Uploaded by

Replication

Presented by Shivang Kumar

replica, then from a stale replica.

You might also like