Explain The Update Consistency - Update (Write-Write Conflict), Read (Read-Write Conflict) With An Example and A Neat Diagram

Uploaded by

Vijaylaxmi Patil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views6 pages

Explain The Update Consistency - Update (Write-Write Conflict), Read (Read-Write Conflict) With An Example and A Neat Diagram

Uploaded by

Vijaylaxmi Patil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

1.

Explain the update consistency – update (write-write conflict), read (read-write

conflict) with an example and a neat diagram.
Update (Write-Write Conflict): This occurs when two users attempt to update the same data item
simultaneously. For example, if Martin and Pramod both try to update a phone number on a company
website at the same time, they may use different formats, leading to a write-write conflict. The system
must resolve this conflict, often by allowing one update to succeed while the other fails or is queued
for later resolution

Example: Martin and Pramod both update a contact number on a company website. Martin changes
it to "123-456-7890," while Pramod updates it to "987-654-3210." If the server processes Martin's
update first, the final stored value may be "987-654-3210," causing Martin's update to be lost.

Read (Read-Write Conflict): This happens when one user reads data while another user is updating it.
For instance, if Martin reads the old phone number while Pramod is updating it, he may not see the
latest information. This can lead to inconsistencies in what users perceive as the current data

Example: If Martin reads the phone number while Pramod is updating it, he may see the old number
"123-456-7890." This outdated information can lead to incorrect decisions based on stale data.

2.Define Quorums and explain read and write quorum with examples
A quorum is a subset of nodes in a distributed system that must agree on a read or write
operation to ensure strong consistency.
The concept is crucial when dealing with replicated data across multiple nodes, as it helps
avoid inconsistencies that can arise from concurrent operations.
Write Quorum
A write quorum is the minimum number of nodes that must acknowledge a write
operation for it to be considered successful.
For example, if data is replicated across three nodes (N = 3), a write quorum (W) of 2
means that at least two nodes must confirm the write. This can be expressed as W > N/2,
ensuring that a majority of nodes have the latest data.
If two nodes acknowledge the write while one does not, the system can still maintain
consistency, as the majority has agreed on the new value .
Read Quorum
A read quorum is the minimum number of nodes that must be contacted to ensure that
the most recent write is read.
Continuing with the previous example, if the write quorum is W = 2, then a read quorum
(R) of 2 is also required to guarantee that the latest data is retrieved. This can be expressed
as R + W > N.
If a read operation contacts only one node while the write quorum was not met, it may
read stale data. However, if it contacts two nodes, it can ensure that it retrieves the most
up-to-date information .
Example Scenario
Consider a system with three nodes (A, B, and C) where:
A write operation is performed, and nodes A and B acknowledge the write (W = 2).
For a subsequent read operation, if nodes A and C are contacted (R = 2), the read
will return the latest data since the write quorum was met.

3.Define Version Stamps. List and explain the approaches through which version stamps
can be constructed for single source models.
Version stamps are mechanisms used to track changes in data records, ensuring that updates
are based on the most current information. They help prevent conflicts in multi-user
environments by indicating the version of a record at any given time. When a record is
updated, its version stamp changes, allowing systems to verify whether the data being
modified is up-to-date.
Approaches to Construct Version Stamps for Single Source Models
1. Counter-Based Version Stamps:
• Each time a record is updated, a counter is incremented.
• This approach is straightforward and allows easy comparison of versions; a
higher counter indicates a more recent update.
• However, it requires a single authoritative source to manage the counter to
avoid duplication [1].
2. GUID (Globally Unique Identifier):
• A GUID is a large random number that is unique across different systems.
• It can be generated by any node, eliminating the risk of duplication.
• The downside is that GUIDs are large and cannot be directly compared for
recency, making it difficult to determine which version is newer [1].
3. Content Hashing:
• This method involves creating a hash of the contents of the resource.
• A sufficiently large hash key size can ensure global uniqueness and can be
generated by anyone.
• While deterministic (the same content will always produce the same hash), it
cannot be directly compared for recency [1].
4. Timestamp-Based Version Stamps:
• This approach uses the timestamp of the last update to indicate the version.
• Timestamps are relatively short and can be directly compared to determine
which version is more recent.
• However, it requires synchronized clocks across multiple machines to avoid
issues with data corruption due to clock discrepancies [1].
5. Composite Version Stamps:
• A combination of the above methods can be used to create a composite version
stamp.
• For example, using both a counter and a content hash can help in identifying
conflicts while allowing for recentness comparison.
• This method is particularly useful in systems that require high availability and
consistency, such as peer-to-peer replication systems

4.Explain map-reduce with example

Map-Reduce is a programming model designed for processing large datasets by distributing the work
across multiple machines in a cluster. It consists of two main functions: the Map function and
the Reduce function. Here’s a breakdown of how it works, along with an example.
How Map-Reduce Works
• Map Function:
• The map function reads records from a dataset and emits key-value pairs. For instance,
if we have a list of orders, the map function might extract details like product name
and quantity, emitting pairs such as (product_name, quantity) for each order .
• Shuffle and Sort:
• After the map phase, the framework groups all emitted key-value pairs by key. This
means all values associated with the same key are collected together, preparing them
for the reduce phase .
• Reduce Function:
• The reduce function takes these grouped key-value pairs and combines them to
produce a final result. For example, if the key is a product name, the reduce function
could sum the quantities to find the total number of orders for that product .
5.Explain the partitioning and combining stages with examples

Partitioning is the process of dividing the output of the map function into different
segments or partitions. Each partition contains key-value pairs that will be sent to a specific
reducer. The goal is to ensure that all data for the same key is grouped together in one
partition so it can be processed by a single reducer .
Example:
Consider a scenario where we have the following key-value pairs emitted by the map
function:
(Product A, 2)
(Product B, 1)
(Product A, 3)
(Product C, 4)
If we have two reducers, the partitioning might look like this:
Reducer 1: (Product A, 2), (Product A, 3)
Reducer 2: (Product B, 1), (Product C, 4)
Here, all entries for Product A are sent to Reducer 1, while Products B and C go to Reducer
2. This allows each reducer to work on its own set of keys in parallel, improving processing
speed .
Combining Stage
Definition:
The combining stage is an optional step that occurs before the data is sent to the reducers.
A combiner function can be used to combine all values for the same key within each
partition. This helps reduce the amount of data that needs to be transferred across the
network, making the process more efficient .
Example:
Using the same key-value pairs from the previous example, if we apply a combiner function
that sums the quantities for each product, the output before sending to the reducers
might look like this:
(Product A, 5) // Combined from (Product A, 2) and (Product A, 3)
(Product B, 1)
(Product C, 4)
This means that instead of sending multiple entries for Product A to the reducer, we only
send a single entry with the total quantity. This reduces the amount of data transferred
and speeds up the overall process .

6.Explain two stages of map-reduce with a neat diagram

7.What are Key-value stores? List out some popular key value database.

Key-value stores are a type of NoSQL database that uses a simple data model to store data as a
collection of key-value pairs. Each key is unique and acts as an identifier for the associated value, which
can be a simple data type or a more complex data structure. This model is akin to a hash table, where
the key is the index, and the value is the data being stored
Popular Key-Value Databases
• Redis:An in-memory data structure store, often used as a database, cache, and message
broker. It supports various data structures such as strings, hashes, lists, sets, and more.
• Amazon DynamoDB:A fully managed NoSQL database service that provides fast and
predictable performance with seamless scalability. It is designed for high availability and
durability.
• Riak:A distributed NoSQL database that offers high availability, fault tolerance, and scalability.
It is designed to handle large amounts of data across many servers.
• Cassandra:While primarily a wide-column store, it can also function as a key-value store. It is
known for its high availability and scalability, making it suitable for handling large datasets
across multiple nodes.
• Berkeley DB:A high-performance embedded database that provides a key-value store
interface. It is often used in applications requiring fast data access.
• LevelDB:A fast key-value storage library written at Google that provides an ordered mapping
from string keys to string values. It is designed for high performance and efficiency.

Final - Module-4 Cloud Computing - May 8, 2023
No ratings yet
Final - Module-4 Cloud Computing - May 8, 2023
88 pages
VH GR-7 Mathematics T1 Sample-QP
100% (2)
VH GR-7 Mathematics T1 Sample-QP
6 pages
NoSQL - Unit2 - PPT
No ratings yet
NoSQL - Unit2 - PPT
24 pages
Understanding Inputs and Outputs of Mapreduce
No ratings yet
Understanding Inputs and Outputs of Mapreduce
13 pages
NGD Mini Notes
No ratings yet
NGD Mini Notes
7 pages
Ch02 - Big Data Storage Concepts
No ratings yet
Ch02 - Big Data Storage Concepts
23 pages
Nosql Module 2
100% (1)
Nosql Module 2
87 pages
Module 3
No ratings yet
Module 3
79 pages
4 - Key-Value Stores
No ratings yet
4 - Key-Value Stores
47 pages
DRKP Module 3
No ratings yet
DRKP Module 3
44 pages
Ch02a Mapreduce
No ratings yet
Ch02a Mapreduce
53 pages
3 Key Value
No ratings yet
3 Key Value
32 pages
Hadoop: A Seminar Report On
No ratings yet
Hadoop: A Seminar Report On
28 pages
DS CH6 - Consistency and Replication
No ratings yet
DS CH6 - Consistency and Replication
18 pages
L19 Mod6 ReplicationPartitioning P2
No ratings yet
L19 Mod6 ReplicationPartitioning P2
27 pages
Nosql Data Management
No ratings yet
Nosql Data Management
13 pages
Nosql Qbsol Ia-02
No ratings yet
Nosql Qbsol Ia-02
18 pages
Da Unit 5 Data Analytics
No ratings yet
Da Unit 5 Data Analytics
43 pages
No SQL
No ratings yet
No SQL
12 pages
3 Module NOSQL Preparation
No ratings yet
3 Module NOSQL Preparation
12 pages
Slides
No ratings yet
Slides
31 pages
Cloud 4 Unit
No ratings yet
Cloud 4 Unit
26 pages
Audit Objectives Procedures Evidences and Documentation
100% (4)
Audit Objectives Procedures Evidences and Documentation
35 pages
Nosql 1
No ratings yet
Nosql 1
40 pages
Lecture 27
No ratings yet
Lecture 27
19 pages
Mod 2 Continue Edited
No ratings yet
Mod 2 Continue Edited
5 pages
Dynamo: Amazon's Highly Available Key-Value Store
No ratings yet
Dynamo: Amazon's Highly Available Key-Value Store
21 pages
Uppen FP Series FP 2400Q Service Manual
No ratings yet
Uppen FP Series FP 2400Q Service Manual
47 pages
Unit 5 NOSQL
No ratings yet
Unit 5 NOSQL
102 pages
Lesson 2 A Review of Hadoop
No ratings yet
Lesson 2 A Review of Hadoop
6 pages
Gcru 2 Nosql
No ratings yet
Gcru 2 Nosql
52 pages
Notes NoSQL Module 2 Leason 5
No ratings yet
Notes NoSQL Module 2 Leason 5
6 pages
Consistency
No ratings yet
Consistency
42 pages
Module - 2 - Introduction To Hadoop
No ratings yet
Module - 2 - Introduction To Hadoop
24 pages
Module-2 NOSQL
No ratings yet
Module-2 NOSQL
5 pages
Ir MR 1
No ratings yet
Ir MR 1
34 pages
Nosql Systems: Sharding, Replication and Consistency: Riccardo Torlone Università Roma Tre
No ratings yet
Nosql Systems: Sharding, Replication and Consistency: Riccardo Torlone Università Roma Tre
28 pages
Seminar On Schedule U: Presented by
No ratings yet
Seminar On Schedule U: Presented by
21 pages
S MapReduce Types Formats Features 06
No ratings yet
S MapReduce Types Formats Features 06
26 pages
Notes NoSQL Module 2 Leason 6
No ratings yet
Notes NoSQL Module 2 Leason 6
3 pages
B. Hadoop Ecosystem - III (MapReduce)
No ratings yet
B. Hadoop Ecosystem - III (MapReduce)
55 pages
TM2 ch02 Mapreduce
No ratings yet
TM2 ch02 Mapreduce
51 pages
Sem 7 - COMP - BDA
No ratings yet
Sem 7 - COMP - BDA
16 pages
Interview Questions - Introduction To Hadoop and MapReduce Programming
No ratings yet
Interview Questions - Introduction To Hadoop and MapReduce Programming
4 pages
CC Unit-7
No ratings yet
CC Unit-7
16 pages
Unitw 12 W 2
No ratings yet
Unitw 12 W 2
18 pages
BDA Answers
No ratings yet
BDA Answers
6 pages
Map Reduce
No ratings yet
Map Reduce
42 pages
777 1651400043 BD Module 4
No ratings yet
777 1651400043 BD Module 4
21 pages
Big Data Analysis PDF 2
No ratings yet
Big Data Analysis PDF 2
18 pages
Big Data Computing
No ratings yet
Big Data Computing
36 pages
Module 2
No ratings yet
Module 2
40 pages
JLG-860SJ - en
No ratings yet
JLG-860SJ - en
142 pages
NOSQL
No ratings yet
NOSQL
23 pages
NOSQL Databases
No ratings yet
NOSQL Databases
19 pages
Introduction To Hadoop
No ratings yet
Introduction To Hadoop
37 pages
Big Data Notes (All Lectures)
No ratings yet
Big Data Notes (All Lectures)
44 pages
Module 3 Joint Arrangements
No ratings yet
Module 3 Joint Arrangements
19 pages
Consistency and Rep Contd
No ratings yet
Consistency and Rep Contd
28 pages
Map Reduce: Simplified Processing On Large Clusters
No ratings yet
Map Reduce: Simplified Processing On Large Clusters
29 pages
BIS613D Module 5 Textbook
No ratings yet
BIS613D Module 5 Textbook
9 pages
Hadoop: A Report Writing On
No ratings yet
Hadoop: A Report Writing On
13 pages
Data Engineering Unit 3
No ratings yet
Data Engineering Unit 3
4 pages
Piper Progressive Inspection 100 Hour Cycle: Cheyenne
No ratings yet
Piper Progressive Inspection 100 Hour Cycle: Cheyenne
66 pages
Dynamo: Amazon'S Highly Available Key-Value Store: Csci 8101: Advanced Operating Systems Presented By: Chaithra KN
No ratings yet
Dynamo: Amazon'S Highly Available Key-Value Store: Csci 8101: Advanced Operating Systems Presented By: Chaithra KN
23 pages
Mapreduce: Simpli - Ed Data Processing On Large Clusters
No ratings yet
Mapreduce: Simpli - Ed Data Processing On Large Clusters
4 pages
Arcs and Inscribed Angle
No ratings yet
Arcs and Inscribed Angle
29 pages
Financial Modelling PDF
No ratings yet
Financial Modelling PDF
2 pages
Sbi General Set PPT 2012
No ratings yet
Sbi General Set PPT 2012
20 pages
Worksheet (AS)
No ratings yet
Worksheet (AS)
4 pages
Operation & Service Manual For Cable Tensiometer: Series
No ratings yet
Operation & Service Manual For Cable Tensiometer: Series
28 pages
8BVI0055HWDS.000-1 en
No ratings yet
8BVI0055HWDS.000-1 en
10 pages
Pandas Viva Questions
No ratings yet
Pandas Viva Questions
23 pages
Chemical Burn
No ratings yet
Chemical Burn
32 pages
792224FT-371008 - 207-00011
No ratings yet
792224FT-371008 - 207-00011
13 pages
GSTR1 Excel Workbook Template V1.4
No ratings yet
GSTR1 Excel Workbook Template V1.4
84 pages
Alternative Binder Systems For Lower Carbon Concrete Code of Practice
No ratings yet
Alternative Binder Systems For Lower Carbon Concrete Code of Practice
8 pages
Impro New 2.7 Preview
No ratings yet
Impro New 2.7 Preview
24 pages
UPD 5.5.0.12834 ReleaseNotes
No ratings yet
UPD 5.5.0.12834 ReleaseNotes
9 pages
Ug II New Sem 2024 Time Table
No ratings yet
Ug II New Sem 2024 Time Table
4 pages
9th Major-4 English NCERT Paper Zdyxcq
No ratings yet
9th Major-4 English NCERT Paper Zdyxcq
7 pages
Bab III
No ratings yet
Bab III
22 pages
Designation
No ratings yet
Designation
12 pages
60 Seconds: It Only Takes Up 60 Seconds For A Person To Fall in Love
No ratings yet
60 Seconds: It Only Takes Up 60 Seconds For A Person To Fall in Love
40 pages
Chapter-5 Push and Pull Model
No ratings yet
Chapter-5 Push and Pull Model
12 pages
Updated CV Hrithik Mhatre
No ratings yet
Updated CV Hrithik Mhatre
2 pages
Loading XL Sheet
No ratings yet
Loading XL Sheet
9 pages
Analysis2 Final Exam 2022 PDF
No ratings yet
Analysis2 Final Exam 2022 PDF
3 pages
Emergency Cart Checklist
No ratings yet
Emergency Cart Checklist
1 page
Administering Microsoft Azure SQL Solutions DP 300
From Everand
Administering Microsoft Azure SQL Solutions DP 300
Manish Soni
No ratings yet

Explain The Update Consistency - Update (Write-Write Conflict), Read (Read-Write Conflict) With An Example and A Neat Diagram

Uploaded by

Explain The Update Consistency - Update (Write-Write Conflict), Read (Read-Write Conflict) With An Example and A Neat Diagram

Uploaded by

1.

Explain the update consistency – update (write-write conflict), read (read-write

4.Explain map-reduce with example

6.Explain two stages of map-reduce with a neat diagram

You might also like