Mod 2 Continue Edited

The document discusses version stamps which are used in NoSQL databases to help ensure consistency when updates occur without transactions. It describes how different types of version stamps like counters, timestamps, and GUIDs work and compares their advantages. It also covers how version stamps are handled in distributed systems with multiple nodes rather than a single server.

Uploaded by

sirisatishakadur

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views5 pages

Mod 2 Continue Edited

Uploaded by

sirisatishakadur

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

NOSQL Data Base

Chapter 3.
Version Stamps

Many critics of NoSQL databases focus on the lack of support for transactions. Transactions are a
useful tool that helps programmers support consistency. One reason why many NoSQL proponents
worry less about a lack of transactions is that aggregate-oriented NoSQL databases do support
atomicupdates within an aggregate—and aggregates are designed so that their data forms a natural
unit of update. That said, it’s true that transactional needs are something to take into account when
you decide what database to use.
As part of this, it’s important to remember that transactions have limitations. Even within a
transactional system we still have to deal with updates that require human intervention and usually
cannot be run within transactions because they would involve holding a transaction open for too
long. We can cope with these using version stamps—which turn out to be handy in other situations
as well,particularly as we move away from the single-server distribution model.
3.1 Business and System Transactions

 The need to support update consistency without transactions is actually a common feature of
systems even when they are built on top of transactional databases.
 When users think about transactions, they usually mean business transactions. A business
transaction may be something like browsing a product catalog, choosing a bottle of Talisker at a
good price, filling in credit card information, and confirming the order.
 Yet all of this usually won’t occur within the system transaction provided by the database
because this would mean locking the database elements while the user is trying to find their credit
card and gets called off to lunch by their colleagues.
 Usually applications only begin a system transaction at the end of the interaction with the user, so
that the locks are only held for a short period of time.
 The problem, however, is that calculations and decisions may have been made based on data
that’s changed. The price list may have updated the price of the Talisker, or someone may have
updated the customer’s address, changing the shipping charges.
 The broad techniques for handling this are offline concurrency [Fowler PoEAA], useful in NoSQL
situations too.
 A particularly useful approach is the Optimistic Offline Lock [Fowler PoEAA], a form of
conditional update where a client operation rereads any information that the business transaction

Page 1
NOSQL Data Base

relies on and checks that it hasn’t changed since it was originally read and displayed tothe user.
 A good way of doing this is to ensure that records in the database contain some form of version
stamp: a field that changes every time the underlying data in the record changes. When you read
the data you keep a note of the version stamp, so that when you write data you can check tosee if
the version has changed.
 You may have come across this technique with updating resources with HTTP [HTTP]. One way
of doing this is to use etags.
 Whenever you get a resource, the server responds with an etag in the header. This etag is an
opaque string that indicates the version of the resource.
 If you then update that resource, you can use a conditional update by supplying the etag that you
got from your last GET. If the resource has changed on the server, the etags won’t match and the
server will refuse the update, returning a 412 (Precondition Failed) response.
 Some databases provide a similar mechanism of conditional update that allows you to ensure
updates won’t be based on stale data.
 You can do this check yourself, although you then have to ensure no other thread can run against
the resource between your read and your update. (Sometimes this is called a compare-and-set
(CAS) operation, whose name comes from the CAS operations done in processors. The
difference is that a processor CAS compares a value before setting it, while a database
conditional update compares a version stamp of the value.)
 There are various ways you can construct your version stamps. You can use a counter, always
incrementing it when you update the resource.
 Counters are useful since they make it easy to tell if one version is more recent than another. On
the other hand, they require the server to generate the counter value, and also need a single master
to ensure the counters aren’t duplicated.
 Another approach is to create a GUID, a large random number that’s guaranteed to be unique.
These use some combination of dates, hardware information, and whatever other sources of
randomness they can pick up.
 The nice thing about GUIDs is that they can be generated by anyone and you’ll never get a
duplicate; a disadvantage is that they are large and can’t be compared directly for recentness.
 A third approach is to make a hash of the contents of the resource. With a big enough hash key
size, a content hash can be globally unique like a GUID and can also be generated by anyone; the
advantage is that they are deterministic—any node will generate the same content hash for same

Page 2
NOSQL Data Base

resource data.
 However, like GUIDs they can’t be directly compared for recentness, and they can be lengthy.
 A fourth approach is to use the timestamp of the last update. Like counters, they are reasonably
short and can be directly compared for recentness, yet have the advantage of not needing a single
master.
 Multiple machines can generate timestamps—but to work properly, their clocks have to be kept in
sync. One node with a bad clock can cause all sorts of data corruptions.
 There’s also a danger that if the timestamp is too granular you can get duplicates—it’s no good
using timestamps of a millisecond precision if you get many updates per millisecond.
 You can blend the advantages of these different version stamp schemes by using more than one
of them to create a composite stamp.
 For example, CouchDB uses a combination of counter and content hash. Most of the time this
allows version stamps to be compared for recentness, even when you use peer- to-peer
replication.
 Should two peers update at the same time, the combination of the same count and different content
hashes makes it easy to spot the conflict.
 As well as helping to avoid update conflicts, version stamps are also useful for providing session
consistency.

3.2 Version Stamps on Multiple Nodes

 The basic version stamp works well when you have a single authoritative source for data, such as
a single server or master-slave replication.
 In that case the version stamp is controlled by the master. Any slaves follow the master’s stamps.
But this system has to be enhanced in a peer-to-peer distribution model because there’s no longer
a single place to set the version stamps.
 If you’re asking two nodes for some data, you run into the chance that they may give you
different answers. If this happens, your reaction may vary depending on the cause of that
difference.
 It may be that an update has only reached one node but not the other, in which case you can
accept the latest (assuming you can tell which one that is).
 Alternatively, you may have run into an inconsistent update, in which case you need to decide
how to deal with that.

Page 3
NOSQL Data Base

 In this situation, a simple GUID or etag won’t suffice, since these don’t tell you enough about the
relationships.
 The simplest form of version stamp is a counter. Each time a node updates the data, it increments
the counter and puts the value of the counter into the version stamp.
 If you have blue and green slave replicas of a single master, and the blue node answers with a
version stamp of 4 and the green node with 6, you know that the green’s answer is more recent.
 In multiple-master cases, we need something fancier. One approach, used by distributed version
control systems, is to ensure that all nodes contain a history of version stamps.
 That way you can see if the blue node’s answer is an ancestor of the green’s answer. This would
either require the clients to hold onto version stamp histories, or the server nodes to keep version
stamp histories and include them when asked for data.
 This also detects an inconsistency, which we would see if we get two version stamps and neither
of them has the other in their histories.
 Although version control systems keep these kinds of histories, they aren’t found in NoSQL
databases.
 A simple but problematic approach is to use timestamps. The main problem here is that it’s usually
difficult to ensure that all the nodes have a consistent notion of time, particularly if updates can
happen rapidly. Should a node’s clock get out of sync, it can cause all sorts of trouble.
 In addition, you can’t detect write-write conflicts with timestamps, so it would only work well for
the single-master case—and then a counter is usually better.
 The most common approach used by peer-to-peer NoSQL systems is a special form of version
stamp which we call a vector stamp.
 In essence, a vector stamp is a set of counters, one for each node. A vector stamp for three nodes
(blue, green, black) would look something like [blue: 43, green: 54, black: 12].
 Each time a node has an internal update, it updates its own counter, so an update in the green
node would change the vector to [blue: 43, green: 55, black: 12].
 Whenever two nodes communicate, they synchronize their vector stamps. There are several
variations of exactly how this synchronization is done.
 We’re coining the term “vector stamp” as a general term in this book; you’ll also come across
vector clocks and version vectors—these are specific forms of vector stamps that differ in how
they synchronize.



Page 4
NOSQL Data Base

 By using this scheme you can tell if one version stamp is newer than another because the newer
stamp will have all its counters greater than or equal to those in the older stamp. So [blue: 1, green:
2, black: 5] is newer than [blue:1, green: 1, black 5] since one of its counters is greater. If both
stamps have a counter greater than the other, e.g. [blue: 1, green: 2, black: 5] and [blue: 2, green:
1,black: 5], then you have a write-write conflict.
 There may be missing values in the vector, in which case we use treat the missing value as 0.
So[blue: 6, black: 2] would be treated as [blue: 6, green: 0, black: 2].
 This allows you to easily addnew nodes without invalidating the existing vector stamps.
 Vector stamps are a valuable tool that spots inconsistencies, but doesn’t resolve them. Any
conflictresolution will depend on the domain you are working in.
 This is part of the consistency/latency tradeoff. You either have to live with the fact that network
partitions may make your system unavailable, or you have to detect and deal with
inconsistencies.

Key Points
• Version stamps help you detect concurrency conflicts. When you read data, then update it, you can
check the version stamp to ensure nobody updated the data between your read and write.
• Version stamps can be implemented using counters, GUIDs, content hashes, timestamps, or a
combination of these.
• With distributed systems, a vector of version stamps allows you to detect when different nodes
have conflicting updates.

Page 5

Nosql Module 2
100% (1)
Nosql Module 2
87 pages
IAT-I Question Paper With Solution of 18CS823 Nosql Database May-2021-Poonam Tijare
100% (1)
IAT-I Question Paper With Solution of 18CS823 Nosql Database May-2021-Poonam Tijare
12 pages
Nosql Q&A
No ratings yet
Nosql Q&A
204 pages
15-440 Distributed Systems: Hashing and Cdns
No ratings yet
15-440 Distributed Systems: Hashing and Cdns
38 pages
Ch02 - Big Data Storage Concepts
No ratings yet
Ch02 - Big Data Storage Concepts
23 pages
NoSQL Module 2
No ratings yet
NoSQL Module 2
76 pages
Swift Standards Faq Gpi Sr2017
0% (1)
Swift Standards Faq Gpi Sr2017
13 pages
DS Chapter V7replication
No ratings yet
DS Chapter V7replication
33 pages
Nosql 1
No ratings yet
Nosql 1
40 pages
4 - Key-Value Stores
No ratings yet
4 - Key-Value Stores
47 pages
Introduction To Distributed Computing
No ratings yet
Introduction To Distributed Computing
57 pages
NoSQL Intro
No ratings yet
NoSQL Intro
26 pages
CC - Lecture 8-Final
No ratings yet
CC - Lecture 8-Final
51 pages
Module 2
No ratings yet
Module 2
40 pages
L19 Mod6 ReplicationPartitioning P2
No ratings yet
L19 Mod6 ReplicationPartitioning P2
27 pages
Consistency
No ratings yet
Consistency
42 pages
DS CH6 - Consistency and Replication
No ratings yet
DS CH6 - Consistency and Replication
18 pages
Lecture 27
No ratings yet
Lecture 27
19 pages
Big Data Management and Nosql Databases: Doc. Rndr. Irena Holubova, PH.D
No ratings yet
Big Data Management and Nosql Databases: Doc. Rndr. Irena Holubova, PH.D
27 pages
Chap. 6 Consistency & Replication: Distributed Systems
No ratings yet
Chap. 6 Consistency & Replication: Distributed Systems
31 pages
Nosql Systems: Sharding, Replication and Consistency: Riccardo Torlone Università Roma Tre
No ratings yet
Nosql Systems: Sharding, Replication and Consistency: Riccardo Torlone Università Roma Tre
28 pages
Unit-6 Transactions & Replications Syllabus: Introduction, System Model and Group Communication, Concurrency Control in Distributed
No ratings yet
Unit-6 Transactions & Replications Syllabus: Introduction, System Model and Group Communication, Concurrency Control in Distributed
20 pages
No SQL
No ratings yet
No SQL
12 pages
Consistency in Distributed Systems
No ratings yet
Consistency in Distributed Systems
21 pages
Managing Replicated Objects: Deterministic Thread Scheduling
No ratings yet
Managing Replicated Objects: Deterministic Thread Scheduling
12 pages
Content Distribution: Presented by Tanuja V
No ratings yet
Content Distribution: Presented by Tanuja V
15 pages
Replication
No ratings yet
Replication
11 pages
Nosql Data Management
No ratings yet
Nosql Data Management
13 pages
Module 2 Nosql
No ratings yet
Module 2 Nosql
10 pages
Explain The Update Consistency - Update (Write-Write Conflict), Read (Read-Write Conflict) With An Example and A Neat Diagram
No ratings yet
Explain The Update Consistency - Update (Write-Write Conflict), Read (Read-Write Conflict) With An Example and A Neat Diagram
6 pages
Advanced Distributed Systems Replication: What Is Replication? Reasons For Replication
No ratings yet
Advanced Distributed Systems Replication: What Is Replication? Reasons For Replication
20 pages
NoSql Module 2 Part2
No ratings yet
NoSql Module 2 Part2
13 pages
Notes NoSQL Module 2 Leason 5
No ratings yet
Notes NoSQL Module 2 Leason 5
6 pages
Dynamo: Amazon's Highly Available Key-Value Store
No ratings yet
Dynamo: Amazon's Highly Available Key-Value Store
21 pages
Dynamo: Amazon'S Highly Available Key-Value Store: Csci 8101: Advanced Operating Systems Presented By: Chaithra KN
No ratings yet
Dynamo: Amazon'S Highly Available Key-Value Store: Csci 8101: Advanced Operating Systems Presented By: Chaithra KN
23 pages
A Case Study On Different Applications and Security Issues in Distributed Systems
No ratings yet
A Case Study On Different Applications and Security Issues in Distributed Systems
10 pages
Module-2 NOSQL
No ratings yet
Module-2 NOSQL
5 pages
Notes NoSQL Module 2 Leason 6
No ratings yet
Notes NoSQL Module 2 Leason 6
3 pages
07 Replication
No ratings yet
07 Replication
14 pages
Dynamo
No ratings yet
Dynamo
19 pages
Csss - 2012 - 336 - Anna's Archive
No ratings yet
Csss - 2012 - 336 - Anna's Archive
4 pages
Com Error Codes
50% (2)
Com Error Codes
78 pages
Vengeio Hack Script
No ratings yet
Vengeio Hack Script
25 pages
EasyDCP Creator UserManual
No ratings yet
EasyDCP Creator UserManual
73 pages
Qualys Doc v1
No ratings yet
Qualys Doc v1
20 pages
MCD Error Codes
No ratings yet
MCD Error Codes
14 pages
Tilos7 Exchange Manual PDF
No ratings yet
Tilos7 Exchange Manual PDF
55 pages
1 Order
No ratings yet
1 Order
44 pages
IDOR Final
No ratings yet
IDOR Final
77 pages
Reversing Encrypted Callbacks and COM Interfaces
No ratings yet
Reversing Encrypted Callbacks and COM Interfaces
31 pages
Back of Envelope Calculations - Cheat Sheet
No ratings yet
Back of Envelope Calculations - Cheat Sheet
4 pages
The Lua Integration Guide
No ratings yet
The Lua Integration Guide
36 pages
MDL GDIPlus 2
No ratings yet
MDL GDIPlus 2
25 pages
Allplan BIM Compendium
No ratings yet
Allplan BIM Compendium
279 pages
Privacy Information For Installation Features Windows 7 Privacy Statement For Installation Features
No ratings yet
Privacy Information For Installation Features Windows 7 Privacy Statement For Installation Features
13 pages
Smart Software Manager Satellite Enhanced Edition BDM
No ratings yet
Smart Software Manager Satellite Enhanced Edition BDM
93 pages
Sapinsider HR2014 Krishnamoorthy Candidinsightsintotherealeffortsfinal
No ratings yet
Sapinsider HR2014 Krishnamoorthy Candidinsightsintotherealeffortsfinal
49 pages
BSOD - System Error Codes (English Version)
No ratings yet
BSOD - System Error Codes (English Version)
98 pages
eCTD EU Validation Criteria v8.1 - September 2024
No ratings yet
eCTD EU Validation Criteria v8.1 - September 2024
30 pages
Trace Event Programmers Guide
No ratings yet
Trace Event Programmers Guide
20 pages
AKN4EU 4-1 PART 1 Guideline
No ratings yet
AKN4EU 4-1 PART 1 Guideline
125 pages
Snmpidrac1omem En-Us
No ratings yet
Snmpidrac1omem En-Us
155 pages
Scamshield Technical Requirements Document
No ratings yet
Scamshield Technical Requirements Document
2 pages
Automatic Detection of Access Control Vulnerabilities Via Api Specification Processing
No ratings yet
Automatic Detection of Access Control Vulnerabilities Via Api Specification Processing
22 pages
Ex MF898 00
No ratings yet
Ex MF898 00
37 pages
Prompt NK Betulkan Apps I
No ratings yet
Prompt NK Betulkan Apps I
2 pages
Symbian Os Designing Bluetooth Applications in CPP v1 0 en
No ratings yet
Symbian Os Designing Bluetooth Applications in CPP v1 0 en
54 pages
MS Ovba
No ratings yet
MS Ovba
110 pages
Ug211 BT Smart Profile Toolkit
No ratings yet
Ug211 BT Smart Profile Toolkit
20 pages
Home Automation System HAS Using Android For
No ratings yet
Home Automation System HAS Using Android For
11 pages
Linux, Apache, MySQL, PHP Performance End to End
From Everand
Linux, Apache, MySQL, PHP Performance End to End
Colin McKinnon
5/5 (1)
Model Based Environment: A Practical Guide for Data Model Implementation with Examples in Powerdesigner
From Everand
Model Based Environment: A Practical Guide for Data Model Implementation with Examples in Powerdesigner
Vladimir Pantic
No ratings yet
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
From Everand
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
Wei Liu
No ratings yet
Crafting Data-Driven Solutions: Core Principles for Robust, Scalable, and Sustainable Systems
From Everand
Crafting Data-Driven Solutions: Core Principles for Robust, Scalable, and Sustainable Systems
Peter Jones
No ratings yet
Real-Time Analytics: Techniques to Analyze and Visualize Streaming Data
From Everand
Real-Time Analytics: Techniques to Analyze and Visualize Streaming Data
Byron Ellis
No ratings yet
Dataflow and Reactive Programming Systems
From Everand
Dataflow and Reactive Programming Systems
Matt Carkci
No ratings yet
Learn Multithreading with Modern C++
From Everand
Learn Multithreading with Modern C++
James Raynard
No ratings yet
Google BigQuery Analytics
From Everand
Google BigQuery Analytics
Jordan Tigani
3/5 (1)
SignalR on .NET 6 - the Complete Guide
From Everand
SignalR on .NET 6 - the Complete Guide
Fiodar Sazanavets
No ratings yet
Preparing Data for Analysis with JMP
From Everand
Preparing Data for Analysis with JMP
Robert Carver
No ratings yet
AWS Certified Solutions Architect - Professional
From Everand
AWS Certified Solutions Architect - Professional
VB Dev
No ratings yet
Advanced Apache Tez Techniques: Definitive Reference for Developers and Engineers
From Everand
Advanced Apache Tez Techniques: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
CouchDB Essentials: Definitive Reference for Developers and Engineers
From Everand
CouchDB Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Practical TimescaleDB Solutions: Definitive Reference for Developers and Engineers
From Everand
Practical TimescaleDB Solutions: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Efficient Data Preparation with AWS Glue DataBrew: Definitive Reference for Developers and Engineers
From Everand
Efficient Data Preparation with AWS Glue DataBrew: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Debezium in Action: Definitive Reference for Developers and Engineers
From Everand
Debezium in Action: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Snowflake Data Platform Engineering: Definitive Reference for Developers and Engineers
From Everand
Snowflake Data Platform Engineering: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Edge Cloud Operations: A Systems Approach
From Everand
Edge Cloud Operations: A Systems Approach
Larry L Peterson
No ratings yet
SAS Interview Questions You'll Most Likely Be Asked
From Everand
SAS Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
From Everand
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet

Mod 2 Continue Edited

Uploaded by

Mod 2 Continue Edited

Uploaded by

NOSQL Data Base

You might also like