0% found this document useful (0 votes)

11 views49 pages

Lecture 3 - Principles of NoSQL Databases

The document outlines the principles of NoSQL databases, focusing on data models, distribution, and consistency. It contrasts relational database management systems (RDBMS) with NoSQL, highlighting the flexibility and scalability of NoSQL databases, as well as their aggregate-oriented data models. Additionally, it discusses various distribution models, including sharding and replication strategies, and the implications of the CAP theorem on consistency and availability in distributed systems.

Uploaded by

Yasmine Elqorashy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views49 pages

Lecture 3 - Principles of NoSQL Databases

Uploaded by

Yasmine Elqorashy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 49

Principles of NoSQL Databases

Data Model, Distribution & Consistency

Lecture 3 of NoSQL Databases (PA195)

David Novak, FI, Masaryk University, Brno
http://disa.fi.muni.cz/david-novak/teaching/nosql-databases-2018/
Agenda
● Fundamentals of RDBMs and NoSQL Databases
● Data Model of Aggregates
● Models of Data Distribution
○ scalability, sharding
○ replication: master-slave, peer-to-peer
○ combination
● Consistency
○ write-write vs. read-write conflict
○ strategies and techniques
○ relaxing consistency
Agenda
● Fundamentals of RDBMs and NoSQL Databases
● Data Model of Aggregates
● Models of Data Distribution
○ scalability, sharding
○ replication: master-slave, peer-to-peer
○ combination
● Consistency
○ write-write vs. read-write conflict
○ strategies and techniques
○ relaxing consistency
Fundamentals of RDBMS
Relational Database Management Systems (RDMBS)
1. Data structures are broken into the smallest units
○ normalization of database schema (3NF, BCNF)
● because the data structure is known in advance
● and users/applications query the data in different ways

○ database schema is rigid

2. Queries merge the data from different tables

3. Write operations are simple, search can be slower
4. Strong guarantees for transactional processing
From RDBMS to NoSQL
Efficient implementations of table joins and of
transactional processing require centralized system.
NoSQL Databases:
● Database schema tailored for specific application
○ keep together data pieces that are often accessed together
● Write operations might be slower but read is fast
● Weaker consistency guarantees
=> efficiency and horizontal scalability
Data Model
● The model by which the database organizes data
● Each NoSQL DB type has a different data model
○ Key-value, document, column-family, graph
○ The first three are oriented on aggregates

● Let us have a look at the classic relational model

Example (1): UML Model

source: Holubová, Kosek, Minařík, Novák. Big Data a NoSQL databáze. 2015.
Example (2): Relational Model

source: Holubová, Kosek, Minařík, Novák. Big Data a NoSQL databáze. 2015.
Agenda
● Fundamentals of RDBMs and NoSQL Databases
● Data Model of Aggregates
● Models of Data Distribution
○ scalability, sharding
○ replication: master-slave, peer-to-peer
○ combination
● Consistency
○ write-write vs. read-write conflict
○ strategies and techniques
○ relaxing consistency
Aggregates
An aggregate
● A data unit with a complex structure
○ Not simply a tuple (a table row) like in RDBMS
● A collection of related objects treated as a unit
○ unit for data manipulation and management of consistency

● Relational model is aggregate-ignorant

○ It is not a bad thing, it is a feature
○ Allows to easily look at the data in different ways
○ Best choice when there is no primary structure for data
manipulation
Example (3): Aggregates

source: Holubová, Kosek, Minařík, Novák. Big Data a NoSQL databáze. 2015.
Example (4): Aggregates
// collection "Customer" // collection "Order"
{ {
"customerID": 1, "orderNumber": 11,
"name": "Jan Novák", "date": "2015-04-01",
"address": { "customerID": 1,
"city": "Praha", "orderItems": [
"street": "Krásná 5", {
"ZIP": "111 00" "productID": 111,
} "name": "Vysavač ETA E1490",
} "quantity": 1,
// collection "Invoice" "price": 1300
{ },
"invoiceID": 2015003, {
"orderNumber": 11, "productID": 112,
"bankAccount": "64640439/0100", "name": "Sáček k ETA E1490",
"paymentDate": "2015-04-16", "quantity": 10,
"address": { "price": 300
"city": "Brno", }
"street": "Slunečná 7", ]
"ZIP": "602 00" }
}
NoSQL Databases: Aggregate-oriented
Many NoSQL stores are aggregate-oriented:
○ There is no general strategy to set aggregate boundaries
○ Aggregates give the database information about which bits
of data will be manipulated together
■ What should be stored on the same node

○ Minimize the number of nodes accessed during a search

○ Impact on concurrency control:
■ NoSQL databases typically support atomic manipulation of a single
aggregate at a time
Agenda
● Fundamentals of RDBMs and NoSQL Databases
● Data Model of Aggregates
● Models of Data Distribution
○ scalability, sharding
○ replication: master-slave, peer-to-peer
○ combination
● Consistency
○ write-write vs. read-write conflict
○ strategies and techniques
○ relaxing consistency
Scalability of Database Systems
● Scalability = handling growing amounts of data
and queries without losing performance

Two general approaches:

● vertical scalability
● horizontal scalability
Vertical Scalability (Scaling up)
● Involve larger and more powerful machines
○ large disk storage using disk arrays
○ massively parallel architectures
○ large main memories

● Traditional choice
○ in favour of strong consistency
○ very simple to realize (no handling of data distribution)

● Works in many cases but…

Vertical Scalability: Drawbacks
● Higher costs
○ Large machines cost more than equivalent commodity HW
● Data growth limit
○ Large machine works well until the data grows to fill it
○ Even the largest of machines has a limit
● Proactive provisioning
○ In the beginning, no idea of the final scale of the application
○ An upfront budget is needed when scaling vertically
● Vendor lock-in
○ Large machines are produced by a few vendors
○ Customer is dependent on a single vendor (proprietary HW)
Horizontal Scalability (Scaling out)
System is distributed across multiple machines/nodes
● Commodity machines, cost effective
● Provides higher scalability than vertical approach
○ Data is partitioned over many disks
○ Application can use main memory of all machines
○ Distribution computational model

● Introduces new problems:

○ synchronization, consistency, partial failures handling, etc.
Horizontal Scalability: Fallacies
● Typical false assumptions of distributed computing:
○ The network is reliable
○ Latency is zero
○ Bandwidth is infinite
○ The network is secure
○ The network is homogeneous
○ Topology of the network does not change
○ There is one network administrator

source: https://blogs.oracle.com/jag/resource/Fallacies.html
Distribution Models: Overview
● Horizontal scalability = scaling out
● Two generic ways of data distribution:
○ Replication – the same data is copied over multiple nodes
■ Master-slave or peer-to-peer
○ Sharding – different data chunks are put on different nodes
(data partitioning)

● We can use either or combine them

○ Distribution models = specific ways to do sharding,
replication or combination of both
Distribution Model: Single Server
● Running the database on a single machine is
always the prefered scenario
○ it spares us a lot of problems

● It can make sense to use a NoSQL database on a

single server
○ Other advantages remain: Flexible data model, simplicity
○ Graph databases: If the graph is “almost” complete, it is
difficult to distribute it
Sharding (Data Partitioning)

● Placing different parts

of the data (card suits)
onto different servers

● Applicability: Different
clients access different
parts of the dataset

source: Sadalage & Fowler: NoSQL Distilled, 2012

Distribution Models: Sharding (2)
We should try to ensure that
1. Data accessed together is kept together
○ So that user gets all data from a single server
○ Aggregates data model helps to achieve this
2. Arrange the data on the nodes:
○ Keep the load balanced (can change in time)
○ Consider the physical location (of the data centers)
● Many NoSQL databases offer auto-sharding
● A node failure makes shard’s data unavailable
○ Sharding is often combined with replication
Master-slave Replication
● We replicate data across
multiple nodes

● One node is designated as

primary (master), others as
secondary (slaves)

● Master is responsible for

processing all updates to
the data

● Reads from any node source: Sadalage & Fowler: NoSQL Distilled, 2012
Master-slave Replication (2)
● For scaling a read-intensive application
○ More read requests → more slave nodes
○ The master fails → the slaves can still handle read requests
○ A slave can become a new master quickly (it is a replica)

● Limited by ability of the master to process updates

● Masters are selected manually or automatically

○ User-defined vs. cluster-elected
Peer-to-peer Replication

● No master, all the

replicas are equal

● Every node can handle

a write and then
spreads the update
to the others

source: Sadalage & Fowler: NoSQL Distilled, 2012

Sharding & Replication (1)
● Sharding and master-slave replication:
○ Each data shard is replicated (via a single master)
○ A node can be a master for some data and a slave for other

source: Sadalage & Fowler: NoSQL Distilled, 2012

Sharding & Replication (2)
● Sharding and peer-to-peer replication:
○ A common strategy for column-family databases
○ A typical default is replication factor of 3
■ each shard is present on three nodes

=> we have to solve

consistency issues

(let’s first talk more about

what consistency means)
source: Sadalage & Fowler: NoSQL Distilled, 2012
Agenda
● Fundamentals of RDBMs and NoSQL Databases
● Data Model of Aggregates
● Models of Data Distribution
○ scalability, sharding
○ replication: master-slave, peer-to-peer
○ combination
● Consistency
○ write-write vs. read-write conflict
○ strategies and techniques
○ relaxing consistency
Consistency in Databases
● “Consistency is the lack of contradiction in the DB”
● Centralized RDBMS ensure strong consistency
Write (Update) Consistency
● Problem: two users want Write(K, A)
Write(K, B)
to update the same record
(write-write conflict) DB

○ Issues: lost update, second update is based on stale data

Read Consistency
1. Write(K, A)

● Problem: one user reads 2. Read(K)

3. Read(K’)
in the middle of other 4. Write(K’, B)

user’s writes DB

(read-write conflict, inconsistent read)

○ this leads to logical inconsistency
Consistency in Distributed NoSQL
● Distributed NoSQL databases typically relax
consistency (and/or durability)
○ CAP theorem
○ tradeoff between consistency and availability
○ Strong consistency → eventual consistency
○ BASE (basically available, soft state, eventual consistency)
CAP Theorem

CAP = Consistency, Availability, Partition Tolerance

Consistency
● After an update, all readers in a distributed system
(assuming replication) see the same data

● Example:
○ A single database instance is always consistent
○ If the replication factor > 1, the system must handle the
writes and/or reads in a special way
CAP Theorem (2)
Availability
● If a node (server) is working, it can read and write data
○ Every request must result in a response

Partition Tolerance
● System continues to operate, even if two sets of servers
get isolated
○ A connection failure should not shut the system down

It would be great to have all these three CAP properties!

CAP Theorem: Formulation
● CAP Theorem: A “shared-data” system cannot
have all three CAP properties
○ Or: only two of the three CAP properties are possible
■ This is the common version of the theorem

● First formulated in 2000: prof. Eric Brewer

○ PODC Conference Keynote speech
■ www.cs.berkeley.edu/~brewer/cs262b-2004/PODC-keynote.pdf

● Proven in 2002: Seth Gilbert & Nancy Lynch

○ SIGACT News 33(2) http://dl.acm.org/citation.cfm?id=564601
CAP Theorem: Real Application
● A single-server system is
always CA
○ As well as all ACID systems

● A distributed system practically

has to be tolerant of network Partitions (P)
○ because it is difficult to detect all network failures

● So, tradeoff between Consistency and Availability

○ in fact, it is not a binary decision
Now Let’s Solve Read and Write
Consistency Given The CAP Theorem.
Write (Update) Strong Consistency in
NoSQL
Example: two users, two Write(key, A)

Write(key, B)
Nodes have replicated data, two write attempts
node 1 node 2
● Strong consistency: agreement
○ Before the write is committed,
both nodes have to agree on the order of the writes

● If the nodes are partitioned, Write(key, A) Write(key, B)

(waiting 4ever) (waiting 4ever)
we are losing Availability
○ (but reads are still available)
node 1 node 2
Write (Update) Consistency in NoSQL When Data is
Replicated : Master/Slave

Write(key, A)

● Adding some availability: Write(key, B)

○ Master-slave replication master slave

Write(key, B)

● In case of partitioning,
master can commit write Write(key, A)
(OK)
○ Losing some Consistency: Write(key, B)
(waiting 4ever)
Data on slave will be stale
for read master slave
Write (Update) Consistency in NoSQL When Data is
Replicated Peer to Peer
Write(key, A)
● Choosing Availability: Write(key, B)
○ Peer-to-peer replication
○ Eventual consistency peer 1 peer 2

● In case of Partitioning
○ All requests are answered (full Write(key, A)
Availability) Write(key, B)
○ We risk losing consistency
guarantees completely
peer 1 peer 2

● But we can do something in

the middle: Quorums
Read Consistency in NoSQL
● Consistency among replicas read(K)
read(K)
○ Ensuring that the same data item
has the same value when reading node 1 node 2

from different replicas

● Read-your-writes (session consistency)
○ Is violated if one user writes and reads on different replicas
○ Solution: sticky session (session affinity)
● After some time, the write propagates everywhere
○ Eventual consistency, in the meanwhile: stale data
○ Various levels of consistency (e.g. Quorums)
Quorums
● Peer-to-peer replication with replication factor N
○ Number of replicas of each data object
● Write quorum: W
○ When writing, at least W replicas have to agree
○ Having W > N/2 results in write consistency
■ In case of two simultaneous writes, only one can get the majority and
thus , a conflict can happen and can be detected and then must be
resolved ( Later in the course)
Example: Write(key, A) Write(key, B)
● Replication factor N = 3
● Write quorum: W = 2 peer 1 peer 2
(W > N/2)
peer 3
Quorums (2)
● Read quorum: R
○ Number of peers contacted for a single read
■ Assuming that each value has a time stamp (time of write) to tell the
older value from the newer
○ For a strong read consistency: R + W > N
■ reader surely does not read stale data

Example:
● Read quorum: R = 2 Write(key, A) Write(key, B)

(R + W > N)
peer 1 peer 2
● 2 nodes contacted for read
Read(key)
=> the newest data returned peer 3
Relaxing Durability
Durability:
● When Write is committed, the change is permanent
● In some cases, strict durability is not essential and it
can be traded for scalability (write performance)
○ e.g., storing session data, collection sensor data

A simple way to relax durability:

● Store data in memory and flush to disk regularly
○ if the system shuts down, we loose updates in memory
Relaxing Durability II
● Replication durability (of a write operation)
○ The writing node can either
1. acknowledge (answer) the write operation immediately
● not wait until spread to other replicas
● if the writing node crashes before spreading, durability fails
● write-behind (write-back)
2. or it can first spread the update to other replicas
● operation is answered only after acknowledgement from the others
● write-through
○ both variants are possible for P2P repl., master-slave
replication, quora...
Summary of the Lesson
● Aggregate-oriented data modelling
● Sharding vs. replication
○ Master-slave vs. peer-to-peer replication
■ Combination of sharding & replication
● Database consistency:
○ Write/Read consistency (write-write & write-read conflict)
■ Replication consistency (also, read-your-own-writes)
● Relaxing consistency:
○ CAP (Consistency, Availability, Tolerance to Partitions),
■ Eventual consistency
○ Quoras (write/read quorum)
■ can ensure strong replication consistency; wide range of settings
Conclusions
● There is a wide range of options influencing
○ Scalability
■ of data storage, of read operations, of update (write) requests
○ Availability
■ How the system behaves in case of HW (e.g. network) failure
○ Consistency
■ Consistency has many facets and it depends how important they are
○ Durability
■ Can I rely on confirmed updates (and is it so important)?
○ Fault-tolerance
■ Do I have copies of data to recover after a complete HW fail?
● It’s good to know the options and choose wisely
References
● I. Holubová, J. Kosek, K. Minařík, D. Novák. Big Data a
NoSQL databáze. Praha: Grada Publishing, 2015. 288 p.
● Sadalage, P. J., & Fowler, M. (2012). NoSQL Distilled: A
Brief Guide to the Emerging World of Polyglot
Persistence. Addison-Wesley Professional, 192 p.
● RNDr. Irena Holubova, Ph.D. MMF UK course NDBI040:
Big Data Management and NoSQL Databases
● Eric Brewer: Towards Robust Distributed Systems.
www.cs.berkeley.edu/~brewer/cs262b-2004/PODC-keynote.pdf

Answer EGE 11 - Living in The IT Era Midterm
No ratings yet
Answer EGE 11 - Living in The IT Era Midterm
9 pages
Big Data - No SQL Databases and Related Concepts
100% (1)
Big Data - No SQL Databases and Related Concepts
101 pages
NoSQL Databases
No ratings yet
NoSQL Databases
20 pages
Big Data Management and Nosql Databases: Doc. Rndr. Irena Holubova, PH.D
No ratings yet
Big Data Management and Nosql Databases: Doc. Rndr. Irena Holubova, PH.D
27 pages
Big Data Management Basic Principles
No ratings yet
Big Data Management Basic Principles
55 pages
NoSQL - Unit2
No ratings yet
NoSQL - Unit2
8 pages
Module 2
No ratings yet
Module 2
40 pages
NoSql 2024 Assign2
No ratings yet
NoSql 2024 Assign2
189 pages
2 NoSQL Databases Principles
No ratings yet
2 NoSQL Databases Principles
58 pages
Unit 6
No ratings yet
Unit 6
143 pages
NoSQL M1
No ratings yet
NoSQL M1
48 pages
III Sharding Strategies
No ratings yet
III Sharding Strategies
30 pages
Nosql What Does It Mean
No ratings yet
Nosql What Does It Mean
15 pages
Module 2 Nosql
No ratings yet
Module 2 Nosql
31 pages
Module 1
No ratings yet
Module 1
69 pages
NoSQL Module 2
No ratings yet
NoSQL Module 2
76 pages
Introduction To NoSQL
No ratings yet
Introduction To NoSQL
29 pages
0zI2XrFJX5tR CjuECI f5HwGdQkpL8DAkTmwDPyFm3H0eCERMEvG9fH
No ratings yet
0zI2XrFJX5tR CjuECI f5HwGdQkpL8DAkTmwDPyFm3H0eCERMEvG9fH
13 pages
Nosql
No ratings yet
Nosql
20 pages
BIG Data 2
No ratings yet
BIG Data 2
18 pages
Introduction To NoSQL
No ratings yet
Introduction To NoSQL
43 pages
Distribution Model
100% (1)
Distribution Model
24 pages
Lecture 6 - NoSQL
No ratings yet
Lecture 6 - NoSQL
28 pages
Nosql What Does It Mean
No ratings yet
Nosql What Does It Mean
8 pages
Unit 4: Big Data Tehnology Landscape Two Inportant Technologies
No ratings yet
Unit 4: Big Data Tehnology Landscape Two Inportant Technologies
42 pages
Unit 1
No ratings yet
Unit 1
23 pages
Cs 620 / Dasc 600 Introduction To Data Science & Analytics: Lecture 6-Nosql
No ratings yet
Cs 620 / Dasc 600 Introduction To Data Science & Analytics: Lecture 6-Nosql
31 pages
Unit Ii - Nosql Databases
No ratings yet
Unit Ii - Nosql Databases
112 pages
Nosql Databases
No ratings yet
Nosql Databases
379 pages
NOSQL Lecture 1 Notes
No ratings yet
NOSQL Lecture 1 Notes
31 pages
Unit No 1
No ratings yet
Unit No 1
34 pages
CS3492-DBMS Unit-5
No ratings yet
CS3492-DBMS Unit-5
9 pages
Lecture 1 - NoSQL
No ratings yet
Lecture 1 - NoSQL
31 pages
Bda Ia2 Bda
No ratings yet
Bda Ia2 Bda
7 pages
NoSQL DBs
No ratings yet
NoSQL DBs
46 pages
Nosql Tricks
No ratings yet
Nosql Tricks
34 pages
Mathina BDA
No ratings yet
Mathina BDA
11 pages
Lecture 1
No ratings yet
Lecture 1
31 pages
No SQL
No ratings yet
No SQL
12 pages
Unit-3 BDA
No ratings yet
Unit-3 BDA
21 pages
Module 2
No ratings yet
Module 2
36 pages
Module 2
No ratings yet
Module 2
100 pages
NoSQL D
No ratings yet
NoSQL D
26 pages
NoSQL
No ratings yet
NoSQL
18 pages
Nosql Module 1
No ratings yet
Nosql Module 1
23 pages
Unit 2
No ratings yet
Unit 2
41 pages
Module 2
No ratings yet
Module 2
104 pages
Unit 2 (Big Data Analytics)
No ratings yet
Unit 2 (Big Data Analytics)
11 pages
Unit 5 NOSQL
No ratings yet
Unit 5 NOSQL
102 pages
No SQL Ia-01 - Micro
No ratings yet
No SQL Ia-01 - Micro
6 pages
Dbms Presentation
No ratings yet
Dbms Presentation
22 pages
BDT Assignment
No ratings yet
BDT Assignment
4 pages
Introduction To NoSQL
No ratings yet
Introduction To NoSQL
13 pages
No SQL
No ratings yet
No SQL
109 pages
Nosql Databases: P.Krishna Reddy Iiit Hyderabad
No ratings yet
Nosql Databases: P.Krishna Reddy Iiit Hyderabad
30 pages
CC - Lecture 6-Data
No ratings yet
CC - Lecture 6-Data
44 pages
NoSQL Databases UNIT-2
No ratings yet
NoSQL Databases UNIT-2
29 pages
Introduction To: Nosql
No ratings yet
Introduction To: Nosql
27 pages
Introduction To Big Data and NoSQL
No ratings yet
Introduction To Big Data and NoSQL
52 pages
Massively Parallel Cloud Data Storage Systems: S. Sudarshan IIT Bombay
No ratings yet
Massively Parallel Cloud Data Storage Systems: S. Sudarshan IIT Bombay
17 pages
DBA's Guide to NoSQL
From Everand
DBA's Guide to NoSQL
The Enlightened DBA
5/5 (1)
FSWD-SEM-5 Removed (1) Removed
No ratings yet
FSWD-SEM-5 Removed (1) Removed
38 pages
Final Report of 360 Degree Rotating Car
No ratings yet
Final Report of 360 Degree Rotating Car
42 pages
EASE 5 Installation Instructions - November 24, 2023 - v2.1
No ratings yet
EASE 5 Installation Instructions - November 24, 2023 - v2.1
5 pages
4 Workover and Potential Hazards
100% (1)
4 Workover and Potential Hazards
24 pages
NEGATIVE PHASE SEQUENCE RELAY - Marine Inbox
No ratings yet
NEGATIVE PHASE SEQUENCE RELAY - Marine Inbox
3 pages
2..1-Half Adder and Full Adder
No ratings yet
2..1-Half Adder and Full Adder
12 pages
Course Outlines - Computer Architecture - Spring 2025
No ratings yet
Course Outlines - Computer Architecture - Spring 2025
5 pages
Unit 1-Overview of Language Processors and Translators
No ratings yet
Unit 1-Overview of Language Processors and Translators
71 pages
Office Administration SBA
25% (4)
Office Administration SBA
13 pages
Hfe Harman Kardon Avr65 en
No ratings yet
Hfe Harman Kardon Avr65 en
51 pages
ALV Using OOP
No ratings yet
ALV Using OOP
8 pages
Hasil Malwarebytes
No ratings yet
Hasil Malwarebytes
5 pages
Ficha de Evaluación Diagnóstica Del Área de Inglés
No ratings yet
Ficha de Evaluación Diagnóstica Del Área de Inglés
16 pages
Ibm Internet of Things
100% (1)
Ibm Internet of Things
147 pages
8200.47 Transponder Landing System
No ratings yet
8200.47 Transponder Landing System
28 pages
Panasonic Malaysia Transformation Industry 4.0
0% (1)
Panasonic Malaysia Transformation Industry 4.0
23 pages
BC Cti Whitepaper
No ratings yet
BC Cti Whitepaper
13 pages
Mathematical Driving Model of Three Phase Induction Motors in Stationary Coordinate Frame
No ratings yet
Mathematical Driving Model of Three Phase Induction Motors in Stationary Coordinate Frame
11 pages
Value Stream Mapping Case Study
No ratings yet
Value Stream Mapping Case Study
5 pages
E-Learning in The Classroom?
No ratings yet
E-Learning in The Classroom?
7 pages
WP8010 Communicators Program
No ratings yet
WP8010 Communicators Program
3 pages
Descarga Manual Del Electrobistury Mb160
No ratings yet
Descarga Manual Del Electrobistury Mb160
2 pages
Thiết kế hệ thống nhúng.
No ratings yet
Thiết kế hệ thống nhúng.
7 pages
VM1 Connects To A Virtual Network Named VNET2 by Using A Network Interface Named NIC1
No ratings yet
VM1 Connects To A Virtual Network Named VNET2 by Using A Network Interface Named NIC1
7 pages
C28 X 1 Day Workshop
No ratings yet
C28 X 1 Day Workshop
84 pages
Is ChatGPT Making Us Dumb
No ratings yet
Is ChatGPT Making Us Dumb
4 pages
Syed Furqan Rafique CV - PHD
No ratings yet
Syed Furqan Rafique CV - PHD
2 pages
Mca 23 24
No ratings yet
Mca 23 24
52 pages
ATIs Q45C4 4 Electrode Conductivity Monitor Support Drawings
No ratings yet
ATIs Q45C4 4 Electrode Conductivity Monitor Support Drawings
11 pages

Lecture 3 - Principles of NoSQL Databases

Uploaded by

Lecture 3 - Principles of NoSQL Databases

Uploaded by

Principles of NoSQL Databases

Data Model, Distribution & Consistency

Lecture 3 of NoSQL Databases (PA195)

○ database schema is rigid

2. Queries merge the data from different tables

● Let us have a look at the classic relational model

● Relational model is aggregate-ignorant

○ Minimize the number of nodes accessed during a search

Two general approaches:

● Works in many cases but…

● Introduces new problems:

● We can use either or combine them

● It can make sense to use a NoSQL database on a

● Placing different parts

source: Sadalage & Fowler: NoSQL Distilled, 2012

● One node is designated as

● Master is responsible for

● Limited by ability of the master to process updates

● Masters are selected manually or automatically

● No master, all the

● Every node can handle

source: Sadalage & Fowler: NoSQL Distilled, 2012

source: Sadalage & Fowler: NoSQL Distilled, 2012

=> we have to solve

(let’s first talk more about

○ Issues: lost update, second update is based on stale data

● Problem: one user reads 2. Read(K)

(read-write conflict, inconsistent read)

CAP = Consistency, Availability, Partition Tolerance

It would be great to have all these three CAP properties!

● First formulated in 2000: prof. Eric Brewer

● Proven in 2002: Seth Gilbert & Nancy Lynch

● A distributed system practically

● So, tradeoff between Consistency and Availability

● If the nodes are partitioned, Write(key, A) Write(key, B)

● Adding some availability: Write(key, B)

○ Master-slave replication master slave

● But we can do something in

from different replicas

A simple way to relax durability:

You might also like