0% found this document useful (0 votes)

55 views7 pages

Newsql Databases: Corso Di Sistemi E Architetture Per Big Data

The document discusses NewSQL databases, which aim to provide the scalability of NoSQL systems while maintaining the ACID guarantees of traditional databases. It describes Google's Spanner and VoltDB as examples of NewSQL databases, and notes key aspects of how they implement replication and concurrency control to achieve these goals. The document also provides background on the motivations for building NewSQL databases and an overview of features like Spanner's use of TrueTime for synchronization.

Uploaded by

Nhat Nguyen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

55 views7 pages

Newsql Databases: Corso Di Sistemi E Architetture Per Big Data

Uploaded by

Nhat Nguyen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Università degli Studi di Roma Tor Vergata

Dipartimento di Ingegneria Civile e Ingegneria Informatica

NewSQL Databases

Corso di Sistemi e Architetture per Big Data

A.A. 2016/17

Valeria Cardellini

The reference Big Data stack

High-level Interfaces
Support / Integration

Data Processing

Data Storage

Resource Management

Valeria Cardellini - SABD 2016/17 1

Relational database services

•  RDBMS pros:
–  ACID transactions
–  Relational schemas (and schema changes without
downtime)
–  SQL queries
–  Strong consistency

•  RDBMS cons:
–  Lack of horizontal scalability, to hundreds or
thousands of servers

Valeria Cardellini - SABD 2016/17 2

NewSQL databases

•  How to build a relational database service that is both

strongly consistent and horizontally scalable?

•  NewSQL: a class of modern RDBMSs that seek to

provide the same scalable performance of NoSQL
systems for OLTP read-write workloads while
maintaining ACID guarantees of traditional DB
systems
–  Support SQL

Valeria Cardellini - SABD 2016/17 3

NewSQL examples

•  Google’s Spanner
–  Also available as cloud service in Google Cloud Platform:
Cloud Spanner https://cloud.google.com/spanner/
•  Google’s F1
–  Built on top of Spanner
•  VoltDB
-  And H-Store, its research prototype predecessor
http://hstore.cs.brown.edu
-  Developed by M. Stonebraker (2015 ACM Turing award)
•  Clustrix
–  Closed source
•  NuoDB
–  Closed source, no support for stored procedures

Valeria Cardellini - SABD 2016/17 4

Replication in NewSQL
•  Multi-master or masterless schemes
–  Any node can receive update statements

•  VoltDB and Clustrix

–  A transaction/session manager receives the updates, which
are forwarded to all replicas and executed in parallel
•  Google Spanner
–  Uses Paxos state machine replication to guarantee that a
sequence of commands will be executed in the same order
in all the replica nodes

Valeria Cardellini - SABD 2016/17 5

Spanner

•  Motivations:
–  “We provide a database instead of a key-value
store to make it easier for programmers to write
their applications”
–  “We consistently received complaints from users
that Bitable can be difficult to use for some kinds
of applications”

Valeria Cardellini - SABD 2016/17 6

What is Spanner

•  Wide-area distributed multiversion database

-  General-purpose transactions (ACID)
-  SQL query language
-  Schematized tables
-  Semi-relational data model

•  Running in production
-  Storage for Google’s ad data
-  Replaced a sharded MySQL database

Valeria Cardellini - SABD 2016/17 7

Spanner overview

•  Feature: lock-free distributed read transactions

•  Property: external consistency of distributed
transactions
–  First system at global scale
•  Implementation: integration of concurrency control,
replication, and 2PC
–  Correctness and performance
•  Enabling technology: TrueTime
–  Interval-based global time
–  Based on hardware-assisted time synchronization using
GPS clocks and atomic clocks
–  Accuracy: ~ 1 ms!

Valeria Cardellini - SABD 2016/17 8

Concurrency control in Spanner

•  Hybrid approach
–  Read-write transactions are implemented through read-write
locks, but read-only transactions are lock-free

•  Why is it possible?
–  Spanner stores multiple versions of data, and a read
transaction is basically a read at a “safe” timestamp

Valeria Cardellini - SABD 2016/17 9

VoltDB
•  In-memory database
•  Starting point
–  Open source RDBMS ran on memory-based file system
•  Over 80% of time spent on page buffer management, index
management, and concurrency management
•  Only 12% of time spent doing the real work
–  Lead to H-Store
•  Features
–  Horizontal scale-out on commodity hardware with linear
scalability
–  Full and strong ACID compliance
–  High concurrency
–  Reliable disk persistence
–  High availability

Valeria Cardellini - SABD 2016/17 10

VoltDB

•  Tables are partitioned over multiple servers,

and clients can call any server
–  Transparent distribution but the user can choose
the sharding attribute
•  Selected tables can be replicated over
servers, e.g. for fast access to read-mostly
data
•  Shards are replicated, so that data can be
recovered in the event of a node crash
•  Database snapshots are also supported,
continuous or scheduled
Valeria Cardellini - SABD 2016/17 11
VoltDB and concurrency control

•  Alternative design based on two assumptions

–  Assumption 1: total available memory is large
enough to store the entire data store
–  Assumption 2: all user transactions are short-lived
and can be very efficiently executed over in-
memory data
•  Then, all transactions are executed
sequentially in a single-threaded, lock-free
environment

Valeria Cardellini - SABD 2016/17 12

References

•  Golinger at al., “Data management in cloud

environments: NoSQL and NewSQL data stores”, J.
Cloud Comp., 2013. http://bit.ly/2oRKA5R
•  Corbett et al., “Spanner: Google’s Globally
Distributed Database”, OSDI 2012.
http://bit.ly/2nyJBrb
•  Stonebraker and Weisberg, “The VoltDB Main
Memory DBMS”, 2013. http://bit.ly/2okw837

Valeria Cardellini - SABD 2016/17 13

1 - The Databases Revolutions
No ratings yet
1 - The Databases Revolutions
46 pages
Molitfelnic 2019 Compressed
92% (13)
Molitfelnic 2019 Compressed
927 pages
الإلحاد يهزم نفسه
100% (1)
الإلحاد يهزم نفسه
170 pages
Welcome To VoltDB Training
100% (1)
Welcome To VoltDB Training
102 pages
BIG Data 2
No ratings yet
BIG Data 2
18 pages
Unit 6
No ratings yet
Unit 6
143 pages
01 BigDataDesign
No ratings yet
01 BigDataDesign
38 pages
Benchmarking NewSQL Database VoltDB
No ratings yet
Benchmarking NewSQL Database VoltDB
49 pages
Big Data Storage and Processing
No ratings yet
Big Data Storage and Processing
49 pages
Aim: Program:: Implement The Data Link Layer Framing Methods Such As Character Count
100% (1)
Aim: Program:: Implement The Data Link Layer Framing Methods Such As Character Count
21 pages
777 1651399819 BD Module 5
No ratings yet
777 1651399819 BD Module 5
75 pages
(Davoudian Et Al., 2018) A Survey On NoSQL Stores
No ratings yet
(Davoudian Et Al., 2018) A Survey On NoSQL Stores
43 pages
BDT Unit 4
No ratings yet
BDT Unit 4
93 pages
Oracle Database No SQL-1
No ratings yet
Oracle Database No SQL-1
28 pages
NuoDB-20 White Paper
No ratings yet
NuoDB-20 White Paper
27 pages
Rdbms Important
No ratings yet
Rdbms Important
76 pages
The Role of Data Architecture in Nosql: What Advances Occurred in DBMSS?
No ratings yet
The Role of Data Architecture in Nosql: What Advances Occurred in DBMSS?
22 pages
Unit 5 - BD - Storing Data
No ratings yet
Unit 5 - BD - Storing Data
48 pages
DBMS Chapter 5
No ratings yet
DBMS Chapter 5
52 pages
A Developer Guide To Jakarta EE NoSQL Development With Mongodb and Morphia
No ratings yet
A Developer Guide To Jakarta EE NoSQL Development With Mongodb and Morphia
26 pages
Nosql Tricks
No ratings yet
Nosql Tricks
34 pages
Module 1 Nosql Notes
No ratings yet
Module 1 Nosql Notes
56 pages
CC - Lecture 6-Data
No ratings yet
CC - Lecture 6-Data
44 pages
Introduction To Nosql: Topics To Be Covered
No ratings yet
Introduction To Nosql: Topics To Be Covered
15 pages
WP SQL To Nosql Architectur Differences Considerations Migration 1+ (6) - 1641371845027
No ratings yet
WP SQL To Nosql Architectur Differences Considerations Migration 1+ (6) - 1641371845027
13 pages
Hybrid Database System For Big Data Storage and Management
No ratings yet
Hybrid Database System For Big Data Storage and Management
13 pages
Para Distr Nosql Notes
No ratings yet
Para Distr Nosql Notes
13 pages
HV White Paper Voltdb Technical Overview
No ratings yet
HV White Paper Voltdb Technical Overview
6 pages
Lecture NoSQL
No ratings yet
Lecture NoSQL
30 pages
Chapter 5c
No ratings yet
Chapter 5c
18 pages
NOSQL Lecture 1 Notes
No ratings yet
NOSQL Lecture 1 Notes
31 pages
Orient DB
No ratings yet
Orient DB
23 pages
Unit VI - 1
No ratings yet
Unit VI - 1
31 pages
First QSN
No ratings yet
First QSN
2 pages
CS501 Solved MCQs Final Term
No ratings yet
CS501 Solved MCQs Final Term
45 pages
20 - 04 - 2024 Cheatsheet
No ratings yet
20 - 04 - 2024 Cheatsheet
3 pages
Learning Guide 2.1 - CloudDatabase - NOSQL PDF
No ratings yet
Learning Guide 2.1 - CloudDatabase - NOSQL PDF
44 pages
2 - NoSQL
No ratings yet
2 - NoSQL
32 pages
Lecture 1
No ratings yet
Lecture 1
31 pages
Lecture 1 - NoSQL
No ratings yet
Lecture 1 - NoSQL
31 pages
Nosql Databases: P.Krishna Reddy Iiit Hyderabad
No ratings yet
Nosql Databases: P.Krishna Reddy Iiit Hyderabad
30 pages
DBS-C01-S02-B-03-Relational Databases
No ratings yet
DBS-C01-S02-B-03-Relational Databases
3 pages
Introduction To NoSQL
No ratings yet
Introduction To NoSQL
29 pages
noSQL V newSQL
No ratings yet
noSQL V newSQL
33 pages
A Survey of Post-Relational Data Management and NOSQL Movement
No ratings yet
A Survey of Post-Relational Data Management and NOSQL Movement
22 pages
Overview: High Performance Scalable Data Stores
No ratings yet
Overview: High Performance Scalable Data Stores
19 pages
No SQL Ia-01 - Micro
No ratings yet
No SQL Ia-01 - Micro
6 pages
Massively Parallel Cloud Data Storage Systems: S. Sudarshan IIT Bombay
No ratings yet
Massively Parallel Cloud Data Storage Systems: S. Sudarshan IIT Bombay
17 pages
Basis For Distributed Database Technology
No ratings yet
Basis For Distributed Database Technology
35 pages
Introduction To NoSQL
No ratings yet
Introduction To NoSQL
13 pages
Sayan Ghosh 26900123054 Cse Dbms 6th Sem
No ratings yet
Sayan Ghosh 26900123054 Cse Dbms 6th Sem
11 pages
Bda - 4 Unit
No ratings yet
Bda - 4 Unit
10 pages
Nosql Databases
No ratings yet
Nosql Databases
2 pages
2014 Ieee Computer Nosql
No ratings yet
2014 Ieee Computer Nosql
4 pages
Module 2 Notes
No ratings yet
Module 2 Notes
19 pages
04 Surveys Cattell PDF
No ratings yet
04 Surveys Cattell PDF
16 pages
Lec 6 - Big Data Storage Technologies II - NoSQL
No ratings yet
Lec 6 - Big Data Storage Technologies II - NoSQL
20 pages
NoSQL Intro
No ratings yet
NoSQL Intro
26 pages
VoltDB Decapitates Six SQL Urban Myths and Delivers Internet Scale OLTP in The Process
No ratings yet
VoltDB Decapitates Six SQL Urban Myths and Delivers Internet Scale OLTP in The Process
18 pages
Lecture 8 Chapter 5 Part 4 Big Data Storage Concepts
No ratings yet
Lecture 8 Chapter 5 Part 4 Big Data Storage Concepts
9 pages
CS8492 DBMS Unit 5
No ratings yet
CS8492 DBMS Unit 5
20 pages
Nosql Database
No ratings yet
Nosql Database
8 pages
Day1 Aruba Network Essentials Presentation Printed
100% (1)
Day1 Aruba Network Essentials Presentation Printed
58 pages
Hacking Websites Using SQLMAP - HackingLoops Tutorials - Learn Ethical Hacking Online - HackingLoops
100% (1)
Hacking Websites Using SQLMAP - HackingLoops Tutorials - Learn Ethical Hacking Online - HackingLoops
5 pages
NeetCode 150 - A List by Amoghmc - LeetCode
No ratings yet
NeetCode 150 - A List by Amoghmc - LeetCode
1 page
Simply Modbus
100% (1)
Simply Modbus
5 pages
M.C.a. (Sem - II) Paper - I - Data Structures
No ratings yet
M.C.a. (Sem - II) Paper - I - Data Structures
132 pages
The Art of War Evolved A Deep Dive Into Military Theory, From Ancient Wisdom To Modern Asymmetric Warfare (20,000 Words)
No ratings yet
The Art of War Evolved A Deep Dive Into Military Theory, From Ancient Wisdom To Modern Asymmetric Warfare (20,000 Words)
5 pages
THI2264 Student Guide Book2 v10 0x PDF
No ratings yet
THI2264 Student Guide Book2 v10 0x PDF
354 pages
The Definitive Guide To Email Marketing From Beginner To Pro
No ratings yet
The Definitive Guide To Email Marketing From Beginner To Pro
5 pages
Book Shop Management System
75% (4)
Book Shop Management System
22 pages
93c46 Datasheet
No ratings yet
93c46 Datasheet
12 pages
Redis Enterprise
No ratings yet
Redis Enterprise
35 pages
Hardware
No ratings yet
Hardware
5 pages
The Art and Business of Bakery
No ratings yet
The Art and Business of Bakery
3 pages
1
No ratings yet
1
2 pages
Step 1: Familiarize Yourself With Your Current Setup: 8 Steps Total
No ratings yet
Step 1: Familiarize Yourself With Your Current Setup: 8 Steps Total
9 pages
Marriage
No ratings yet
Marriage
2 pages
The Comprehensive Treatise On Signal Processing From Fundamentals To Cutting-Edge Applications
No ratings yet
The Comprehensive Treatise On Signal Processing From Fundamentals To Cutting-Edge Applications
6 pages
The Importance of Customer Service in Business Success
No ratings yet
The Importance of Customer Service in Business Success
2 pages
The Comprehensive Guide To Cryptocurrency
No ratings yet
The Comprehensive Guide To Cryptocurrency
4 pages
Introduction To Machine Learning Algorithms
No ratings yet
Introduction To Machine Learning Algorithms
3 pages
StreamServe Persuasion SP5 Document Broker Plus
No ratings yet
StreamServe Persuasion SP5 Document Broker Plus
30 pages
Analysis of Applied Natural Language Processing With Python - Implementing Machine Learning and Deep Learning Algorithms For Natural Language Processing (PDFDrive)
No ratings yet
Analysis of Applied Natural Language Processing With Python - Implementing Machine Learning and Deep Learning Algorithms For Natural Language Processing (PDFDrive)
2 pages
The History of Computers
No ratings yet
The History of Computers
5 pages
The Importance of A Positive Working Environment
No ratings yet
The Importance of A Positive Working Environment
3 pages
Ip Practical Index
No ratings yet
Ip Practical Index
6 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
2 pages
Bitcoin
No ratings yet
Bitcoin
3 pages
2
No ratings yet
2
2 pages
Software Expanded
No ratings yet
Software Expanded
2 pages
Sales Expanded
No ratings yet
Sales Expanded
2 pages
Branding Expanded
No ratings yet
Branding Expanded
2 pages
CRM Expanded
No ratings yet
CRM Expanded
2 pages
Sorting Algorithm
No ratings yet
Sorting Algorithm
2 pages
Unit 5. Hardware: What Are We Going To Learn in This Unit?
No ratings yet
Unit 5. Hardware: What Are We Going To Learn in This Unit?
23 pages
Cs Textbook Extended
No ratings yet
Cs Textbook Extended
7 pages
Lab 10 Access List
No ratings yet
Lab 10 Access List
4 pages
Analysis of Deep Learning - Adaptive Computation and Machine Learning (PDFDrive)
No ratings yet
Analysis of Deep Learning - Adaptive Computation and Machine Learning (PDFDrive)
2 pages
01.ORM Fundamentals Exercise MiniORM
No ratings yet
01.ORM Fundamentals Exercise MiniORM
21 pages
The Labyrinth of Underperformance Navigating The Complexities of Lacking Performance Measurement at Work
No ratings yet
The Labyrinth of Underperformance Navigating The Complexities of Lacking Performance Measurement at Work
5 pages
Us Elections Book
No ratings yet
Us Elections Book
5 pages
015.01 Ambler Agile Techniques For Object Databases September 2005
No ratings yet
015.01 Ambler Agile Techniques For Object Databases September 2005
8 pages
Cs Textbook Extended
No ratings yet
Cs Textbook Extended
4 pages
Java Collection Frame Work
No ratings yet
Java Collection Frame Work
10 pages
Unit 4 Material
No ratings yet
Unit 4 Material
14 pages
Srdy 00
No ratings yet
Srdy 00
61 pages
Analysis of Introduction To Machine Learning, Second Edition (Adaptive Computation and Machine Learning)
No ratings yet
Analysis of Introduction To Machine Learning, Second Edition (Adaptive Computation and Machine Learning)
3 pages
Intrusion Detection Systems Using Decision Tree Classifier: Dr. K.K.Shukla
No ratings yet
Intrusion Detection Systems Using Decision Tree Classifier: Dr. K.K.Shukla
23 pages
Using External Volumes Larger Than 4 TB - Cleaned
No ratings yet
Using External Volumes Larger Than 4 TB - Cleaned
11 pages
AI Expanded
No ratings yet
AI Expanded
2 pages
Salesforce Vs Freshwork
No ratings yet
Salesforce Vs Freshwork
2 pages
Baby
No ratings yet
Baby
1 page
Pavement Textures v1 Catalog Web
No ratings yet
Pavement Textures v1 Catalog Web
42 pages
Test Questions
No ratings yet
Test Questions
3 pages
Users Manual Fshview Version 7.1
No ratings yet
Users Manual Fshview Version 7.1
50 pages
Sorting: Note 6: Sorting Algorithms in Data Structure For Application
No ratings yet
Sorting: Note 6: Sorting Algorithms in Data Structure For Application
5 pages
SAPF190 How To Correctly Use The FI General Ledger Comparative Analysis Report SAPF190
No ratings yet
SAPF190 How To Correctly Use The FI General Ledger Comparative Analysis Report SAPF190
3 pages

Newsql Databases: Corso Di Sistemi E Architetture Per Big Data

Uploaded by

Newsql Databases: Corso Di Sistemi E Architetture Per Big Data

Uploaded by

Università degli Studi di Roma Tor Vergata

Dipartimento di Ingegneria Civile e Ingegneria Informatica

Corso di Sistemi e Architetture per Big Data

The reference Big Data stack

Valeria Cardellini - SABD 2016/17 1

Valeria Cardellini - SABD 2016/17 2

• How to build a relational database service that is both

• NewSQL: a class of modern RDBMSs that seek to

Valeria Cardellini - SABD 2016/17 3

Valeria Cardellini - SABD 2016/17 4

• VoltDB and Clustrix

Valeria Cardellini - SABD 2016/17 5

Valeria Cardellini - SABD 2016/17 6

• Wide-area distributed multiversion database

Valeria Cardellini - SABD 2016/17 7

• Feature: lock-free distributed read transactions

Valeria Cardellini - SABD 2016/17 8

Concurrency control in Spanner

Valeria Cardellini - SABD 2016/17 9

Valeria Cardellini - SABD 2016/17 10

• Tables are partitioned over multiple servers,

• Alternative design based on two assumptions

Valeria Cardellini - SABD 2016/17 12

• Golinger at al., “Data management in cloud

Valeria Cardellini - SABD 2016/17 13

You might also like

•  How to build a relational database service that is both

•  NewSQL: a class of modern RDBMSs that seek to

•  VoltDB and Clustrix

•  Wide-area distributed multiversion database

•  Feature: lock-free distributed read transactions

•  Tables are partitioned over multiple servers,

•  Alternative design based on two assumptions

•  Golinger at al., “Data management in cloud