0% found this document useful (0 votes)

24 views45 pages

26 Distributed Dbms Nosql

Distributed databases store data across multiple independent sites for increased availability and scalability. There are three main architectures for distributed databases: client-server, collaborating servers, and middleware. Data can be fragmented or replicated across sites. Distributed queries require coordination between sites, such as using semi-joins to reduce data transfer for join queries. Updates to distributed data must consider synchronous or asynchronous replication to balance consistency and performance. Distributed transactions require distributed concurrency control to coordinate locks across multiple database sites.

Uploaded by

Jam Muslim

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views45 pages

26 Distributed Dbms Nosql

Uploaded by

Jam Muslim

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 45

Distributed Databases

and
NOSQL
Introduction to Databases
CompSci 316 Spring 2017
Announcements (Mon., Apr. 24)
• Homework #4 due today (Monday, April 24, 11:55 pm)
• Project
• final report draft due today -- Monday, April 24, 11:59 pm
• code due on Wednesday -- April 26, 11:59 pm
• See all announcements about project report and demo on piazza
• Please bring a computer to class on Wednesday
• We will take a 5-10 mins break for filling out course evaluations as
advised by the Office of Assessments
• Google Cloud code
• Please redeem your code asap, by May 11
• Final Exam
• May 2 (next Tuesday), 2-5 pm, in class
• Everything covered in the class (up to last lecture) is included
• If you need special accommodation, please email me
Announcements (Mon., Apr. 24)
• Final Project Report
• should be “comprehensive”
• if you had more material in MS1 or MS2, please include them
• not the “work in progress” part

• The “overview of your application” part should include

• description of the user interface of your system – screenshots,
features, how the user will be able to interact with the system
• Sample execution (input and output)
• Your approach (algorithm, indexes, storage, optimization,
normalization)
• Any interesting observation
Where are we now?
We learnt ü XML
ü Relational Model, Query ü DTD and XML schema
Languages, and Database ü XPath and XQuery
Design ü Relational Mapping
ü SQL ü Advanced/Research topics
ü RA overview
ü E/R diagram
ü Normalization ü Parallel DBMS
ü Map Reduce
ü DBMS Internals
ü Storage
ü Data Mining
ü Indexing ü Data Warehousing
ü Query Evaluation
ü External sort • Today
ü Join Algorithms
– Distributed DBMS
ü Query Optimization
– NOSQL
ü Transactions
ü Basic concepts • Next lecture
ü Concurrency control – Overview of other areas database
ü Recovery researchers work on
– Practice problems for the final
Parallel vs. Distributed DBMS
Parallel DBMS Distributed DBMS

• Parallelization of various • Data is physically stored across

operations different sites
• e.g. loading data, building – Each site is typically managed by
indexes, evaluating an independent DBMS
queries
• Location of data and autonomy of
• Data may or may not be sites have an impact on Query
distributed initially opt., Conc. Control and recovery

• Distribution is governed • Also governed by other factors:

by performance – increased availability for system
consideration crash
– local ownership and access
Distributed Databases

Architecture
Data Storage
Query Execution
Transactions
Two desired properties and recent trends
• Data is stored at several sites, each managed by a DBMS that can run
independently
1. Distributed Data Independence
• Users should not have to know where data is located
2. Distributed Transaction Atomicity
• Users should be able to write transactions accessing multiple sites just
like local transactions
• These two properties are in general desirable, but not always efficiently
achievable
• e.g. when sites are connected by a slow long-distance network
• Even sometimes not desirable for globally distributed sites
• too much administrative overhead of making location of data
transparent (not visible to the user)
• Therefore not always supported
• Users have to be aware of where data is located
Distributed DBMS Architecture
• Three alternative approaches
1. Client-Server
• One or more client (e.g. personal computer) and one or more
server processes (e.g. a mainframe)
• Clients are responsible for user interfaces, Server manages
data and executes queries
2. Collaborating Server
• Queries can be submitted and can span multiple sites
• No distinction between clients and servers
3. Middleware
• need just one db server (called middleware) capable of
managing queries and transactions spanning multiple servers
• the remaining servers can handle only the local queries and
transactions
• can integrate legacy systems with limited flexibility and power
Storing Data in a Distributed DBMS

• Relations are stored across several sites

• Accessing data at a remote site incurs message-
passing costs
• To reduce this overhead, a single relation may be
partitioned or fragmented across several sites
• typically at sites where they are most often accessed
• The data can be replicated as well
• when the relation is in high demand
Fragmentation
• Break a relation into smaller relations or fragments
– store them in different sites as needed
TID
t1
t2
t3
• Horizontal: t4
• Usually disjoint
• Can often be identified by a selection query (employees in a city – locality of
reference)
• To retrieve the full relation, need a union

• Vertical:
• Identified by projection queries
• Typically unique TIDs added to each tuple
• TIDs replicated in each fragments
• Ensures that we have a Lossless Join
Replication
• When we store several copies of a relation or relation
fragments
• can be replicated at one or more sites
• e.g. R is fragmented into R1, R2, R3; one copy of R2, R3; but two
copies at R1 at two sites
• Advantages
• Gives increased availability – e.g. when a site or communication
link goes down
• Faster query evaluation – e.g. using a local copy
• Synchronous and Asynchronous (later)
• Vary in how current different copies are when a relation is modified

SITE A SITE B

R1 R3
R1 R2
Distributed Query Processing: SELECT AVG(S.age)
FROM Sailors S
Non-Join Distributed Queries WHERE S.rating > 3
tid sid sname rating age AND S.rating < 7
T1 4 stored at Shanghai
T2 5
stored at Tokyo
T3 9

• Horizontally Fragmented: Tuples with rating < 5 at Shanghai, >= 5 at Tokyo.

• Must compute SUM(age), COUNT(age) at both sites.
• If WHERE contained just S.rating > 6, just one site
• Vertically Fragmented: sid and rating at Shanghai, sname and age at Tokyo,
tid at both.
• Must reconstruct relation by join on tid, then evaluate the query
• if no tid, decomposition would be lossy
• Replicated: Sailors copies at both sites.
• Choice of site based on local costs (e.g. index), shipping costs
Joins in a Distributed DBMS
• Can be very expensive if relations are stored at
different sites
• Semi-join (we will cover only this)
• Other join methods:
• Fetch as needed
• Ship to one site
• Bloom join (similar approach to semi-join but uses hashing )

LONDON PARIS

Sailors (S) Reserves (R)

500 pages 1000 pages

Semijoin -1/2
• Suppose want to ship R to London and then do join with S at
London. Instead,
1. At London, project S onto join columns and ship this to Paris
• Here foreign keys, but could be arbitrary join
2. At Paris, join S-projection with R
• Result is called reduction of Reserves w.r.t. Sailors (only these tuples are
needed)
3. Ship reduction of R to back to London
4. At London, join S with reduction of R
LONDON PARIS

Sailors (S) Reserves (R)

500 pages 1000 pages

Semijoin – 2/2
• Tradeoff the cost of computing and shipping
projection for cost of shipping full R relation
• Especially useful if there is a selection on Sailors, and
answer desired at London

LONDON PARIS

Sailors (S) Reserves (R)

500 pages 1000 pages

Updating Distributed Data
• Synchronous Replication:
• All copies of a modified relation (or fragment) must be updated before
the modifying transaction commits
• Data distribution is made totally “transparent” (not visible!) to users
• Before an update transaction can commit, it must obtain locks on all
modified copies – Slow!
• By Majority voting or “read any write all”
• Asynchronous Replication:
• Copies of a modified relation are only periodically updated; different
copies may get out of sync in the meantime
• Users must be aware of data distribution
• More efficient – many current products follow this approach
• Update “master” copy and propagate (primary site), or update “any”
copy and propagate (peer to peer)
Distributed Transactions
• Distributed CC
• How can locks for objects stored across several sites be
managed?
• How can deadlocks be detected in a distributed
database?
• Distributed Recovery
• When a transaction commits, all its actions, across all the
sites at which is executes must persist
• When a transaction aborts, none of its actions must be
allowed to persist
Distributed Deadlock Detection
T1 T2 T1 T2 T1 T2

SITE A SITE B GLOBAL

• Locking can happen:

• Centrally – one site manages all locks
• Primary site – primary copy is locked
• Distributed – by different sites storing a copy
• Each site maintains a local waits-for graph
• A global deadlock might exist even if the local graphs contain no cycles
• Further, phantom deadlocks may be created while communicating
• due to delay in propagating local information
• might lead to unnecessary aborts
Three Distributed
Deadlock Detection Approaches
T1 T2 T1 T2 T1 T2

SITE A SITE B GLOBAL

1. Centralized
• send all local graphs to one site periodically
• A global waits-for graph is generated
2. Hierarchical
• organize sites into a hierarchy and send local graphs to parent in the
hierarchy
• e.g. sites (every 10 sec)-> sites in a state (every min)-> sites in a
country (every 10 min) -> global waits for graph
• intuition: more deadlocks are likely across closely related sites
3. Timeout
• abort transaction if it waits too long (low overhead)
Distributed Recovery
• Two new issues:
• New kinds of failure, e.g., links and remote sites
• If “sub-transactions” of a transaction execute
at different sites, all or none must commit
• Need a commit protocol to achieve this
• Most widely used: Two Phase Commit (2PC)

• A log is maintained at each site

• as in a centralized DBMS
• commit protocol actions are additionally logged
Two-Phase Commit (2PC)
• Site at which transaction originates is coordinator
• Other sites at which it executes are subordinates
• w.r.t. coordinarion of this transaction
• Two rounds of communication (overview only)
• Phase 1:
• Prepare from Coordinator to all subs asking if ready to commit
• Yes/No – from subs to coordinator
• To commit, all subs must say yes
• Phase 2:
• If yes from all subs, coordinator sends out Commit
• Subs do so, and send back ack
• Before each message sent, the log is updated with the decision
• If time out for the next message, ping other
machines/restart/abort according to the state of the log
NoSQL

• Optional reading:
• Cattell’s paper (2010-11)
• Warning! some info will be outdated
• see webpage http://cattell.net/datastores/ for
updates and more pointers
So far -- RDBMS
• Relational Data Model
• Relational Database Systems (RDBMS)
• RDBMSs have
• a complete pre-defined fixed schema
• a SQL interface
• and ACID transactions
NOSQL
• Many of the new systems are referred to as “NoSQL” data
stores
• MongoDB, CouchDB, VoltDB, Dynamo, Membase, ….

• NoSQL stands for “Not Only SQL” or “Not Relational”

• not entirely agreed upon

• NoSQL = “new” database systems

• not typically RDBMS
• relax on some requirements, gain efficiency and scalability

• New systems choose to use/not use several concepts we

learnt so far
• e.g. System X does not use locks but use multi-version CC (MVCC) or,
• System Y uses asynchronous replication
Applications of New Systems

• Designed to scale simple “OLTP”-style application

loads
• to do updates as well as reads
• in contrast to traditional DBMSs and data warehouses
• to provide good horizontal scalability for simple
read/write database operations distributed over many
servers
• Originally motivated by Web 2.0 applications
• these systems are designed to scale to thousands or
millions of users
NoSQL: Six Key Features
1. the ability to horizontally scale “simple operations”
throughput over many servers
2. the ability to replicate and to distribute (partition) data
over many servers
3. a simple call level interface or protocol (in contrast to SQL
binding)
4. a weaker concurrency model than the ACID transactions
of most relational (SQL) database systems
5. efficient use of distributed indexes and RAM for data
storage
6. the ability to dynamically add new attributes to data
records
BASE (not ACID J)
• Recall ACID for RDBMS desired properties of
transactions:
• Atomicity, Consistency, Isolation, and Durability

• NOSQL systems typically do not provide ACID

• Basically Available
• Soft state
• Eventually consistent
ACID vs. BASE
• The idea is that by giving up ACID constraints, one
can achieve much higher performance and
scalability

• The systems differ in how much they give up

• e.g. most of the systems call themselves “eventually
consistent”, meaning that updates are eventually
propagated to all nodes
• but many of them provide mechanisms for some degree
of consistency, such as multi-version concurrency control
(MVCC)
“CAP” Theorem
• Often Eric Brewer’s CAP theorem cited for NoSQL

• A system can have only two out of three of the following

properties:
• Consistency,
• Availability
• Partition-tolerance

• The NoSQL systems generally give up consistency

• However, the trade-offs are complex
What is different in NOSQL systems
• When you study a new NOSQL system, notice how
it differs from RDBMS in terms of

1. Concurrency Control
2. Data Storage Medium
3. Replication
4. Transactions
Choices in NOSQL systems:
1. Concurrency Control
a) Locks
• some systems provide one-user-at-a-time read or
update locks
• MongoDB provides locking at a field level
b) MVCC
c) None
• do not provide atomicity
• multiple users can edit in parallel
• no guarantee which version you will read
d) ACID
• pre-analyze transactions to avoid conflicts
• no deadlocks and no waits on locks
Choices in NOSQL systems:
2. Data Storage Medium

a) Storage in RAM
• snapshots or replication to disk
• poor performance when overflows RAM
b) Disk storage
• caching in RAM
Choices in NOSQL systems:
3. Replication
• whether mirror copies are always in sync
a) Synchronous
b) Asynchronous
• faster, but updates may be lost in a crash
c) Both
• local copies synchronously, remote copies
asynchronously
Choices in NOSQL systems:
4. Transaction Mechanisms

a) support
b) do not support
c) in between
• support local transactions only within a single object or
“shard”
• shard = a horizontal partition of data in a database
Comparison from Cattell’s paper (2011)
Data Store Categories
• The data stores are grouped according to their data model
• Key-value Stores:
• store values and an index to find them based on a programmer-
defined key
• e.g. Project Voldemort, Riak, Redis, Scalaris, Tokyo Cabinet,
Memcached/Membrain/Membase
• Document Stores:
• store documents, which are indexed, with a simple query mechanism
• e.g. Amazon SimpleDB, CouchDB, MongoDB, Terrastore
• Extensible Record Stores:
• store extensible records that can be partitioned vertically and
horizontally across nodes (“wide column stores”)
• e.g. Hbase, HyperTable, Cassandra, Yahoo’s PNUTS
• Relational Databases:
• store (and index and query) tuples, e.g. the new RDBMSs that provide
horizontal scaling
• e.g. MySQL Cluster, VoltDB, Clustrix, ScaleDB, ScaleBase, NimbusDB,
Google Megastore (a layer on BigTable)
RDBMS benefits
• Relational DBMSs have taken and retained majority market
share over other competitors in the past 30 years

• While no “one size fits all” in the SQL products themselves,

there is a common interface with SQL, transactions, and
relational schema that give advantages in training,
continuity, and data interchange

• Successful relational DBMSs have been built to handle other

specific application loads in the past:
• read-only or read-mostly data warehousing, OLTP on multi-core
multi-disk CPUs, in-memory databases, distributed databases, and
now horizontally scaled databases
NoSQL benefits
• We haven’t yet seen good benchmarks showing that RDBMSs can
achieve scaling comparable with NoSQL systems like Google’s BigTable

• If you only require a lookup of objects based on a single key, then a key-
value/document store may be adequate and probably easier to
understand than a relational DBMS

• Some applications require a flexible schema

• A relational DBMS makes “expensive” (multi- node multi-table)

operations “too easy”
• NoSQL systems make them impossible or obviously expensive for
programmers

• The new systems are slowly gaining market shares too

Column Store
Row vs. Column Store

• Row store
• store all attributes of a tuple together
• storage like “row-major order” in a matrix
• Column store
• store all rows for an attribute (column) together
• storage like “column-major order” in a matrix
• e.g.
• MonetDB, Vertica (earlier, C-store), SAP/Sybase IQ,
Google Bigtable (with column groups)
Ack: Slide from VLDB 2009 tutorial on Column store
Ack: Slide from VLDB 2009 tutorial on Column store
Ack: Slide from VLDB 2009 tutorial on Column store
Ack: Slide from VLDB 2009 tutorial on Column store
Ack: Slide from VLDB 2009 tutorial on Column store

Chapter 4 - Distributed Database System
No ratings yet
Chapter 4 - Distributed Database System
52 pages
Adv DBMS-Unit 2
No ratings yet
Adv DBMS-Unit 2
15 pages
17 DatabaseArchitectures
No ratings yet
17 DatabaseArchitectures
41 pages
Distributed Databases: by Allyson Moran
No ratings yet
Distributed Databases: by Allyson Moran
37 pages
Distributed Databases: by Chien-Pin Hsu CS157B Section 1 Nov 11, 2004
No ratings yet
Distributed Databases: by Chien-Pin Hsu CS157B Section 1 Nov 11, 2004
24 pages
Distributed Database
100% (1)
Distributed Database
24 pages
DDB Slides
No ratings yet
DDB Slides
30 pages
Mod 5
No ratings yet
Mod 5
88 pages
Distributed Databases and Client-Server Architectures
No ratings yet
Distributed Databases and Client-Server Architectures
60 pages
Distributed Database Systems Guide
No ratings yet
Distributed Database Systems Guide
25 pages
Tybca Recent Trends in It Chpter 1
No ratings yet
Tybca Recent Trends in It Chpter 1
16 pages
Distributed Databases
No ratings yet
Distributed Databases
53 pages
Adbms
No ratings yet
Adbms
70 pages
Advanced Database Chapter 6 and 7
No ratings yet
Advanced Database Chapter 6 and 7
30 pages
Unit 1
No ratings yet
Unit 1
28 pages
Distributed Databases Overview
No ratings yet
Distributed Databases Overview
33 pages
DDB Slides
No ratings yet
DDB Slides
67 pages
DB Unit-2
No ratings yet
DB Unit-2
27 pages
Unit-2 - Distributed Database System
No ratings yet
Unit-2 - Distributed Database System
7 pages
Distributed Databases
No ratings yet
Distributed Databases
58 pages
Distributed Databases AND Client-Server Architechures
No ratings yet
Distributed Databases AND Client-Server Architechures
73 pages
Chapter 7 - Distributed Database System
No ratings yet
Chapter 7 - Distributed Database System
27 pages
Distributed Databases: Benefits and Issues To Be Considered
No ratings yet
Distributed Databases: Benefits and Issues To Be Considered
25 pages
Distributed Database Concepts
No ratings yet
Distributed Database Concepts
52 pages
Distributed DB Systems Overview
No ratings yet
Distributed DB Systems Overview
67 pages
CS3492-DBMS Unit-5
No ratings yet
CS3492-DBMS Unit-5
9 pages
Distributed Database: Database Storage Devices CPU Database Management System Computers Network
No ratings yet
Distributed Database: Database Storage Devices CPU Database Management System Computers Network
9 pages
ADBS Chapter Seven
No ratings yet
ADBS Chapter Seven
22 pages
Database
No ratings yet
Database
6 pages
Distributed Databases: Not Just A Client/server System
No ratings yet
Distributed Databases: Not Just A Client/server System
43 pages
DDBMS (3,4 & 14)
No ratings yet
DDBMS (3,4 & 14)
11 pages
Unit V
No ratings yet
Unit V
22 pages
Distributed Databases & Security
No ratings yet
Distributed Databases & Security
27 pages
Chapter 4 Bing
No ratings yet
Chapter 4 Bing
5 pages
Distributed Databases: CMP-3440 - Database Systems
No ratings yet
Distributed Databases: CMP-3440 - Database Systems
12 pages
Distributed Database Systems Guide
No ratings yet
Distributed Database Systems Guide
5 pages
Distributed Databases
No ratings yet
Distributed Databases
46 pages
Chapter - 7 Distributed Database System
No ratings yet
Chapter - 7 Distributed Database System
29 pages
Distributed Databases: Chapter 22.6-22.14
No ratings yet
Distributed Databases: Chapter 22.6-22.14
26 pages
Ddis U1-3
No ratings yet
Ddis U1-3
40 pages
Introduction To DDBMS Enhanced
No ratings yet
Introduction To DDBMS Enhanced
17 pages
DDB Unit 1-5
No ratings yet
DDB Unit 1-5
190 pages
Distibuted System
No ratings yet
Distibuted System
11 pages
Distributed Databases: Presentation-I
No ratings yet
Distributed Databases: Presentation-I
30 pages
Module 2
No ratings yet
Module 2
62 pages
Unit 2-DBP
No ratings yet
Unit 2-DBP
44 pages
Chapter-7 Distributed Database Systems
No ratings yet
Chapter-7 Distributed Database Systems
40 pages
Distributed Databases: Not Just A Client/server System
No ratings yet
Distributed Databases: Not Just A Client/server System
43 pages
Chapter 6
No ratings yet
Chapter 6
45 pages
Distributed Systems
No ratings yet
Distributed Systems
25 pages
Distributed DBMS
No ratings yet
Distributed DBMS
62 pages
Types of Distributed Data Base System - 49724
No ratings yet
Types of Distributed Data Base System - 49724
37 pages
Distributed DBMS Fundamentals
No ratings yet
Distributed DBMS Fundamentals
25 pages
Distributed Databases: Daniel Marcous
No ratings yet
Distributed Databases: Daniel Marcous
41 pages
No SQL
No ratings yet
No SQL
109 pages
Adbms Ise1
No ratings yet
Adbms Ise1
16 pages
Rajshekhar CloudDB
No ratings yet
Rajshekhar CloudDB
13 pages
Unit I Distributed Databases
No ratings yet
Unit I Distributed Databases
15 pages
Cookies
No ratings yet
Cookies
14 pages
New Topic 4
No ratings yet
New Topic 4
1 page
New Topic 5
No ratings yet
New Topic 5
1 page
Topic 3
No ratings yet
Topic 3
1 page
Symplectic Spreads
No ratings yet
Symplectic Spreads
6 pages
Prenot@mi
No ratings yet
Prenot@mi
1 page
CFC For s7
100% (1)
CFC For s7
83 pages
Cyclists' Guide: XOSS G+ GPS Manual
No ratings yet
Cyclists' Guide: XOSS G+ GPS Manual
12 pages
Index: BM111 SERIES 3KW Auto-Focusing Laser Cutting Heads User Manual
No ratings yet
Index: BM111 SERIES 3KW Auto-Focusing Laser Cutting Heads User Manual
1 page
UML Class Diagram Basics
No ratings yet
UML Class Diagram Basics
8 pages
FSK Modem Lab: Signal Processing
No ratings yet
FSK Modem Lab: Signal Processing
11 pages
BCS6010 Question Paper
No ratings yet
BCS6010 Question Paper
17 pages
Bosch Turbo Scara SR6 / SR8 Planning Manual: Dvvhpeo/Whfkqrorj
No ratings yet
Bosch Turbo Scara SR6 / SR8 Planning Manual: Dvvhpeo/Whfkqrorj
65 pages
HCSA-Presales Storage Test - V2
100% (1)
HCSA-Presales Storage Test - V2
32 pages
CSE 5 Notes
No ratings yet
CSE 5 Notes
33 pages
Abstract
No ratings yet
Abstract
2 pages
Itu T V.11
No ratings yet
Itu T V.11
20 pages
Lab 7 - Wireshark Ethernet ARP v8.1 PhuongVo
No ratings yet
Lab 7 - Wireshark Ethernet ARP v8.1 PhuongVo
8 pages
Design and Analysis of Portable Solar Powered Refrigerator Unit
No ratings yet
Design and Analysis of Portable Solar Powered Refrigerator Unit
7 pages
Sunways WIFI Module User Manual CN
No ratings yet
Sunways WIFI Module User Manual CN
1 page
38942089968
No ratings yet
38942089968
2 pages
SIP Easy For The Schneider PowerLogic ION6200
No ratings yet
SIP Easy For The Schneider PowerLogic ION6200
2 pages
The Design and Construction of A DDS Based Waveform Generator
No ratings yet
The Design and Construction of A DDS Based Waveform Generator
12 pages
Illustration Essay Example
100% (2)
Illustration Essay Example
4 pages
DaguCar Update
No ratings yet
DaguCar Update
5 pages
POB User Manual v1.2.2 1
No ratings yet
POB User Manual v1.2.2 1
61 pages
Matrimonial Site with Django Backend
No ratings yet
Matrimonial Site with Django Backend
4 pages
Arithmetic Instructions
No ratings yet
Arithmetic Instructions
30 pages
ARCON PAM Vs Beyond Trust
No ratings yet
ARCON PAM Vs Beyond Trust
9 pages
KX-NS500 Remote IP Extensions
No ratings yet
KX-NS500 Remote IP Extensions
29 pages
Dynamic Seq XII
No ratings yet
Dynamic Seq XII
20 pages
ETG - ARUZE TECHNICAL TRAINING STWC - v02
No ratings yet
ETG - ARUZE TECHNICAL TRAINING STWC - v02
16 pages
HMI GOT1000 Discontinued & Replacement
No ratings yet
HMI GOT1000 Discontinued & Replacement
16 pages
SAP Security Authorization - Trace & Checks
No ratings yet
SAP Security Authorization - Trace & Checks
6 pages
Datasheet CL s703 en
No ratings yet
Datasheet CL s703 en
3 pages
Fault Diagnosis of Analog Integrated Circuits Frontiers in Electronic Testing 1st Edition Prithviraj Kabisatpathy 2024 Scribd Download
100% (7)
Fault Diagnosis of Analog Integrated Circuits Frontiers in Electronic Testing 1st Edition Prithviraj Kabisatpathy 2024 Scribd Download
51 pages

26 Distributed Dbms Nosql

Uploaded by

26 Distributed Dbms Nosql

Uploaded by

Distributed Databases

• The “overview of your application” part should include

• Parallelization of various • Data is physically stored across

• Distribution is governed • Also governed by other factors:

• Relations are stored across several sites

• Horizontally Fragmented: Tuples with rating < 5 at Shanghai, >= 5 at Tokyo.

Sailors (S) Reserves (R)

500 pages 1000 pages

Sailors (S) Reserves (R)

500 pages 1000 pages

Sailors (S) Reserves (R)

500 pages 1000 pages

SITE A SITE B GLOBAL

• Locking can happen:

SITE A SITE B GLOBAL

• A log is maintained at each site

• NoSQL stands for “Not Only SQL” or “Not Relational”

• NoSQL = “new” database systems

• New systems choose to use/not use several concepts we

• Designed to scale simple “OLTP”-style application

• NOSQL systems typically do not provide ACID

• The systems differ in how much they give up

• A system can have only two out of three of the following

• The NoSQL systems generally give up consistency

• While no “one size fits all” in the SQL products themselves,

• Successful relational DBMSs have been built to handle other

• Some applications require a flexible schema

• A relational DBMS makes “expensive” (multi- node multi-table)

• The new systems are slowly gaining market shares too

You might also like