TechVault Distributed Databases
TechVault Distributed Databases
Challenges
• Server failures
Scaling storage and processing
Server Failure
Users
Application
Database
Cluster
Distributed Database
Leader
Master
Coordinator
Followers
Segments
Sharding
Centralized Database
Distributed Database
Sharding
SELECT count(*)
FROM orders
SELECT count(*)
FROM orders 20 SELECT count(*)
FROM orders
15
65
30
SELECT count(*)
FROM orders
Replication
Master StandBy
Replication
• Synchronous
• Asynchronous
Segments Replica
Distributed Databases
Advantages
Improved
Scalability
Performance
High Geographic
Availability Distribution
Query Processing
SELECT count(*)
FROM orders
15
65
30
SELECT count(*)
FROM orders
Query Processing
SELECT count(*)
FROM orders
Availability
• Every request receives a response (but not necessarily the latest data)
Partition Tolerance
• The system continues to operate even if some nodes are unavailable
CAP Theorem
CA Centralized Databases
CP
Distributed Databases
AP
CAP Theorem
• CP (prioritize consistency)
• No stale results
QUERY
• Return error (unavailable)
• CA (prioritize availability)
• Return stale result (not
consistent)