Week 6 - Global Serialization
Week 6 - Global Serialization
ke
OUTLINE
• 1. Global Serialization
1. Global Serialization:
Definition: Global Serialization in the context of distributed databases refers to the process
of ensuring that transactions are executed in a globally serializable order, even when
different nodes. Global Serialization is essential to maintain data consistency and integrity
words, it ensures that transactions appear to be executed one after the other, even
serializable order.
• Conflict Resolution: Conflicts can arise when multiple transactions access and
modify the same data concurrently. Conflict resolution mechanisms determine how
Multiple customers place orders concurrently from different regions. To maintain order
consistency, Global Serialization ensures that each order transaction is processed in a way
that maintains the global order history, even though orders are placed from different
locations.
Explanation: Data Distribution strategies are employed to optimize the performance and
resilience of distributed database systems. Replication and partitioning are two common
approaches:
• Data Replication: In data replication, copies of the same data are stored on multiple
nodes or servers. This strategy improves data availability and fault tolerance since
data can still be accessed even if one node fails. It also reduces data access latency.
• Data Partitioning (Sharding): Data partitioning involves dividing a large dataset into
Abdulrahman A. Mohamed Mobile: +254 713 500 814 Email: [email protected]
different node, distributing the data processing load. This strategy enhances
Real-World Example: A social media platform uses data replication for user profiles. Each
user's profile information is replicated across multiple servers to ensure fast access and
availability. However, for the vast amount of user-generated content, such as posts and
based on user IDs, and each partition is stored on different servers to distribute the
processing load.
distributed and potentially exposed nature of data. Here are key aspects:
to protect data from eavesdropping during transit. Secure Socket Layer (SSL) and
• Authorization: Access control is critical. Role-based access control (RBAC) and access
control lists (ACLs) define who can access which parts of the database and what
• Data Privacy Compliance: Distributed databases often store sensitive data, such as
HIPAA, is crucial.
access and changes to data, helping detect and respond to security breaches.
authentication for healthcare providers accessing patient data, and role-based access
control. The database is audited regularly to ensure compliance with healthcare privacy