0% found this document useful (0 votes)

151 views49 pages

Design Key-Value Database + Real Usecase

Key-value databases are commonly used for caching, real-time recommendations, and queueing systems due to their fast read/write capabilities. The document discusses designing keys, structuring values, limitations of key-value databases, and common design patterns including time to live, emulating tables, aggregates, atomic aggregates, enumerable keys, and indexes. It also provides examples of when to use key-value databases such as for user sessions in web applications or powering recommendations and advertising.

Uploaded by

CNTT-2C-18 Vũ Thu hoài

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

151 views49 pages

Design Key-Value Database + Real Usecase

Uploaded by

CNTT-2C-18 Vũ Thu hoài

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 49

Design Key-value database

Nguyen Huu Cam

Table of contents
• Key design and partitioning

• Designing structured values

• Limitation of Key-value database

• Design pattern for key-value database

• Use cases to use and when not to use
1. Key design and partitioning
• How you design your keys can impact the ease of working with your key-value
database.

• Avoid meaning less name, such as ‘laklsjfdjjd’

• Key should have logical structure to make code more readable and extensible, but
also be designed with storage efficiency in mind
General guideline to name key
• Use meaningful and unambiguous naming components

• Use range-based components when would like to retrieve ranges of values

• Use common delimiter when appending components to make a key. Normally, use
“:” as delimiter

• Keep key as short as possible without sacrifice other characteristics mentioned

Keys Must Take into Account Implementation
Limitations
• Different key-value databases have different limitations. Consider those limitations
when choosing your key-value database

• Some key-value database restricts size of keys. For example, FoundationDB limits the
size of keys to 10,000 bytes
Valid datatypes in Redis
Redis datatypes
• The variety of data types supported by Redis allows you more flexibility when
creating keys

• Redis keys can be up to 512MB in length

How keys are used in partitioning
• Key is used in range partitioning

• Range partitioning works by grouping contiguous values and sending them to the
same node in a cluster

• Sort order must be defined over keys. For example, partition by customer number,
date, part identifier.
Range partitioning example
If you decide to use range partitioning, carefully consider how your data volumes may grow. If you need to
restructure your partitioning scheme, some keys may be reassigned to different nodes and data will have to
migrate between nodes.
2. Designing structure values
• Store frequently used values in memory. But it is limited by the amount of room in
memory allocated to caching

• Store commonly used attribute values together

as this reduce the number of disk seeks that must be performed to read all the needed
data than store name and address separately.
Designing structured values
• If customer name is used many times but not the address, store key name separately.
This would duplicate the customer name in your key-value database, but that should
not be considered a problem.

• However, there are many cases that use only key “name” to lookup name when only
that attribute is needed. This is beneficial to our application. Or use key “name” and
“address” when name and address are needed.
Large values can lead to inefficient Read/Write
Operations

• Using structured data types such as lists and sets can improve overall efficiency of
applications

• It is important to also consider how increasing the size of a value can adversely
impact read and write operations

• Consider data structure in the next image (don’t do that on your database)
Customer
info.

Cart items

Key: ordID:781379
How this structure is built
• When the customer adds her first item to her cart, the list is created and the
customer name and address are copied from the customer database.

• An order array is created and a list with the item identifier, quantity, description, and
price is added to the array. The key value is hashed and the entire data structure is
written to disk.

• The customer then adds another item to the cart, and a new entry is added to the
array of ordered items. Because the value is treated as an atomic unit, the entire list
(for example, customer information and ordered items) is written to the disk again.

• This process continues for each of the additional items.

3. Limitation of key-value
• The only way to lookup value is by key: If you want to lookup information in the
value, but only using key, this is not convenient.

• Some key-value databases do not support range queries. For example: searching data
within date range.

• There is no standard query language comparable to SQL: Only GET and SET are
enough.
3. Design pattern for Key-Value database
• There are some aspects of Key-Value database need considerations

• Time to Live (TTL)

• Emulating tables
• Aggregates
• Atomic aggregates
• Enumerable keys
• Indexes
3.1. Time to live
• Is a term to describe a temporary object which has start time and expiration time.

• Used in caching data in limited memory server when keys are used to hold a resource
for some specified period of time

• When current time exceeds expiration time, data will be removed to free memory to
store something else.
3.2. Emulating table (Table simulation)
• Implement data structure like a table, with only 2 methods SET and GET

• Emulating tables streamlines the getting and setting of multiple attributes related to
a single instance of an entity, but should not be overused

• Frequent use of emulating tables can indicate a misuse of a key-value database

• It is helpful when you routinely get or set a related set of attributes. This pattern is
useful when you are dealing with a small number of emulated tables.
Sample implementation
SET GET
3.3. Aggregate
• Aggregation is a pattern that supports different attributes for different subtypes of an
entity

• In a relational database, you can handle subtypes in a couple of different ways. You
could create a single table with all attributes across all subtypes.

• You could also create a table with the attributes common to all subtypes and then
create an additional table for each of the subtypes.
Example data
Design in Relational database
Possible design
If it’s stadium

If it’s small venue

If it’s festival
• Each of these ticket may be assigned by single namespace such as ConcertApp, such as
3.4. Atomic Aggregate
• Atomic aggregates contain all values that must be updated together or not at all.

• The atomic aggregate pattern uses a single assignment statement to save multiple values

• For example, if the concert ticket application logged a record each time a stadium ticket is
purchased, it should record the date, location, and seat assignment.

This will save all three values or none at all. Critical data will be missed if only 1 attr is saved
Atomic Aggregate (2)
• If trying to log each attribute separately, some attributes will be updated while others are not

• The best way is to store all information in a single key

• What if the server writing this data to disk failed after writing the locDescr attribute but
before writing the assgnSeat attribute, then you would lose a critical piece of data.
3.5. Enumerable Keys
• Keys that use counters or sequences to generate new keys

• This on its own would not be too useful; however, when combined with other
attributes, this can be helpful when working with groups of keys
Example
• Entity “ticketLog” and counter is used as key

• Counter starts at 1 and increases each time a ticket is sold. For

example: 'ticketLog:20140617:1', 'ticketLog:20140617:2’,
'ticketLog:20140617:3'

• However, with ranged values such as from date… to date…, this key naming
convention doesn’t fully function well.
Example (2)
• If day is required for logging, then this key format is accepted

• 20140617: Date sold on June 17th 2014

• 10: Tenth ticket

• Of course, range of tickets can be retrieved by generating series of keys, such as until
reaches the number of keys specified
3.6. Index
• Inverted indexes are sets of key-value pairs that allow for looking up keys or values by
other attribute values of the same entity

• For example

• This is useful for tracking all seats assigned across concerts, but it is not easy to list
only seats assigned in a particular location unless your key-value database provides
search capabilities
Perform searching
• In order to do that, write a function to search by key and if matches, append to the end
of list and return result

• If the function is initially called as the following,

• It would set the value of ConcertApp['Springfield Civic Center'] to

{'J38’} and so on with other values
When to use design patterns
• The Time to Live pattern is useful when you have operations that may be disrupted
and can be safely ignored after some period of inactivity or inability to finish the
operation

• Emulating tables streamlines the getting and setting of multiple attributes related to
a single instance of an entity, but should not be overused. Frequent use of emulating
tables can indicate a misuse of a key-value database

• Aggregates provide a means for working with entities that need to manage subtypes
and different attributes associated with each subtype.
When to use design patterns(2)
• The atomic aggregate pattern is used when you have multiple attributes that should
be set together

• Enumerable keys provide a crude range functionality by allowing a program to

generate and test for the existence of keys

• Indexes allow you to look up attribute values starting with something other than a
key
5. Common Use-cases
• Web applications may store user session details and preference in a key-value store.
All the information is accessible via user key, and key-value stores lend themselves to
fast reads and writes.
• Real-time recommendations and advertising are often powered by key-value stores
because the stores can quickly access and present new recommendations or ads as a
web visitor moves throughout a site.
• On the technical side, key-value stores are commonly used for in-memory data
caching to speed up applications by minimizing reads and writes to slower disk-based
systems.
Common use cases (2)
• Queues: Any application that deals with traffic congestion, messaging, data
gathering, job management, or packer routing

• Ip address tracking: Redis Sets are a great tool for developers who want to analyze all
of the IP addresses that visited a specific website page or blog post

• Scoreboards: Redis Sorted Sets to maintain their high score lists, as scores can be
repeated, but the strings which contain the unique user details cannot.
5. Applications using Redis
• Handing user information

• Handing user session

• Supporting personalization
Handling user session
• Generally, every web session is unique and is assigned a unique sessionId value after
user successfully logs in.

• Applications that store the sessionId on disk or in an RDBMS will greatly benefit from
moving to a Key-Value DB since everything about the session can be stored by a single
PUT request or retrieved using GET.

• This single-request operation makes it very fast, as everything about the session is
stored in a single object as we use in key-value
Example session storage
• Store sessionId in key
Supporting personalization
• User receives different personalized views of same data from database. These view
preferences need to be saved somewhere

• No need to store personalization settings in database, it’s specific to frontend

application

• Using key-value store contains userId and the service allows to store
personalization settings as value. This makes system responds very quick and prevent
slow performance.
Example personalization storage

User:<userId>:preferences = {
item_per_page: 10,
theme: light,
notifications: {
comments: [‘push’, ‘email’, ‘sms’],
tags: [‘push’,’email’, ‘sms]
}
…
}
5.2. High-speed data caching
• Should cache data that takes times to complete, or can reuse, instead of querying
same data over and over again, such as querying authenticated users info.

• High‐speed in‐memory caching provides this caching capability without the need for
a separate application level caching layer.

• This reduces total cost of ownership and makes developing well‐performing

applications quicker and easier.
Example
• Laravel_news:posts = [
{
Id: 1,
Name: ‘abc’,
},
{
Id: 2,
Name: ‘def’
}
]
Redis use case: Queues
• For example: Send email to all subscribers. We can use queue to handle sending
email for each person who subscribed
Queues(2)
• Each time user subscribed to a channel, a new user data will be serialized and will be
recorded to Redis and DB for persistent by appending to the end of LIST using
RPUSH. After sending email, LPOP user that has been sent.
When should not use
• Relationships among data
• If you need to have relationships between different sets of data, or correlate the data
between different sets of keys, key-value stores are not the best solution to use.

• Multi-operation transaction
• If you’re saving multiple keys and there is a failure to save any one of them, and you want
to revert or roll back the rest of the operations, key-value stores are not the best solution
to be used.
When should not use (2)
• Query by data
• If you need to search the keys based on something found in the value part of the key-
value pairs, then key-value stores are not going to perform well for you.

• Operations by Sets
• Since operations are limited to one key at a time, there is no way to operate upon multiple
keys at the same time. If you need to operate upon multiple keys, you have to handle this
from the client side

Se Unit 3
100% (1)
Se Unit 3
21 pages
50 TOP SOA Multiple Choice Questions and Answers PDF Free Download - Multiple Choice Questions and Answers PDF
40% (5)
50 TOP SOA Multiple Choice Questions and Answers PDF Free Download - Multiple Choice Questions and Answers PDF
7 pages
NOTES Agile - Unit 1
No ratings yet
NOTES Agile - Unit 1
40 pages
1.fundamentals of Agile
No ratings yet
1.fundamentals of Agile
42 pages
Case Study On Dbms & Rdbms
No ratings yet
Case Study On Dbms & Rdbms
36 pages
Samuel - Object-Oriented Systems Analysis and Design
No ratings yet
Samuel - Object-Oriented Systems Analysis and Design
317 pages
Mastering Blockchain: Chapter 2, Decentralization
No ratings yet
Mastering Blockchain: Chapter 2, Decentralization
15 pages
PST UnitV
No ratings yet
PST UnitV
14 pages
Identifying Use Cases - Object Analysis - Classification
No ratings yet
Identifying Use Cases - Object Analysis - Classification
147 pages
Agile Technologies 21CS641 Module 1
No ratings yet
Agile Technologies 21CS641 Module 1
19 pages
Cloud Computing Unit 1
No ratings yet
Cloud Computing Unit 1
23 pages
Object Oriented Analysis and Design Two Mark and Sixteen Mark Q & A Part - A Questions and Answers Unit-I
No ratings yet
Object Oriented Analysis and Design Two Mark and Sixteen Mark Q & A Part - A Questions and Answers Unit-I
39 pages
Sample Questions:: Section I: Subjective Questions
100% (2)
Sample Questions:: Section I: Subjective Questions
7 pages
SD Unit 1
No ratings yet
SD Unit 1
30 pages
Se Module 2 PPT
No ratings yet
Se Module 2 PPT
86 pages
DBMS Assignment 1
No ratings yet
DBMS Assignment 1
11 pages
DBMS Question DBMS
100% (1)
DBMS Question DBMS
14 pages
Distributed Deadlocks and Transaction Recovery
100% (1)
Distributed Deadlocks and Transaction Recovery
22 pages
CS403 - Database Management Systems
No ratings yet
CS403 - Database Management Systems
4 pages
Software Risk, Configuration Management
No ratings yet
Software Risk, Configuration Management
35 pages
Ooad MCQ
100% (1)
Ooad MCQ
7 pages
NORMALIZATION
No ratings yet
NORMALIZATION
51 pages
Luyện Đọc Tiếng Anh - p3
No ratings yet
Luyện Đọc Tiếng Anh - p3
8 pages
Introduction To AngularJS
No ratings yet
Introduction To AngularJS
74 pages
Comprehensive 1 PDF
No ratings yet
Comprehensive 1 PDF
154 pages
OOAD Viva Question
No ratings yet
OOAD Viva Question
2 pages
ECOPS
100% (1)
ECOPS
39 pages
MVC Design Pattern PPT Presented by QuontraSolutions
No ratings yet
MVC Design Pattern PPT Presented by QuontraSolutions
35 pages
Storage in Cloud
No ratings yet
Storage in Cloud
51 pages
MCQ Final
No ratings yet
MCQ Final
46 pages
The Good Shadowing Guide by Julian Northbrook
67% (3)
The Good Shadowing Guide by Julian Northbrook
8 pages
Introduction of DBMS
No ratings yet
Introduction of DBMS
83 pages
Feedback: Your Answer Is Correct
No ratings yet
Feedback: Your Answer Is Correct
46 pages
Use Case Quiz Q
No ratings yet
Use Case Quiz Q
8 pages
Atm Uml Diagram
No ratings yet
Atm Uml Diagram
7 pages
Industrial Extreme Programming: Submitted By: Group 3 Submitted To
No ratings yet
Industrial Extreme Programming: Submitted By: Group 3 Submitted To
7 pages
Agile Methodology
No ratings yet
Agile Methodology
6 pages
DeltaX - Fresher Sample App - Spotify
No ratings yet
DeltaX - Fresher Sample App - Spotify
5 pages
L04-The Components of Software Quality Assurance System
No ratings yet
L04-The Components of Software Quality Assurance System
47 pages
Requirements Analysis and Specification PDF
No ratings yet
Requirements Analysis and Specification PDF
77 pages
Component Diagram Tutorial
No ratings yet
Component Diagram Tutorial
6 pages
TE7265 - Introduction To Data Science
No ratings yet
TE7265 - Introduction To Data Science
4 pages
Final Project Report: Locate A Tutor
No ratings yet
Final Project Report: Locate A Tutor
65 pages
Enterprise Java Unit 5
No ratings yet
Enterprise Java Unit 5
10 pages
Unit I - Relational Database Part - A (2 Marks) : 1. Define Database Management System
No ratings yet
Unit I - Relational Database Part - A (2 Marks) : 1. Define Database Management System
27 pages
Java Model Paper Answer
No ratings yet
Java Model Paper Answer
18 pages
SPM 2 Marks Refer
No ratings yet
SPM 2 Marks Refer
13 pages
Serializability
No ratings yet
Serializability
26 pages
#Shuaib OODJ Assignment
No ratings yet
#Shuaib OODJ Assignment
46 pages
Requirement Modeling - 1
No ratings yet
Requirement Modeling - 1
12 pages
Chap 2 Emerging Database Landscape
No ratings yet
Chap 2 Emerging Database Landscape
10 pages
CHAPTER 3 Software Engineering
No ratings yet
CHAPTER 3 Software Engineering
4 pages
Unit IV Ooad
No ratings yet
Unit IV Ooad
68 pages
Design Document Database
No ratings yet
Design Document Database
62 pages
Employee Performance Portal
No ratings yet
Employee Performance Portal
5 pages
Partitioning Strategy
No ratings yet
Partitioning Strategy
17 pages
Horizontal and Vertical Silos
No ratings yet
Horizontal and Vertical Silos
3 pages
What Is Microsoft Access
No ratings yet
What Is Microsoft Access
5 pages
Best Practices of Software Engineering: Obj I Objectives
No ratings yet
Best Practices of Software Engineering: Obj I Objectives
31 pages
Unit 9 - Retrieve Data Using Subqueries
No ratings yet
Unit 9 - Retrieve Data Using Subqueries
20 pages
Simplified Guide To Learning SQL For Product Managers
No ratings yet
Simplified Guide To Learning SQL For Product Managers
6 pages
Cit208 Calculus Educational Consult Eze-Ego QQQZZZW Updated
No ratings yet
Cit208 Calculus Educational Consult Eze-Ego QQQZZZW Updated
31 pages
Unit 2 - Question Bank
No ratings yet
Unit 2 - Question Bank
18 pages
Lecture 2 - Spring Data and Spring Data Rest
No ratings yet
Lecture 2 - Spring Data and Spring Data Rest
47 pages
XIIInfo Pract S E 435
0% (1)
XIIInfo Pract S E 435
5 pages
SQL02 DML
No ratings yet
SQL02 DML
18 pages
Data Preparation
No ratings yet
Data Preparation
28 pages
Using Oracle Autonomous Database Serverless 21 30
No ratings yet
Using Oracle Autonomous Database Serverless 21 30
10 pages
Answer
No ratings yet
Answer
21 pages
SAP HANA Distributed In-Memory Database System Transaction Session and Metadata Management
No ratings yet
SAP HANA Distributed In-Memory Database System Transaction Session and Metadata Management
9 pages
DBMS SQL Delete - Javatpoint
No ratings yet
DBMS SQL Delete - Javatpoint
5 pages
DBMS Material Module-1 & 2
No ratings yet
DBMS Material Module-1 & 2
67 pages
6 SQL Post 4
No ratings yet
6 SQL Post 4
42 pages
DMS Annexure 2 (Group 18) Deepak Gaund
No ratings yet
DMS Annexure 2 (Group 18) Deepak Gaund
12 pages
Oracle E-Business Suite Developer's Guide
No ratings yet
Oracle E-Business Suite Developer's Guide
2 pages
API
No ratings yet
API
13 pages
Azure Data
No ratings yet
Azure Data
6 pages
Igcse Compsci 2ed TR Worksheet 9.2
No ratings yet
Igcse Compsci 2ed TR Worksheet 9.2
4 pages
DBMS Assigment - 4
No ratings yet
DBMS Assigment - 4
10 pages
Relational Databases
No ratings yet
Relational Databases
18 pages
SQL MCQ
No ratings yet
SQL MCQ
5 pages
Keerthana C-3
No ratings yet
Keerthana C-3
1 page
Dbms Lab 7 19IT030
No ratings yet
Dbms Lab 7 19IT030
10 pages
Coding Powerbuilder
No ratings yet
Coding Powerbuilder
6 pages
Daatabase Assignment
No ratings yet
Daatabase Assignment
4 pages
Case Study
No ratings yet
Case Study
3 pages
SQL Tutorial - Commit: Insert Into Values
No ratings yet
SQL Tutorial - Commit: Insert Into Values
4 pages
12 Step Query Tuning Oracle IG
No ratings yet
12 Step Query Tuning Oracle IG
1 page
Agile Scrum Handbook – 3rd edition
From Everand
Agile Scrum Handbook – 3rd edition
Nader K. Rad
No ratings yet
Software Asset Management: What Is It and Why Do I Need It?: A Textbook on the Fundamentals in Software License Compliance, Audit Risks, Optimizing Software License ROI, Business Practices and Life Cycle Management
From Everand
Software Asset Management: What Is It and Why Do I Need It?: A Textbook on the Fundamentals in Software License Compliance, Audit Risks, Optimizing Software License ROI, Business Practices and Life Cycle Management
Carl A. Bolton
No ratings yet
Microsoft Dynamics CRM 2011: Dashboards Cookbook
From Everand
Microsoft Dynamics CRM 2011: Dashboards Cookbook
Mark AuCoin
No ratings yet
SnapLogic Second Edition
From Everand
SnapLogic Second Edition
Gerardus Blokdyk
No ratings yet
system integrator Second Edition
From Everand
system integrator Second Edition
Gerardus Blokdyk
No ratings yet
Cloud API A Clear and Concise Reference
From Everand
Cloud API A Clear and Concise Reference
Gerardus Blokdyk
No ratings yet

Design Key-Value Database + Real Usecase

Uploaded by

Design Key-Value Database + Real Usecase

Uploaded by

Design Key-value database

Nguyen Huu Cam

• Designing structured values

• Design pattern for key-value database

• Avoid meaning less name, such as ‘laklsjfdjjd’

• Use range-based components when would like to retrieve ranges of values

• Keep key as short as possible without sacrifice other characteristics mentioned

• Redis keys can be up to 512MB in length

• Store commonly used attribute values together

• This process continues for each of the additional items.

• Time to Live (TTL)

• Frequent use of emulating tables can indicate a misuse of a key-value database

If it’s small venue

• The best way is to store all information in a single key

• Counter starts at 1 and increases each time a ticket is sold. For

• 20140617: Date sold on June 17th 2014

• If the function is initially called as the following,

• It would set the value of ConcertApp['Springfield Civic Center'] to

• Enumerable keys provide a crude range functionality by allowing a program to

• Handing user session

• No need to store personalization settings in database, it’s specific to frontend

• This reduces total cost of ownership and makes developing well‐performing

You might also like