Dynamo DB Insights

Uploaded by

walia_raman89

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views17 pages

Dynamo DB Insights

Uploaded by

walia_raman89

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 17

DYNAMO DB

INSIGHTS
Antti Stenvall
P2P/AP Senior product architect
Contents
• What and why
• From SQL design to DynamoDB design – Key differences
• Indices, partitions, data duplication
• Pricing
• Examples:
• Consumption manager with conditional updates
• Relational data (single table design)
What and Why?
• An AWS maintained NoSQL database
• Can be used as key-value store, but also in (substantially) more complex situations
• Not as flexible as SQL database (e.g. aggregation is not possible, update or delete is possible only with full key, transaction
support not like in SQL, arbitrary queries are not feasible)
• Query happens in two ways and only in two ways
1. Two exactly known keys (this is get, returns only one item)
2. One exactly known key and part of other (or nothing) and sort order comes from the other key
• In an actual use case it is usually an AWS maintained table, not database, but you can use single table as a whole database,
which makes it awesome
• You can’t know the total number of rows based on condition, pagination is implemented in browsing index (backwards or
forward) starting from given item

• Why would I use something that is not as flexible as SQL?

• Fully serverless, zero maintenance, easy provision, security at rest is easy to implement, easy back-ups
• Super fast, scales infinitely, global tables for multiregion replication
• Easy to use, but challenging to design and it is not suitable for all use cases (like SQL isn’t either)
• Very cheap (low to zero cost at rest), reliable
• Automatic row expiration (data removal at given time)
From SQL design to DynamoDB
design
• Indexing requires substantially more thought
• ALL access patterns are tightly coupled to indices and must be known beforehand
• DynamoDB limitations must be considered in the beginning
• You may want to forget transaction support (you can e.g. save at max 25 items simultaneously, so there is limited
support for atomic operations)
• Aggregation is missing (if needed, consider maintaining aggregated objects yourself)
• Arbitrary querying is not possible (only one-or-two key queries)
• Note! You may scan the table with any conditions BUT that’s like full table scan for the whole SQL database → It can only be used for
maintenance related purposes e.g. if key schema update is needed
• Update is happens only to known primary key (PK+SK)
• DynamoDB special features may be considered
• You may want to react to changes via DynamoDB streams and decouple logic
• Conditional updates can (should) be used for optimistic locking
• Automatic row remove at defined time
• Automatic back-ups
• With Python, boto3 Table API should be used
• 400 KB row size limit
Indices – Primary index
• Primary key consists of two parts
• Partition key: PK (also known as hash key, not as primary key)
• Sort key: SK (also known as range key)
• (PK, SK) pair is unique. Put operation will overwrite if there is already
entry. There are get and query operations for index.
• In get operation PK and SK must be given exactly
• In query operation PK must be given exactly, SK with e.g. begins,
between (or no need to give at all)
• Name your index columns always with PK and SK and fix type to string
What does PK (partition key) and SK
(sort key) do?
• PK partitions the data and it is
dictating the performance
• SK sorts the data within partition,
therefore queries like ends_with or
contains can’t be possible because
they are not performant

You can’t query how many rows there are! You can only browse index through in the order defined by SK.

In addition to keys, rows also contains attributes, these are like columns in a table in relational database
but they are fully schema free. In one table col_a can be number, in another nested json object.
Indices – Secondary index
• Two kind of secondary indices Local and Global
• Local (don’t use, complicates your design and no benefit)
• Only sort key (partition key is the same as global) -> Local because it’s in the same
partition
• Strongly consistent reads
• With same PK up to 10 GB data max
• Global
• Use index name GSI1, GSI2, ... and column names GSI1PK (partition) and GSI1SK
(sort)
• Different items can have same values for pair (GSI1PK, GSI1SK) → No get operation
based on index, only query
• Eventually consistent reads
Indices – Partitions and data
duplication
• PK (or GSI PK) defines partition where data physically locates → speed
• GSIs duplicate data (attributes), you can defined which or all (use all,
disk space is cheap and keeps designing easier)
• Design always for the minimum amount of secondary indices, write
down and revisit your access patterns
• There may be rows that are not relevant for GSI3, then if GSI3PK or
GSI3SK is empty, this data is not in the duplicated to partition defined
by GSI3PK
Pricing
• Read request: up to 4 KB consumes one unit (strongly consistent operation),
or half unit (eventually consistent operation)
• Write request: up to 1 KB consumes one unit
• Two capacity modes: on-demand and provisioned (use on-demand)
• On-demand: pay when you use (read $ 0.25 / M units, write $ 1.25 / M units)
• Provisioned: provision capacity, billed hourly
• Data storage: first 25 GB free then $ 0.25 / GB
• + Other costs for other service: https://aws.amazon.com/dynamodb/pricing/
• Always when designing/workign with AWS, it’s important to understands
costs
Example: Consumption manager
• Use case: we want to limit the
usage of a service
• Service A can be for example
”inifinitely scalable lambda” and
Service B managed AWS service
that has quota and we want
Service A to consume only part
of that quota
• What can possibly go wrong
here?
Consumption manager – Database
design
• One database, two different kind of items: Consumption, Queued job
• Access patterns
• Give me current consumption
• Update current consumption
• Add job to queue
TYPE PK SK Attributes
• Get next job in queue
CONSUMPTION CONSUMPTION CONSUMPTION consumption
• Remove job from queue #SERVICE-A service_name
max_capacity?

JOB JOB#SERVICE-A QUEUED- id,

AT#165994317 service
6149#c474e63 queued_at
payload
Updating consumption table
Example: Relational data in
DynamoDB
• Simplified view of invoice, order and matching data
• Design schema for this in DynamoDB (exclude user initiated search
cases)
DynamoDB design
TYPE PK SK GSI1PK GSI1SK GSI2PK GSI2SK GSI3PK GSI3SK GSI4PK GSI4SK

INVOICE INVOICE#{id} INVOICE#{i

d}
• Always include TYPE, ids are in SKs, remember PK,SK pair is unique
INVOICE_LIN INVOICE#{inv IL#{id} INVOICE#{in MATCHING_
E oice_id} voice_id} STATUS#{sta • Add all attributes appearing in keys to individual attributes as well
tus} • Use GSI only when needed
CODING_RO INVOICE#{inv CR#{id} INVOICE#{in MATCHING_ • All invoice data can be fetched with single query (for given id)
W oice_id} voice_id} STATUS#{sta
tus} • All invoice / order data with given matching status with single
ORDER ORDER#{id} ORDER#{id} ORDER_NU DUMMY query (for given header id + matching status)
MBER#{orde • All order data can be fetched with single query (for given id)
r_number}
ORDER_RO ORDER#{ord OR#{id} ORDER#{ord MATCHING_
• Not possible to fetch for given gr e.g. all the invoices’ header data
W er_id} er_id} STATUS#{sta (which would be trivial in SQL)
tus}
GOODS_REC ORDER#{ord GR#{id} ORDER#{ord MATCHING_ OR#{or_id} GR
EIPT er_id} er_id} STATUS#{sta
tus}
GR_MATCHI ORDER#{ord GRMD#{id} OR#{or_id} GRMDATA GR#{gr_id} GRMDATA INVOICE#{in IL#{il-id} CR#{cr_id} CR
NG_DATA er_id} v_id}
How to document object?
Key definitions of
Key schema of the table the object

SK Optional for
querying

Attributes
Key takeaways
• Plan your access patterns, revisit them continuously
• Querying is with a key pair, there is no searching (forget scanning)
• There are no joins
• There are simple/no transactions, use conditional updates and optimistic locking
• Aggregation is not possible, use DynamoDB stream or aggregate on CUD
• Name one attribute to TYPE (will help in debugging and development)
• Use convention PK, SK, GSI1PK, GSI1SK, GSI2PK, ... for naming keys
• Don’t take DynamoDB for granted, but consider it as an option to other databases
• Sometimes more than one table is better (e.g. lot of CUD, but only little use for streams)
• Centralize you key schema to models, don’t let it leak to repository functions
• Plan your access patterns, revisit them continuously
Thank you!

The DynamoDB Book
100% (1)
The DynamoDB Book
448 pages
Dynamo DB
No ratings yet
Dynamo DB
42 pages
Dynamo DB
No ratings yet
Dynamo DB
1 page
Data Modeling With DynamoDB
No ratings yet
Data Modeling With DynamoDB
9 pages
Dynamo DB
No ratings yet
Dynamo DB
30 pages
RSDB En-Us SG M06 Dynamodb
No ratings yet
RSDB En-Us SG M06 Dynamodb
20 pages
DynamoDB
No ratings yet
DynamoDB
27 pages
8 - DynamoDb
No ratings yet
8 - DynamoDb
4 pages
Unit 5 Lecture 4
No ratings yet
Unit 5 Lecture 4
22 pages
CS523 CC Assignment-2
No ratings yet
CS523 CC Assignment-2
12 pages
Table of Contents
No ratings yet
Table of Contents
7 pages
Exp2 - Computing Lab - BookReview - Rollno - 3
No ratings yet
Exp2 - Computing Lab - BookReview - Rollno - 3
10 pages
AWS2
No ratings yet
AWS2
11 pages
Amazon DynamoDB - Wikipedia
No ratings yet
Amazon DynamoDB - Wikipedia
7 pages
Deep Dive Dynamo DB
No ratings yet
Deep Dive Dynamo DB
88 pages
AWS Foundation DynamoDB Part 1
No ratings yet
AWS Foundation DynamoDB Part 1
16 pages
Dynamo DB (RDS)
No ratings yet
Dynamo DB (RDS)
28 pages
Dynamodb
No ratings yet
Dynamodb
5 pages
DynamoDB Data Modelling
No ratings yet
DynamoDB Data Modelling
223 pages
Key Value Pair Database
No ratings yet
Key Value Pair Database
24 pages
Dynamo DB Api
No ratings yet
Dynamo DB Api
75 pages
Dynamo DB
No ratings yet
Dynamo DB
34 pages
G7 Amazon DynamoDB
No ratings yet
G7 Amazon DynamoDB
22 pages
Dynamo DB
No ratings yet
Dynamo DB
20 pages
Exercise 4: AWS Database Services: COSC2626/COSC2640 Cloud Computing
No ratings yet
Exercise 4: AWS Database Services: COSC2626/COSC2640 Cloud Computing
25 pages
Lecture Notes - DynamoDB
No ratings yet
Lecture Notes - DynamoDB
24 pages
Ado Lecture III 2024-26
No ratings yet
Ado Lecture III 2024-26
93 pages
Amazon Dynamo DB - Presentation
100% (1)
Amazon Dynamo DB - Presentation
30 pages
Amazon DynamoDB Technical Deep Dive
No ratings yet
Amazon DynamoDB Technical Deep Dive
30 pages
Introduction To Nosql: - Key Value Databases
No ratings yet
Introduction To Nosql: - Key Value Databases
14 pages
Dynamodb Applied Design Patterns: Chapter No. 1 "Data Modeling With Dynamodb"
No ratings yet
Dynamodb Applied Design Patterns: Chapter No. 1 "Data Modeling With Dynamodb"
23 pages
T09 - NoSQL 1
No ratings yet
T09 - NoSQL 1
32 pages
Module 8 - Database Services
No ratings yet
Module 8 - Database Services
33 pages
Nihira 5
No ratings yet
Nihira 5
12 pages
The Bad Parts of AWS Copy 3
No ratings yet
The Bad Parts of AWS Copy 3
173 pages
Amazon DynamoDB
No ratings yet
Amazon DynamoDB
16 pages
Class Notes Aws
No ratings yet
Class Notes Aws
2 pages
Data Modeling With Amazon DynamoDB CMY304
No ratings yet
Data Modeling With Amazon DynamoDB CMY304
106 pages
Amazon Dynamodb A Scalable Predictably Performant and Fully Managed Nosql Database Service
No ratings yet
Amazon Dynamodb A Scalable Predictably Performant and Fully Managed Nosql Database Service
12 pages
Data Analytics Using NoSQL
0% (1)
Data Analytics Using NoSQL
50 pages
DBMS Unit 4
No ratings yet
DBMS Unit 4
22 pages
Introduction To Amazon DynamoDB
No ratings yet
Introduction To Amazon DynamoDB
5 pages
AWSNOTE
No ratings yet
AWSNOTE
10 pages
Single Multi Table
No ratings yet
Single Multi Table
2 pages
Database 240112 181346
No ratings yet
Database 240112 181346
16 pages
Module 4
No ratings yet
Module 4
38 pages
Aws Dynamodb Two Case Studies
No ratings yet
Aws Dynamodb Two Case Studies
3 pages
An in Depth Look at Database Indexing
No ratings yet
An in Depth Look at Database Indexing
3 pages
Use Cases of Dynamo DB
No ratings yet
Use Cases of Dynamo DB
16 pages
Big Data - RDBMS, NoSQL and DynamoDB
No ratings yet
Big Data - RDBMS, NoSQL and DynamoDB
6 pages
DynamoDB Cookbook - Sample Chapter
No ratings yet
DynamoDB Cookbook - Sample Chapter
35 pages
11-NoSQL Nhom8
No ratings yet
11-NoSQL Nhom8
72 pages
Introduction To Database Services: Brian Rice Product Marketing Manager, Amazon RDS
No ratings yet
Introduction To Database Services: Brian Rice Product Marketing Manager, Amazon RDS
65 pages
Database Scalability: Jonathan Ellis
No ratings yet
Database Scalability: Jonathan Ellis
49 pages
Class 5 DynamoDB
No ratings yet
Class 5 DynamoDB
37 pages
Class 5 DynamoDB
No ratings yet
Class 5 DynamoDB
37 pages
AWS DynamoDB Notes
No ratings yet
AWS DynamoDB Notes
2 pages
Learn MongoDB in 24 Hours
From Everand
Learn MongoDB in 24 Hours
Alex Nordeen
5/5 (2)
SQL Tutorial For Beginners
From Everand
SQL Tutorial For Beginners
HAU DANG
No ratings yet
SQL Interview Success From Beginner To Pro
From Everand
SQL Interview Success From Beginner To Pro
Shana
No ratings yet
Sample Annual Self Classification Report
No ratings yet
Sample Annual Self Classification Report
3 pages
Report Milestone 1
No ratings yet
Report Milestone 1
23 pages
New RCS Agent Form
No ratings yet
New RCS Agent Form
7 pages
R20 I B.Tech. CSE Syllabus
No ratings yet
R20 I B.Tech. CSE Syllabus
45 pages
Keyboard Shortcut1
0% (1)
Keyboard Shortcut1
2 pages
Wipro PRP
71% (7)
Wipro PRP
82 pages
Mini Project Report
No ratings yet
Mini Project Report
14 pages
45 Tricks Using Flex and Grid
No ratings yet
45 Tricks Using Flex and Grid
56 pages
Empowerment Technologies: Quarter 2 - Activity Sheets
No ratings yet
Empowerment Technologies: Quarter 2 - Activity Sheets
5 pages
Job Description
No ratings yet
Job Description
1 page
Report IOT102t
No ratings yet
Report IOT102t
22 pages
How To Connect Cisco AnyConnect Remote-Access VPN
No ratings yet
How To Connect Cisco AnyConnect Remote-Access VPN
8 pages
Foundations of Information Systems in Business
No ratings yet
Foundations of Information Systems in Business
31 pages
Thesis Format University of Auckland
100% (3)
Thesis Format University of Auckland
8 pages
Cambridge International Examinations Cambridge International General Certificate of Secondary Education
No ratings yet
Cambridge International Examinations Cambridge International General Certificate of Secondary Education
16 pages
Top 500 Abbreviation For HC Aso Mains 2025
No ratings yet
Top 500 Abbreviation For HC Aso Mains 2025
14 pages
Naushad Qureshi
No ratings yet
Naushad Qureshi
1 page
Working With The JavaScript Cache API
No ratings yet
Working With The JavaScript Cache API
7 pages
Machine Tool Software Support Article: Productivity+™ Active Editor Pro
No ratings yet
Machine Tool Software Support Article: Productivity+™ Active Editor Pro
13 pages
WnO POS Server Setup Guide
No ratings yet
WnO POS Server Setup Guide
14 pages
Agreed Terms & Condition
No ratings yet
Agreed Terms & Condition
15 pages
Date and Time Handling in Object Pascal: Michaël Van Canneyt December 1, 2009
No ratings yet
Date and Time Handling in Object Pascal: Michaël Van Canneyt December 1, 2009
12 pages
B2B Lead Gen Strategy
No ratings yet
B2B Lead Gen Strategy
24 pages
Project Report On E-Hostel: Bachelor of Engineering
100% (1)
Project Report On E-Hostel: Bachelor of Engineering
31 pages
Unit III JDBC Connectivity
No ratings yet
Unit III JDBC Connectivity
14 pages
Debuglog
No ratings yet
Debuglog
7 pages
Exam1 f09 v1
No ratings yet
Exam1 f09 v1
18 pages
Oracle 11G PL SQL Programming 2nd Edition Casteel Test Bankinstant Download
100% (5)
Oracle 11G PL SQL Programming 2nd Edition Casteel Test Bankinstant Download
39 pages
Automata Theory Questions and Answers - Finite Automata-Introduction
No ratings yet
Automata Theory Questions and Answers - Finite Automata-Introduction
278 pages
A Guide To EV Slickline Memory Cameras
No ratings yet
A Guide To EV Slickline Memory Cameras
20 pages

Dynamo DB Insights

Uploaded by

Dynamo DB Insights

Uploaded by

DYNAMO DB

• Why would I use something that is not as flexible as SQL?

JOB JOB#SERVICE-A QUEUED- id,

INVOICE INVOICE#{id} INVOICE#{i

You might also like