NoSQL Database DataStax
NoSQL Database DataStax
Databases
Your presenters
@SonicDMG
@clun
@david-gilardi
@clunven
Your presenters
@RyanWelford @SonicDMG
@ryanwelford @david-gilardi
Your presenters
4
Your presenters
@hadesarchitect @RyanWelford
5
Housekeeping
Livestream: youtube.com/DataStaxDevs Runtime: astra.new/intro-nosql
YouTube
Twitch
YouTube
6
Achievement Unlocked!
dtsx.io/badges
Repository:
- https://github.com/datastaxdevs/
workshop-introduction-to-nosql
10
Agenda
01 02 03
Definitions and Tabular Document
objectives of NoSQL Databases Databases
04 05 06
Key/values Graph Games
Databases Databases TakeAways
11
Agenda
01 02 03
Definitions and Tabular Document
objectives of NoSQL Databases Databases
04 05 06
Key/values Graph Games
Databases Databases TakeAways
12
Get Ready = Hands-on #1
Repository:
- https://github.com/datastaxdevs/work
shop-introduction-to-nosql
Databases
14
15
Databases
EXECUTION (parser,analyzer,dispatcher)
16
OLTP / OLAP
● OnLine Transaction Processing
● OnLine Analytical Processing
OLTP OLAP
Traditional RDBMS
Relational
DB
Streaming Throughput
(Events) (Transactions)
CPU Bound
(Compute)
18
IO Bound
At some point it does not fit
Relational Databases (Capacity, Storage)
a single machine you need
Limits Distributed to scale out.
Storage
Relational DB
Streaming Throughput
(Events) (Transactions)
Streaming XTP
Platforms
GPU/Parallel
Programming
CPU Bound
(Compute)
19
IO Bound Copes with new requirements in
(Capacity, Storage)
volume (capacity) and velocity
Distributed (throughput) + format (variety)
Storage
3V: Volume, Velocity, Variety
NO
DA T O
Streaming TA NL Throughput
BA Y
(Events)
Streaming SE SQ (Transactions)
S L XTP
Platfoms
GPU/Parallel
Programming
CPU Bound
(Compute)
20
Introduction to the C.A.P Theorem (Eric Brewer)
NoSQL are
Distributed Systems
Clouds like
Distributed Systems
21
22
Main NoSQL Databases Types
Key/value Graph
23
$25/month credit
Launch a database in the cloud
with a few clicks, no credit card
required.
User Interface
Swagger UI GraphQL Playground Tools Web based
Developer Tools
OSS Stargate.io
A data gateway to allow
c
multiple usages
24
Agenda
01 02 03
Definitions and Tabular Document
objectives of NoSQL Databases Databases
04 05 06
Key/values Graph Games
Databases Databases TakeAways
25
Tabular or Column Type
Model: Stored Tables sharded on keys to distribute on nodes
Query
Use Cases
26
Apache Cassandra™ = NoSQL Distributed Database
1 Installation = 1 NODE
NODE ✔ Capacity = ~ 2-4TB
✔ Throughput = LOTS Tx/sec/core
NODE NODE
DataCenter | Ring
NODE NODE
Communication:
✔ Gossiping
NODE NODE
27
Data is Distributed
Country City Population
Partition Key
Data is Distributed
USA New York 8.000.000
Country City Population
USA Los Angeles 4.000.000
FR Paris 2.230.000
DE Berlin 3.350.000
FR Toulouse 1.100.000
DE Nuremberg 500.000
RF = ? 83 17
Replication Factor
means the number
of nodes used to
store each partition
67 33
50
30
Data is Replicated
RF = 1 83 17
Replication Factor 1
means that every
partition is stored
on 1 node USA New York 8.000.000
USA
67
Los Angeles 4.000.000
33
50
31
Data is Replicated
RF = 2 USA
USA
New York
83
Los Angeles
8.000.000
4.000.000
17
Replication Factor 2
means that every
partition is stored
on 2 nodes USA New York 8.000.000
USA
67
Los Angeles 4.000.000
33
50
32
Data is Replicated
USA New York 8.000.000
USA
0
Los Angeles 4.000.000
RF = 3 USA
USA
New York
83
Los Angeles
8.000.000
4.000.000
17
Replication Factor 3
means that every
partition is stored
on 3 nodes USA New York 8.000.000
USA
67
Los Angeles 4.000.000
33
50
33
Replication within the Ring
USA New York 8.000.000
USA Los Angeles 4.000.000
0
59 (data)
83 17
RF = 3
67 33
50
34
Replication within the Ring
RF = 3
67 33
50
35
Replication within the Ring
59 (data)
0
59 (data)
83 17
RF = 3
59 (data)
67 33
50
36
Node Failure
59 (data)
0
83 17 Hint
59 (data)
RF = 3
59 (data)
67 33
50
37
Node Failure Recovered
59 (data)
0
83 17 Hint
59 (data)
RF = 3
59 (data)
67 33
50
38
Node Failure Recovered
59 (data)
0
59 (data)
83 17
RF = 3
59 (data)
67 33
50
39
Data Distributed Everywhere
Geographic Distribution Hybrid-Cloud and Multi-Cloud
On-premise
40
Understanding Use Cases
High Throughput Heavy Writes Event Streaming Log Analytics
Scalability
High Volume Heavy Reads Internet of Things Other Time Series
41
HandsOn #2 Tabular Databases
Repository:
- https://github.com/datastaxdevs/work
shop-introduction-to-nosql
Agenda
01 02 03
Definitions and Tabular Document
objectives of NoSQL Databases Databases
04 05 06
Key/values Graph Games
Databases Databases TakeAways
43
Document-Oriented Database
Model: Structured Objects identified by a key
Queries
Use Cases
44
Document Shredding
45
Document Shredding
46
Document Shredding
47
HandsOn #3 Documents DB
Repository:
- https://github.com/datastaxdevs/work
shop-introduction-to-nosql
Agenda
01 02 03
Definitions and Tabular Document
objectives of NoSQL Databases Databases
04 05 06
Key/values Graph Games
Databases Databases TakeAways
49
Key Value Database
Model: Like a distributed HashTable
Queries
Use Cases
● Distributed Cache !
● User cache Data, User Sessions
● data Deduplications
50
HandsOn #4 Key-Value DB
Repository:
- https://github.com/datastaxdevs/work
shop-introduction-to-nosql
Agenda
01 02 03
Definitions and Tabular Document
objectives of NoSQL Databases Databases
04 05 06
Key/values Graph Games
Databases Databases TakeAways
52
Graph Database Database
Model: Store Vertices and Edges data structured
Queries
Use Cases
53
Positioning graphs Scalability and flexibility
Key Value
High
Graph
Tabular
Scalability
Document
Low
Relational
Low High
Value in Relationships
54
HandsOn #5 Graph Databases
Repository:
- https://github.com/datastaxdevs/work
shop-introduction-to-nosql
Agenda
01 02 03
Definitions and Tabular Document
objectives of NoSQL Databases Databases
04 05 06
Key/values Graph Games
Databases Databases TakeAways
56
menti.com
Developer Resources
Join community.datastax.com
ASK/SHARE
Ask/answer community user questions - share your expertise
Follow us @DataStaxDevs
CONNECT
We are on Youtube - Twitter - Twitch!
58
Homework (datastax.com/dev)
59
Certifications
https://www.datastax.com/dev/certifications
60
Weekly Workshops https://www.datastax.com/workshops
Join our 10k Discord Community https://dtsx.io/discord
The Fellowship of the RINGS
Thank you!
@hadesarchitect
@clun
@SonicDMG
@hadesarchitect @hadesarchitect
@clunven @clunven
@SonicDMG @david-gilardi
Thank you!