0% found this document useful (0 votes)

20 views61 pages

Intro To Cassandra For Developers

Uploaded by

Adithya ghost

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views61 pages

Intro To Cassandra For Developers

Uploaded by

Adithya ghost

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 61

Intro to Cassandra for Developers

Housekeeping
Courses: youtube.com/DataStaxDevs Runtime: dtsx.io/workshop

YouTube

Twitch

Questions: bit.ly/cassandra-workshop Quizz: menti.com

Discord

YouTube

2
Achievement Unlocked! - “Introduction to Cassandra”
Homework
==
Fully managed Cassandra
Without the ops!
DataStax Astra

Global Scale No Operations 25 Gig Free Tier

Put your data where you need it Launch a database in the cloud
Eliminate the overhead to install,
without compromising performance, with a few clicks, no credit card
operate, and scale Cassandra.
availability, or accessibility. required.
menti.com
Apache Cassandra™ = NoSQL Distributed Database

1 Installation = 1 NODE
NODE ✔ Capacity = ~ 2-4TB
✔ Throughput = LOTS Tx/sec/core
NODE NODE

DataCenter | Ring

NODE NODE
Communication:
✔ Gossiping

NODE NODE
Apache Cassandra™ = NoSQL Distributed Database

- Big Data Ready

- Highest Availability
- Geographical Distribution
- Read/Write Performance
- Vendor Independent
Data is Distributed
Country City Population

USA New York 8.000.000

USA Los Angeles 4.000.000
FR Paris 2.230.000
DE Berlin 3.350.000
UK London 9.200.000
AU Sydney 4.900.000
DE Nuremberg 500.000
CA Toronto 6.200.000
CA Montreal 4.200.000
FR Toulouse 1.100.000
JP Tokyo 37.430.000
IN Mumbai 20.200.000

Partition Key
Data is Distributed
USA New York 8.000.000
Country City Population
USA Los Angeles 4.000.000

FR Paris 2.230.000
DE Berlin 3.350.000
FR Toulouse 1.100.000
DE Nuremberg 500.000

UK London 9.200.000 JP Tokyo 37.430.000

AU Sydney 4.900.000 CA Toronto 6.200.000

IN Mumbai 20.200.000 CA Montreal 4.200.000
Data is Replicated

RF = 3 83 17

Replication Factor 3
means that every
row is stored on 3
different nodes
67 33

50
Replication within the Ring

0
59 (data)
83 17

RF = 3

67 33

50
Replication within the Ring

83 59 (data)
17

RF = 3

67 33

50
Replication within the Ring

59 (data)
0

59 (data)
83 17

RF = 3

59 (data)
67 33

50
Node Failure

59 (data)
0

83 17 Hint
59 (data)
RF = 3

59 (data)
67 33

50
Node Failure Recovered

59 (data)
0

83 17 Hint
59 (data)
RF = 3

59 (data)
67 33

50
Immediate Consistency – A Better Way

Client Client

Write Read
CL = QUORUM CL = QUORUM
Data Distributed Everywhere

• Geographic Distribution • Hybrid-Cloud and Multi-Cloud

On-premise
Understanding Use Cases
High Throughput Heavy Writes Event Streaming Log Analytics
Scalability
High Volume Heavy Reads Internet of Things Other Time Series

No Data Loss Caching Pricing

Availability Mission-Critical
Always-on Market Data Inventory

Global Presence Banking Retail

Distributed Compliance /
GDPR Tracking / Customer
Workload Mobility
Logistics Experience

Modern Cloud API Layer Hybrid-cloud

Cloud-native Applications
Enterprise Data
Multi-cloud
Layer
https://github.com/DataStax-Academy
/Intro-to-Cassandra-for-Developers
Intro to Cassandra for Developers

1. Tables, Partitions

2. The Art of Data Modelling

3. What’s NEXT?
Intro to Cassandra for Developers

1. Tables, Partitions

2. The Art of Data Modelling

3. What’s NEXT?
Data Structure: a Cell

An intersection of a row
and a column, stores data.
Data Structure: a Row

A single, structured
data item in a table.
Data Structure: a Partition

A group of rows having the ID First Name Last Name Department

same partition token, a base
unit of access in Cassandra. 1 John Doe Wizardry

IMPORTANT: stored together, all 399 Marisha Chapez Wizardry

the rows are guaranteed to be
neighbors. 415 Maximus Flavius Wizardry
Data Structure: a Table

ID First Name Last Name Department

1 John Doe Wizardry

A group of columns and
rows storing partitions. 2 Mary Smith Dark Magic

3 Patrick McFadin DevRel

Data Structure: Overall
Keyspace columns

Table ● Tabular data model, with one twist

● Tables are organized in rows and columns
- - - -
- - - ● Groups of related rows called partitions are
x stored together on the same node (or nodes)
partitions - - -
● Each row contains a partition key
- - - ○ One or more columns that are hashed to
y - - - determine which node(s) store that data
- - -

z - - -
rows

Partition key
Example Data: Users organized by city

Keyspace killrvideo

Table users_by_city
Last First
City Address Email
Name Name
Hellson Kevin 23 Jackson St. [email protected]
Phoenix Lastfall Norda 3 Stone St [email protected]
partitions Smith Jana 3 Stone St [email protected]
Franklin George 2 Star St [email protected]
rows
Seattle Jackson Jane 2 Star St [email protected]
Jasons Judy 2 StarSt [email protected]

Partition key column Clustering columns Data columns

Creating a Table in CQL

keyspace table

CREATE TABLE killrvideo.users_by_city (

city text,
column last_name text,
deﬁnitions first_name text,
address text,
email text,
PRIMARY KEY ((city), last_name, first_name, email));

Primary key Partition key Clustering columns

Primary Key CREATE TABLE killrvideo.users_by_city (
city text,
An identiﬁer for a row. Consists last_name text,
of at least one Partition Key and first_name text,
address text,
zero or more Clustering email text,
Columns. PRIMARY KEY ((city), last_name, first_name, email));

MUST ENSURE UNIQUENESS.

MAY DEFINE SORTING. Partition key Clustering columns

Good Examples:

PRIMARY KEY ((city), last_name, first_name, email);

PRIMARY KEY (user_id);

Bad Example:
PRIMARY KEY ((city), last_name, first_name);
Partition Key CREATE TABLE killrvideo.users_by_city (
city text,
An identiﬁer for a partition. last_name text,
Consists of at least one column, first_name text,
address text,
may have more if needed email text,
PRIMARY KEY ((city), last_name, first_name, email));
PARTITIONS ROWS.

Partition key Clustering columns

Good Examples:

PRIMARY KEY (user_id);

PRIMARY KEY ((video_id), comment_id);

Bad Example:
PRIMARY KEY ((sensor_id), logged_at);
Clustering Column(s) CREATE TABLE killrvideo.users_by_city (
city text,
Used to ensure uniqueness and last_name text,
sorting order. Optional. first_name text,
address text,
email text,
PRIMARY KEY ((city), last_name, first_name, email));

Partition key Clustering columns

PRIMARY KEY ((city), last_name, first_name); Not Unique

PRIMARY KEY ((city), last_name, first_name, email);

PRIMARY KEY ((video_id), comment_id); Not Sorted

PRIMARY KEY ((video_id), created_at, comment_id);

The Slide of the Year Award!
Rules of a Good Partition
● Store together what you retrieve together
● Avoid big partitions
● Avoid hot partitions

Example: open a video? Get the comments in a single query!

PRIMARY KEY ((video_id), created_at, comment_id);

PRIMARY KEY ((comment_id), created_at);

The Slide of the Year Award!
Rules of a Good Partition
● Store together what you retrieve together
● Avoid big partitions
● Avoid hot partitions

PRIMARY KEY ((video_id), created_at, comment_id);

PRIMARY KEY ((country), user_id);

● Up to 2 billion cells per partition

● Up to ~100k rows in a partition
● Up to ~100MB in a Partition
The Slide of the Year Award!
Rules of a Good Partition
● Store together what you retrieve together
● Avoid big and constantly growing partitions!
● Avoid hot partitions

Example: a huge IoT infrastructure, hardware all over

● Sensor ID: UUID
the world, different sensors reporting their state
● Timestamp: Timestamp
every 10 seconds. Every sensor reports its UUID,
● Value: ﬂoat
timestamp of the report, sensor’s value.

PRIMARY KEY ((sensor_id), reported_at);

The Slide of the Year Award!
Rules of a Good Partition
● Store together what you retrieve together

BUCKETING
● Avoid big and constantly growing partitions!
● Avoid hot partitions

Example: a huge IoT infrastructure, hardware all over

● Sensor ID: UUID
the world, different sensors reporting their state
● MonthYear: Integer or String
every 10 seconds. Every sensor reports its UUID,
● Timestamp: Timestamp
timestamp of the report, sensor’s value.
● Value: ﬂoat

PRIMARY KEY ((sensor_id), reported_at);

PRIMARY KEY ((sensor_id, month_year), reported_at);

The Slide of the Year Award!
Rules of a Good Partition
● Store together what you retrieve together
● Avoid big partitions
● Avoid hot partitions

PRIMARY KEY (user_id);

PRIMARY KEY ((video_id), created_at, comment_id);

PRIMARY KEY ((country), user_id);

https://github.com/DataStax-Academy/Intro-t
o-Cassandra-for-Developers#2-create-a-table
Intro to Cassandra for Developers

1. Tables, Partitions

2. The Art of Data Modelling

3. What’s NEXT?
Normalization
Employees
“Database normalization is the process of
structuring a relational database in accordance userId deptId ﬁrstName lastName
with a series of so-called normal forms in order
to reduce data redundancy and improve data 1 1 Edgar Codd
integrity. It was ﬁrst proposed by Edgar F. Codd
as part of his relational model.” 2 1 Raymond Boyce

Departments

departmentId department
PROS: Simple write, Data Integrity
CONS: Slow read, Complex Queries 1 Engineering

2 Math

41
Denormalization
“Denormalization is a strategy used on a Employees
database to increase performance. In
computing, denormalization is the process of userId ﬁrstName lastName department
trying to improve the read performance of a
database, at the expense of losing some write 1 Edgar Codd Engineering
performance, by adding redundant copies of
data” 2 Raymond Boyce Engineering

3 Sage Lahja Math

PROS: Quick Read, Simple Queries 4 Juniper Jones Botany

CONS: Multiple Writes, Manual Integrity

42
Relational Data Modelling
Data
1. Analyze raw data

2. Identify entities, their properties

and relations

3. Design tables, using

normalization and foreign keys. Models

4. Use JOIN when doing queries to

join normalized data from
multiple tables

Application
NoSQL Data Modelling
Application
1. Analyze user behaviour
(customer ﬁrst!)

2. Identify workﬂows, their

dependencies and needs

3. Deﬁne Queries to fulﬁll these Models

workﬂows

4. Knowing the queries, design tables,

using denormalization.

5. Use BATCH when inserting or

updating denormalized data of Data
multiple tables
Designing Process: Step by Step
Entities & Relationships

Queries
Designing Process:
Conceptual Data Model
Designing Process:
Application Workﬂow

Use-Case I:
● A User opens a Proﬁle

WF2: Find comments related to target user using its identiﬁer, get most recent ﬁrst

Use-Case II:
● A User opens a Video Page

WF1: Find comments related to target video using its identiﬁer, most recent ﬁrst
Designing Process:
Mapping

Query I: Find comments posted for a user comments_by_user

with a known id (show most recent ﬁrst)

Query II: Find comments for a video with a comments_by_video

known id (show most recent ﬁrst)
Designing Process:
Mapping

SELECT * FROM comments_by_user comments_by_user

WHERE userid = <some UUID>

SELECT * FROM comments_by_video comments_by_video

WHERE videoid = <some UUID>
Designing Process:
Logical Data Model

comments_by_user comments_by_video

userid K videoid K
creationdate creationdate C
↑
C
↑
commentid C↑ commentid C↑
videoid userid
comment comment
Designing Process:
Physical Data Model

comments_by_user comments_by_video

userid UUID K videoid UUID K

commentid TIMEUUID C
↑ commentid TIMEUUID C
↑
videoid UUID userid UUID

comment TEXT comment TEXT

Designing Process:
Schema DDL
CREATE TABLE IF NOT EXISTS comments_by_user (
userid uuid,
commentid timeuuid,
videoid uuid,
comment text,
PRIMARY KEY ((userid), commentid)
) WITH CLUSTERING ORDER BY (commentid DESC);

CREATE TABLE IF NOT EXISTS comments_by_video (

videoid uuid,
commentid timeuuid,
userid uuid,
comment text,
PRIMARY KEY ((videoid), commentid)
) WITH CLUSTERING ORDER BY (commentid DESC);
https://github.com/DataStax-Academy/Intro-to-Cas
sandra-for-Developers#3-execute-crud-operations
menti.com
Intro to Cassandra for Developers

1. Tables, Partitions

2. The Art of Data Modelling

3. What’s NEXT?
Homework
MORE LEARNING!!!!
Developer site: datastax.com/dev

● Developer Stories
● New hands-on learning scenarios with
Katacoda
● Try it Out
● Cassandra Fundamentals
● https://www.datastax.com/learn/cassandra-funda
mentals
● New Data Modeling course
https://www.datastax.com/dev/modeling

Classic courses available at DataStax Academy

✔ Academy.datastax.com

✔ datastax.com/dev

✔ community.datastax.com

✔ Datastax Developers
YouTube Channel

58
Weekly Workshops https://www.datastax.com/workshops

59
Join our 10k Discord Community https://bit.ly/cassandra-workshop
The Fellowship of the RINGS

60
Thank you!

09b Cassandra Slides
No ratings yet
09b Cassandra Slides
26 pages
Cassandra
No ratings yet
Cassandra
25 pages
Deep Dive Dynamo DB
No ratings yet
Deep Dive Dynamo DB
88 pages
4 - Key-Value Storage
No ratings yet
4 - Key-Value Storage
109 pages
Cassandra Presentation BSB 23.9.2021
No ratings yet
Cassandra Presentation BSB 23.9.2021
50 pages
Cassandra Presentation Final
100% (3)
Cassandra Presentation Final
71 pages
Apache Cassandra Tutorial
No ratings yet
Apache Cassandra Tutorial
7 pages
Become A Super Modeler
No ratings yet
Become A Super Modeler
29 pages
4 Key Value
No ratings yet
4 Key Value
30 pages
Lec 17
No ratings yet
Lec 17
21 pages
Cassandra Data Modeling Best Practices
No ratings yet
Cassandra Data Modeling Best Practices
57 pages
Casandra
No ratings yet
Casandra
57 pages
Cassandra Data Model
No ratings yet
Cassandra Data Model
17 pages
Rangkum Handson
No ratings yet
Rangkum Handson
20 pages
Lecture7 Cassandra Animations
No ratings yet
Lecture7 Cassandra Animations
20 pages
NOSQL Databases
No ratings yet
NOSQL Databases
19 pages
Class 3 Cassandra
No ratings yet
Class 3 Cassandra
64 pages
PR 5 - No SQL
No ratings yet
PR 5 - No SQL
9 pages
2: Data Model: Creating An E Cient Data Model For Highly-Loaded Applications
No ratings yet
2: Data Model: Creating An E Cient Data Model For Highly-Loaded Applications
83 pages
Cassandra CQL Commands
No ratings yet
Cassandra CQL Commands
16 pages
Intro To Cassandra and CQL
No ratings yet
Intro To Cassandra and CQL
29 pages
Distributed Data Store
No ratings yet
Distributed Data Store
11 pages
App Ache
No ratings yet
App Ache
55 pages
Cassandra
No ratings yet
Cassandra
31 pages
Ch3 Nosql Wordpress
No ratings yet
Ch3 Nosql Wordpress
15 pages
BDA
No ratings yet
BDA
9 pages
Module 4
No ratings yet
Module 4
22 pages
Wide-Column Stores: Big Data Management Phil Bartie
No ratings yet
Wide-Column Stores: Big Data Management Phil Bartie
46 pages
Cassandra Data Base1
No ratings yet
Cassandra Data Base1
9 pages
Cassandradatamodeling 150520131838 Lva1 App6891
No ratings yet
Cassandradatamodeling 150520131838 Lva1 App6891
50 pages
Intro To NoSQL
No ratings yet
Intro To NoSQL
18 pages
Apache Cassandra Nosql SonuJha 04
No ratings yet
Apache Cassandra Nosql SonuJha 04
14 pages
DSX Developer Ebook4 FINAL PDF
No ratings yet
DSX Developer Ebook4 FINAL PDF
27 pages
Cassandra
No ratings yet
Cassandra
5 pages
Features of Cassandra
No ratings yet
Features of Cassandra
6 pages
Cassandra PPT Final
No ratings yet
Cassandra PPT Final
23 pages
02 CQL - Solution
No ratings yet
02 CQL - Solution
3 pages
Whitepaper - Data Modeling in Apache Cassandra
No ratings yet
Whitepaper - Data Modeling in Apache Cassandra
21 pages
Introduction To NOSQL and Cassandra: @rantav @outbrain
No ratings yet
Introduction To NOSQL and Cassandra: @rantav @outbrain
60 pages
Cassandra - Module5
No ratings yet
Cassandra - Module5
37 pages
Oracle Partitioning For Developers
No ratings yet
Oracle Partitioning For Developers
70 pages
Cassandra Complete Notes
No ratings yet
Cassandra Complete Notes
5 pages
Apache Cassandra: Database
No ratings yet
Apache Cassandra: Database
55 pages
Business Process Management Workshops - BPM 2015, 13th International Workshops PDF
No ratings yet
Business Process Management Workshops - BPM 2015, 13th International Workshops PDF
600 pages
Dzone Refcard 153 Apache Cassandra 2020
No ratings yet
Dzone Refcard 153 Apache Cassandra 2020
11 pages
Introduction To Cassandra
No ratings yet
Introduction To Cassandra
47 pages
Fundamentals of RDBMS
No ratings yet
Fundamentals of RDBMS
9 pages
An Overview of Apache Cassandra: Cassandra Essentials Tutorial Series
No ratings yet
An Overview of Apache Cassandra: Cassandra Essentials Tutorial Series
20 pages
Chapter 7
No ratings yet
Chapter 7
48 pages
Cassandra Quick Guide
No ratings yet
Cassandra Quick Guide
60 pages
Nvidia Resume
No ratings yet
Nvidia Resume
1 page
Cassandra Design Patterns - Sample Chapter
No ratings yet
Cassandra Design Patterns - Sample Chapter
32 pages
Learn Cassandra
100% (2)
Learn Cassandra
37 pages
Cassandra Tutorial
No ratings yet
Cassandra Tutorial
27 pages
Office Records Management
No ratings yet
Office Records Management
17 pages
Introduction To Cassandra
No ratings yet
Introduction To Cassandra
37 pages
Cassandra As Used by Facebook
100% (1)
Cassandra As Used by Facebook
12 pages
Apache Cassandra Database - Instaclustr
No ratings yet
Apache Cassandra Database - Instaclustr
8 pages
Learning Apache Cassandra - Sample Chapter
No ratings yet
Learning Apache Cassandra - Sample Chapter
20 pages
Cassandra: Wa'el Belkasim Arash Akhlaghi Badrinath Jayakumar
No ratings yet
Cassandra: Wa'el Belkasim Arash Akhlaghi Badrinath Jayakumar
37 pages
28-Introduction To Dashboards-27-03-2025
No ratings yet
28-Introduction To Dashboards-27-03-2025
20 pages
CA Assignment Two
No ratings yet
CA Assignment Two
4 pages
Sahu KirtiSundar
No ratings yet
Sahu KirtiSundar
296 pages
Bala Chitra
No ratings yet
Bala Chitra
244 pages
See How Talend Helped Domino's: Integrate Data From 85,000 Sources
No ratings yet
See How Talend Helped Domino's: Integrate Data From 85,000 Sources
6 pages
Cassandra
No ratings yet
Cassandra
7 pages
Data and DW Lab Manual Updated
No ratings yet
Data and DW Lab Manual Updated
44 pages
ICT - Grade 3 - Unit 6-Compressed
No ratings yet
ICT - Grade 3 - Unit 6-Compressed
19 pages
Turban Chap 03
No ratings yet
Turban Chap 03
30 pages
Scientific Literature
No ratings yet
Scientific Literature
20 pages
Sandisk SDCF2B-160 PDF
No ratings yet
Sandisk SDCF2B-160 PDF
124 pages
Improving Existing Bad Design Into Good Design
No ratings yet
Improving Existing Bad Design Into Good Design
4 pages
Types of Computer
No ratings yet
Types of Computer
5 pages
CRUD
No ratings yet
CRUD
6 pages
Gis Books
No ratings yet
Gis Books
12 pages
Information Technology 402 Class X Term 2 Sample Paper 09
No ratings yet
Information Technology 402 Class X Term 2 Sample Paper 09
2 pages
BITP 2213 Software Engineering: Sequence Diagram
No ratings yet
BITP 2213 Software Engineering: Sequence Diagram
9 pages
BI Assignment7
No ratings yet
BI Assignment7
38 pages
Recess Term Project Presentation
No ratings yet
Recess Term Project Presentation
9 pages
Student ID Return System
No ratings yet
Student ID Return System
12 pages
CC5051 Database Coursework Guidelines
No ratings yet
CC5051 Database Coursework Guidelines
7 pages
AIS Chapter 17
No ratings yet
AIS Chapter 17
15 pages
What Is Data Mart-1
No ratings yet
What Is Data Mart-1
4 pages
Upload 1 Document To Download: Steelproj PDF
No ratings yet
Upload 1 Document To Download: Steelproj PDF
3 pages
ch7 Part1 4up
No ratings yet
ch7 Part1 4up
4 pages
Krishna Garg SF
No ratings yet
Krishna Garg SF
1 page
DAY Course Content Description
No ratings yet
DAY Course Content Description
1 page
Schema Evolution Bib
No ratings yet
Schema Evolution Bib
2 pages
The Tech Interview Playbook: From DSA to System Design
From Everand
The Tech Interview Playbook: From DSA to System Design
Chinmoy Mukherjee
No ratings yet
CompTIA Network+ CertMike: Prepare. Practice. Pass the Test! Get Certified!: Exam N10-008
From Everand
CompTIA Network+ CertMike: Prepare. Practice. Pass the Test! Get Certified!: Exam N10-008
Mike Chapple
No ratings yet

Intro To Cassandra For Developers

Uploaded by

Intro To Cassandra For Developers

Uploaded by

Intro to Cassandra for Developers

Questions: bit.ly/cassandra-workshop Quizz: menti.com

Global Scale No Operations 25 Gig Free Tier

- Big Data Ready

USA New York 8.000.000

UK London 9.200.000 JP Tokyo 37.430.000

AU Sydney 4.900.000 CA Toronto 6.200.000

• Geographic Distribution • Hybrid-Cloud and Multi-Cloud

No Data Loss Caching Pricing

Global Presence Banking Retail

Modern Cloud API Layer Hybrid-cloud

2. The Art of Data Modelling

2. The Art of Data Modelling

A group of rows having the ID First Name Last Name Department

IMPORTANT: stored together, all 399 Marisha Chapez Wizardry

ID First Name Last Name Department

1 John Doe Wizardry

3 Patrick McFadin DevRel

Table ● Tabular data model, with one twist

Partition key column Clustering columns Data columns

CREATE TABLE killrvideo.users_by_city (

Primary key Partition key Clustering columns

MUST ENSURE UNIQUENESS.

PRIMARY KEY ((city), last_name, first_name, email);

PRIMARY KEY (user_id);

Partition key Clustering columns

PRIMARY KEY (user_id);

PRIMARY KEY ((video_id), comment_id);

Partition key Clustering columns

PRIMARY KEY ((city), last_name, first_name); Not Unique

PRIMARY KEY ((city), last_name, first_name, email);

PRIMARY KEY ((video_id), comment_id); Not Sorted

PRIMARY KEY ((video_id), created_at, comment_id);

Example: open a video? Get the comments in a single query!

PRIMARY KEY ((video_id), created_at, comment_id);

PRIMARY KEY ((comment_id), created_at);

PRIMARY KEY ((video_id), created_at, comment_id);

PRIMARY KEY ((country), user_id);

● Up to 2 billion cells per partition

Example: a huge IoT infrastructure, hardware all over

PRIMARY KEY ((sensor_id), reported_at);

Example: a huge IoT infrastructure, hardware all over

PRIMARY KEY ((sensor_id), reported_at);

PRIMARY KEY ((sensor_id, month_year), reported_at);

PRIMARY KEY (user_id);

PRIMARY KEY ((video_id), created_at, comment_id);

PRIMARY KEY ((country), user_id);

2. The Art of Data Modelling

3 Sage Lahja Math

PROS: Quick Read, Simple Queries 4 Juniper Jones Botany

2. Identify entities, their properties

3. Design tables, using

4. Use JOIN when doing queries to

2. Identify workﬂows, their

3. Deﬁne Queries to fulﬁll these Models

4. Knowing the queries, design tables,

5. Use BATCH when inserting or

Query I: Find comments posted for a user comments_by_user

Query II: Find comments for a video with a comments_by_video

SELECT * FROM comments_by_user comments_by_user

WHERE userid = <some UUID>

SELECT * FROM comments_by_video comments_by_video

userid UUID K videoid UUID K

comment TEXT comment TEXT

CREATE TABLE IF NOT EXISTS comments_by_video (

2. The Art of Data Modelling

Classic courses available at DataStax Academy

You might also like