0% found this document useful (0 votes)

178 views60 pages

Introduction To NOSQL and Cassandra: @rantav @outbrain

The document provides an introduction to NoSQL and Cassandra. It discusses some of the challenges of modern web applications that have led to the development of NoSQL databases, such as large data sizes, high read/write rates, and frequent schema changes. It then summarizes Cassandra, describing it as a column-oriented distributed database modeled after Bigtable that provides eventual consistency. The document also covers Cassandra's data model and basic operations through its API.

Uploaded by

chrisjaure

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

178 views60 pages

Introduction To NOSQL and Cassandra: @rantav @outbrain

Uploaded by

chrisjaure

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 60

Introduction to NOSQL

And Cassandra
@rantav
@outbrain
SQL is good

• Rich language
• Easy to use and integrate
• Rich toolset
• Many vendors

• The promise: ACID

o Atomicity
o Consistency
o Isolation
o Durability
SQL Rules
BUT
HOWEVER...
The Challenge: Modern web apps

• Internet-scale data size

• High read-write rates
• Frequent schema changes

• "social" apps - not banks

o They don't need the same
level of ACID

SCALING
Scaling Solutions - Replication

Scales Reads
Scaling Solutions - Sharding

Scales also Writes

Brewer's CAP Theorem:

You can only choose two

CAP
Availability + Consistency (no Partition Tolerance)

• Single master SQL server

• Or - an array of SQLs
Consistency + Partition Tolerance (no Availability)
Availability + Partition Tolerance (no Consistency)
Consistency Levels

• Strong Consistency (RDBMS, Local Disk, RAM, ...)

• Weak Consistency - no guarranties

• Eventual Consistentcy (Cassandra, DNS etc)

o Causal consistency. A writes, then tells B "I wrote".
o Read-your-writes consistency. (special case of
causal).
o Monotonic read consistency. A reads x. In future reads,
A will never read older values of x
o Monotonic write consistency. Serialize the writes by the
same process.
Existing NOSQL Solutions

• Developed at facebook

• Follows the BigTable Data Model - column

oriented

• Follows the Dynamo Eventual Consistency

model

• Opensourced at Apache

• Implemented in Java
CONSISTENCY DOWN TO EARTH
N/R/W

• N - Number of replicas (nodes) for any data item

• W - Number or nodes a write operation blocks on

• R - Number of nodes a read operation blocks on

N/R/W - Typical Values

• W=1 => Block until first node written successfully

• W=N => Block until all nodes written successfully
• W=0 => Async writes

• R=1 => Block until the first node returns an answer

• R=N => Block until all nodes return an answer
• R=0 => Doesn't make sense

• QUORUM:
o R = N/2+1
o W = N/2+1
o => Fully consistent
Data Model - Forget SQL

Do you know SQL?

Data Model - Vocabulary

• Keyspace – like namespace for unique keys.

• Column Family – very much like a table… but not quite.

• Key – a key that represent row (of columns)

• Column – representation of value with:

o Column name
o Value
o Timestamp

• Super Column – Column that holds list of columns inside

Data Model - Columns

struct Column {
1: binary name,
2: binary value,
3: i64 timestamp,
}

JSON-ish notation:
{
"name": "emailAddress",
"value": "[email protected]",
"timestamp": 123456789 }
Data Model - Column Family

• Similar to SQL tables

• Has many columns
• Has many rows
Data Model - Rows

• Primary key for objects

• All keys are arbitrary length strings
{
"Users": {
   "ran":{
       {"name":"emailAddress", "value":"[email protected]"},
       {"name":"webSite", "value":"http://bar.com"}
   },
   "f.rat":{
       {"name":"emailAddress", "value":"[email protected]"}
   }
"Stats":{
    "ran":{
       {"name":"visits", "value":"243"},
   }
}
Data Model - Short Notation

Users: CF
   ran: ROW
       emailAddress: [email protected],   COLUMN
       webSite: http://bar.com COLUMN
   f.rat:   ROW
       emailAddress: [email protected] COLUMN
Stats:   CF
   ran: ROW
       visits: 243   COLUMN
Data Model - Songs example

Songs:
   Meir Ariel:
       Shir Keev: 6:13,
       Tikva: 4:11,
       Erol: 6:17
       Suetz: 5:30
       Dr Hitchakmut: 3:30
   Mashina:
       Rakevet Layla: 3:02
       Optikai: 5:40
Data Model - Super Columns

Songs:
   Meir Ariel:
       Shirey Hag:
           Shir Keev: 6:13,
           Tikva: 4:11,
           Erol: 6:17
       Vegluy Eynaim:
           Suetz: 5:30
           Dr Hitchakmut: 3:30
   Mashina:
       ...
Data Model - Super Columns

• Columns whose values are lists of columns

The API

get
get_slice
multiget
multiget_slice
get_count
get_ranage_slice
get_ranage_slices
insert
remove
batch_insert
batch_mutate
The True API

get(keyspace, key, column_path, consistency)

get_slice(ks, key, column_parent, predicate, consistency)
multiget(ks, keys, column_path, consistency)
multiget_slice(ks, keys, column_parent,
predicate, consistency)
...
Consistency Model

• N - per keyspace
• R - per each read requests
• W - per each write request
Consistency Model

Cassandra defines:

enum ConsistencyLevel {
   ZERO = 0,
   ONE = 1,
   QUORUM = 2,
   DCQUORUM = 3,
   ALL = 5,
}
Java Code

TTransport tr = new TSocket("localhost", 9160);

TProtocol proto = new TBinaryProtocol(tr);
Cassandra.Client client = new Cassandra.Client(proto);
tr.open();

String key_user_id = "1";

long timestamp = System.currentTimeMillis();

client.insert("Keyspace1",
           key_user_id,
              new ColumnPath("Standard1",
                             null,
                            "name".getBytes("UTF-8")),
           "Chris Goffinet".getBytes("UTF-8"),
              timestamp,
              ConsistencyLevel.ONE);
Java Client - Hector
http://github.com/rantav/hector
• The de-facto java client for cassandra

• Encapsulates thrift
• Adds JMX (Monitoring)
• Connection pooling
• Failover
• Open-sourced at github and has a growing
community of developers and users.
Java Client - Hector - cont

/**
   * Insert a new value keyed by key
   *
   * @param key Key for the value
   * @param value the String value to insert
   */
  public void insert(final String key, final String value) {
   Mutator m = createMutator(keyspaceOperator);
   m.insert(key,
   CF_NAME,
   createColumn(COLUMN_NAME, value));
  }
Java Client - Hector - cont

  /**
   * Get a string value.
   *
   * @return The string value; null if no value exists for the given key.
   */
  public String get(final String key) throws HectorException {
   ColumnQuery<String, String> q = createColumnQuery(keyspaceOperator, serializer, serializer);
   Result<HColumn<String, String>> r = q.setKey(key).
   setName(COLUMN_NAME).
   setColumnFamily(CF_NAME).
   execute();
   HColumn<String, String> c = r.get();
   return c == null ? null : c.getValue();
  }
Extra

If you're not snoring yet...

Sorting

Columns are sorted by their type

• BytesType
• UTF8Type
• AsciiType
• LongType
• LexicalUUIDType
• TimeUUIDType

Rows are sorted by their Partitioner

• RandomPartitioner
• OrderPreservingPartitioner
• CollatingOrderPreservingPartitioner
Thrift

Cross-language protocol
Compiles to: C++, Java, PHP, Ruby, Erlang, Perl, ...

struct UserProfile {
   1: i32 uid,
   2: string name,
   3: string blurb
}

service UserStorage {
void store(1: UserProfile user),
UserProfile retrieve(1: i32 uid)
}
Thrift

Generating sources:

thrift --gen java cassandra.thrift

thrift -- gen py cassandra.thrift

Internals
Required Reading ;-)

BigTable http://labs.google.com/papers/bigtable.html

Dynamo http://www.allthingsdistributed.com/2007/10/amazons_dynamo.html
From Dynamo:

• Symmetric p2p architecture
• Gossip based discovery and error detection
• Distributed key-value store
o Pluggable partitioning
o Pluggable topology discovery
• Eventual consistent and Tunable per operation
From BigTable

• Sparse Column oriented sparse array

• SSTable disk storage
o Append-only commit log
o Memtable (buffering and sorting)
o Immutable sstable files
o Compactions
o High write performance
Architecture Layers

Cluster Management Single Host Consistency

Messaging service Commit log Tombstones
Gossip Memtable Hinted handoff
Failure detection SSTable Read repair
Cluster state Indexes Bootstrap
Partitioner Compaction Monitoring
Replication Admin tools
Gossip

• p2p
• Enables seamless nodes addition.
• Rebalancing of keys
• Fast detection of nodes that goes down.
• Every node knows about all others - no
master.
Internals - Consistent Hashing

Memtables

• In-memory representation of recently written data

• When the table is full, it's sorted and then flushed to disk ->
sstable
SSTables

Sorted Strings Tables

• Immutable
• On-disk
• Sorted by a string key
• In-memory index of elements
• Binary search (in memory) to find element location
• Bloom filter to reduce number of unneeded binary searches.
Write Path

Write Path

Compactions

Write Properties

• No Locks in the critical path

• Always available to writes, even if there are failures.

• No reads
• No seeks
• Fast
• Atomic within ColumnFamily
Read Path
Reads
Read Properteis

• Read multiple SSTables

• Slower than writes (but still fast)
• Seeks can be mitigated with more RAM
• Uses probabilistic bloom filters to reduce lookups.
Bloom Filters

• Space efficient probabilistic data structure

• Test whether an element is a member of a set
• Allow false positive, but not false negative
• k hash functions
• Union and intersection are implemented as bitwise OR, AND
Compactions

• Merge keys
• Combine columns
• Discard tombstones
• Use bloom filters bitwise OR operation

• Large and Small compactions

Deletions

• Deletion marker (tombstone) necessary to suppress data in

older SSTables, until compaction
• Read repair complicates things a little
• Eventually consistent complicates things more
• Solution: configurable delay before tombstone GC, after
which tombstones are not repaired
Extra Long list of subjects

SEDA
anti entropy
hinted handoff
repair on read
timestamps -> vector clocks
consistent hashing
merkle trees
References

• http://horicky.blogspot.com/2009/11/nosql-patterns.html
• http://s3.amazonaws.com/AllThingsDistributed/sosp/amazon
-dynamo-sosp2007.pdf
• http://labs.google.com/papers/bigtable.html
• https://nosqleast.com/2009/
• http://bret.appspot.com/entry/how-friendfeed-uses-mysql
• http://www.julianbrowne.com/article/viewer/brewers-cap-
theorem
• http://www.allthingsdistributed.com/2008/12/eventually_cons
istent.html
• http://wiki.apache.org/cassandra/DataModel
• http://incubator.apache.org/thrift/

DS - TLSR8258-E - Datasheet For Telink BLE+IEEE802.15.4 Multi-Standard Wireless SoC TLSR8258
No ratings yet
DS - TLSR8258-E - Datasheet For Telink BLE+IEEE802.15.4 Multi-Standard Wireless SoC TLSR8258
155 pages
Cassandra Presentation Final
100% (3)
Cassandra Presentation Final
71 pages
09b Cassandra Slides
No ratings yet
09b Cassandra Slides
26 pages
CassandraTraining v3.3.4
100% (1)
CassandraTraining v3.3.4
183 pages
Qualys Authenticated Scanning Windows
No ratings yet
Qualys Authenticated Scanning Windows
26 pages
BIG Data Analytics 21CSH-471: Computer Science & Engineering
No ratings yet
BIG Data Analytics 21CSH-471: Computer Science & Engineering
26 pages
Unit 2
No ratings yet
Unit 2
65 pages
Cassandra
No ratings yet
Cassandra
25 pages
BC100
0% (2)
BC100
1 page
AnswerSheet Part3
No ratings yet
AnswerSheet Part3
100 pages
NGD Mini Notes
No ratings yet
NGD Mini Notes
7 pages
NoSql 2024 Assign2
No ratings yet
NoSql 2024 Assign2
189 pages
Introduction To NoSQL
No ratings yet
Introduction To NoSQL
29 pages
Igt Game Set Up
No ratings yet
Igt Game Set Up
1 page
NoSQL Intro
No ratings yet
NoSQL Intro
26 pages
No SQL
No ratings yet
No SQL
109 pages
CIS - 468 - 04 - NOSQL Databases and Big Data Storage Systems
No ratings yet
CIS - 468 - 04 - NOSQL Databases and Big Data Storage Systems
102 pages
Nosql Ecosystem
No ratings yet
Nosql Ecosystem
120 pages
4 - Key-Value Storage
No ratings yet
4 - Key-Value Storage
109 pages
Unit 5 NOSQL
No ratings yet
Unit 5 NOSQL
102 pages
NTRN10DB.2 (6500 R10.1 Planning) Issue1
No ratings yet
NTRN10DB.2 (6500 R10.1 Planning) Issue1
108 pages
Cassandra: Wa'el Belkasim Arash Akhlaghi Badrinath Jayakumar
No ratings yet
Cassandra: Wa'el Belkasim Arash Akhlaghi Badrinath Jayakumar
37 pages
2: Data Model: Creating An E Cient Data Model For Highly-Loaded Applications
No ratings yet
2: Data Model: Creating An E Cient Data Model For Highly-Loaded Applications
83 pages
Seminar Topic Nosql
No ratings yet
Seminar Topic Nosql
73 pages
Curriculum Vitae: Sanju C
No ratings yet
Curriculum Vitae: Sanju C
4 pages
10 NoSQL Databases - HBase Hive Cassandra
No ratings yet
10 NoSQL Databases - HBase Hive Cassandra
74 pages
Settings Provider
No ratings yet
Settings Provider
69 pages
NoSql Unit 2
No ratings yet
NoSql Unit 2
72 pages
Introduction To Nosql: Gabriele Pozzani
No ratings yet
Introduction To Nosql: Gabriele Pozzani
49 pages
Unit 5 App Implementation in Cloud
No ratings yet
Unit 5 App Implementation in Cloud
10 pages
Software Requirements Specification For Online Courier Tracking System
0% (1)
Software Requirements Specification For Online Courier Tracking System
6 pages
NoSQL Introduction OpenWest
No ratings yet
NoSQL Introduction OpenWest
65 pages
NoSQL Big Data Management
No ratings yet
NoSQL Big Data Management
36 pages
Casandra
No ratings yet
Casandra
57 pages
Cs 620 / Dasc 600 Introduction To Data Science & Analytics: Lecture 6-Nosql
No ratings yet
Cs 620 / Dasc 600 Introduction To Data Science & Analytics: Lecture 6-Nosql
31 pages
App Ache
No ratings yet
App Ache
55 pages
BDS Session 5 - NoSQL DB
No ratings yet
BDS Session 5 - NoSQL DB
51 pages
FY20 Getting Started With Ansible
No ratings yet
FY20 Getting Started With Ansible
51 pages
Introduction To: Nosql
No ratings yet
Introduction To: Nosql
27 pages
LabTask CassendraCRUDoperations
No ratings yet
LabTask CassendraCRUDoperations
45 pages
Nosql 1
No ratings yet
Nosql 1
40 pages
OpenSAP Ui51 Week 3 All Slides
No ratings yet
OpenSAP Ui51 Week 3 All Slides
37 pages
NoSQL Database Technology - A Survey and Comparison of Systems
No ratings yet
NoSQL Database Technology - A Survey and Comparison of Systems
44 pages
Lec09 No SQL
No ratings yet
Lec09 No SQL
42 pages
Bcse302l Dbms Module-7 Nosql
No ratings yet
Bcse302l Dbms Module-7 Nosql
30 pages
Introduction To Cassandra
No ratings yet
Introduction To Cassandra
47 pages
Thanks: With More Than 1000 Students/ Professors, Subject Experts and Editors Contributing To It Every Day
No ratings yet
Thanks: With More Than 1000 Students/ Professors, Subject Experts and Editors Contributing To It Every Day
27 pages
Cassandra
No ratings yet
Cassandra
31 pages
Information System Unit 4 Notes
No ratings yet
Information System Unit 4 Notes
33 pages
Lecture 6 - NoSQL
No ratings yet
Lecture 6 - NoSQL
28 pages
Unit 2
No ratings yet
Unit 2
26 pages
IntroNoSQL Revised
No ratings yet
IntroNoSQL Revised
28 pages
One Size Does Not Fit All: Investigating Efficacy of Perplexity in Detecting LLM-Generated Code
No ratings yet
One Size Does Not Fit All: Investigating Efficacy of Perplexity in Detecting LLM-Generated Code
33 pages
4 NoSql
No ratings yet
4 NoSql
25 pages
L20 Cassandra - Fa12
No ratings yet
L20 Cassandra - Fa12
27 pages
Hospital Appiontment System Cs
No ratings yet
Hospital Appiontment System Cs
22 pages
NO SQL-Unit 3
No ratings yet
NO SQL-Unit 3
27 pages
Mobile Cloud Computing: Seminar On
No ratings yet
Mobile Cloud Computing: Seminar On
28 pages
BigData NoSQL
No ratings yet
BigData NoSQL
30 pages
No SQL
No ratings yet
No SQL
32 pages
Topo TNM Style Template Users Guide
No ratings yet
Topo TNM Style Template Users Guide
25 pages
Intro To NoSQL
No ratings yet
Intro To NoSQL
18 pages
3-StringBuffer StringBuilder
No ratings yet
3-StringBuffer StringBuilder
23 pages
MySQL Cheat Sheet & Quick Reference
No ratings yet
MySQL Cheat Sheet & Quick Reference
14 pages
Cassandra CQL Commands
No ratings yet
Cassandra CQL Commands
16 pages
AXIS Site Designer 2: User Manual
No ratings yet
AXIS Site Designer 2: User Manual
15 pages
DBMS 11
No ratings yet
DBMS 11
13 pages
Cassandra Data Model
No ratings yet
Cassandra Data Model
17 pages
10 Data Structures That Make Databases Fast and Scalable
No ratings yet
10 Data Structures That Make Databases Fast and Scalable
12 pages
Aktu Mini 3rd Year Project
No ratings yet
Aktu Mini 3rd Year Project
12 pages
Nse Option Chain Indices
No ratings yet
Nse Option Chain Indices
12 pages
Odoo Transversal Knowledge
No ratings yet
Odoo Transversal Knowledge
3 pages
ErrMsg Eng
No ratings yet
ErrMsg Eng
9 pages
Visual Guide To NoSQL Systems - Nathan Hurst's Blog
No ratings yet
Visual Guide To NoSQL Systems - Nathan Hurst's Blog
10 pages
BDA
No ratings yet
BDA
9 pages
Dzone Refcard 153 Apache Cassandra 2020
No ratings yet
Dzone Refcard 153 Apache Cassandra 2020
11 pages
System Design
No ratings yet
System Design
6 pages
Doom Cmds
No ratings yet
Doom Cmds
5 pages
Cassandra Complete Notes
No ratings yet
Cassandra Complete Notes
5 pages
Name Shivam Prasad Reg No. 15BCE1196
No ratings yet
Name Shivam Prasad Reg No. 15BCE1196
8 pages
NoSQL, Cloud Computing, and IOT
No ratings yet
NoSQL, Cloud Computing, and IOT
3 pages
Adm510 Planning Report - Group 5
No ratings yet
Adm510 Planning Report - Group 5
6 pages
SRE Course Outline F22
No ratings yet
SRE Course Outline F22
4 pages
2 - Disadvantages of NoSQL Technology
No ratings yet
2 - Disadvantages of NoSQL Technology
3 pages
8 - Correspondance UML-JAVA
No ratings yet
8 - Correspondance UML-JAVA
3 pages
How To Modify The Default Expiry Time For The Vpxuser Account (1016736) - VMware KB
No ratings yet
How To Modify The Default Expiry Time For The Vpxuser Account (1016736) - VMware KB
2 pages
Resume Peng Wang
No ratings yet
Resume Peng Wang
2 pages
Cassandra
No ratings yet
Cassandra
7 pages
Q Tips: Fast, Scalable, and Maintainable Kdb+
From Everand
Q Tips: Fast, Scalable, and Maintainable Kdb+
Nick Psaris
No ratings yet
Elements of Android Room
From Everand
Elements of Android Room
Mark Murphy
No ratings yet

Introduction To NOSQL and Cassandra: @rantav @outbrain

Uploaded by

Introduction To NOSQL and Cassandra: @rantav @outbrain

Uploaded by

Introduction to NOSQL

• The promise: ACID

• Internet-scale data size

• "social" apps - not banks

Scales also Writes

You can only choose two

• Single master SQL server

• Strong Consistency (RDBMS, Local Disk, RAM, ...)

• Weak Consistency - no guarranties

• Eventual Consistentcy (Cassandra, DNS etc)

• Follows the BigTable Data Model - column

• Follows the Dynamo Eventual Consistency

• N - Number of replicas (nodes) for any data item

• W - Number or nodes a write operation blocks on

• R - Number of nodes a read operation blocks on

• W=1 => Block until first node written successfully

• R=1 => Block until the first node returns an answer

Do you know SQL?

• Keyspace – like namespace for unique keys.

• Column Family – very much like a table… but not quite.

• Key – a key that represent row (of columns)

• Column – representation of value with:

• Super Column – Column that holds list of columns inside

• Similar to SQL tables

• Primary key for objects

• Columns whose values are lists of columns

get(keyspace, key, column_path, consistency)

TTransport tr = new TSocket("localhost", 9160);

String key_user_id = "1";

long timestamp = System.currentTimeMillis();

If you're not snoring yet...

Columns are sorted by their type

Rows are sorted by their Partitioner

thrift --gen java cassandra.thrift

thrift -- gen py cassandra.thrift

• Sparse Column oriented sparse array

Cluster Management Single Host Consistency

• In-memory representation of recently written data

Sorted Strings Tables

• No Locks in the critical path

• Read multiple SSTables

• Space efficient probabilistic data structure

• Large and Small compactions

• Deletion marker (tombstone) necessary to suppress data in

You might also like