0% found this document useful (0 votes)

17 views5 pages

Casandra Vs MongoDB

Uploaded by

pbecic

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views5 pages

Casandra Vs MongoDB

Uploaded by

pbecic

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

Casandra vs MongoDB

Independent benchmark analyses and testing of various NoSQL platforms under big data and
production-level workloads have been performed over the years. Most of them, including the recent
ones, shows that Apache Cassandra performed significantly better than Couchbase 3.0, MongoDB 3.0
(with the Wired Tiger storage engine) in throughput and latency.

Casandra advantages:
Peer to Peer Architecture:

Cassandra follows a peer-to-peer architecture, instead of master-slave architecture. Hence, there is no

single point of failure in Cassandra. Moreover, any number of servers/nodes can be added to any
Cassandra cluster in any of the datacenters. As all the machines are at equal level, any server can
entertain request from any client. Undoubtedly, with its robust architecture and exceptional
characteristics, Cassandra has raised the bar far above than other databases.

Elastic Scalability:

One of the biggest advantages of using Cassandra is its elastic scalability. Cassandra cluster can be easily
scaled-up or scaled-down. Interestingly, any number of nodes can be added or deleted in Cassandra
cluster without much disturbance. You don’t have to restart the cluster or change queries related
Cassandra application while scaling up or down. This is why Cassandra is popular of having a very high
throughput for the highest number of nodes. As scaling happens, read and write throughput both
increase simultaneously with zero downtime or any pause to the applications.

High Availability and Fault Tolerance:

Another striking feature of Cassandra is Data replication which makes Cassandra highly available and
fault-tolerant. Replication means each data is stored at more than one location. This is because, even if
one node fails, the user should be able to retrieve the data with ease from another location. In a
Cassandra cluster, each row is replicated based on the row key. You can set the number of replicas you
want to create. Just like scaling, data replication can also happen across multiple data centers. This
further leads to high level back-up and recovery competencies in Cassandra.

High Performance:

Cassandra provides very fast writes, and they are actually faster than reads where it can transfer data
about 80-360MB/sec per node. It achieves this using two techniques.

 Cassandra keeps most of the data within memory at the responsible node, and any updates are
done in the memory and written to the persistent storage (file system) in a lazy fashion. To avoid
losing data, however, Cassandra writes all transactions to a commit log in the disk. Unlike
updating data items in the disk, writes to commit logs are append-only and, therefore, avoid
rotational delay while writing to the disk.
 Unless writes have requested full consistency, Cassandra writes data to enough nodes without
resolving any data inconsistencies where it resolves inconsistencies only at the first read. This
process is called "read repair."

Tunable Consistency:

Characteristics like Tunable Consistency, makes Cassandra an incomparable database. In Cassandra,

Consistency can be of two types:

 Eventual consistency - makes sure that the client is approved as soon as the cluster accepts the
write
 Strong consistency - any update is broadcasted to all machines or all the nodes where the
particular data is situated

You can adopt any of these, based on your requirements. You also have the freedom to blend both
eventual and strong consistency. For instance, you can go for eventual consistency in case of remote
data centers where latency is quite high and go for Strong consistency for local data centers where
latency is low.

Replication

Cassandra has much more advanced support for replication by being aware of the network topology.
The server can be set to use a specific consistency level to ensure that queries are replicated locally, or
to remote data centers. This means you can let Cassandra handle redundancy across nodes where it is
aware of which rack and data center those nodes are on. Cassandra can also monitor nodes and route
queries away from “slow” responding nodes.

Idempotency

Idempotency is easy to maintain (don’t need to do a query before an insertion) which prevent
duplication of data.

Memory requirements

Cassandra is much lighter on the memory requirements, especially if you don’t need to keep a lot of
data in cache

Casandra disadvantages and limitations:

 Aggregations in Cassandra are not supported by the Cassandra nodes - client must provide
aggregations. When the aggregation requirement spans multiple rows, Random Partitioning
makes aggregations very difficult for the client. Recommendation is to use Storm or Hadoop for
aggregations.
 Cassandra doesn’t provide a custom map/reduce implementation, but provides native Hadoop
support including for Hive (a SQL data warehouse built on Hadoop map/reduce)
 Querying options for retrieving data are very limited
 Cassandra is more complex to use, and more sensitive to queries (in fact, one large query can
very easily bring down a node)
 In-depth understanding of the database is required to effectively manage it.
 Ordering is done per-partition, and is specified at table creation time.
 A single column value may not be larger than 2GB; in practice, "single digits of MB" is a more
reasonable limit, since there is no streaming or random access of blob values.
 Collection values may not be larger than 64KB.
 The maximum number of cells (rows x columns) in a single partition is 2 billion.

Hadoop advantages:
 Hadoop Distributed File System (HDFS) - can store massive distributed unstructured data sets.
Data can be stored directly in HDFS, or it can be stored in a semi-structured format in HBase,
which allows rapid record-level data access
 MapReduce capabilities are very strong

Hadoop disadvantages:
 HDFS file system is extremely complex to set up
 Has single points of failure

Advantages of Cassandra-Hadoop combination:

 Can be implemented on the same cluster which means we can have the best of both worlds.
 Time-based and real-time running under Cassandra applications (real-time being the strength
of Cassandra) while batch-based analytics and queries that do not require a timestamp can run
on Hadoop. In this kind of ecosystem, HDFS is replaced by Cassandra and this is invisible to the
developer. One can reassign dynamically, nodes between the Cassandra and Hadoop
environments as is appropriate.
 Cassandra File System removes the single points of failure that are associated with HDFS,
namely the NameNode and Job Tracker points of failure that are associated with HDFS.

MongoDB advantages:
 Easier development and much better documentation
 Better fit for single server
 Stores BSON (basically JSON) which is easy to manage and extremely useful when working with
web applications
 Strongly consistent by default
 Scalability – mongoDB has a number of functions related to scalability
o automatic sharding (auto-partitioning of data across servers)
o reads and writes distributed over shards
o eventually-consistent reads that can be distributed over replicated servers
 Availability - data is spread across several shards (replica sets).
Typically, each shard consists of multiple Mongo Daemon instances, including an
arbiter node, a master node, and multiple slaves. If a slave node fails, the master
node automatically re-distributes the workload to the rest of the slave nodes. In
case the master node crashes, the arbiter node elects a new master.
Replica set can span across multiple datacenters but writes can only go to one primary instance
in one data-center.
 Simple and powerful indexing - Indexes work very similar to relational databases. You can create
single or compound indexes on the collection level and every document inserted into that
collection has those fields indexed. Querying by index is extremely fast so long as you have all
your indexes in memory.
 Dynamic queries, sorting, rich updates…
 MapReduce can be used for batch processing of data and aggregation operations. The
aggregation framework enables users to obtain the kind of results for which the SQL GROUP BY
clause is used.

MongoDB disadvantages:
 Global write lock limits its use for big data applications (When you perform a write operation in
MongoDB, it creates a lock on the entire database, not just the affected entries, and not just for
a particular connection. This lock blocks not only other write operations, but also read
operations.)
 Writes in MongoDB are “unsafe” by default.
Data isn’t written right away by default so it’s possible that a write operation could return
success but be lost if the server fails before the data is flushed to disk. This is how Mongo attains
high performance. If you need increased durability then you can specify a safe write which will
guarantee the data is written to disk before returning
 Memory Usage - MongoDB has the natural tendency to use up more memory because it has to
store the key names within each document. This is due to the fact that the data structure is not
necessarily consistent amongst the data objects.
 Increasing cluster size in Mongo involves a lot of manual operations done through the command
line. So, it is mandatory that you have a highly skilled system administrator for this database.

Couchbase advantages:
 Couchbase is really user/developer/admin friendly. You can see easily what’s going on in your
cluster by using web console. When things get wrong, web console is a huge advantage.
 Built-in caching mechanism - couchbase includes a Memcached component that can operate
independently (if you wish) from the document storage components.
 Low-latency read and write operations
 No single point of failure
 Document access in Couchbase is strongly consistent, query access is eventually consistent
 Scalability - easy to scale-out the cluster and support live cluster topology changes (all nodes
are identical, easy to setup and can be added or removed with no changes to the application)
Cross-datacenter replication makes it possible to scale a cluster across datacenters for better
data locality and faster data access.
 Availability - Couchbase Server maintains multiple copies (up to three replicas) of
each document in a cluster. Each server is identical and serves active and replica
documents. Data is uniformly distributed across all the nodes and the clients are
aware of the topology. If a node in the cluster fails, Couchbase Server detects the
failure and promotes replica documents on other live nodes to active.

Couchbase disadvantages and limitations:

 Max key length = 250 bytes
 Max value size = 20 Mbytes
 Max metadata = 150 bytes per document
 Max Buckets per Cluster = 10
 Max View Key Size = 4096 bytes

Java 21 Interview Problems
No ratings yet
Java 21 Interview Problems
2 pages
Java Multithreading Interview Problems
No ratings yet
Java Multithreading Interview Problems
2 pages
TNPSC GroupI Prelims Afternoon Batch3 2025 Coimbatore
No ratings yet
TNPSC GroupI Prelims Afternoon Batch3 2025 Coimbatore
10 pages
Java Memory Model Interview Problems
No ratings yet
Java Memory Model Interview Problems
2 pages
Java 9 To 17 Interview Problems
No ratings yet
Java 9 To 17 Interview Problems
2 pages
JPA Hibernate Interview Problems
No ratings yet
JPA Hibernate Interview Problems
2 pages
Section 10 Message Brokers True Senior H1 H2
No ratings yet
Section 10 Message Brokers True Senior H1 H2
3 pages
Packaging Efficiency Project
No ratings yet
Packaging Efficiency Project
8 pages
NOSQL Database
No ratings yet
NOSQL Database
6 pages
Spring Boot Interview Problems
No ratings yet
Spring Boot Interview Problems
2 pages
RabbitMQ Interview Problems
No ratings yet
RabbitMQ Interview Problems
2 pages
AWS Interview Problems
No ratings yet
AWS Interview Problems
2 pages
Integration Testing Interview Problems
No ratings yet
Integration Testing Interview Problems
2 pages
Section 1 Microservices H1 H2 Bullets
No ratings yet
Section 1 Microservices H1 H2 Bullets
3 pages
Kafka Interview Problems Clean
No ratings yet
Kafka Interview Problems Clean
3 pages
Docker Interview Problems
No ratings yet
Docker Interview Problems
2 pages
Hexagonal Architecture Interview Problems
No ratings yet
Hexagonal Architecture Interview Problems
2 pages
Clean Architecture Interview Problems
No ratings yet
Clean Architecture Interview Problems
2 pages
Engineering Mechanics - ME3351 2021 Regulation - Semester Question Paper 2022 Nov Dec
No ratings yet
Engineering Mechanics - ME3351 2021 Regulation - Semester Question Paper 2022 Nov Dec
5 pages
U.K. Chatterjee, S.K. Bose, S.K. Roy - Environmental Degradation of Metals - Corrosion Technology Series - 14-CRC Press (2001)
No ratings yet
U.K. Chatterjee, S.K. Bose, S.K. Roy - Environmental Degradation of Metals - Corrosion Technology Series - 14-CRC Press (2001)
509 pages
Bda Unit 4
No ratings yet
Bda Unit 4
12 pages
Unit II
No ratings yet
Unit II
31 pages
DB 5
No ratings yet
DB 5
39 pages
Environmental Aspects-Impact As Per ISO 14001 - 2015
No ratings yet
Environmental Aspects-Impact As Per ISO 14001 - 2015
28 pages
NoSQL D
No ratings yet
NoSQL D
26 pages
NoSQL Database Technology - A Survey and Comparison of Systems
No ratings yet
NoSQL Database Technology - A Survey and Comparison of Systems
44 pages
Brewers Fayre Main Menu Band4
No ratings yet
Brewers Fayre Main Menu Band4
8 pages
CSL and Mongo
No ratings yet
CSL and Mongo
43 pages
Is Your Web API Truly RESTful
No ratings yet
Is Your Web API Truly RESTful
43 pages
NOSQL
No ratings yet
NOSQL
50 pages
Bigdata Unit 4
No ratings yet
Bigdata Unit 4
97 pages
BOX Hill Growth Centres Precinct Development Control Plan - in Force 28 June 2021
No ratings yet
BOX Hill Growth Centres Precinct Development Control Plan - in Force 28 June 2021
243 pages
Cassandra PPT Final
No ratings yet
Cassandra PPT Final
23 pages
Kirt You So Much
No ratings yet
Kirt You So Much
3 pages
Mongodb Report
No ratings yet
Mongodb Report
26 pages
Pitot Tube Apparatus
No ratings yet
Pitot Tube Apparatus
6 pages
Cassandra Complete Notes
No ratings yet
Cassandra Complete Notes
5 pages
BDCN Unit 2 Activity 1
No ratings yet
BDCN Unit 2 Activity 1
10 pages
Comparison Between NoSQL and RDBMS
No ratings yet
Comparison Between NoSQL and RDBMS
6 pages
Difference Between Mongodb and RDBMS
No ratings yet
Difference Between Mongodb and RDBMS
5 pages
MokpoBridge NewLandmarkinMokpoCity
No ratings yet
MokpoBridge NewLandmarkinMokpoCity
3 pages
Goal-Directed Cold Exposure Protocols From The Huberman Lab Podcast
No ratings yet
Goal-Directed Cold Exposure Protocols From The Huberman Lab Podcast
2 pages
Orth Update 2023 16 85-90
No ratings yet
Orth Update 2023 16 85-90
6 pages
Intro To NoSQL
No ratings yet
Intro To NoSQL
18 pages
Module 3
No ratings yet
Module 3
14 pages
Estimation of Glomerular Filtration Rate in South Asian Healthy Adult Kidney Donors
No ratings yet
Estimation of Glomerular Filtration Rate in South Asian Healthy Adult Kidney Donors
7 pages
BigData NoSQL
No ratings yet
BigData NoSQL
30 pages
No SQL
No ratings yet
No SQL
32 pages
CockroachLabs CockroachDB Vs MongoDB
No ratings yet
CockroachLabs CockroachDB Vs MongoDB
1 page
Maven Cheat Sheet
No ratings yet
Maven Cheat Sheet
1 page
NoSql Unit 2
No ratings yet
NoSql Unit 2
72 pages
Reserch SQL
No ratings yet
Reserch SQL
5 pages
That Time I Got Reincarnated As A Slime, Vol. 10
No ratings yet
That Time I Got Reincarnated As A Slime, Vol. 10
456 pages
No SQL Lecture Notes
No ratings yet
No SQL Lecture Notes
17 pages
No SQLMongo DB
No ratings yet
No SQLMongo DB
47 pages
CheatSheet 101 Javascript
No ratings yet
CheatSheet 101 Javascript
2 pages
Cassandra Article Review
No ratings yet
Cassandra Article Review
10 pages
BDS Session 5 - NoSQL DB
No ratings yet
BDS Session 5 - NoSQL DB
51 pages
Unit2 Cassandra
No ratings yet
Unit2 Cassandra
15 pages
Atterberg Limits (Liquid and Plastic Limit) and Linear Shrinkage Test
No ratings yet
Atterberg Limits (Liquid and Plastic Limit) and Linear Shrinkage Test
7 pages
C1200 Manual VER1.02
No ratings yet
C1200 Manual VER1.02
27 pages
NoSQL Technologies Notes Unit 1
100% (1)
NoSQL Technologies Notes Unit 1
20 pages
BDCN Unit 2 Activity 1
No ratings yet
BDCN Unit 2 Activity 1
11 pages
Assignment 4
No ratings yet
Assignment 4
9 pages
Geometry Enrichment Packet
No ratings yet
Geometry Enrichment Packet
38 pages
Cassandra
No ratings yet
Cassandra
31 pages
Dod Unit2
No ratings yet
Dod Unit2
22 pages
Mongo DB
No ratings yet
Mongo DB
33 pages
Magnifico 160000334 V1 1121 LR 01
No ratings yet
Magnifico 160000334 V1 1121 LR 01
12 pages
Okuma 5 Axis Guide
100% (1)
Okuma 5 Axis Guide
13 pages
BIG - DATA - Unit 4
No ratings yet
BIG - DATA - Unit 4
99 pages
Full Stack UNIT 3
No ratings yet
Full Stack UNIT 3
36 pages
Document Database
No ratings yet
Document Database
25 pages
Emmet CheatSheet
100% (1)
Emmet CheatSheet
24 pages
Intro To Data Science - Week 10 - LAQ's
No ratings yet
Intro To Data Science - Week 10 - LAQ's
4 pages
Bda Notes (Unit-2)
No ratings yet
Bda Notes (Unit-2)
26 pages
Cisco 2800 Series Integrated Services Routers: Data Sheet
No ratings yet
Cisco 2800 Series Integrated Services Routers: Data Sheet
16 pages
Hotel Classification
No ratings yet
Hotel Classification
9 pages
Jquery Cheat Sheet: by Via
No ratings yet
Jquery Cheat Sheet: by Via
1 page
Yarn Package Manager Cheat Sheet: by Via
No ratings yet
Yarn Package Manager Cheat Sheet: by Via
2 pages
Platonic Idealism: By: Dylan Isabela Jairus Marcos
No ratings yet
Platonic Idealism: By: Dylan Isabela Jairus Marcos
15 pages
Nodejs Cheat Sheet: by Via
No ratings yet
Nodejs Cheat Sheet: by Via
4 pages
InfoQ - Java8 PDF
No ratings yet
InfoQ - Java8 PDF
46 pages
No SQL
No ratings yet
No SQL
109 pages
Typescript Cheat Sheet: by Via
No ratings yet
Typescript Cheat Sheet: by Via
2 pages
Bootstrap Cheat Sheet: by Via
No ratings yet
Bootstrap Cheat Sheet: by Via
3 pages
InfoQ - Hadoop PDF
No ratings yet
InfoQ - Hadoop PDF
20 pages
Scott Slaybaugh - Who Is To Blame? (Titanic Articles)
No ratings yet
Scott Slaybaugh - Who Is To Blame? (Titanic Articles)
8 pages
History of Irrigation
No ratings yet
History of Irrigation
24 pages
Elasticsearch Developer Cheat Sheet PDF
No ratings yet
Elasticsearch Developer Cheat Sheet PDF
2 pages
Github Git Cheat Sheet
No ratings yet
Github Git Cheat Sheet
2 pages
No SQL
No ratings yet
No SQL
17 pages
InfoQ - Java Performance PDF
No ratings yet
InfoQ - Java Performance PDF
27 pages
Engine Maintenance ................................... Ma-2
No ratings yet
Engine Maintenance ................................... Ma-2
12 pages
Chapter 5-NoSQL PDF
No ratings yet
Chapter 5-NoSQL PDF
47 pages
Peri
No ratings yet
Peri
128 pages
App Ache
No ratings yet
App Ache
55 pages
NoSQL Databases (MongoDB-Cassandra)
No ratings yet
NoSQL Databases (MongoDB-Cassandra)
13 pages
The Java Garbage Collection Mini Book
No ratings yet
The Java Garbage Collection Mini Book
104 pages
List of Transportation Vehicles and Equipments
No ratings yet
List of Transportation Vehicles and Equipments
2 pages
RTWP Process
No ratings yet
RTWP Process
2 pages
Data Analytics Using NoSQL
0% (1)
Data Analytics Using NoSQL
50 pages
NoSQL Big Data Management
No ratings yet
NoSQL Big Data Management
36 pages
Heteroskedasticity
100% (1)
Heteroskedasticity
23 pages
Nosql Cassandra Database: What Is Apache Cassandra?
No ratings yet
Nosql Cassandra Database: What Is Apache Cassandra?
4 pages
Cassandra Architecture PDF
No ratings yet
Cassandra Architecture PDF
112 pages
GRE Writting 1000 Tips
No ratings yet
GRE Writting 1000 Tips
58 pages
Cassendra
100% (1)
Cassendra
21 pages
Cassandra Tutorial For Beginners: Learn in 3 Days: What Is Apache Cassandra?
No ratings yet
Cassandra Tutorial For Beginners: Learn in 3 Days: What Is Apache Cassandra?
4 pages
Whitepaper - Data Modeling in Apache Cassandra
No ratings yet
Whitepaper - Data Modeling in Apache Cassandra
21 pages
Apache Cassandra: Het Patel Kajal Patel
No ratings yet
Apache Cassandra: Het Patel Kajal Patel
8 pages
AWS Certified Solutions Architect - Professional
From Everand
AWS Certified Solutions Architect - Professional
VB Dev
No ratings yet
Cassandra - Module5
No ratings yet
Cassandra - Module5
37 pages
Introduction To Cassandra
No ratings yet
Introduction To Cassandra
37 pages
Name Shivam Prasad Reg No. 15BCE1196
No ratings yet
Name Shivam Prasad Reg No. 15BCE1196
8 pages
Seminar Topic Nosql
No ratings yet
Seminar Topic Nosql
73 pages
Cassandra Quick Guide
No ratings yet
Cassandra Quick Guide
60 pages
Cassandra Design Patterns - Sample Chapter
No ratings yet
Cassandra Design Patterns - Sample Chapter
32 pages
An Overview of Apache Cassandra: Cassandra Essentials Tutorial Series
No ratings yet
An Overview of Apache Cassandra: Cassandra Essentials Tutorial Series
20 pages
Learn Cassandra in 24 Hours
From Everand
Learn Cassandra in 24 Hours
Alex Nordeen
No ratings yet

Casandra Vs MongoDB

Uploaded by

Casandra Vs MongoDB

Uploaded by

Casandra vs MongoDB

Cassandra follows a peer-to-peer architecture, instead of master-slave architecture. Hence, there is no

High Availability and Fault Tolerance:

Characteristics like Tunable Consistency, makes Cassandra an incomparable database. In Cassandra,

Casandra disadvantages and limitations:

Advantages of Cassandra-Hadoop combination:

Couchbase disadvantages and limitations:

You might also like