0% found this document useful (0 votes)
27 views7 pages

Wide Column Stores (Module 5)

Uploaded by

dhiyajose555
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
27 views7 pages

Wide Column Stores (Module 5)

Uploaded by

dhiyajose555
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 7

WIDE COLUMN STORES -

EXAMPLES AND MAIN CHARECTERISTICS


PRESENTED BY
GROUP 3 – ARUNDHATHI ,ARUN ,ASWIN ,ATHIRA , ATMAJ
INTRODUCTION
Wide column stores are a type of NoSQL database
designed to efficiently handle vast amounts of data
while offering flexibility in schema design. Unlike
traditional relational databases, wide column stores
organize data in columns rather than rows, allowing
for fast and scalable storage and retrieval of
information. This architecture is particularly well-
suited for applications dealing with large-scale data
sets and semi-structured or unstructured data types
The basis of the architecture of wide column stores is
that data is stored in columns instead of rows as in a
conventional relational database management system
(RDBMS). And the names and format of the columns
can vary from row to row in the same table.
Subsequently, a wide column database can be
interpreted as a two-dimensional key-value. Wide
column databases do often support the notion of column
families that are stored separately. However, each such
column family typically contains multiple columns that
are used together, like traditional RDBMS tables.
Within a given column family, all data is stored in a
row-by-row fashion, such that the columns for a given
row are stored together, rather than each column being
stored separately.
EXAMPLES
Apache Cassandra: One of the most popular wide column stores,
known for its decentralized architecture, high availability, and
scalability .Cassandra's decentralized architecture and eventual
consistency model enable it to deliver fast read and write
performance while ensuring data resilience and fault tolerance.
making it suitable for use cases such as real-time analytics. IoT,
and online transaction processing (OLTP).

Apache HBase: Built on top of Hadoop and modeled after


Google's Bigtable, HBase provides real-time read/ write access
to large datasets. It is designed to provide real-time random
access to large volumes of structured and semi-structured data
stored in Hadoop clusters. HBase offers scalability, fault
tolerance, and high availability. making it suitable for
applications that require low-latency access to massive datasets,
such as social media analytics, time-series data, and
recommendation systems.
MAIN CHARECTERISTICS OF WIDE COLUMN STORES
1. Schema flexibility: Unlike traditional relational databases, wide column
stores do not require a predefined schema. Each row can have different
columns, and new columns can be added dynamically without affecting
existing data.

2. Scalability: Wide column stores are designed to scale horizontally, meaning


they can efficiently handle large volumes of data across multiple nodes or
servers. This makes them suitable for big data applications.

3. High availability: Most wide column stores offer built-in replication and
fault-tolerance mechanisms, ensuring high availability of data even in the
event of node failures.
MAIN CHARECTERISTICS OF WIDE COLUMN STORES
4. Distributed architecture: Wide column stores typically use a distributed
architecture, which allows them to achieve fault tolerance and high availability
by replicating data across multiple nodes.

5. Query flexibility: They support flexible querying through various


mechanisms, such as secondary indexes, allowing for efficient retrieval of data
based on different criteria.

6. Column-oriented storage: Data is stored in columns rather than rows, which


can lead to better performance for certain types of queries, especially those that
involve aggregating or analyzing subsets of columns.

You might also like