Best Distributed Databases for Linux

Compare the Top Distributed Databases for Linux as of April 2025

Sort By:

Distributed Databases Linux Clear Filters

What are Distributed Databases for Linux?

Distributed databases store data across multiple physical locations, often across different servers or even geographical regions, allowing for high availability and scalability. Unlike traditional databases, distributed databases divide data and workloads among nodes in a network, providing faster access and load balancing. They are designed to be resilient, with redundancy and data replication ensuring that data remains accessible even if some nodes fail. Distributed databases are essential for applications that require quick access to large volumes of data across multiple locations, such as global eCommerce, finance, and social media. By decentralizing data storage, they support high-performance, fault-tolerant operations that scale with an organization’s needs. Compare and read user reviews of the best Distributed Databases for Linux currently available using the table below. This list is updated regularly.

1

InterSystems IRIS

InterSystems

InterSystems IRIS is a complete cloud-first data platform that includes a multi-model transactional data management engine, an application development platform, and interoperability engine, and an open analytics platform. It is the next generation of our proven data management software.It includes the capabilities of InterSystems Cache and Ensemble, plus a wealth of exciting new capabilities to make it easy to build and deploy cloud based, analytics-intensive enterprise applications with even greater performance and scalability. InterSystems IRIS provides a set of APIs to operate with transactional persistent data simultaneously: key-value, relational, object, document, multidimensional. Data can be managed by SQL, Java, node.js, .NET, C++, Python, and native server-side ObjectScript language. InterSystems IRIS includes

23 Ratings

View Software
2

Objectivity/DB

Objectivity, Inc.

Objectivity/DB is a massively scalable, high performance, distributed Object Database (ODBMS). It is extremely good at handling complex data, where there are many types of connections between objects and many variants. Objectivity/DB can also serve as a massively scalable, high performance graph database. Its DO query language supports standard data retrieval queries as well as high-performance path-based navigational queries. Objectivity/DB is a distributed database, presenting a Single Logical View of its managed data. Data can be hosted on a single machine or distributed across up to 65,000 machines. Connected items can span machines. Objectivity/DB runs on 32 or 64-bit processors running Windows, Linux, and Mac OS X. APIs include: C++, C#, Java and Python. All platform and language combinations are interoperable. For example, objects stored by a program using C++ on Linux can be read by a C# program on Windows and a Java program on Mac OS X.

1 Rating

Starting Price: See Pricing Details...

View Software
3

eXtremeDB

McObject

How is platform independent eXtremeDB different? - Hybrid data storage. Unlike other IMDS, eXtremeDB can be all-in-memory, all-persistent, or have a mix of in-memory tables and persistent tables - Active Replication Fabric™ is unique to eXtremeDB, offering bidirectional replication, multi-tier replication (e.g. edge-to-gateway-to-gateway-to-cloud), compression to maximize limited bandwidth networks and more - Row & Columnar Flexibility for Time Series Data supports database designs that combine row-based and column-based layouts, in order to best leverage the CPU cache speed - Embedded and Client/Server. Fast, flexible eXtremeDB is data management wherever you need it, and can be deployed as an embedded database system, and/or as a client/server database system -A hard real-time deterministic option in eXtremeDB/rt Designed for use in resource-constrained, mission-critical embedded systems. Found in everything from routers to satellites to trains to stock markets worldwide

View Software
4

RavenDB

RavenDB

RavenDB is the pioneer NoSQL Document Database that is fully transactional (ACID) across your database and throughout your cluster. At a fraction of the total cost of ownership (TCO), our open source distributed database offers high availability and high performance with zero administration. It is designed as an easy to use all-in-one database which minimizes the need for third party addons, tools, or support to boost developer productivity and get your project into production fast. You can setup and secure a data cluster in minutes and deploy in the cloud, on-premise or in a hybrid environment. RavenDB offers a Database as a Service solution, allowing you to pass on all your database operations to us so you can focus exclusively on your application. RavenDB has a built-in storage engine, Voron, that operates at speeds up to 1 million reads per second and 150,000 writes per second on a single node using simple commodity hardware to increase your application’s performance.

View Software
5

Apache Trafodion

Apache Software Foundation

Apache Trafodion is a webscale SQL-on-Hadoop solution enabling transactional or operational workloads on Apache Hadoop. Trafodion builds on the scalability, elasticity, and flexibility of Hadoop. Trafodion extends Hadoop to provide guaranteed transactional integrity, enabling new kinds of big data applications to run on Hadoop. Full-functioned ANSI SQL language support. JDBC/ODBC connectivity for Linux/Windows clients. Distributed ACID transaction protection across multiple statements, tables, and rows. Performance improvements for OLTP workloads with compile-time and run-time optimizations. Support for large data sets using a parallel-aware query optimizer. Reuse existing SQL skills and improve developer productivity. Distributed ACID transactions guarantee data consistency across multiple rows and tables. Interoperability with existing tools and applications. Hadoop and Linux distribution neutral. Easy to add to your existing Hadoop infrastructure.

Starting Price: Free

View Software
6

AntDB

Antdb AsiaInfo

AntDB is a cloud-native, distributed relational database developed by AsiaInfo Technologies, designed to handle high-performance online transaction processing and online analytical processing workloads. AntDB has been serving over 1 billion subscribers across 24 provinces in China, supporting massive business data related to calls, internet access, payments, and billing. AntDB's cloud-native distributed architecture supports online scalability, data consistency, and high availability across data centers. It is compatible with SQL2016 standards and integrates seamlessly with various domestic ecosystems, including mainstream CPUs and operating systems. The platform offers features such as automatic high availability, online elastic capacity expansion, and read/write splitting at the kernel level to efficiently manage traffic loads during peak periods. AntDB has been successfully commercialized in industries like telecommunications, finance, transportation, and energy.

Starting Price: Free

View Software
7

OrbitDB

OrbitDB

OrbitDB is a serverless, distributed, peer-to-peer database that utilizes IPFS for data storage and Libp2p Pubsub for automatic synchronization across peers. It employs Merkle-CRDTs to ensure conflict-free database writes and merges, making it suitable for decentralized applications, blockchain integrations, and local-first web apps. OrbitDB offers various database types tailored to different use cases: 'events' for immutable append-only logs, 'documents' for JSON document storage indexed by a specified key, 'keyvalue' for traditional key-value pairs, and 'keyvalue-indexed' for LevelDB-indexed key-value data. All these databases are built atop OpLog, an immutable, cryptographically verifiable, operation-based CRDT structure. The JavaScript implementation supports both browser and Node.js environments, with a Go version maintained by the Berty project.

Starting Price: Free

View Software
8

ScyllaDB

ScyllaDB

ScyllaDB is the database for data-intensive apps that require high performance and low latency. It enables teams to harness the ever-increasing computing power of modern infrastructures – eliminating barriers to scale as data grows. Unlike any other database, ScyllaDB is a distributed NoSQL database fully compatible with Apache Cassandra and Amazon DynamoDB, yet is built with deep architectural advancements that enable exceptional end-user experiences at radically lower costs. Over 400 game-changing companies like Disney+ Hotstar, Expedia, FireEye, Discord, Zillow, Starbucks, Comcast, and Samsung use ScyllaDB for their toughest database challenges. ScyllaDB is available as free open source software, a fully-supported enterprise product, and a fully managed database-as-a-service (DBaaS) on multiple cloud providers.

View Software
9

Greenplum

Greenplum Database

Greenplum Database® is an advanced, fully featured, open source data warehouse. It provides powerful and rapid analytics on petabyte scale data volumes. Uniquely geared toward big data analytics, Greenplum Database is powered by the world’s most advanced cost-based query optimizer delivering high analytical query performance on large data volumes. Greenplum Database® project is released under the Apache 2 license. We want to thank all our current community contributors and are interested in all new potential contributions. For the Greenplum Database community no contribution is too small, we encourage all types of contributions. An open-source massively parallel data platform for analytics, machine learning and AI. Rapidly create and deploy models for complex applications in cybersecurity, predictive maintenance, risk management, fraud detection, and many other areas. Experience the fully featured, integrated, open source analytics platform.

View Software
10

CrateDB

CrateDB

The enterprise database for time series, documents, and vectors. Store any type of data and combine the simplicity of SQL with the scalability of NoSQL. CrateDB is an open source distributed database running queries in milliseconds, whatever the complexity, volume and velocity of data.

View Software
11

Tarantool

Tarantool

Corporations need a way to ensure uninterrupted operation of their systems, high speed of data processing, and reliability of storage. The in-memory technologies have proven themselves well in solving these problems. For more than 10 years, Tarantool has been helping companies all over the world build smart caches, data marts, and golden client profiles while saving server capacity. Reduce the cost of storing credentials compared to siloed solutions and improve the service and security of client applications. Reduce data management costs of maintaining a large number of disparate systems that store customer identities. Increase sales by improving the speed and quality of customer recommendations for goods or services through the analysis of user behavior and user data. Improve mobile and web channel service by accelerating frontends to reduce user outflow. IT systems of large organizations operate in a closed loop of a local network, where data circulates unprotected.

View Software
12

Couchbase

Couchbase

Unlike other NoSQL databases, Couchbase provides an enterprise-class, multicloud to edge database that offers the robust capabilities required for business-critical applications on a highly scalable and available platform. As a distributed cloud-native database, Couchbase runs in modern dynamic environments and on any cloud, either customer-managed or fully managed as-a-service. Couchbase is built on open standards, combining the best of NoSQL with the power and familiarity of SQL, to simplify the transition from mainframe and relational databases. Couchbase Server is a multipurpose, distributed database that fuses the strengths of relational databases such as SQL and ACID transactions with JSON’s versatility, with a foundation that is extremely fast and scalable. It’s used across industries for things like user profiles, dynamic product catalogs, GenAI apps, vector search, high-speed caching, and much more.

View Software
13

Neo4j

Neo4j

Neo4j’s graph data platform is purpose-built to leverage not only data but also data relationships. Using Neo4j, developers build intelligent applications that traverse today's large, interconnected datasets in real time. Powered by a native graph storage and processing engine, Neo4j’s graph database delivers an intuitive, flexible and secure database for unique, actionable insights.

View Software
14

JanusGraph

JanusGraph

JanusGraph is a scalable graph database optimized for storing and querying graphs containing hundreds of billions of vertices and edges distributed across a multi-machine cluster. JanusGraph is a project under The Linux Foundation, and includes participants from Expero, Google, GRAKN.AI, Hortonworks, IBM and Amazon. Elastic and linear scalability for a growing data and user base. Data distribution and replication for performance and fault tolerance. Multi-datacenter high availability and hot backups. All functionality is totally free. No need to buy commercial licenses. JanusGraph is fully open source under the Apache 2 license. JanusGraph is a transactional database that can support thousands of concurrent users executing complex graph traversals in real time. Support for ACID and eventual consistency. In addition to online transactional processing (OLTP), JanusGraph supports global graph analytics (OLAP) with its Apache Spark integration.

View Software
15

Nebula Graph

vesoft

The graph database built for super large-scale graphs with milliseconds of latency. We are continuing to collaborate with the community to prepare, popularize and promote the graph database. Nebula Graph only allows authenticated access via role-based access control. Nebula Graph supports multiple storage engine types and the query language can be extended to support new algorithms. Nebula Graph provides low latency read and write , while still maintaining high throughput to simplify the most complex data sets. With a shared-nothing distributed architecture , Nebula Graph offers linear scalability. Nebula Graph's SQL-like query language is easy to understand and powerful enough to meet complex business needs. With horizontal scalability and a snapshot feature, Nebula Graph guarantees high availability even in case of failures. Large Internet companies like JD, Meituan, and Xiaohongshu have deployed Nebula Graph in production environments.

View Software
16

Apache Kudu

The Apache Software Foundation

A Kudu cluster stores tables that look just like tables you're used to from relational (SQL) databases. A table can be as simple as a binary key and value, or as complex as a few hundred different strongly-typed attributes. Just like SQL, every table has a primary key made up of one or more columns. This might be a single column like a unique user identifier, or a compound key such as a (host, metric, timestamp) tuple for a machine time-series database. Rows can be efficiently read, updated, or deleted by their primary key. Kudu's simple data model makes it a breeze to port legacy applications or build new ones, no need to worry about how to encode your data into binary blobs or make sense of a huge database full of hard-to-interpret JSON. Tables are self-describing, so you can use standard tools like SQL engines or Spark to analyze your data. Kudu's APIs are designed to be easy to use.

View Software
17

rqlite

rqlite

The lightweight, user-friendly, distributed relational database built on SQLite. Fault tolerance and high availability with zero hassle. rqlite is a distributed relational database that combines the simplicity of SQLite with the robustness of a fault-tolerant, highly available system. It's developer-friendly, its operation is straightforward, and it's designed for reliability with minimal complexity. Deploy in seconds, with no complex configurations. Seamlessly integrates with modern cloud infrastructures. Built on SQLite, the world’s most popular database. Supports full-text search, Vector Search, and JSON documents. Access controls and encryption for secure deployments. Rigorous, automated testing ensures high quality. Clustering provides high availability and fault tolerance. Automatic node discovery simplifies clustering.

View Software