Sharding in MongoDB

Sharding in MongoDB is a method for distributing data across multiple machines to support large datasets and high throughput operations, addressing the limitations of vertical scaling. A sharded cluster consists of shards, mongos query routers, and config servers, with shard keys determining data distribution. MongoDB manages data through chunks, which can be split and migrated to ensure even distribution across shards, with a balancer process overseeing chunk migrations.

Uploaded by

khatoonsabiya159

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views4 pages

Sharding in MongoDB

Uploaded by

khatoonsabiya159

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Sharding in MongoDB

Sharding is a method for distributing data across multiple machines. MongoDB uses sharding to
support deployments with very large data sets and high throughput operations.
Database systems with large data sets or high throughput applications can challenge the capacity
of a single server. For example, high query rates can exhaust the CPU capacity of the server.
Working set sizes larger than the system's RAM stress the I/O capacity of disk drives.
There are two methods for addressing system growth: vertical and horizontal scaling.
Vertical Scaling involves increasing the capacity of a single server, such as using a more
powerful CPU, adding more RAM, or increasing the amount of storage space. Limitations in
available technology may restrict a single machine from being sufficiently powerful for a given
workload. Additionally, Cloud-based providers have hard ceilings based on available hardware
configurations. As a result, there is a practical maximum for vertical scaling.
Horizontal Scaling involves dividing the system dataset and load over multiple servers, adding
additional servers to increase capacity as required. While the overall speed or capacity of a single
machine may not be high, each machine handles a subset of the overall workload, potentially
providing better efficiency than a single high-speed high-capacity server. Expanding the capacity
of the deployment only requires adding additional servers as needed, which can be a lower
overall cost than high-end hardware for a single machine. The trade off is increased complexity
in infrastructure and maintenance for the deployment.
MongoDB supports horizontal scaling through sharding.

Sharded Cluster
A MongoDB sharded cluster consists of the following components:
 shard: Each shard contains a subset of the sharded data. Each shard can be deployed as
a replica set.
 mongos: The mongos acts as a query router, providing an interface between client
applications and the sharded cluster. Starting in MongoDB 4.4, mongos can
support hedged reads to minimize latencies.
 config servers: Config servers store metadata and configuration settings for the cluster.
The following graphic describes the interaction of components within a sharded cluster:

MongoDB shards data at the collection level, distributing the collection data across the shards in
the cluster.

Shard Keys
MongoDB uses the shard key to distribute the collection's documents across shards. The shard
key consists of a field or multiple fields in the documents.
 Starting in version 4.4, documents in sharded collections can be missing the shard key
fields. Missing shard key fields are treated as having null values when distributing the
documents across shards but not when routing queries. For more information,
see Missing Shard Key Fields.
 In version 4.2 and earlier, shard key fields must exist in every document for a sharded
collection.
You select the shard key when sharding a collection.
 Starting in MongoDB 5.0, you can reshard a collection by changing a collection's shard
key.
 Starting in MongoDB 4.4, you can refine a shard key by adding a suffix field or fields to
the existing shard key.
 In MongoDB 4.2 and earlier, the choice of shard key cannot be changed after sharding.
A document's shard key value determines its distribution across the shards.
 Starting in MongoDB 4.2, you can update a document's shard key value unless your
shard key field is the immutable _id field. See Change a Document's Shard Key
Value for more information.
 In MongoDB 4.0 and earlier, a document's shard key field value is immutable.

Shard Key Index

To shard a populated collection, the collection must have an index that starts with the shard key.
When sharding an empty collection, MongoDB creates the supporting index if the collection
does not already have an appropriate index for the specified shard key. See Shard Key Indexes.

Shard Key Strategy

The choice of shard key affects the performance, efficiency, and scalability of a sharded cluster.
A cluster with the best possible hardware and infrastructure can be bottlenecked by the choice of
shard key. The choice of shard key and its backing index can also affect the sharding
strategy that your cluster can use.
---------------------------------------------------------

MongoDB: Chunks
MongoDB partitions sharded data into chunks. Each chunk has an inclusive lower and exclusive
upper range based on the shard key.

Chunk Splits
Splitting is a process that keeps chunks from growing too large. When a chunk grows beyond
a specified chunk size, or if the number of documents in the chunk exceeds Maximum Number
of Documents Per Chunk to Migrate, MongoDB splits the chunk based on the shard key values
the chunk represent. A chunk may be split into multiple chunks where necessary. Inserts and
updates may trigger splits. Splits are an efficient meta-data change. To create splits, MongoDB
does not migrate any data or affect the shards.

Splits may lead to an uneven distribution of the chunks for a collection across the shards. In such
cases, the balancer redistributes chunks across shards. See Cluster Balancer for more details on
balancing chunks across shards.
Chunk Migration
MongoDB migrates chunks in a sharded cluster to distribute the chunks of a sharded collection
evenly among shards. Migrations may be either:
 Manual. Only use manual migration in limited cases, such as to distribute data during
bulk inserts. See Migrating Chunks Manually for more details.
 Automatic. The balancer process automatically migrates chunks when there is an uneven
distribution of a sharded collection's chunks across the shards. See Migration
Thresholds for more details.

Balancing
The balancer is a background process that manages chunk migrations. If the difference in
number of chunks between the largest and smallest shard exceed the migration thresholds, the
balancer begins migrating chunks across the cluster to ensure an even distribution of data.
You can manage certain aspects of the balancer. The balancer also respects any zones created as
a part of configuring zones in a sharded cluster.

Balancer and Even Chunk Distribution

In an attempt to achieve an even distribution of chunks across all shards in the cluster,
a balancer runs in the background to migrate chunks across the shards .

Indexing-Sharding and Replication in MongoDB
No ratings yet
Indexing-Sharding and Replication in MongoDB
32 pages
Mongo DB
No ratings yet
Mongo DB
227 pages
NoSQL CIA EXAMS QUESTIONS WITH ANSWERS
No ratings yet
NoSQL CIA EXAMS QUESTIONS WITH ANSWERS
32 pages
Introduction To MongoDB
No ratings yet
Introduction To MongoDB
25 pages
1664473609-Unit 5 - Database Management - MongoDB
No ratings yet
1664473609-Unit 5 - Database Management - MongoDB
23 pages
MongoDB Sharding Guide
100% (1)
MongoDB Sharding Guide
80 pages
Mongodb Interview Questions
No ratings yet
Mongodb Interview Questions
18 pages
Sharding:: Vertical Scaling Involves Increasing The Capacity of A Single Server, Such As Using A More Powerful CPU
No ratings yet
Sharding:: Vertical Scaling Involves Increasing The Capacity of A Single Server, Such As Using A More Powerful CPU
233 pages
Data
No ratings yet
Data
233 pages
MongoDB Sharding Guide PDF
No ratings yet
MongoDB Sharding Guide PDF
81 pages
MA023 ADBMS TermWork
No ratings yet
MA023 ADBMS TermWork
234 pages
Unit - III
No ratings yet
Unit - III
34 pages
Unit IV
No ratings yet
Unit IV
50 pages
MongoDB Sharding Guide
No ratings yet
MongoDB Sharding Guide
88 pages
Mongodb Auto Sharding: Aaron Staple Mongo Seattle July 27, 2010
No ratings yet
Mongodb Auto Sharding: Aaron Staple Mongo Seattle July 27, 2010
53 pages
Sharding Methods For Mongodb: Jay Runkel @jayrunkel
No ratings yet
Sharding Methods For Mongodb: Jay Runkel @jayrunkel
48 pages
Mongoshardingunderstanding (Copy) PDF
No ratings yet
Mongoshardingunderstanding (Copy) PDF
31 pages
Mongoshardingunderstanding PDF
No ratings yet
Mongoshardingunderstanding PDF
31 pages
Mongo
No ratings yet
Mongo
58 pages
05 Chapter Performance MongoDB New
No ratings yet
05 Chapter Performance MongoDB New
78 pages
6q9k5yndkd9j-SDE DF400 020 Full Deck
No ratings yet
6q9k5yndkd9j-SDE DF400 020 Full Deck
81 pages
Mongodb
No ratings yet
Mongodb
19 pages
Rick Copeland @rick446 Arborian Consulting, LLC
No ratings yet
Rick Copeland @rick446 Arborian Consulting, LLC
32 pages
Mongodb at Fliptop
No ratings yet
Mongodb at Fliptop
28 pages
To Shard or Not To Shard
No ratings yet
To Shard or Not To Shard
31 pages
Unit 2 (MongoDB)
No ratings yet
Unit 2 (MongoDB)
17 pages
Module 7 - NoSQL
No ratings yet
Module 7 - NoSQL
34 pages
MongoDB Case Study 1
No ratings yet
MongoDB Case Study 1
6 pages
Mongo DB
No ratings yet
Mongo DB
28 pages
Notes For Question Bank
No ratings yet
Notes For Question Bank
17 pages
MongoDB Lecture 1
No ratings yet
MongoDB Lecture 1
37 pages
An Introduction To Mongodb: Rácz Gábor
No ratings yet
An Introduction To Mongodb: Rácz Gábor
18 pages
Top MongoDB Interview Q&A
No ratings yet
Top MongoDB Interview Q&A
14 pages
MongoAsia - Scaling
No ratings yet
MongoAsia - Scaling
44 pages
Lecture 40 1
No ratings yet
Lecture 40 1
18 pages
Configuring and Deploying Mongodb Sharded Cluster in 30 Minutes
No ratings yet
Configuring and Deploying Mongodb Sharded Cluster in 30 Minutes
11 pages
How To Use Sharding in MongoDB - DigitalOcean
No ratings yet
How To Use Sharding in MongoDB - DigitalOcean
22 pages
Big Data
No ratings yet
Big Data
12 pages
S Harding
No ratings yet
S Harding
7 pages
Homework 4.4 Mongodb
No ratings yet
Homework 4.4 Mongodb
6 pages
Lecture 07.07 MongoDBPerformanceArchitecture - 10
No ratings yet
Lecture 07.07 MongoDBPerformanceArchitecture - 10
10 pages
Mongo Shard
No ratings yet
Mongo Shard
9 pages
Манго Дб
No ratings yet
Манго Дб
28 pages
MongoDB With Linux
No ratings yet
MongoDB With Linux
8 pages
Database Sharding
No ratings yet
Database Sharding
5 pages
S Harding
No ratings yet
S Harding
9 pages
MongoDB Sharding
No ratings yet
MongoDB Sharding
7 pages
Shard A Time Series Collection - MongoDB Manual v8.0
No ratings yet
Shard A Time Series Collection - MongoDB Manual v8.0
4 pages
Sharding Strategy in MongoDB
No ratings yet
Sharding Strategy in MongoDB
4 pages
MongoDB Sharding PDF
No ratings yet
MongoDB Sharding PDF
3 pages
ADB - Lab Sheet 7
No ratings yet
ADB - Lab Sheet 7
3 pages
Sharding in MongoDB
No ratings yet
Sharding in MongoDB
3 pages
MongoDB Replication and Sharding
No ratings yet
MongoDB Replication and Sharding
3 pages
FullStackCafe QAS 1694522508328
No ratings yet
FullStackCafe QAS 1694522508328
3 pages
Mongodb Interview Questions
No ratings yet
Mongodb Interview Questions
3 pages
MongoDB Simple Sharding Manual
No ratings yet
MongoDB Simple Sharding Manual
2 pages
MongoDB Architecture and Operations: Definitive Reference for Developers and Engineers
From Everand
MongoDB Architecture and Operations: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Learn MongoDB in 24 Hours
From Everand
Learn MongoDB in 24 Hours
Alex Nordeen
5/5 (2)
Learn Cassandra in 24 Hours
From Everand
Learn Cassandra in 24 Hours
Alex Nordeen
No ratings yet
SAS Interview Questions You'll Most Likely Be Asked
From Everand
SAS Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet

Sharding in MongoDB

Uploaded by

Sharding in MongoDB

Uploaded by

Sharding in MongoDB

Shard Key Index

Shard Key Strategy

Balancer and Even Chunk Distribution

You might also like