0% found this document useful (0 votes)

27 views7 pages

Best Practices For Time Series Collections - MongoDB Manual v8.0

Uploaded by

nhienduyvu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views7 pages

Best Practices For Time Series Collections - MongoDB Manual v8.0

Uploaded by

nhienduyvu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Best Practices for Time Series Collections

This page describes best practices to improve performance and data usage for time series collections.

Compression Best Practices

To optimize data compression for time series collections, perform the following actions:

Omit Fields Containing Empty Objects and Arrays from Documents

If your data contains empty objects, arrays, or strings, omit the empty fields from your documents to
optimize compression.

For example, consider the following documents:

{
timestamp: ISODate("2020-01-23T00:00:00.441Z"),
coordinates: [1.0, 2.0]
},
{
timestamp: ISODate("2020-01-23T00:00:10.441Z"),
coordinates: []
},
{
timestamp: ISODate("2020-01-23T00:00:20.441Z"),
coordinates: [3.0, 5.0]
}

coordinates fields with populated values and coordinates fields with an empty array result in a
schema change for the compressor. The schema change causes the second and third documents in the
sequence to remain uncompressed.

Optimize compression by omitting the fields with empty values, as shown in the following documents:

{
timestamp: ISODate("2020-01-23T00:00:00.441Z"),
coordinates: [1.0, 2.0]
},
{
timestamp: ISODate("2020-01-23T00:00:10.441Z")
},
{
timestamp: ISODate("2020-01-23T00:00:20.441Z"),
coordinates: [3.0, 5.0]
}

Round Numeric Data to Few Decimal Places

Round numeric data to the precision that your application requires. Rounding numeric data to fewer
decimal places improves the compression ratio.

Inserts Best Practices

To optimize insert performance for time series collections, perform the following actions:

Batch Document Writes

When inserting multiple documents:

To avoid network roundtrips, use a single insertMany() statement as opposed to

multiple insertOne() statements.

If possible, insert data that contains identical metaField values in the same batches.

Set the ordered parameter to false.

For example, if you have two sensors that correspond to

two metaField values, sensor A and sensor B, a batch that contains multiple measurements from a
single sensor incurs the cost of one insert, rather than one insert per measurement.

The following operation inserts six documents, but only incurs the cost of two inserts (one
per metaField value), because the documents are ordered by sensor. The ordered parameter is set
to false to improve performance:

db.temperatures.insertMany(
[
{
metaField: {
sensor: "sensorA"
},
timestamp: ISODate("2021-05-18T00:00:00.000Z"),
temperature: 10
},
{
metaField: {
sensor: "sensorA"
},
timestamp: ISODate("2021-05-19T00:00:00.000Z"),
temperature: 12
},
{
metaField: {
sensor: "sensorA"
},
timestamp: ISODate("2021-05-20T00:00:00.000Z"),
temperature: 13
},
{
metaField: {
sensor: "sensorB"
},
timestamp: ISODate("2021-05-18T00:00:00.000Z"),
temperature: 20
},
{
metaField: {
sensor: "sensorB"
},
timestamp: ISODate("2021-05-19T00:00:00.000Z"),
temperature: 25
},
{
metadField: {
sensor: "sensorB"
},
timestamp: ISODate("2021-05-20T00:00:00.000Z"),
temperature: 26
}
],
{ "ordered": false }
)

Use Consistent Field Order in Documents

Using a consistent field order in your documents improves insert performance.

For example, inserting the following documents, all of which have the same field order, results in optimal
insert performance.

{
_id: ObjectId("6250a0ef02a1877734a9df57"),
timestamp: ISODate("2020-01-23T00:00:00.441Z"),
name: "sensor1",
range: 1
},
{
_id: ObjectId("6560a0ef02a1877734a9df66"),
timestamp: ISODate("2020-01-23T01:00:00.441Z"),
name: "sensor1",
range: 5
}

In contrast, the following documents do not achieve optimal insert performance, because their field orders
differ:

{
range: 1,
_id: ObjectId("6250a0ef02a1877734a9df57"),
name: "sensor1",
timestamp: ISODate("2020-01-23T00:00:00.441Z")
},
{
_id: ObjectId("6560a0ef02a1877734a9df66"),
name: "sensor1",
timestamp: ISODate("2020-01-23T01:00:00.441Z"),
range: 5
}
Increase the Number of Clients
Increasing the number of clients that write data to your collections can improve performance.

Sharding Best Practices

To optimize sharding on your time series collection, perform the following action:

Use the metaField as your Shard Key

Using the metaField to shard your collection provides sufficienct cardinality as a shard key for time
series collections.

NOTE

Starting in MongoDB 8.0, the use of the timeField as a shard key in time series collections is
deprecated.

Query Best Practices

To optimize queries on your time series collection, perform the following actions:

Set a Strategic metaField When Creating the Collection

Your choice of metaField has the biggest impact on optimizing queries in your application.

Select fields that rarely or never change as part of your metaField.

If possible, select identifiers or other stable values that are common in filter expressions as part of
your metaField.

Avoid selecting fields that are not used for filtering as part of your metaField. Instead, use those fields
as measurements.

For more information, see metaField Considerations.

Set Appropriate Bucket Granularity

When you create a time series collection, MongoDB groups incoming time series data into buckets. By
accurately setting granularity, you control how frequently data is bucketed based on the ingestion rate of
your data.

Starting in MongoDB 6.3, you can use the custom bucketing

parameters bucketMaxSpanSeconds and bucketRoundingSeconds to specify bucket boundaries and
more precisely control how time series data is bucketed.

You can improve performance by setting the granularity or custom bucketing parameters to the best
match for the time span between incoming measurements from the same data source. For example, if you
are recording weather data from thousands of sensors but only record data from each sensor once per 5
minutes, you can either set granularity to "minutes" or set the custom bucketing parameters
to 300 (seconds).

In this case, setting the granularity to hours groups up to a month's worth of data ingest events into a
single bucket, resulting in longer traversal times and slower queries. Setting it to seconds leads to multiple
buckets per polling interval, many of which might contain only a single document.

The following table shows the maximum time interval included in one bucket of data when using a
given granularity value:

granularity granularity bucket limit

seconds 1 hour

minutes 24 hours

hours 30 days

TIP

Create Secondary Indexes

To improve query performance, create one or more secondary indexes on
your timeField and metaField to support common query patterns. In versions 6.3 and higher,
MongoDB creates a secondary index on the timeField and metaField automatically.

Additional Index Best Practices

Use the metaField index for filtering and equality.

Use the timeField and other indexed fields for range queries.

General indexing strategies also apply to time series collections. For more information, see Indexing
Strategies.

Query the metaField on Sub-Fields

MongoDB reorders the metaField of time-series collections, which may cause servers to store data in a
different field order than applications. If a metaField is an object, queries on the metaField may
produce inconsistent results because metaField order may vary between servers and applications. To
optimize queries on a time-series metaField, query the metaField on scalar sub-fields rather than the
entire metaField.

The following example creates a time series collection:

db.weather.insertMany( [
{
metaField: { sensorId: 5578, type: "temperature" },
timestamp: ISODate( "2021-05-18T00:00:00.000Z" ),
temp: 12
},
{
metaField: { sensorId: 5578, type: "temperature" },
timestamp: ISODate( "2021-05-18T04:00:00.000Z" ),
temp: 11
}
] )

The following query on the sensorId and type scalar sub-fields returns the first document that matches
the query criteria:

db.weather.findOne( {
"metaField.sensorId": 5578,
"metaField.type": "temperature"
} )

Example output:

{
_id: ObjectId("6572371964eb5ad43054d572"),
metaField: { sensorId: 5578, type: 'temperature' },
timestamp: ISODate( "2021-05-18T00:00:00.000Z" ),
temp: 12
}

Use $group Instead of Distinct()

Due to the unique data structure of time series collections, MongoDB can't efficiently index them for
distinct values. Avoid using the distinct command or db.collection.distinct() helper method
on time series collections. Instead, use a $group aggregation to group documents by distinct values.

For example, to query for distinct meta.type values on documents where meta.project = 10, instead
of:

db.foo.distinct("meta.type", {"meta.project": 10})

Use:

db.foo.createIndex({"meta.project":1, "meta.type":1})
db.foo.aggregate([{$match: {"meta.project": 10}},
{$group: {_id: "$meta.type"}}])

This works as follows:

1. Creating a compound index on meta.project and meta.type and supports the aggregation.
2. The $match stage filters for documents where meta.project = 10.

3. The $group stage uses meta.type as the group key to output one document per unique value.

Indexing-Sharding and Replication in MongoDB
No ratings yet
Indexing-Sharding and Replication in MongoDB
32 pages
Mongodb Notes Basic To Advanced 1692833294
No ratings yet
Mongodb Notes Basic To Advanced 1692833294
10 pages
Unit 2 Part 2
No ratings yet
Unit 2 Part 2
68 pages
Presentation 1
No ratings yet
Presentation 1
13 pages
MEAN 3 L4 Advanced MongoDB With Aggregation
No ratings yet
MEAN 3 L4 Advanced MongoDB With Aggregation
94 pages
5 Indexes
No ratings yet
5 Indexes
39 pages
Module 5 Indexes
No ratings yet
Module 5 Indexes
4 pages
Time Series Collections Considerations - MongoDB Manual v8.0
No ratings yet
Time Series Collections Considerations - MongoDB Manual v8.0
2 pages
Sqlalchemy Cheatsheet PDF
100% (2)
Sqlalchemy Cheatsheet PDF
34 pages
Experment 8
No ratings yet
Experment 8
5 pages
Create and Query A Time Series Collection - MongoDB Manual v8.0
No ratings yet
Create and Query A Time Series Collection - MongoDB Manual v8.0
7 pages
No SQL
No ratings yet
No SQL
21 pages
05 Chapter Performance MongoDB
No ratings yet
05 Chapter Performance MongoDB
42 pages
List Time Series Collections in A Database - MongoDB Manual v8.0
No ratings yet
List Time Series Collections in A Database - MongoDB Manual v8.0
2 pages
Set Granularity For Time Series Data - MongoDB Manual v8.0
No ratings yet
Set Granularity For Time Series Data - MongoDB Manual v8.0
3 pages
Time Series Indexes - MongoDB Manual v8.0
No ratings yet
Time Series Indexes - MongoDB Manual v8.0
2 pages
Add Secondary Indexes To Time Series Collections - MongoDB Manual v8.0
No ratings yet
Add Secondary Indexes To Time Series Collections - MongoDB Manual v8.0
5 pages
Unit 3 Chap2
No ratings yet
Unit 3 Chap2
11 pages
12 MongoDB Design Patterns Part 1
No ratings yet
12 MongoDB Design Patterns Part 1
24 pages
WK 2 3 MongoDB Indexing
No ratings yet
WK 2 3 MongoDB Indexing
4 pages
About Querying Time Series Data - MongoDB Manual v8.0
No ratings yet
About Querying Time Series Data - MongoDB Manual v8.0
2 pages
Simplr Solutions - Field
No ratings yet
Simplr Solutions - Field
31 pages
Migrate Data Into A Time Series Collection - MongoDB Manual v8.0
No ratings yet
Migrate Data Into A Time Series Collection - MongoDB Manual v8.0
5 pages
Dod Unit5
No ratings yet
Dod Unit5
15 pages
Dod Unit4
No ratings yet
Dod Unit4
18 pages
Mongodb Indexing Simplified
No ratings yet
Mongodb Indexing Simplified
7 pages
Lecture 9 - MongoDB
No ratings yet
Lecture 9 - MongoDB
8 pages
AIOT WORK FOR WEDNESDAY JUNE 26th
No ratings yet
AIOT WORK FOR WEDNESDAY JUNE 26th
10 pages
ADO Lecture IV 2024-26
No ratings yet
ADO Lecture IV 2024-26
28 pages
Assignment 11
No ratings yet
Assignment 11
9 pages
DF200 - 01 - Indexes and Optimization Mongo DB Training
No ratings yet
DF200 - 01 - Indexes and Optimization Mongo DB Training
69 pages
8-MongoDB Use Cases
No ratings yet
8-MongoDB Use Cases
13 pages
5 Indexes
No ratings yet
5 Indexes
51 pages
ADO Lecture VIII 2023-25
No ratings yet
ADO Lecture VIII 2023-25
27 pages
Set Up Automatic Removal For Time Series Collections (TTL) - MongoDB Manual v8.0
No ratings yet
Set Up Automatic Removal For Time Series Collections (TTL) - MongoDB Manual v8.0
3 pages
Remaining NGD New
No ratings yet
Remaining NGD New
21 pages
Lab09-Time Series Collections
No ratings yet
Lab09-Time Series Collections
2 pages
Aggregation and Operator Considerations - MongoDB Manual v8.0
No ratings yet
Aggregation and Operator Considerations - MongoDB Manual v8.0
3 pages
MongoDB Index Type and Properties
No ratings yet
MongoDB Index Type and Properties
18 pages
MongoDb Imp
No ratings yet
MongoDb Imp
21 pages
ADO Lecture V 2023-25
No ratings yet
ADO Lecture V 2023-25
44 pages
Data Modeling With Mongodb
No ratings yet
Data Modeling With Mongodb
22 pages
Interview Ques
No ratings yet
Interview Ques
14 pages
Mongo DB Notes - by Prakash
No ratings yet
Mongo DB Notes - by Prakash
6 pages
Indexes MongoDB
No ratings yet
Indexes MongoDB
21 pages
Wa0004.
No ratings yet
Wa0004.
8 pages
NoSQL 14 MONGO 2
No ratings yet
NoSQL 14 MONGO 2
37 pages
Wa0005.
No ratings yet
Wa0005.
145 pages
Mongocommands
No ratings yet
Mongocommands
2 pages
Notes-Lecture 14 - MongoDB With NodeJS - II-3447
No ratings yet
Notes-Lecture 14 - MongoDB With NodeJS - II-3447
13 pages
M10A1
No ratings yet
M10A1
3 pages
Mongo DB Notes
No ratings yet
Mongo DB Notes
5 pages
Fastquerying Indexingforperformance4 150324144349 Converske01
No ratings yet
Fastquerying Indexingforperformance4 150324144349 Converske01
59 pages
Indexing: Alvin Richards - Alvin@
No ratings yet
Indexing: Alvin Richards - Alvin@
45 pages
Disk Partition Alignment Best Practices For SQL Server - Microsoft Docs PDF
100% (1)
Disk Partition Alignment Best Practices For SQL Server - Microsoft Docs PDF
18 pages
Mongo Performance Tuning MongoSeattle 2012
100% (1)
Mongo Performance Tuning MongoSeattle 2012
20 pages
Mongo DB
No ratings yet
Mongo DB
8 pages
DB Practices For MongoDB
No ratings yet
DB Practices For MongoDB
7 pages
Lakshmi DE
No ratings yet
Lakshmi DE
3 pages
Progsup
No ratings yet
Progsup
66 pages
DBMS Handwritten Notes
No ratings yet
DBMS Handwritten Notes
87 pages
MongoDB ReferenceCards
No ratings yet
MongoDB ReferenceCards
28 pages
MongoDB Reference Card
No ratings yet
MongoDB Reference Card
28 pages
Vsan 703 Administration
No ratings yet
Vsan 703 Administration
126 pages
Hydro GeoAnalyst - Getting Started Tutorial PDF
No ratings yet
Hydro GeoAnalyst - Getting Started Tutorial PDF
72 pages
Big Data Processing Concepts
No ratings yet
Big Data Processing Concepts
9 pages
Isilon OneFS
No ratings yet
Isilon OneFS
42 pages
Caching Techniques
No ratings yet
Caching Techniques
4 pages
CC W3 AWS Basic Infra
No ratings yet
CC W3 AWS Basic Infra
57 pages
MIS403 Lec15 Nov14
No ratings yet
MIS403 Lec15 Nov14
24 pages
Rishabh Jha: LAB - 5 (2K20CSUN0 1084)
No ratings yet
Rishabh Jha: LAB - 5 (2K20CSUN0 1084)
16 pages
Worksheet 6th
No ratings yet
Worksheet 6th
6 pages
PT 1 Paper CS 12th 24-25
No ratings yet
PT 1 Paper CS 12th 24-25
2 pages
Advanced SQL and PL/SQL: Guide To Oracle 10g
No ratings yet
Advanced SQL and PL/SQL: Guide To Oracle 10g
22 pages
UTD Data Analytics Bootcamp Syllabus
No ratings yet
UTD Data Analytics Bootcamp Syllabus
1 page
Mongodb QRC Booklet
No ratings yet
Mongodb QRC Booklet
12 pages
The Having Clause
No ratings yet
The Having Clause
16 pages
Latch Lock and Mutex Contention Troubleshooting
100% (1)
Latch Lock and Mutex Contention Troubleshooting
20 pages
3.3x-Modeling Data Exercise
No ratings yet
3.3x-Modeling Data Exercise
18 pages
Processing Integrity and Availability Controls
No ratings yet
Processing Integrity and Availability Controls
10 pages
DRP REVISED Final Version
No ratings yet
DRP REVISED Final Version
15 pages
Powerprotect Appliances dp4400 Ds
No ratings yet
Powerprotect Appliances dp4400 Ds
2 pages
BPC10 1 BADIs
No ratings yet
BPC10 1 BADIs
6 pages
Mark Trigger With PRAGMA AUTONOMOUS - TRANSACTION - Trigger and Transaction Trigger Oracle PL - SQL Tutorial
No ratings yet
Mark Trigger With PRAGMA AUTONOMOUS - TRANSACTION - Trigger and Transaction Trigger Oracle PL - SQL Tutorial
2 pages
Excel Ninja Tracker
No ratings yet
Excel Ninja Tracker
5 pages
Zabbix Server
No ratings yet
Zabbix Server
9 pages
CIS150 1E Plaster
No ratings yet
CIS150 1E Plaster
4 pages
Expected Questions in Written Test
No ratings yet
Expected Questions in Written Test
11 pages

Best Practices For Time Series Collections - MongoDB Manual v8.0

Uploaded by

Best Practices For Time Series Collections - MongoDB Manual v8.0

Uploaded by

Best Practices for Time Series Collections

Compression Best Practices

Omit Fields Containing Empty Objects and Arrays from Documents

For example, consider the following documents:

Round Numeric Data to Few Decimal Places

Inserts Best Practices

Batch Document Writes

To avoid network roundtrips, use a single insertMany() statement as opposed to

Set the ordered parameter to false.

For example, if you have two sensors that correspond to

Use Consistent Field Order in Documents

Sharding Best Practices

Use the metaField as your Shard Key

Query Best Practices

Set a Strategic metaField When Creating the Collection

Select fields that rarely or never change as part of your metaField.

For more information, see metaField Considerations.

Set Appropriate Bucket Granularity

Starting in MongoDB 6.3, you can use the custom bucketing

granularity granularity bucket limit

Create Secondary Indexes

Additional Index Best Practices

Query the metaField on Sub-Fields

The following example creates a time series collection:

Use $group Instead of Distinct()

db.foo.distinct("meta.type", {"meta.project": 10})

This works as follows:

You might also like