0% found this document useful (0 votes)

36 views1 page

Mongodb Schema Design Part 1

Uploaded by

Javier Morales

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

36 views1 page

Mongodb Schema Design Part 1

Uploaded by

Javier Morales

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

| Blog Home News Applied Developer QuickStart Updates Culture Events Mark Loves Tech All  Search

6 Rules of Thumb for MongoDB Schema

Design: Part 1
Get Started with MongoDB Atlas

MongoDB
May 29, 2014 | Updated: June 17, 2020
#Technical

By William Zola, Lead Technical Support Engineer at MongoDB

“I have lots of experience with SQL, but I’m just a beginner with MongoDB. How do I model a one-to-
N relationship?” This is one of the more common questions I get from users attending MongoDB
office hours.

I don’t have a short answer to this question, because there isn’t just one way, there’s a whole
rainbow’s worth of ways. MongoDB has a rich and nuanced vocabulary for expressing what, in SQL,
gets flattened into the term “One-to-N”. Let me take you on a tour of your choices in modeling One-
to-N relationships.

There’s so much to talk about here, I’m breaking this up into three parts. In this first part, I’ll talk about
the three basic ways to model One-to-N relationships. In the second part I’ll cover more sophisticated
schema designs, including denormalization and two-way referencing. And in the final part, I’ll review
the entire rainbow of choices, and give you some suggestions for choosing among the thousands
(really – thousands) of choices that you may consider when modeling a single One-to-N relationship.

Many beginners think that the only way to model “One-to-N” in MongoDB is to embed an array of
sub-documents into the parent document, but that’s just not true. Just because you can embed a
document, doesn’t mean you should embed a document.

When designing a MongoDB schema, you need to start with a question that you’d never consider
when using SQL: what is the cardinality of the relationship? Put less formally: you need to
characterize your “One-to-N” relationship with a bit more nuance: is it “one-to-few”, “one-to-many”, or
“one-to-squillions”? Depending on which one it is, you’d use a different format to model the
relationship.

Basics: Modeling One-to-Few

An example of “one-to-few” might be the addresses for a person. This is a good use case for
embedding – you’d put the addresses in an array inside of your Person object:

> db.person.findOne()
{
name: 'Kate Monster',
ssn: '123-456-7890',
addresses : [
{ street: '123 Sesame St', city: 'Anytown', cc: 'USA' },
{ street: '123 Avenue Q', city: 'New York', cc: 'USA' }
]
}

This design has all of the advantages and disadvantages of embedding. The main advantage is that
you don’t have to perform a separate query to get the embedded details; the main disadvantage is
that you have no way of accessing the embedded details as stand-alone entities.

For example, if you were modeling a task-tracking system, each Person would have a number of
Tasks assigned to them. Embedding Tasks inside the Person document would make queries like
“Show me all Tasks due tomorrow” much more difficult than they need to be. I will cover a more
appropriate design for this use case in the next post.

Basics: One-to-Many

An example of “one-to-many” might be parts for a product in a replacement parts ordering system.
Each product may have up to several hundred replacement parts, but never more than a couple
thousand or so. (All of those different-sized bolts, washers, and gaskets add up.) This is a good use
case for referencing – you’d put the ObjectIDs of the parts in an array in product document. (For
these examples I’m using 2-byte ObjectIDs because they’re easier to read: real-world code would use
12-byte ObjectIDs.)

Each Part would have its own document:

> db.parts.findOne()
{
_id : ObjectID('AAAA'),
partno : '123-aff-456',
name : '#4 grommet',
qty: 94,
cost: 0.94,
price: 3.99

Each Product would have its own document, which would contain an array of ObjectID references to
the Parts that make up that Product:

> db.products.findOne()
{
name : 'left-handed smoke shifter',
manufacturer : 'Acme Corp',
catalog_number: 1234,
parts : [ // array of references to Part documents
ObjectID('AAAA'), // reference to the #4 grommet above
ObjectID('F17C'), // reference to a different Part
ObjectID('D2AA'),
// etc
]

You would then use an application-level join to retrieve the parts for a particular product:

// Fetch the Product document identified by this catalog number

> product = db.products.findOne({catalog_number: 1234});
// Fetch all the Parts that are linked to this Product
> product_parts = db.parts.find({_id: { $in : product.parts } } ).toArray() ;

For efficient operation, you’d need to have an index on ‘products.catalog_number’. Note that there will
always be an index on ‘parts._id’, so that query will always be efficient.

This style of referencing has a complementary set of advantages and disadvantages to embedding.
Each Part is a stand-alone document, so it’s easy to search them and update them independently.
One trade off for using this schema is having to perform a second query to get details about the Parts
for a Product. (But hold that thought until we get to denormalizing in part 2.)

As an added bonus, this schema lets you have individual Parts used by multiple Products, so your
One-to-N schema just became an N-to-N schema without any need for a join table!

Basics: One-to-Squillions

An example of “one-to-squillions” might be an event logging system that collects log messages for
different machines. Any given host could generate enough messages to overflow the 16 MB
document size, even if all you stored in the array was the ObjectID. This is the classic use case for
“parent-referencing” – you’d have a document for the host, and then store the ObjectID of the host in
the documents for the log messages.

> db.hosts.findOne()
{
_id : ObjectID('AAAB'),
name : 'goofy.example.com',
ipaddr : '127.66.66.66'
}

>db.logmsg.findOne()
{
time : ISODate("2014-03-28T09:42:41.382Z"),
message : 'cpu is on fire!',
host: ObjectID('AAAB') // Reference to the Host document
}

You’d use a (slightly different) application-level join to find the most recent 5,000 messages for a
host:

// find the parent ‘host’ document

> host = db.hosts.findOne({ipaddr : '127.66.66.66'}); // assumes unique index
// find the most recent 5000 log message documents linked to that host
> last_5k_msg = db.logmsg.find({host: host._id}).sort({time : -1}).limit(5000).toArray
()

Recap

So, even at this basic level, there is more to think about when designing a MongoDB schema than
when designing a comparable relational schema. You need to consider two factors:

Will the entities on the “N” side of the One-to-N ever need to stand alone?

What is the cardinality of the relationship: is it one-to-few; one-to-many; or one-to-squillions?

Based on these factors, you can pick one of the three basic One-to-N schema designs:

Embed the N side if the cardinality is one-to-few and there is no need to access the embedded object outside
the context of the parent object

Use an array of references to the N-side objects if the cardinality is one-to-many or if the N-side objects
should stand alone for any reasons

Use a reference to the One-side in the N-side objects if the cardinality is one-to-squillions

Next time we’ll see how to use two-way relationship and denormalizing to enhance the performance
of these basic schemas.

Part 2: Two-way referencing and denormalization

Part 3: Your guide through the rainbow

More Information
Schema Design Consulting Services

Thinking in Documents (recorded webinar)

Schema Design for Time-Series Data (recorded webinar)

Socialite, the Open Source Status Feed - Storing a Social Graph (recorded webinar)

This post was updated in January 2015 to include additional resources and updated links.

Get Started with MongoDB Atlas

Try Free
Run MongoDB in the cloud for free with MongoDB Atlas. No
credit card required.

   

← Previous Next →

Dwight Merriman Named EY Accelerate App Delivery with

Entrepreneur Of The Year™ 2014 Cognizant's Next Gen Continuous
Award finalist in New York Integrator

EY recently announced that Co-founder The phrase “digital transformation” is

and Chairman Dwight Merriman of ubiquitous these days. But what does it
MongoDB, Inc. is a finalist for the EY… actually mean? Often, the heart of a…

May 27, 2014 September 29, 2021

Resources Education & Support Popular Topics About Follow Us

NoSQL Database Explained View Course Catalog MongoDB on AWS MongoDB, Inc. Facebook

MongoDB Architecture Guide Certification MongoDB on Google Cloud Leadership Github

MongoDB Enterprise Advanced MongoDB Manual Run MongoDB on Multiple Clouds with MongoDB Press Room Youtube
Atlas
MongoDB Atlas Installation Careers Twitter
Migrate to MongoDB Atlas
MongoDB Realm Support Investors LinkedIn
What is a Cloud Database?
MongoDB Engineering Blog Community Legal Notices StackOverflow
Building a REST API with MongoDB Realm
FAQ Privacy Notice Twitch

Security
Information

Trust Center

Office Locations

Code of Conduct

Mongo, MongoDB, and the MongoDB leaf logo are registered trademarks of MongoDB, Inc.

03_Chapter_Relationships_DataModeling_Mongodb_New
No ratings yet
03_Chapter_Relationships_DataModeling_Mongodb_New
60 pages
FSD5
No ratings yet
FSD5
43 pages
The Little Mongo DB Schema Design Book by Christian Amor Kvalheim
No ratings yet
The Little Mongo DB Schema Design Book by Christian Amor Kvalheim
153 pages
Module-05_FSD(BIS601) Search Creators
No ratings yet
Module-05_FSD(BIS601) Search Creators
43 pages
Embedded Documents
No ratings yet
Embedded Documents
10 pages
Data-Modeling
No ratings yet
Data-Modeling
36 pages
INS Assignments
No ratings yet
INS Assignments
4 pages
PPT Lecture 2.4 and 2.5 Relationship Types and Diff Between Data Models (1)
No ratings yet
PPT Lecture 2.4 and 2.5 Relationship Types and Diff Between Data Models (1)
39 pages
Chapter 3-Database Modelling
No ratings yet
Chapter 3-Database Modelling
59 pages
WEBX IAT2 QB SOLN
No ratings yet
WEBX IAT2 QB SOLN
13 pages
01_Chapter_Introducing Data Modeling
No ratings yet
01_Chapter_Introducing Data Modeling
50 pages
1ST
No ratings yet
1ST
6 pages
fundamental of database group work
No ratings yet
fundamental of database group work
15 pages
12 Steps To Enabling Audit in PostgreSQL
No ratings yet
12 Steps To Enabling Audit in PostgreSQL
15 pages
Unit 2
No ratings yet
Unit 2
85 pages
Dbms Unit5 Notes
No ratings yet
Dbms Unit5 Notes
81 pages
SQL101
No ratings yet
SQL101
54 pages
Fsd Unit III
No ratings yet
Fsd Unit III
22 pages
Docker Inc Docker Fundamentals Course PDF
0% (1)
Docker Inc Docker Fundamentals Course PDF
193 pages
Csis 3300 w5 9 Nosql
No ratings yet
Csis 3300 w5 9 Nosql
27 pages
lec_7_part_2
No ratings yet
lec_7_part_2
7 pages
Mysql Constraints
No ratings yet
Mysql Constraints
24 pages
Se CH 3
No ratings yet
Se CH 3
16 pages
DPA Lecture 6
No ratings yet
DPA Lecture 6
69 pages
Wa0009.
No ratings yet
Wa0009.
1 page
slides01-31-45
No ratings yet
slides01-31-45
15 pages
Mongodb Session 2
100% (1)
Mongodb Session 2
47 pages
Mongo DB Cheat Sheet KKJHG
No ratings yet
Mongo DB Cheat Sheet KKJHG
9 pages
SQL To MongoDB Mapping Chart
No ratings yet
SQL To MongoDB Mapping Chart
17 pages
Composite Datatypes: Types: PL/SQL Records PL/SQL Tables Contain Internal Components Are Reusable
No ratings yet
Composite Datatypes: Types: PL/SQL Records PL/SQL Tables Contain Internal Components Are Reusable
14 pages
Chapitre 4 MongoDB
No ratings yet
Chapitre 4 MongoDB
27 pages
Class 12 Competency Based Question - Computer Science Chap 8 (2024-25)
No ratings yet
Class 12 Competency Based Question - Computer Science Chap 8 (2024-25)
25 pages
Nouveau Document Microsoft Word (3) (AutoRecovered)
No ratings yet
Nouveau Document Microsoft Word (3) (AutoRecovered)
7 pages
No SQLData Modeling
No ratings yet
No SQLData Modeling
22 pages
HANA Traces PerformanceTrace 2.00.040+
No ratings yet
HANA Traces PerformanceTrace 2.00.040+
3 pages
What is a Document Database (1)
No ratings yet
What is a Document Database (1)
7 pages
Grade 7 Fa2 Worksheet 2023-24
No ratings yet
Grade 7 Fa2 Worksheet 2023-24
3 pages
Project Work
No ratings yet
Project Work
6 pages
SQL Cheat Sheet - Basics - SELECT, INSERT, UPDATE, DELETE, COUNT, DISTINCT, LIMIT
No ratings yet
SQL Cheat Sheet - Basics - SELECT, INSERT, UPDATE, DELETE, COUNT, DISTINCT, LIMIT
2 pages
Cse249-Database-Management-System - QB
No ratings yet
Cse249-Database-Management-System - QB
55 pages
4 - Explore Concepts of Non-Relational Data
No ratings yet
4 - Explore Concepts of Non-Relational Data
14 pages
3 - Relationships in Data
No ratings yet
3 - Relationships in Data
62 pages
Mongodb Relationships1
No ratings yet
Mongodb Relationships1
2 pages
Chapter 11 - ABAP Native SQL
No ratings yet
Chapter 11 - ABAP Native SQL
9 pages
RecPgm4 10
No ratings yet
RecPgm4 10
19 pages
10.2 Oracle Statement-Level Triggers by Practical Examples
No ratings yet
10.2 Oracle Statement-Level Triggers by Practical Examples
4 pages
G8-HBase 2
No ratings yet
G8-HBase 2
100 pages
Mongodb Schema Design Part 2
No ratings yet
Mongodb Schema Design Part 2
1 page
Schema Chalk Talk
No ratings yet
Schema Chalk Talk
36 pages
Introducing DocumentDB Chappell v1.1
No ratings yet
Introducing DocumentDB Chappell v1.1
13 pages
Case Study 3
No ratings yet
Case Study 3
2 pages
1-MongoDB (3 Files Merged)
No ratings yet
1-MongoDB (3 Files Merged)
7 pages
Reporting With Reports Viewer in Visual Studio 2005: C# Corner Authors Team
No ratings yet
Reporting With Reports Viewer in Visual Studio 2005: C# Corner Authors Team
25 pages
MongoDB CheatSheet
No ratings yet
MongoDB CheatSheet
9 pages
Drillhole Database Creation
No ratings yet
Drillhole Database Creation
8 pages
Data Modeling With MongoDB
No ratings yet
Data Modeling With MongoDB
59 pages
Mean Stack Technologies Unit-5
No ratings yet
Mean Stack Technologies Unit-5
9 pages
DBMS 1: Introduction + ER Diagram + Functional Dependency
No ratings yet
DBMS 1: Introduction + ER Diagram + Functional Dependency
9 pages
Final Examsql
No ratings yet
Final Examsql
243 pages
AFUN20
No ratings yet
AFUN20
46 pages
Mongodb Schema Design Part 3
No ratings yet
Mongodb Schema Design Part 3
1 page
Whats The Difference of Majority Committed Data and The Snapshot of Majority
No ratings yet
Whats The Difference of Majority Committed Data and The Snapshot of Majority
1 page
Mongodb
No ratings yet
Mongodb
9 pages
Mongodb (Cont.) : Excerpts From "The Little Mongodb Book" Karl Seguin
No ratings yet
Mongodb (Cont.) : Excerpts From "The Little Mongodb Book" Karl Seguin
37 pages
Ipv6 Hardening Guide For Windows Servers
No ratings yet
Ipv6 Hardening Guide For Windows Servers
21 pages
M03 - HOL Complex Data Relationships
No ratings yet
M03 - HOL Complex Data Relationships
35 pages
A. Im, G. Cai, H. Tunc, J. Stevens, Y. Barve, S. Hei Vanderbilt University
No ratings yet
A. Im, G. Cai, H. Tunc, J. Stevens, Y. Barve, S. Hei Vanderbilt University
81 pages
Subprograms and Packages Subprograms (Procedures and Functions)
No ratings yet
Subprograms and Packages Subprograms (Procedures and Functions)
4 pages
Android Content Providers
No ratings yet
Android Content Providers
31 pages
Exercise11 DocumentStores
No ratings yet
Exercise11 DocumentStores
11 pages
Database Terminology: Relational Databases Terms Glossary-Bonus Resource
No ratings yet
Database Terminology: Relational Databases Terms Glossary-Bonus Resource
8 pages
02-Common Data Service Lab Manual
No ratings yet
02-Common Data Service Lab Manual
51 pages
04-Power Automate Lab Manual
0% (1)
04-Power Automate Lab Manual
29 pages
M02 - HOL Reusable Components
No ratings yet
M02 - HOL Reusable Components
28 pages
M04 - HOL Embedded Canvas
No ratings yet
M04 - HOL Embedded Canvas
21 pages
Prefer Embedding: Document Schema Design Cheatsheet
No ratings yet
Prefer Embedding: Document Schema Design Cheatsheet
1 page
Database Upgrade Process
No ratings yet
Database Upgrade Process
3 pages
Performance of Graph Query Languages: Comparison of Cypher, Gremlin and Native Access in Neo4j
No ratings yet
Performance of Graph Query Languages: Comparison of Cypher, Gremlin and Native Access in Neo4j
10 pages
Richardson Maturity Model
No ratings yet
Richardson Maturity Model
12 pages
hw4 mongoDB
No ratings yet
hw4 mongoDB
5 pages
01-Power Apps Canvas App Lab Manual
No ratings yet
01-Power Apps Canvas App Lab Manual
49 pages
Fundamentals of Continuous Integration: Jenkins
No ratings yet
Fundamentals of Continuous Integration: Jenkins
7 pages
Mongo DB Exercise
No ratings yet
Mongo DB Exercise
45 pages
Python Code: Mysql - Connector Time Datetime
No ratings yet
Python Code: Mysql - Connector Time Datetime
5 pages
How To Reduce DB File Sequential Read Wait
No ratings yet
How To Reduce DB File Sequential Read Wait
5 pages
Assignment On SQL
No ratings yet
Assignment On SQL
6 pages
00-AppInADay Lab Overview
No ratings yet
00-AppInADay Lab Overview
8 pages
Role of Software Readability On Software Development Cost: Collare@wcsu - Edu rvalerdi@MIT - EDU
No ratings yet
Role of Software Readability On Software Development Cost: Collare@wcsu - Edu rvalerdi@MIT - EDU
3 pages
MongoDB Data Modeling - Sample Chapter
No ratings yet
MongoDB Data Modeling - Sample Chapter
40 pages
DB2 For i5/OS: V6R1 Overview
No ratings yet
DB2 For i5/OS: V6R1 Overview
24 pages
FoxPro Tutorial Santosh Sir
No ratings yet
FoxPro Tutorial Santosh Sir
5 pages
Mongo DB
No ratings yet
Mongo DB
8 pages
How To Create A Stellar: UX/UI Portfolio
No ratings yet
How To Create A Stellar: UX/UI Portfolio
14 pages
ICT Lesson 7 Notes
No ratings yet
ICT Lesson 7 Notes
14 pages
hw4 Mongodb
No ratings yet
hw4 Mongodb
5 pages
Little Mongodb Schema Book
No ratings yet
Little Mongodb Schema Book
27 pages
Employee Table: Practical QUERIES (1-10)
No ratings yet
Employee Table: Practical QUERIES (1-10)
7 pages
MongoDB Schema Design Basics
100% (2)
MongoDB Schema Design Basics
51 pages
Microsoft Zero Trust Maturity Model - Oct 2019
No ratings yet
Microsoft Zero Trust Maturity Model - Oct 2019
7 pages
DBMS Assignment
100% (1)
DBMS Assignment
11 pages
The Art of Debugging with GDB, DDD, and Eclipse
From Everand
The Art of Debugging with GDB, DDD, and Eclipse
Norman Matloff
3.5/5 (6)
Prompt to Profit: AI Patterns That Give Solo Builders an Unfair Advantage
From Everand
Prompt to Profit: AI Patterns That Give Solo Builders an Unfair Advantage
Lucas Merritt
No ratings yet
JavaScript for Kids: Start Your Coding Adventure
From Everand
JavaScript for Kids: Start Your Coding Adventure
Abdelfattah Ragab
No ratings yet
Mastering Node.js Web Development: Go on a comprehensive journey from the fundamentals to advanced web development with Node.js
From Everand
Mastering Node.js Web Development: Go on a comprehensive journey from the fundamentals to advanced web development with Node.js
Adam Freeman
No ratings yet
Learn MongoDB in 24 Hours
From Everand
Learn MongoDB in 24 Hours
Alex Nordeen
5/5 (2)
SQLite Database Programming for Xamarin: Cross-platform C# database development for iOS and Android using SQLite.XM
From Everand
SQLite Database Programming for Xamarin: Cross-platform C# database development for iOS and Android using SQLite.XM
Anthony Serpico
No ratings yet
The Easiest Way to Learn Design Patterns
From Everand
The Easiest Way to Learn Design Patterns
Fiodar Sazanavets
No ratings yet
Elements of Android Room
From Everand
Elements of Android Room
Mark Murphy
No ratings yet

Mongodb Schema Design Part 1

Uploaded by

Mongodb Schema Design Part 1

Uploaded by

| Blog Home News Applied Developer QuickStart Updates Culture Events Mark Loves Tech All  Search

6 Rules of Thumb for MongoDB Schema

By William Zola, Lead Technical Support Engineer at MongoDB

Basics: Modeling One-to-Few

Each Part would have its own document:

// Fetch the Product document identified by this catalog number

// find the parent ‘host’ document

What is the cardinality of the relationship: is it one-to-few; one-to-many; or one-to-squillions?

Part 2: Two-way referencing and denormalization

Part 3: Your guide through the rainbow

Thinking in Documents (recorded webinar)

Schema Design for Time-Series Data (recorded webinar)

Get Started with MongoDB Atlas

Dwight Merriman Named EY Accelerate App Delivery with

EY recently announced that Co-founder The phrase “digital transformation” is

May 27, 2014 September 29, 2021

Resources Education & Support Popular Topics About Follow Us

MongoDB Architecture Guide Certification MongoDB on Google Cloud Leadership Github

© 2021 MongoDB, Inc.

You might also like