FSD Unit III

This document provides an overview of MongoDB, a NoSQL database that utilizes a document model for data storage, emphasizing its advantages over traditional SQL databases. It covers essential concepts such as data modeling, collections, documents, data types, and operational practices including connecting from Node.js, data normalization, and performance considerations. Additionally, it discusses MongoDB's features like sharding, replication, and indexing to enhance performance and reliability.

Uploaded by

Adiba Fatima

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

112 views22 pages

FSD Unit III

Uploaded by

Adiba Fatima

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 22

Full Stack Development

UNIT - III

MongoDB:

Need of NoSQL, Understanding MongoDB, MongoDB Data Types, Planning

Your Data Model, Building the MongoDB Environment, Administering User
Accounts, Configuring Access Control, Administering Databases, Managing
Collections, Adding the MongoDB Driver to Node.js, Connecting to MongoDB
from Node.js, Understanding the Objects Used in the MongoDB Node.js Driver,
Accessing and Manipulating Databases, Accessing and Manipulating
Collections
Need of NoSQL
• Most large-scale web applications and services is a high-performance data storage
solution.

• The backend data store is responsible for storing everything from user account
information to shopping cart items to blog and comment data.

• Good web applications must store and retrieve data with accuracy, speed, and
reliability. Therefore, the data storage mechanism must perform at a level that
satisfies user demand.

• Several different data storage solutions are available to store and retrieve data
needed by web applications.

• The three most common are direct file system storage in files, relational databases,
and NoSQL databases.
Need of NoSQL
• The concept of NoSQL (Not Only SQL) consists of technologies that provide
storage and retrieval without the tightly constrained models of traditional SQL
relational databases.
• The motivation behind NoSQL is mainly simplified designs, horizontal scaling,
and finer control of the availability of data.
• NoSQL breaks away from the traditional structure of relational databases and
allows developers to implement models in ways that more closely fit the data
flow needs of their systems.
• This allows NoSQL databases to be implemented in ways that traditional
relational databases could never be structured.
• There are several different NoSQL technologies, such as HBase’s column
structure, Redis’s key/value structure, and Neo4j’s graph structure.
• MongoDB and the document model were chosen because of great flexibility and
scalability when it comes to implementing backend storage for web applications
and services.
• MongoDB is one of the most popular and well supported NoSQL databases
Understanding MongoDB
MongoDB
• MongoDB is a NoSQL database based on a document model where data objects
are stored as separate documents inside a collection.
• The motivation of the MongoDB language is to implement a data store that
provides high performance, high availability, and automatic scaling.
• MongoDB is simple to install and implement
Understanding Collections
• MongoDB groups data together through collections.
• A collection is simply a grouping of documents that have the same or a similar
purpose.
• A collection acts similarly to a table in a traditional SQL database, with one major
difference.
• In MongoDB, a collection is not enforced by a strict schema; instead, documents in
a collection can have a slightly different structure from one another as needed.
This reduces the need to break items in a document into several different tables,
which is often done in SQL implementations.
Understanding
Understanding Documents
MongoDB
• A document is a representation of a single entity of data in the MongoDB database.
• A collection is made up of one or more related objects. A major difference between
MongoDB and SQL is that documents are different from rows. Row data is flat, meaning
there is one column for each value in the row. In MongoDB, documents can contain
embedded subdocuments, thus providing a much closer inherent data model to your
applications.
• The records in MongoDB that represent documents are stored as BSON, which is a
lightweight binary form of JSON, with field:value pairs corresponding to JavaScript
property:value pairs. These field:value pairs define the values stored in the document.
For example, a document in MongoDB may be structured with the following fields:
{
name: "New Project",
version: 1,
languages: ["JavaScript", "HTML", "CSS"],
admin: {name: “CSE", password: ""},
paths: {temp: "/tmp", project: "/opt/project", html: "/opt/project/html"}
MongoDB Data Types
The document structure contains fields/properties that are strings, integers, arrays,
and objects.
The field names cannot contain null characters, . (dots), or $ (dollar signs). Also, the
_id field name is reserved for the Object ID. The _id field is a unique ID for the system
that consists of the following parts:
• A 4-byte value representing the seconds since the last epoch
• A 3-byte machine identifier
• A 2-byte process ID
• A 3-byte counter, starting with a random value
The maximum size of a document in MongoDB is 16MB
MongoDB Data Types
• The BSON data format provides several different types that are used when storing
the JavaScript objects to binary form. These types match the JavaScript type as
closely as possible.
• MongoDB assigns each of the data types an integer ID number from 1 to 255 that is
Type Number
Double 1
MongoDB Data Types
String 2
Object 3
Array 4
Binary data 5
Object id 7
Boolean 8
Date 9 MongoDB data types and corresponding ID number
Null 10
Regular Expression 11
JavaScript 13
JavaScript (with scope) 15
32-bit integer 16
Timestamp 17
64-bit integer 18
Decimal126 19
Min key -1
Max key 127
Planning Your Data Model
• Before you begin implementing a MongoDB database, you need to understand the
nature of the data being stored, how that data is going to get stored, and how it is
going to be accessed.
• What are the basic objects that my application will be using?
• What is the relationship between the different object types: one-to-one, one-
tomany,
• or many-to-many?
• How often will new objects be added to the database?
• How often will objects be deleted from the database?
• How often will objects be changed?
• How often will objects be accessed?
• How will objects be accessed: by ID, property values, comparisons, and so on?
• How will groups of object types be accessed: by common ID, common property
• value, and so on?
Normalizing Data with Document
References
• Data normalization is the process of organizing documents and collections to
minimize redundancy and dependency.

• Typically, this is used for objects that have a one-to many or many-to-many
relationship with subobjects.
• The advantage of normalizing data is that the database size will be smaller
because only a single copy of an object will exist in its own collection instead of
being duplicated on multiple objects in a single collection.

• Also, if you modify the information in the subobject frequently, you only need to
modify a single instance rather than every record in the object’s collection that
has that subobject.
• Major Disadvantage of Normalizing Data: Performance Hit
• When you normalize data in MongoDB (or any database), you separate
related information into different collections.For example:
• You store user info in the Users collection And store info in the FavoriteStores
collection .The Users collection just has a reference ID to link to the store
• If you want to see:
"Alice's name and her favorite store name and address"
MongoDB has to do two jobs:
• Look up Alice’s record in the Users collection
• Then use the favoriteStore ID to go fetch that store’s info from FavoriteStores
This second step is called a lookup (or a join in SQL terms).
If your app needs to show full user info very frequently, these extra lookups:
• Take more time, Use more memory, Slow down performance, especially if
there are thousands or millions of users
Denormalizing Data with Embedded
Documents
Denormalizing data means finding smaller parts of a main object and storing them directly inside that
main object’s document.
this is done on objects that have a mostly one-to-one relationship or are relatively small and do not get
updated frequently.
The major advantage of denormalized documents is that you can get the full object back in a single
lookup without the need to do additional lookups to combine subobjects from other collections.

Since all the information is already packed together, MongoDB doesn’t need to do extra lookups. This
means: Faster performance ,Fewer queries, You get everything in one go
Main Downside: More Space & Slower Writes
If many users share the same sub-data (like a company’s contact info), you are copying that data into
every user document.
More disk space used Slower insert/update operations (because the same data lives in many places)
eg)Let’s say you have a User who has both home and work contact info.
You could store it like this (denormalized – all in one document):
home and work are both embedded inside the user.
No separate collections. No reference IDs. Everything is right there.
capped collection,Understanding Atomic Write
Operations-textbook

• ADVANTAGES
• Capped collections keep documents in the same order they were
added.
• You don’t need an index to get documents in the order they were
stored — this saves extra work for the database.
• They don’t allow updates that make documents bigger, so the
documents stay in the same place on disk.
• This avoids the extra effort of moving documents around and tracking
their new locations.
When you update a document, think about whether the new data will make it
bigger.
MongoDB gives some extra space (padding) to handle small changes.
But if the document grows too much, MongoDB must move it to a new place on
the disk.
This slows down performance and can cause disk fragmentation (messy
storage).
Example: If you keep adding items to an array, the document might grow too
big.
To avoid this:
Use normalized objects for parts that grow a lot.
Instead of putting all cart items in an array inside a Cart document,
Create a separate CartItems collection.
Each cart item is a new document linked to the user's cart.
Indexes make frequent searches faster by creating a quick lookup system.
MongoDB automatically creates an index on the \_id field because it's commonly
used to find data.
You should also create extra indexes based on how users search your data.

Sharding means splitting big collections across different MongoDB servers (called
shards).
This helps handle large amounts of data and traffic, improving performance by
sharing the load (horizontal scaling).
Use sharding if your data is too big or gets lots of requests.

Replication means copying data to multiple MongoDB servers.

It makes sure your data is safe and available even if one server fails.
Replication helps with reliability and backup of important data.
🔹 Starting MongoDB
• After installing MongoDB, you use a program called mongod (or mongod.exe on Windows) to start it.
• This program runs the database and listens for requests from applications.
--help or -h Shows basic help info
• --auth Turns on user login security (authentication)
• --dbpath <path> Sets the folder where MongoDB stores data
🔹 Stopping MongoDB
• Best way: Stop it safely using the MongoDB shell client.
• First, switch to the admin database:
db.shutdownServer()
the two ways to run a MongoDB shell
1. Using `--eval` from the Command Line
* You can run a JavaScript command directly from your terminal (cmd or shell) using `--eval`.
Example:
mongo test --eval "printjson(db.getCollectionNames())"
2. Using `load()` from Inside the Mongo Shell
* You can run a `.js` file from inside the MongoDB shell using `load()`.
load("/tmp/db_update.js")
Connecting to MongoDB from Node.js
• Before connecting to and updating data on a MongoDB server, you need to decide
what level of write concern you want to implement on your connection.
• Write concern describes the guarantee that the MongoDB connection provides when
reporting on the success of a write operation. The strength of the write concern
determines the level of guarantee.
• A stronger write concern tells MongoDB to wait until the write has successfully been
written to disk completely before responding back,A weaker write concern means
MongoDB just plans to save the data soon, but doesn't actually wait to finish saving it
before saying "Done" and replying to you.

• Stronger write concern = More reliable, but slower

• Weaker write concern = Faster, but riskier
Connecting to MongoDB from Node.js Using the
MongoClient Object
MongoDB Connection URL Format
When connecting to MongoDB, you write a special kind of URL like this:
mongodb://username:password@host:port/database?
Using MongoClient with Options
You use the `MongoClient.connect()` method to connect with extra options like this:
MongoClient.connect('mongodb://localhost:27017/myDB',
{
connectTimeoutMS: 1000, // Wait max 1 sec before giving up
reconnectInterval: 500 // Try to reconnect every 500ms if lost
},
function(err, db) {
// Handle result here
});
Two Ways to Authenticate and Connect 2.Using the `db` Object After Connecting

1. In the Connection URL You can also connect without giving username and password in the
You can provide `username`, `password`, and `database` URL, and instead use `.authenticate()` after connecting:
right in the URL like this: client.connect(
client.connect( 'mongodb://localhost:27017',
{ poolSize: 5, reconnectInterval: 500 },
'mongodb://dbadmin:test@localhost:27017/testDB', function(err, db) {
{ poolSize: 5, reconnectInterval: 500 }, if (err) {
console.log("Failed");
function(err, db) {
} else {
if (err) console.log("Failed"); const testDB = db.db("testDB");
else { testDB.authenticate("dbadmin", "test", function(err, result) {
if (err) {
console.log("Connected!");
console.log("Authentication Failed");
db.logout(() => { db.close();
console.log("Logged out"); } else { Output
console.log("Authenticated!"); Connected Via Client
db.close(); db.logout(() => { Object ...
}); console.log("Logged out"); Authenticated Via
} db.close(); Client Object ...
}); Logged out Via Client
} } Object ...
); }); Connection closed ...

FIOT Unit-4
No ratings yet
FIOT Unit-4
36 pages
ML Unit - 3
No ratings yet
ML Unit - 3
23 pages
Hadoop Ecosystem and Their Components
No ratings yet
Hadoop Ecosystem and Their Components
19 pages
FIOT Unit-5
No ratings yet
FIOT Unit-5
24 pages
Unit1 ML
No ratings yet
Unit1 ML
23 pages
STM Unit-2
No ratings yet
STM Unit-2
72 pages
ML Unit-3
No ratings yet
ML Unit-3
24 pages
High Performance Techniques For Microsoft SQL Server PDF
100% (1)
High Performance Techniques For Microsoft SQL Server PDF
307 pages
Two Stage Job Title Identification-1
No ratings yet
Two Stage Job Title Identification-1
77 pages
Chap 11 12 - Practical Methodology and Applications - Heechul Lim
100% (1)
Chap 11 12 - Practical Methodology and Applications - Heechul Lim
60 pages
MC4411 Project Work - Format
No ratings yet
MC4411 Project Work - Format
65 pages
ML Unit 4
No ratings yet
ML Unit 4
34 pages
Machine Learning Unit 4
No ratings yet
Machine Learning Unit 4
28 pages
Android Interview Questions PDF
No ratings yet
Android Interview Questions PDF
24 pages
Oracle NOTES & Queries
No ratings yet
Oracle NOTES & Queries
605 pages
ML UNIT 2 Sir
No ratings yet
ML UNIT 2 Sir
46 pages
OS - Module 5 - Memory Management
No ratings yet
OS - Module 5 - Memory Management
81 pages
SQL Server Interview Questions Developers PDF
No ratings yet
SQL Server Interview Questions Developers PDF
142 pages
Mongodb Tutorial: Database Collection
No ratings yet
Mongodb Tutorial: Database Collection
36 pages
Unit 3
100% (1)
Unit 3
11 pages
BCT Techknowledge Want All Subjects Notes Pls
No ratings yet
BCT Techknowledge Want All Subjects Notes Pls
193 pages
DBMS Notes
No ratings yet
DBMS Notes
17 pages
UNIT-3 Javascript: Introduction Java Script
No ratings yet
UNIT-3 Javascript: Introduction Java Script
45 pages
Unit-3 (NLP)
No ratings yet
Unit-3 (NLP)
28 pages
Dbms Lab Manual Reg2021 24-25
No ratings yet
Dbms Lab Manual Reg2021 24-25
73 pages
Wa0001.
No ratings yet
Wa0001.
129 pages
Unit 3 1
No ratings yet
Unit 3 1
20 pages
ADA - Greedy Method
No ratings yet
ADA - Greedy Method
45 pages
ML-3-Decision Tree
No ratings yet
ML-3-Decision Tree
17 pages
UNIT V Application Layer
100% (1)
UNIT V Application Layer
18 pages
ML - CSA 301 - ML Perspective and Issues
No ratings yet
ML - CSA 301 - ML Perspective and Issues
34 pages
Applications of Context Free Grammars
No ratings yet
Applications of Context Free Grammars
23 pages
R Language
No ratings yet
R Language
59 pages
Concept Learning
No ratings yet
Concept Learning
85 pages
DBMS Notes From Rejinpaul
No ratings yet
DBMS Notes From Rejinpaul
195 pages
Unit-5 DS Notes
No ratings yet
Unit-5 DS Notes
19 pages
Practical Questions (Python PGM NO 1 To 21)
No ratings yet
Practical Questions (Python PGM NO 1 To 21)
61 pages
SM 6th-Sem Cse Internet-Of-Things
No ratings yet
SM 6th-Sem Cse Internet-Of-Things
76 pages
Ch-4 Ensemble Learning
No ratings yet
Ch-4 Ensemble Learning
18 pages
Unit 5
No ratings yet
Unit 5
23 pages
ML Module 2 New
No ratings yet
ML Module 2 New
36 pages
Unit-5 Alt
No ratings yet
Unit-5 Alt
15 pages
Unit 5
No ratings yet
Unit 5
17 pages
SQL Dbms
No ratings yet
SQL Dbms
20 pages
Dbms Practical Codes
No ratings yet
Dbms Practical Codes
26 pages
DBMS Record (16-05-2024)
No ratings yet
DBMS Record (16-05-2024)
41 pages
Relational Mapping
No ratings yet
Relational Mapping
56 pages
Streams
No ratings yet
Streams
37 pages
DL Unit-2 Notes PPT
No ratings yet
DL Unit-2 Notes PPT
39 pages
Unit 4 Knowledge Representation
No ratings yet
Unit 4 Knowledge Representation
13 pages
Chandigarh Group of Colleges College of Engineering Landran, Mohali
No ratings yet
Chandigarh Group of Colleges College of Engineering Landran, Mohali
47 pages
Data Analytics Unit III
No ratings yet
Data Analytics Unit III
15 pages
Data Mining and Business Intelligence Lab Manual
No ratings yet
Data Mining and Business Intelligence Lab Manual
52 pages
Oracle Backup & Recovery MCQs
No ratings yet
Oracle Backup & Recovery MCQs
20 pages
Lab Program
100% (1)
Lab Program
15 pages
WT Unit 3
No ratings yet
WT Unit 3
57 pages
CN Unit-3
No ratings yet
CN Unit-3
32 pages
NNDL Unit-1
No ratings yet
NNDL Unit-1
28 pages
Database Notes
No ratings yet
Database Notes
18 pages
Iot Lab NB
No ratings yet
Iot Lab NB
26 pages
Unit 2 AI
No ratings yet
Unit 2 AI
22 pages
STM Unit 5
No ratings yet
STM Unit 5
31 pages
Neural Network Unit - 4 - 221210 - 134739
No ratings yet
Neural Network Unit - 4 - 221210 - 134739
15 pages
Deep Learning r18 Jntuh Lab Manual
No ratings yet
Deep Learning r18 Jntuh Lab Manual
20 pages
Lecture 2.1.2activation Function
No ratings yet
Lecture 2.1.2activation Function
15 pages
Studocu DAA Unit 5 Notes
No ratings yet
Studocu DAA Unit 5 Notes
23 pages
Inserttt
No ratings yet
Inserttt
25 pages
Git 203 Assignment 1
No ratings yet
Git 203 Assignment 1
2 pages
NEURAL NETWORKS and Deep Learning: Going Deep About Neural Network
No ratings yet
NEURAL NETWORKS and Deep Learning: Going Deep About Neural Network
4 pages
S.E Notes
No ratings yet
S.E Notes
29 pages
Da Unit-2
No ratings yet
Da Unit-2
23 pages
Unit 5 - Compiler Design - WWW - Rgpvnotes.in
No ratings yet
Unit 5 - Compiler Design - WWW - Rgpvnotes.in
20 pages
Module 3 Games Optimal Decisions in Games Minimax Algorithm
No ratings yet
Module 3 Games Optimal Decisions in Games Minimax Algorithm
18 pages
Database Backup And: Recovery
No ratings yet
Database Backup And: Recovery
19 pages
Healthcare
No ratings yet
Healthcare
10 pages
BDA Unit 1
No ratings yet
BDA Unit 1
10 pages
MongoDB Why Documents
No ratings yet
MongoDB Why Documents
15 pages
ML Unit 1
No ratings yet
ML Unit 1
44 pages
Agriculture
No ratings yet
Agriculture
9 pages
Test Questions
No ratings yet
Test Questions
1 page
Formated Search
100% (2)
Formated Search
3 pages
Compte Rendu TP2: Gestion Des Droits Et Des Utilisateurs
No ratings yet
Compte Rendu TP2: Gestion Des Droits Et Des Utilisateurs
7 pages
Unit - IV - DIMENSIONALITY REDUCTION AND GRAPHICAL MODELS
No ratings yet
Unit - IV - DIMENSIONALITY REDUCTION AND GRAPHICAL MODELS
59 pages
Subiecte Examen Baze de Date Csie
No ratings yet
Subiecte Examen Baze de Date Csie
10 pages
Unit Iv Web Retrieval and Web Crawling 9
No ratings yet
Unit Iv Web Retrieval and Web Crawling 9
1 page
Conte Edited
No ratings yet
Conte Edited
2 pages
10987C - Performance Tuning and Optimising SQL Databases
No ratings yet
10987C - Performance Tuning and Optimising SQL Databases
4 pages
ROLAP
No ratings yet
ROLAP
4 pages
3.1 - BBDD No Relacionales para Web (Temario)
No ratings yet
3.1 - BBDD No Relacionales para Web (Temario)
2 pages
CP5191 Machine Learning Techniques L T P C3 0 0 3
No ratings yet
CP5191 Machine Learning Techniques L T P C3 0 0 3
7 pages
Guideline in Configuring Ozeki
No ratings yet
Guideline in Configuring Ozeki
2 pages
12 Step Query Tuning Oracle IG
No ratings yet
12 Step Query Tuning Oracle IG
1 page
QTP Database Scripting
No ratings yet
QTP Database Scripting
7 pages
Db2 History File
No ratings yet
Db2 History File
4 pages
Consultas de BD Northwind Por Jrodrigo Ramirez - FISI - UNAP
100% (1)
Consultas de BD Northwind Por Jrodrigo Ramirez - FISI - UNAP
2 pages
Textbook of Engineering Chemistry
From Everand
Textbook of Engineering Chemistry
C. Parameswara Murthy
No ratings yet

FSD Unit III

Uploaded by

FSD Unit III

Uploaded by

Full Stack Development

Need of NoSQL, Understanding MongoDB, MongoDB Data Types, Planning

Replication means copying data to multiple MongoDB servers.

• Stronger write concern = More reliable, but slower

You might also like