0% found this document useful (0 votes)

19 views15 pages

Module 3

This document provides an introduction to MongoDB, highlighting its features such as scalability, flexibility, and high performance as a NoSQL database. It compares MongoDB with traditional RDBMS, explaining key differences in data organization, querying, and transaction handling. Additionally, it covers the MapReduce programming model, detailing its components like Mapper, Reducer, and the processes involved in data processing and analysis.

Uploaded by

jhanaviballa23904

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views15 pages

Module 3

Uploaded by

jhanaviballa23904

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

UNIT III

INTRODUCTION TO MONGODB AND MAPREDUCE PROGRAMMING

Syllabus :

1. MongoDB: Why Mongo DB

2. Terms used in RDBMS and Mongo DB
3. Data Types
4. MongoDB Query Language.
5. MapReduce: Mapper – Reducer – Combiner – Partitioner – Searching – Sorting
– Compression.

I. MongoDB :

MongoDB is a popular open-source, document oriented NoSQL (not only SQL)

database management system designed for scalability, flexibility, and high-
performance.

Why Mongo DB?

Key features of MongoDB include:

• Document-oriented: MongoDB stores data in flexible, self-describing documents

using a format called BSON (Binary JSON). Documents can have varying structures,
allowing for easy and dynamic schema evolution.

• Scalability and Performance: MongoDB is designed to scale horizontally across

multiple servers, enabling it to handle large volumes of data and high traffic loads. It
provides automatic sharding and replication capabilities for distributing data and
ensuring fault tolerance.

• Querying and Indexing: MongoDB supports rich query capabilities, including ad-hoc
queries, indexing, and aggregation pipelines. It has a flexible query language that allows
for complex searches and supports various types of indexes for optimizing query
performance.
• High Availability and Fault Tolerance: MongoDB provides built-in replication,
which allows data to be automatically synchronized across multiple servers, ensuring
data redundancy and high availability. In case of server failures, MongoDB can
seamlessly failover to a replica set member.

• Flexible Data Model: MongoDB's document-oriented data model allows for flexible
and dynamic schemas, accommodating evolving data structures without requiring
downtime or complex migrations. It is well-suited for scenarios where data has varied
attributes or evolving requirements.

• Integration and Ecosystem: MongoDB has a rich ecosystem with support for various
programming languages and frameworks. It provides drivers and connectors for
popular programming languages, making it easy to integrate MongoDB with
applications.

II. Terms used in RDBMS and Mongo DB

RDBMS MongoDB

Database DATABASE DATABASE

Table vs. Collection In an RDBMS, data is n MongoDB, data is stored in

organized in tables, which collections, which are analogous
consist of rows and columns to tables in RDBMS.

Row vs. Document: Each row in an RDBMS table In MongoDB, each document in a
represents a single record with collection is a JSON-like object
a fixed set of columns and data with a dynamic schema, allowing
types for varying fields within the same
collection.

Column vs. Field Columns represent the Fields are the equivalent data
individual data attributes or attributes of a document in a
fields of a table collection
JOIN A JOIN operation combines MongoDB does not support JOINs
rows from two or more tables directly since it follows a
based on a related column denormalized data model. Instead,
between them data is often embedded within
documents to avoid JOIN
operations

Primary Key vs. A primary key is a unique Each document in a collection is

_id: identifier for each row in a assigned a unique identifier called
table, used for fast retrieval and _id, which acts as a primary key.
data integrity.

Normalization vs. Normalization is a process of Denormalization is often used in

Denormalization: structuring data to eliminate MongoDB to improve query
redundancy and improve data performance by embedding related
integrity. This is achieved by data within a single document,
dividing data into multiple reducing the need for multiple
related tables queries.

ACID Transactions RDBMS supports ACID MongoDB offers ACID

(Atomicity, Consistency, transactions for specific operations
Isolation, Durability) within a single document or across
transactions, ensuring data documents in a single collection
integrity and consistency.

Index: An index is used to speed up MongoDB also supports indexing

data retrieval in an RDBMS by for faster queries on specific fields
creating a separate data within a collection.
structure that allows for faster
lookup based on indexed
columns.
MongoDB Query Language

What is MongoDB Query?

• MongoDB Query is a way to get the data from the MongoDB database. MongoDB
queries provide the simplicity in process of fetching data from the database, it’s similar
to SQL queries in SQL Database language.

• While performing a query operation, one can also use criteria or conditions which can
be used to retrieve specific data from the database.

• MongoDB stores the data in the form of the structure (field:value pair) rather than
tabular form.

• It stores data in BSON (Binary JSON) format just like JSON format.

• A simple example of a MongoDB database collection

“_id” : ObjectId(“6009585d35cce6b7b8f087f1”),
“title” : “Math”,
“author” : “Aditya”,

“level” : “basic”,
“length” : 230,
“example” : 11

}
example

• Assume we have created a collection named mycol as –

< use sampleDB

<switched to db sampleDB

< db.createCollection("mycol")

{ "ok" : 1 }

• And inserted 3 documents in it using the insert() method as shown below −

MongoDB uses a flexible and powerful query language to interact with its databases. The
MongoDB query language allows you to retrieve, insert, update, and delete data from the
database. Here are some common elements of the MongoDB query language:

1. Find:

The find() method is used to retrieve data from a collection based on specified criteria.

You can specify the filter conditions as a JSON object in the find() method to retrieve
documents that match the criteria.
Syntax

• The basic syntax of find() method is as follows-

> db.COLLECTION_NAME.find()

find() method will display all the documents in a non-structured way.

• Following method retrieves all the documents in the collection –

• > db.mycol.find()

• { "_id" : ObjectId("5dd4e2cc0821d3b44607534c"), "title" : "MongoDB Overview",

"description" : "MongoDB is no SQL database", "by" : "tutorials point", "url" :
"http://www.tutorialspoint.com", "tags" : [ "mongodb", "database", "NoSQL" ], "likes"
: 100 } { "_id" : ObjectId("5dd4e2cc0821d3b44607534d"), "title" : "NoSQL Database",
"description" : "NoSQL database doesn't have tables", "by" : "tutorials point", "url" :
"http://www.tutorialspoint.com", "tags" : [ "mongodb", "database", "NoSQL" ], "likes"
: 20, "comments" : [ { "user" : "user1", "message" : "My first comment", "dateCreated"
: ISODate("2013-12-09T21:05:00Z"), "like" : 0 } ] } >

pretty()

To display the results in a formatted way, you can use pretty() method.

Syntax

>db.COLLECTION_NAME.find().pretty()

• > db.mycol.find().pretty()
findOne()

o Apart from the find() method, there is findOne() method, that returns only one
document.

Syntax

o >db.COLLECTIONNAME.findOne()
Equal filter query

o The equality operator($eq) is used to match the documents where the value of the field
is equal to the specified value. In other words, the $eq operator is used to specify the
equality condition.

Syntax:

 db.collection_name.find({< key > : {$eq : < value >}})

Example :

o >db.article.find({author:{$eq:"devil"}}).pretty()

Comparison Operators:

MongoDB supports various comparison operators such as $eq, $ne, $gt, $lt, $gte, and $lte to
compare values in queries.

Logical Operators:

MongoDB provides logical operators like $and, $or, and $not to combine multiple conditions
in a query.

AND in MongoDB

In MongoDB, the AND logical operator is used to combine multiple conditions in a query to
retrieve documents that satisfy all of the specified conditions simultaneously
Syntax

>db.mycol.find({ $and: [ {<key1>:<value1>}, { <key2>:<value2>} ] })

Following example will show all the tutorials written by 'tutorials point' and whose title is
'MongoDB Overview

< db.mycol.find({$and:[{"by":"tutorials point"},{"title": "MongoDB Overview"}]}).pretty()

OR in MongoDB

In MongoDB, the OR logical operator is used to combine multiple conditions in a query to

retrieve documents that satisfy at least one of the specified conditions.

Syntax

• >db.mycol.find( { $or: [ {key1: value1}, {key2:value2} ] } ).pretty()

Following example will show all the tutorials written by 'tutorials point' or whose title is
'MongoDB Overview'.

>db.mycol.find({$or:[{"by":"tutorials point"},{"title": "MongoDB Overview"}]}).pretty()

NOT in MongoDB

syntax

>db.COLLECTION_NAME.find( { $not: [ {key1: value1}, {key2:value2} ] } ).pretty()

Following example will retrieve the document(s) whose age is not greater than 25

> db.empDetails.find( { "Age": { $not: { $gt: "25" } } } )

NOR operator

In MongoDB, the NOR logical operator is used to combine multiple conditions in a query to
retrieve documents that do not satisfy any of the specified conditions.

Syntax

>db.collection.find({ $nor: [ { condition1 },{ condition2 }, // Add more conditions as needed

]})
Example

• Now, let's say we want to find products that do not have the name "Product A" and
whose price is not less than 20. We can use the $nor javascroperator as follows:

>db.products.find({ $nor: [ { name: "Product A" }, { price: { $lt: 20 } } ]})

Projection:

The project() method is used to specify which fields to include or exclude from the query result.

Sorting:

The sort() method is used to sort the query result based on one or more fields.

Limiting:

The limit() method is used to restrict the number of documents returned by a query.

Aggregation:

MongoDB supports the Aggregation Framework, which allows you to perform advanced data
processing and transformation tasks, including grouping, filtering, and computing aggregate
functions.

Insert:

To insert data into a collection, you use the insertOne() or insertMany() method.

Update:

The updateOne() and updateMany() methods are used to update existing documents in a
collection based on specified criteria.

Delete:

To delete documents from a collection, you can use the deleteOne() or deleteMany() method.

The MongoDB query language is designed to be intuitive and easy to use. It provides a wide
range of capabilities for retrieving and manipulating data, making it suitable for various use
cases, including real-time applications, analytics, and big data processing. MongoDB's query
language allows developers to efficiently work with both simple and complex data structures,
providing the flexibility needed for modern data-driven applications.
RDBMS Vs MongoDB

RDBMS MongoDB
Retrieving Data SELECT column1, column2 db.collection_name.find({ field1:
FROM table_name WHERE value1, field2: value2 }, { field1: 1,
condition; field2: 1 });
Inserting Data INSERT INTO table_name db.collection_name.insertOne({ field1:
(column1, column2) value1, field2: value2 });
VALUES (value1, value2);
Updating Data: UPDATE table_name SET db.collection_name.updateOne({ field:
column1 = value1, column2 = value }, { $set: { field1: value1, field2:
value2 WHERE condition; value2 } });
Deleting Data: DELETE FROM table_name db.collection_name.deleteOne({ field:
WHERE condition; value });
Aggregation SELECT column1, db.collection_name.aggregate([
(Grouping and COUNT(column2) FROM { $group: { _id: "$field1", count: {
Aggregating table_name GROUP BY $sum: 1 } } },
Data) column1; { $project: { _id: 0, field1: "$_id",
count: 1 } }
]);
Sorting: SELECT column1, column2 db.collection_name.find({}).sort({
FROM table_name ORDER field1: -1, field2: 1 });
BY column1 DESC, column2
ASC;
Limiting: SELECT column1, column2 db.collection_name.find({}).limit(10);
FROM table_name LIMIT
10;
MapReduce: Mapper – Reducer – Combiner – Partitioner – Searching – Sorting –
Compression

The MapReduce algorithm contains two important tasks, namely Map and Reduce.

 The map task is done by means of the Mapper Class

 The reduce task is done by means of Reducer Class.

Mapper class takes the input, tokenizes it, maps and sorts it. The output of Mapper class is used
as input by Reducer class, which in turn searches matching pairs and reduces them.

MapReduce implements various mathematical algorithms to divide a task into small parts and
assign them to multiple systems. In technical terms, MapReduce algorithm helps in sending
the Map & Reduce tasks to appropriate servers in a cluster.

Mapper:

Input: The input data is divided into smaller chunks called input splits.

Mapper Function: The Mapper is responsible for processing these input splits independently.
It applies a user-defined function to each input split to produce a set of intermediate key-
value pairs. The key is typically used for grouping and sorting in the next phase (Shuffle and
sort).

Combiner:

Optional: The Combiner is an optional intermediate step that can be used to reduce the volume
of data transferred between the Mapper and Reducer phases. It performs a local aggregation of
the output from Mappers on each node before sending it to the Reducer. This is particularly
useful when the same key appears multiple times in the Mapper output.
The primary purpose of the combiner is to reduce the volume of the data that needs to be
transferred between the mapper and Reducer and improve the overall efficiency of the
MapReduce job.

Partitioner:

Partitioning Logic: The Partitioner decides which Reducer will receive each intermediate key-
value pair from the Mapper. It is responsible for ensuring that all key-value pairs for a given
key are sent to the same Reducer. This is crucial for correct aggregation and computation during
the Reducer phase.

Reducer:

Grouping and Aggregation: The Reducer takes the intermediate key-value pairs produced by
the Mappers, groups them by key (based on the output of the Partitioner), and then applies a
user-defined Reduce function to each group. The result is typically written to an output file.

Shuffle and Sort: After the Mapper phase, there's a shuffle and sort step where the
MapReduce framework sorts and groups the intermediate key-value pairs before sending
them to the Reducers. This ensures that a Reducer processes together all values
associated with the same key.

These mathematical algorithms may include the following –

 Sorting
 Searching
 Indexing
 TF-IDF

Sorting:

Sorting is one of the basic MapReduce algorithms to process and analyze data. MapReduce
implements sorting algorithm to automatically sort the output key-value pairs from the
mapper by their keys.

 Sorting methods are implemented in the mapper class itself.

 In the Shuffle and Sort phase, after tokenizing the values in the mapper class,
the Context class (user-defined class) collects the matching valued keys as a
collection.
 To collect similar key-value pairs (intermediate keys), the Mapper class takes the help
of RawComparator class to sort the key-value pairs.
 The set of intermediate key-value pairs for a given Reducer is automatically sorted by
Hadoop to form key-values (K2, {V2, V2, …}) before they are presented to the
Reducer.

Searching:
Searching plays an important role in MapReduce algorithm. It helps in the combiner phase
(optional) and in the Reducer phase. Let us try to understand how Searching works with the
help of an example.
Compression :

Data Compression: To optimize data transfer and storage, MapReduce frameworks often
support data compression. Intermediate data and output data can be compressed to reduce
disk I/O and network bandwidth usage.

In summary, MapReduce is a powerful framework for processing large-scale data by breaking

it into smaller tasks (Mappers), aggregating results (Reducers), and allowing for data
manipulation and searching through user-defined logic. The additional components like
Combiners, Partitioners, Sorting, and Compression are used to optimize the process and
improve performance in distributed computing environments

Example

The following example shows how MapReduce employs Searching algorithm to find out the
details of the employee who draws the highest salary in a given employee dataset.

 Let us assume we have employee data in four different files − A, B, C, and D. Let us
also assume there are duplicate employee records in all four files because of
importing the employee data from all database tables repeatedly. See the following
illustration.
 The Map phase processes each input file and provides the employee data in key-value
pairs (<k, v> : <emp name, salary>). See the following illustration.

 The combiner phase (searching technique) will accept the input from the Map phase
as a key-value pair with employee name and salary. Using searching technique, the
combiner will check all the employee salary to find the highest salaried employee in
each file. See the following snippet.


<k: employee name, v: salary>

Max= the salary of an first employee. Treated as
max salary

 if(v(second employee).salary > Max){
 Max = v(salary);
 }

 else{
 Continue checking;
 }

The expected result is as follows –

 Reducer phase − Form each file, you will find the highest salaried employee. To
avoid redundancy, check all the <k, v> pairs and eliminate duplicate entries, if any.
The same algorithm is used in between the four <k, v> pairs, which are coming from
four input files. The final output should be as follows –

 <gopal, 50000> 

Indexing

Normally indexing is used to point to a particular data and its address. It performs batch
indexing on the input files for a particular Mapper.The indexing technique that is normally
used in MapReduce is known as inverted index. Search engines like Google and Bing use
inverted indexing technique. Let us try to understand how Indexing works with the help of a
simple example.

Assignment On Consideration
75% (4)
Assignment On Consideration
12 pages
Brands of Beers
No ratings yet
Brands of Beers
26 pages
Business Plan Sacret Café
No ratings yet
Business Plan Sacret Café
20 pages
Wa0005.
No ratings yet
Wa0005.
145 pages
Finance Module 4 Financial Planning
No ratings yet
Finance Module 4 Financial Planning
7 pages
Mongodb Tutorial
No ratings yet
Mongodb Tutorial
54 pages
6-7 BRITA in Search of A Winning Strategy
No ratings yet
6-7 BRITA in Search of A Winning Strategy
23 pages
mongoDB 1
No ratings yet
mongoDB 1
23 pages
21 Mongo DB
No ratings yet
21 Mongo DB
104 pages
BIS601 Module 5 Textbook
No ratings yet
BIS601 Module 5 Textbook
57 pages
1664473609-Unit 5 - Database Management - MongoDB
No ratings yet
1664473609-Unit 5 - Database Management - MongoDB
23 pages
Chapter 9 Financial Statements of Not For Profit Organisations 2
No ratings yet
Chapter 9 Financial Statements of Not For Profit Organisations 2
44 pages
Big Data (Unit 3)
No ratings yet
Big Data (Unit 3)
46 pages
Unit - Iii Bda
No ratings yet
Unit - Iii Bda
51 pages
Mongodb
No ratings yet
Mongodb
49 pages
CHAPTER 6 MongoDB
No ratings yet
CHAPTER 6 MongoDB
53 pages
M3 Q&a
No ratings yet
M3 Q&a
39 pages
MongoDB Lecture 1
No ratings yet
MongoDB Lecture 1
37 pages
02 - Document-Based and MongoDB
No ratings yet
02 - Document-Based and MongoDB
133 pages
ICH CTD Seminar
100% (3)
ICH CTD Seminar
27 pages
Digital Electronics
100% (1)
Digital Electronics
3 pages
Unit 2 - Bda Notes
No ratings yet
Unit 2 - Bda Notes
37 pages
NOSQL Lab Book
No ratings yet
NOSQL Lab Book
33 pages
Mongodb Cheat Sheet
No ratings yet
Mongodb Cheat Sheet
10 pages
Databases
No ratings yet
Databases
27 pages
2470 2 1985 Reff2022
No ratings yet
2470 2 1985 Reff2022
26 pages
NoSQL 24 Mongo P1
No ratings yet
NoSQL 24 Mongo P1
43 pages
Module 5
No ratings yet
Module 5
32 pages
G8-HBase 2
No ratings yet
G8-HBase 2
100 pages
Basics of Mongodb-Connectivity
No ratings yet
Basics of Mongodb-Connectivity
26 pages
Mongo DB
No ratings yet
Mongo DB
23 pages
Unit 4 (MongoDB)
No ratings yet
Unit 4 (MongoDB)
46 pages
Mongo DB
No ratings yet
Mongo DB
30 pages
Naïve Bayes-DecisionTrees-RandomForest-SVM
No ratings yet
Naïve Bayes-DecisionTrees-RandomForest-SVM
26 pages
MongoDb Imp
No ratings yet
MongoDb Imp
21 pages
K2 Materials Req
No ratings yet
K2 Materials Req
71 pages
Fwwmun - Rules of Procedure
No ratings yet
Fwwmun - Rules of Procedure
20 pages
mongoDB - 2 LIMIT and SELECTORS
No ratings yet
mongoDB - 2 LIMIT and SELECTORS
27 pages
Chap 1 1 Introduction HRM
No ratings yet
Chap 1 1 Introduction HRM
40 pages
What Is MongoDB - Introduction, Architecture, Features & Example
No ratings yet
What Is MongoDB - Introduction, Architecture, Features & Example
8 pages
Mongo DB
No ratings yet
Mongo DB
77 pages
Big Data Notes
No ratings yet
Big Data Notes
13 pages
MSD Unit-4 Material
No ratings yet
MSD Unit-4 Material
17 pages
Bda Unit 4
No ratings yet
Bda Unit 4
13 pages
Mongodb
No ratings yet
Mongodb
19 pages
MongoDB Presentaton
No ratings yet
MongoDB Presentaton
30 pages
Extron Catalog
No ratings yet
Extron Catalog
60 pages
BDA Unit-4
No ratings yet
BDA Unit-4
12 pages
Mongo DB
No ratings yet
Mongo DB
16 pages
MongoDB Is A Document Database
No ratings yet
MongoDB Is A Document Database
14 pages
Mongodb Notes HD Excl
No ratings yet
Mongodb Notes HD Excl
22 pages
Lecture 9 - MongoDB
No ratings yet
Lecture 9 - MongoDB
8 pages
Module 3 Mongodb
No ratings yet
Module 3 Mongodb
10 pages
Growth
No ratings yet
Growth
20 pages
Mongo DB
No ratings yet
Mongo DB
9 pages
Unit 1
No ratings yet
Unit 1
16 pages
Us Service Level Agreements BPO 121214 (1) - Evgeny Romakin
No ratings yet
Us Service Level Agreements BPO 121214 (1) - Evgeny Romakin
10 pages
Mongo DB
No ratings yet
Mongo DB
36 pages
MongoDB Viva Questions1
No ratings yet
MongoDB Viva Questions1
6 pages
An Introduction To Big Data - NoSQL - Data Science
No ratings yet
An Introduction To Big Data - NoSQL - Data Science
14 pages
Mongo DB
No ratings yet
Mongo DB
6 pages
Introduction To Managerial Accounting: GNB
No ratings yet
Introduction To Managerial Accounting: GNB
48 pages
Mongodb Interview Questions (V4.4)
No ratings yet
Mongodb Interview Questions (V4.4)
25 pages
MGEmail 1
No ratings yet
MGEmail 1
2 pages
Unit 3 Two Marks
No ratings yet
Unit 3 Two Marks
5 pages
Lesson 7-Drama, Music and Film
No ratings yet
Lesson 7-Drama, Music and Film
22 pages
Mean Stack Technologies Unit-5
No ratings yet
Mean Stack Technologies Unit-5
9 pages
CHP 13 14 PDF
No ratings yet
CHP 13 14 PDF
39 pages
The UK's Company Law Review
100% (1)
The UK's Company Law Review
6 pages
What Is Mongodb - Working and Features
100% (1)
What Is Mongodb - Working and Features
11 pages
Tenorio Vs CA - 110604 - October 10, 2003 - J. Callejo SR - Second Division
No ratings yet
Tenorio Vs CA - 110604 - October 10, 2003 - J. Callejo SR - Second Division
47 pages
Bda Unit Iv
No ratings yet
Bda Unit Iv
4 pages
Mongodb
No ratings yet
Mongodb
9 pages
Marc h5, 2015: Proprietary and Confidential
No ratings yet
Marc h5, 2015: Proprietary and Confidential
26 pages
L48 - MongoDB
No ratings yet
L48 - MongoDB
31 pages
Quiz Calculation Sheet
No ratings yet
Quiz Calculation Sheet
17 pages
Signal Conditioner P/N 2060-Series: Part Number 2060-08 Revision B
No ratings yet
Signal Conditioner P/N 2060-Series: Part Number 2060-08 Revision B
14 pages
CK Pithawala Colege PDF For Big Data Analysis
No ratings yet
CK Pithawala Colege PDF For Big Data Analysis
16 pages
HINO US Chap02
No ratings yet
HINO US Chap02
10 pages
Mongo DB
No ratings yet
Mongo DB
2 pages
BY U.Vishnu Reddy (H.T.No.1325-17-672-269)
No ratings yet
BY U.Vishnu Reddy (H.T.No.1325-17-672-269)
8 pages
Mongo
No ratings yet
Mongo
7 pages
Mongodb: Goo The Following Table Shows The Relationship of Rdbms Terminology With Mongodb
No ratings yet
Mongodb: Goo The Following Table Shows The Relationship of Rdbms Terminology With Mongodb
7 pages
Bryson Stakeholder ID and Analysis PMR Article
No ratings yet
Bryson Stakeholder ID and Analysis PMR Article
33 pages
موقع تعلموا - كتاب شرح برنامج symantec norton ghost 9 لأخذ نسخة إحتياطية
No ratings yet
موقع تعلموا - كتاب شرح برنامج symantec norton ghost 9 لأخذ نسخة إحتياطية
22 pages
Untitled
No ratings yet
Untitled
2 pages
ARX Ringos
No ratings yet
ARX Ringos
1 page
DBMS MASTER: Become Pro in Database Management System
From Everand
DBMS MASTER: Become Pro in Database Management System
Ummed Singh
No ratings yet
The DynamoDB Handbook: Practical Solutions for Modern NoSQL Database Management
From Everand
The DynamoDB Handbook: Practical Solutions for Modern NoSQL Database Management
Robert Johnson
No ratings yet
Learn MongoDB in 24 Hours
From Everand
Learn MongoDB in 24 Hours
Alex Nordeen
5/5 (2)

Module 3

Uploaded by

Module 3

Uploaded by

UNIT III

INTRODUCTION TO MONGODB AND MAPREDUCE PROGRAMMING

1. MongoDB: Why Mongo DB

MongoDB is a popular open-source, document oriented NoSQL (not only SQL)

Why Mongo DB?

Key features of MongoDB include:

• Document-oriented: MongoDB stores data in flexible, self-describing documents

• Scalability and Performance: MongoDB is designed to scale horizontally across

II. Terms used in RDBMS and Mongo DB

Database DATABASE DATABASE

Table vs. Collection In an RDBMS, data is n MongoDB, data is stored in

Primary Key vs. A primary key is a unique Each document in a collection is

Normalization vs. Normalization is a process of Denormalization is often used in

ACID Transactions RDBMS supports ACID MongoDB offers ACID

Index: An index is used to speed up MongoDB also supports indexing

What is MongoDB Query?

• A simple example of a MongoDB database collection

• Assume we have created a collection named mycol as –

< use sampleDB

• And inserted 3 documents in it using the insert() method as shown below −

• The basic syntax of find() method is as follows-

find() method will display all the documents in a non-structured way.

• Following method retrieves all the documents in the collection –

• { "_id" : ObjectId("5dd4e2cc0821d3b44607534c"), "title" : "MongoDB Overview",

 db.collection_name.find({< key > : {$eq : < value >}})

>db.mycol.find({ $and: [ {<key1>:<value1>}, { <key2>:<value2>} ] })

< db.mycol.find({$and:[{"by":"tutorials point"},{"title": "MongoDB Overview"}]}).pretty()

In MongoDB, the OR logical operator is used to combine multiple conditions in a query to

• >db.mycol.find( { $or: [ {key1: value1}, {key2:value2} ] } ).pretty()

>db.mycol.find({$or:[{"by":"tutorials point"},{"title": "MongoDB Overview"}]}).pretty()

>db.COLLECTION_NAME.find( { $not: [ {key1: value1}, {key2:value2} ] } ).pretty()

> db.empDetails.find( { "Age": { $not: { $gt: "25" } } } )

>db.collection.find({ $nor: [ { condition1 },{ condition2 }, // Add more conditions as needed

>db.products.find({ $nor: [ { name: "Product A" }, { price: { $lt: 20 } } ]})

 The map task is done by means of the Mapper Class

These mathematical algorithms may include the following –

 Sorting methods are implemented in the mapper class itself.

In summary, MapReduce is a powerful framework for processing large-scale data by breaking

<k: employee name, v: salary>

You might also like