0% found this document useful (0 votes)

10 views12 pages

NoSql Module 1 Part2

The document discusses NoSQL databases, focusing on data models and relationships between entities, which can be one-to-one, one-to-many, or many-to-many. It highlights the advantages of graph databases for complex relationships and the flexibility of schemaless databases for unstructured data. Additionally, it covers the differences between views and materialized views, emphasizing their use cases and the importance of modeling data for efficient access.

Uploaded by

athulatk6

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views12 pages

NoSql Module 1 Part2

Uploaded by

athulatk6

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

MODULE 1

NoSQL 24-06-2022
More Details on Data Models

Relationships

The connections between entities in a data model are called relationships, and relationships
reflect business rules.
Relationships between entities can be one-to-one, one-to-many, or many-to-many. The
relationship between products and vendors can illustrate a one-to-many relationship

A common misconception is that NoSQL databases or non-relational databases don’t store

relationship data well.

NoSQL databases can store relationship data — they just store it differently than relational
databases do.

In fact, when compared with relational databases, many find modeling relationship data in
NoSQL databases to be easier than in relational databases, because related data doesn’t have
to be split between tables.

NoSQL data models allow related data to be nested within a single data structure. An
important aspect of relationships between aggregates is how they handle updates. Aggregate
oriented databases treat the aggregate as the unit of data-retrieval. Consequently, atomicity is
only supported within the contents of a single aggregate. If you update multiple aggregates at
once, you have to deal yourself with a failure partway through. Relational databases help you
with this by allowing you to modify multiple records in a single transaction, providing ACID
guarantees while altering many rows. All of this means that aggregate-oriented databases
become more awkward as you need to operate across multiple aggregates.

Graph databases
Graph databases are an odd fish in the NoSQL pond.
Most NoSQL databases were inspired by the need to run on clusters, which led to aggregate
oriented data models of large records with simple connections.
Graph databases are motivated by a different frustration with relational databases and thus
have an opposite model—small records with complex interconnections.

By:Yojana Kiran Kumar,Asst. Professor,Dept of BVOC ,SDM

MODULE 1

NoSQL 24-06-2022

Graph databases specialize in capturing this sort of information—but on a much larger scale
than a readable diagram could capture.
This is ideal for capturing any data consisting of complex relationships such as social
networks, product preferences, or eligibility rules.
The fundamental data model of a graph database is very simple: nodes connected by edges
(also called arcs).
Beyond this essential characteristic there is a lot of variation in data models—in particular,
what mechanisms you have to store data in your nodes and edges.
A quick sample of some current capabilities illustrates this variety of possibilities:
FlockDB is simply nodes and edges with no mechanism for additional attributes; Neo4J
allows you to attach Java objects as properties to nodes and edges in a schemaless
fashion;
Infinite Graph stores your Java objects, which are subclasses of its built-in types, as nodes
and edges.
a graph database allows you to query that network with query operations designed. Graph
databases are purpose-built to store and navigate relationships. Relationships are first class
citizens in graph databases, and most of the value of graph databases is derived from these
relationships. Graph databases use nodes to store data entities, and edges to store relationships
between entities. An edge always has a start node, end node, type, and direction, and an edge
can describe parent-child relationships, actions, ownership, and the like. There is no limit to
the number and kind of relationships a node can have.

A graph in a graph database can be traversed along specific edge types or across the entire
graph. In graph databases, traversing the joins or relationships is very fast because the

By:Yojana Kiran Kumar,Asst. Professor,Dept of BVOC ,SDM

MODULE 1

NoSQL 24-06-2022
relationships between nodes are not calculated at query times but are persisted in the database.
Graph databases have advantages for use cases such as social networking, recommendation
engines, and fraud detection, when you need to create relationships between data and quickly
query these relationships.

Nodes are the entities in the graph.

∙ Nodes can be tagged with labels, representing their different roles in your domain.
(For example, Person).
∙ Nodes can hold any number of key-value pairs, or properties. (For example, name) ∙
Node labels may also attach metadata (such as index or constraint information) to
certain nodes.
Relationships provide directed, named, connections between two node entities (e.g.
Person LOVES Person).
∙ Relationships always have a direction, a type, a start node, and an end node, and they
can have properties, just like nodes.
∙ Nodes can have any number or type of relationships without sacrificing performance. ∙
Although relationships are always directed, they can be navigated efficiently in any
direction.

Schemaless Databases

NoSQL databases are designed to store and query unstructured data, they do not require the
same rigid schemas used by relational databases.

Although a schema can be applied at the application level, NoSQL databases retain all of
your unstructured data in its original raw format.

This means that complete granularity is retained, even if you later change your application
schema — something that is simply not possible with a traditional SQL database.

As a NoSQL database, MongoDB is considered schemaless because it does not require a

rigid, pre-defined schema like a relational database.

The database management system (DBMS) enforces a partial schema as data is written,
explicitly listing collections and indexes.

The applications you use to leverage data stored in MongoDB will enforce a much stricter
dynamically typed schema as documents are read from the database.

∙ Greater flexibility over data types

By operating without a schema, schemaless databases can store, retrieve, and query
any data type — perfect for big data analytics and similar operations that are powered
By:Yojana Kiran Kumar,Asst. Professor,Dept of BVOC ,SDM
MODULE 1

NoSQL 24-06-2022
by unstructured data. Relational databases apply rigid schema rules to data, limiting
what can be stored.

∙ No pre-defined database schemas

The lack of schema means that your NoSQL database can accept any data type —
including those that you do not yet use. This future-proofs your database, allowing it
to grow and change as your data-driven operations change and mature.

∙ No data truncation

A schemaless database makes almost no changes to your data; each item is saved in its
own document with a partial schema, leaving the raw information untouched. This
means that every detail is always available and nothing is stripped to match the current
schema. This is particularly valuable if your analytics needs to change at some point in
the future.

∙ Suitable for real-time analytics functions

With the ability to process unstructured data, applications built on NoSQL databases
are better able to process real-time data, such as readings and measurements from IoT
sensors. Schemaless databases are also ideal for use with machine learning and
artificial intelligence operations, helping to accelerate automated actions in your
business.

∙ Enhanced scalability and flexibility

With NoSQL, you can use whichever data model is best suited to the job. Graph
databases allow you to view relationships between data points, or you can use
traditional wide table views with an exceptionally large number of columns. You can
query, report, and model information however you choose. And as your requirements
grow, you can keep adding nodes to increase capacity and power.

When a record is saved to a relational database, anything (particularly metadata) that

does not match the schema is truncated or removed. Deleted at write, these details
cannot be recovered at a later point in time.

Materialized Views

A materialized view is a database object that contains the results of a query

For example, it may be a local copy of data located remotely, or may be a subset of the rows
and/or columns of a table or join result, or may be a summary using an aggregate function.
Views:
A View is a virtual relation that acts as an actual relation. It is not a part of logical

By:Yojana Kiran Kumar,Asst. Professor,Dept of BVOC ,SDM

MODULE 1

NoSQL 24-06-2022
relational model of the database system. Tuples of the view are not stored in the database
system and tuples of the view are generated every time the view is accessed. Query
expression of the view is stored in the databases system.
Views can be used everywhere were we can use the actual relation. Views can be used to
create custom virtual relations according to the needs of a specific user. We can create as
many views as we want in a databases system.

Materialized Views:
When the results of a view expression are stored in a database system, they are called
materialized views. SQL does not provides any standard way of defining materialized view,
however some database management system provides custom extensions to use materialized
views. The process of keeping the materialized views updated is know as view maintenance.
Database system uses one of the three ways to keep the materialized view updated:

∙ Update the materialized view as soon as the relation on which it is defined is

updated.
∙ Update the materialized view every time the view is accessed.
∙ Update the materialized view periodically.
Materialized view is useful when the view is accessed frequently, as it saves the computation
time, as the result are stored in the database before hand. Materialized view can also be
helpful in case where the relation on which view is defined is very large and the resulting
relation of the view is very small. Materialized view has storage cost and updation overheads
associated with it.
Differences between Views and Materialized Views:
Views Materialized Views

Query expression are stored in the Resulting tuples of the query expression
databases system, and not the are stored in the databases system.
resulting tuples of the query
expression.

Views needs not to be updated every Materialized views are updated as the
time the relation on which view is tuples are stored in the database system.
defined is updated, as the tuples of the It can be updated in one of three ways
views are computed every time when depending on the databases system as
the view is accessed. mentioned above.
It does not have any storage cost It does have a storage cost associated
associated with it. with it.

By:Yojana Kiran Kumar,Asst. Professor,Dept of BVOC ,SDM

MODULE 1

NoSQL 24-06-2022
Views Materialized Views

It does not have any updation cost It does have updation cost associated
associated with it. with it.

There is an SQL standard of defining There is no SQL standard for defining a

a view. materialized view, and the functionality
is provided by some databases systems
as an extension.

Views are useful when the view is Materialized views are efficient when
accessed infrequently. the view is accessed frequently as it
saves the computation time by storing
the results before hand.

Modeling for Data Access

when modeling data aggregates we need to consider how the data is going to be read as well
as what are the side effects on data related to those aggregates. Let’s start with the model
where all the data for the customer is embedded using a key-value store
By:Yojana Kiran Kumar,Asst. Professor,Dept of BVOC ,SDM
MODULE 1

NoSQL 24-06-2022
In this scenario, the application can read the customer’s information and all the related data
by using the key. If the requirements are to read the orders or the products sold in each order,
the whole object has to be read and then parsed on the client side to build the results.

When references are needed, we could switch to document stores and then query inside the
documents, or even change the data for the key-value store to split the value object into
Customer and Order objects and then maintain these objects’ references to each other.

With the references we can now find the orders independently from the Customer, and with
the orderId reference in the Customer we can find all Orders for the Customer. Using
aggregates this way allows for read optimization, but we have to push the orderId reference
into Customer every time with a new Order.

By:Yojana Kiran Kumar,Asst. Professor,Dept of BVOC ,SDM

MODULE 1

NoSQL 24-06-2022
Aggregates can also be used to obtain analytics; for example, an aggregate update may fill in
information on which Orders have a given Product in them.

This denormalization of the data allows for fast access to the data we are interested in and is
the basis for Real Time BI or Real Time Analytics where enterprises don’t have to rely on
end-of-the-day batch runs to populate data warehouse tables and generate analytics; now they
can fill in this type of data, for multiple types of requirements, when the order is placed by the
customer

By:Yojana Kiran Kumar,Asst. Professor,Dept of BVOC ,SDM

MODULE 1

NoSQL 24-06-2022
In document stores, since we can query inside documents, removing references to Orders
from the Customer object is possible. This change allows us to not update the Customer
object when new orders are placed by the Customer.

Since document data stores allow you to query by attributes inside the document, searches
such as “find all orders that include the Refactoring Databases product” are possible, but the
decision to create an aggregate of items and orders they belong to is not based on the
database’s query capability but on the read optimization desired by the application.

When modeling for column-family stores, we have the benefit of the columns being ordered,
allowing us to name columns that are frequently used so that they are fetched first. When
using the column families to model the data, it is important to remember to do it per your
query requirements and not for the purpose of writing; the general rule is to make it easy to
query and denormalize the data during write.

By:Yojana Kiran Kumar,Asst. Professor,Dept of BVOC ,SDM

MODULE 1

NoSQL 24-06-2022
As you can imagine, there are multiple ways to model the data; one way is to store the
Customer and Order in different column-family families

Here, it is important to note the reference to all the orders placed by the customer are in the
Customer column family. Similar other denormalizations are generally done so that query
(read) performance is improved.

When using graph databases to model the same data, we model all objects as nodes and
relations within them as relationships; these relationships have types and directional
significance. Each node has independent relationships with other nodes.

These relationships have names like PURCHASED, PAID_WITH, or BELONGS_TO

these relationship names let you traverse the graph. Let’s say you want to find all the
Customers who PURCHASED a product with the name Refactoring Database.

All we need to do is query for the product node Refactoring Databases and look for all the
Customers with the incoming PURCHASED relationship.

By:Yojana Kiran Kumar,Asst. Professor,Dept of BVOC ,SDM

MODULE 1

NoSQL 24-06-2022
This type of relationship traversal is very easy with graph databases.

It is especially convenient when you need to use the data to recommend products to users or
to find patterns in actions taken by users.

Key Points
• Aggregate-oriented databases make inter-aggregate relationships more difficult to handle
than intra-aggregate relationships.

• Graph databases organize data into node and edge graphs; they work best for data that has
complex relationship structures.

• Schemaless databases allow you to freely add fields to records, but there is usually an
implicit schema expected by users of the data.

• Aggregate-oriented databases often compute materialized views to provide data organized

differently from their primary aggregates. This is often done with map-reduce computations.

**********

By:Yojana Kiran Kumar,Asst. Professor,Dept of BVOC ,SDM

Full Stack UNIT3
No ratings yet
Full Stack UNIT3
57 pages
ISO 19011-2018 Auditing
100% (2)
ISO 19011-2018 Auditing
26 pages
MongoBoulder - Schema Design
No ratings yet
MongoBoulder - Schema Design
59 pages
Snowflake 101 - For Data Architects - LinkedIn
No ratings yet
Snowflake 101 - For Data Architects - LinkedIn
17 pages
BIG Data 2
No ratings yet
BIG Data 2
18 pages
Unit 2
No ratings yet
Unit 2
65 pages
Computer Forensics Windows Registry
No ratings yet
Computer Forensics Windows Registry
47 pages
Unit 6
No ratings yet
Unit 6
143 pages
Lecture NoSqlIntro
No ratings yet
Lecture NoSqlIntro
30 pages
Azure Databricks Interview
100% (2)
Azure Databricks Interview
35 pages
Big Data Unit 3
No ratings yet
Big Data Unit 3
374 pages
Module 5 - Nosql
No ratings yet
Module 5 - Nosql
45 pages
Typically Payroll Interface Requirements: - Requirements Are Often Expressed Like This
100% (1)
Typically Payroll Interface Requirements: - Requirements Are Often Expressed Like This
56 pages
4.0.x Inst Upgr 2
No ratings yet
4.0.x Inst Upgr 2
730 pages
NoSQL Databases
No ratings yet
NoSQL Databases
10 pages
NOSQL
No ratings yet
NOSQL
15 pages
Python Lesson 1 Notes
100% (1)
Python Lesson 1 Notes
7 pages
MongoDB Slides Until ClassTest
No ratings yet
MongoDB Slides Until ClassTest
221 pages
Types of NoSQL Databases - GeeksforGeeks
No ratings yet
Types of NoSQL Databases - GeeksforGeeks
9 pages
Unit Ii - Nosql Databases
No ratings yet
Unit Ii - Nosql Databases
112 pages
NoSQL Database Comprehensive Report
No ratings yet
NoSQL Database Comprehensive Report
75 pages
DBMS Report
No ratings yet
DBMS Report
78 pages
NoSQL Database
No ratings yet
NoSQL Database
8 pages
SPARK
No ratings yet
SPARK
125 pages
SQL Server Hacking On Scale UsingPowerShell S.sutherland
No ratings yet
SQL Server Hacking On Scale UsingPowerShell S.sutherland
110 pages
Chapter 5: No SQL Data Management and Mongodb: Unit-2
No ratings yet
Chapter 5: No SQL Data Management and Mongodb: Unit-2
65 pages
No SQL Database Compiled
No ratings yet
No SQL Database Compiled
20 pages
Unit II - BDA NEW
No ratings yet
Unit II - BDA NEW
48 pages
Veeam Agent For Linux PDF
No ratings yet
Veeam Agent For Linux PDF
203 pages
U5 Final
No ratings yet
U5 Final
45 pages
Cs614 Grand Quiz Merge
No ratings yet
Cs614 Grand Quiz Merge
81 pages
DB 5
No ratings yet
DB 5
39 pages
Unit II Nosql Data Management
No ratings yet
Unit II Nosql Data Management
57 pages
Nosql Final
No ratings yet
Nosql Final
50 pages
Ca23301-Full Stack Web Development Unit-III
No ratings yet
Ca23301-Full Stack Web Development Unit-III
61 pages
Module 3 Bigdata Analytics
No ratings yet
Module 3 Bigdata Analytics
19 pages
Unit II No-SQL DB Managment
No ratings yet
Unit II No-SQL DB Managment
33 pages
21aim45a-Dbms Module-5
No ratings yet
21aim45a-Dbms Module-5
74 pages
Module 5 - NoSQL Databases
No ratings yet
Module 5 - NoSQL Databases
33 pages
Aws1 1
No ratings yet
Aws1 1
38 pages
Unit 2 (Big Data Analytics)
No ratings yet
Unit 2 (Big Data Analytics)
11 pages
Full Stack-Unit-Iii
No ratings yet
Full Stack-Unit-Iii
56 pages
Graph Databases: Key Points: 1. Definition & Basics
No ratings yet
Graph Databases: Key Points: 1. Definition & Basics
20 pages
FBI FAST CLIN 2 IntelliBridge Volume III Agile Methodology
No ratings yet
FBI FAST CLIN 2 IntelliBridge Volume III Agile Methodology
21 pages
Chapter14 BigData&NoSQLDatabases
No ratings yet
Chapter14 BigData&NoSQLDatabases
39 pages
Nosql Module 1
No ratings yet
Nosql Module 1
23 pages
Unit II - BIG DATA ANALYTICS
No ratings yet
Unit II - BIG DATA ANALYTICS
11 pages
Lecture 3.1.2
No ratings yet
Lecture 3.1.2
47 pages
Kalai Examination Final Document
No ratings yet
Kalai Examination Final Document
55 pages
Unit VI Big Data
No ratings yet
Unit VI Big Data
19 pages
NoSQL Module 1 Part1
No ratings yet
NoSQL Module 1 Part1
13 pages
WeeklyDiary CPP 12
No ratings yet
WeeklyDiary CPP 12
15 pages
CH.5 NOSQL Database For Business Applications
No ratings yet
CH.5 NOSQL Database For Business Applications
21 pages
NoSql Module 2 Part2
No ratings yet
NoSql Module 2 Part2
13 pages
No SQL PDF
No ratings yet
No SQL PDF
24 pages
Chapter 1 - Introducing Big Data & NoSQL
No ratings yet
Chapter 1 - Introducing Big Data & NoSQL
14 pages
Unit 4
No ratings yet
Unit 4
36 pages
BDT Unit-Ii
No ratings yet
BDT Unit-Ii
13 pages
Module 1 Introduction
No ratings yet
Module 1 Introduction
9 pages
NoSql Module 2 Part 1
No ratings yet
NoSql Module 2 Part 1
13 pages
Bda Unit 3
No ratings yet
Bda Unit 3
8 pages
Lecture 08
No ratings yet
Lecture 08
8 pages
NOSQL
No ratings yet
NOSQL
25 pages
Unit 2 Handouts
No ratings yet
Unit 2 Handouts
11 pages
Bda Unit12
No ratings yet
Bda Unit12
9 pages
Aws Cloud Deploy
No ratings yet
Aws Cloud Deploy
21 pages
Non-Relational Databases (NoSQL)
No ratings yet
Non-Relational Databases (NoSQL)
15 pages
Question Bank Section Leave Blank
No ratings yet
Question Bank Section Leave Blank
6 pages
Ethical Hacking
No ratings yet
Ethical Hacking
7 pages
No SQL
No ratings yet
No SQL
12 pages
3 Storage222
No ratings yet
3 Storage222
9 pages
NoSQL Notes
No ratings yet
NoSQL Notes
11 pages
CETech - Module 8 PDF
No ratings yet
CETech - Module 8 PDF
6 pages
Introduction To Nosql: What Is A Nosql Database Used For?
No ratings yet
Introduction To Nosql: What Is A Nosql Database Used For?
6 pages
Unit 3
No ratings yet
Unit 3
10 pages
Big Data Unit-Ii Notes
No ratings yet
Big Data Unit-Ii Notes
7 pages
SLIMS: An Open Source Library Management System: March 2019
No ratings yet
SLIMS: An Open Source Library Management System: March 2019
11 pages
Release Notes
No ratings yet
Release Notes
14 pages
Week 8-Requirements Elicitation & Analysis
No ratings yet
Week 8-Requirements Elicitation & Analysis
7 pages
Mod5 Bda
No ratings yet
Mod5 Bda
9 pages
Naukri VivekChaubey (2y 5m)
No ratings yet
Naukri VivekChaubey (2y 5m)
2 pages
Learning Guide 2.1 - CloudDatabase - NOSQL PDF
No ratings yet
Learning Guide 2.1 - CloudDatabase - NOSQL PDF
44 pages
Inventory Management System (Lab3)
No ratings yet
Inventory Management System (Lab3)
10 pages
Linked List Data Structure PDF
No ratings yet
Linked List Data Structure PDF
5 pages
No SQL
No ratings yet
No SQL
3 pages
Cs 620 / Dasc 600 Introduction To Data Science & Analytics: Lecture 6-Nosql
No ratings yet
Cs 620 / Dasc 600 Introduction To Data Science & Analytics: Lecture 6-Nosql
31 pages
Ethical Hacking Questionbank
No ratings yet
Ethical Hacking Questionbank
3 pages
C Lab
No ratings yet
C Lab
1 page
More Details On Data Models
No ratings yet
More Details On Data Models
23 pages
Lec 15 Notes
No ratings yet
Lec 15 Notes
3 pages
10gen Top 5 NoSQL Considerations
No ratings yet
10gen Top 5 NoSQL Considerations
11 pages
Curriculum Vitae Aditya Tiwari Contact: +91-8308988815: Technical Skills and Programming Modules
No ratings yet
Curriculum Vitae Aditya Tiwari Contact: +91-8308988815: Technical Skills and Programming Modules
4 pages
HRP Tables List
No ratings yet
HRP Tables List
28 pages
Features of Nosql: Non-Relational
No ratings yet
Features of Nosql: Non-Relational
7 pages
K.Trinath: Virtusa Consulting Services PVT - LTD
No ratings yet
K.Trinath: Virtusa Consulting Services PVT - LTD
4 pages
Install PHP 5.3 and 5.2 Together On Ubuntu 12.04
No ratings yet
Install PHP 5.3 and 5.2 Together On Ubuntu 12.04
8 pages
System Administation and Maintenance PCC004 PDF
No ratings yet
System Administation and Maintenance PCC004 PDF
1 page
DBMS MASTER: Become Pro in Database Management System
From Everand
DBMS MASTER: Become Pro in Database Management System
Ummed Singh
No ratings yet

NoSql Module 1 Part2

Uploaded by

NoSql Module 1 Part2

Uploaded by

MODULE 1

A common misconception is that NoSQL databases or non-relational databases don’t store

By:Yojana Kiran Kumar,Asst. Professor,Dept of BVOC ,SDM

By:Yojana Kiran Kumar,Asst. Professor,Dept of BVOC ,SDM

Nodes are the entities in the graph.

As a NoSQL database, MongoDB is considered schemaless because it does not require a

∙ Greater flexibility over data types

∙ No pre-defined database schemas

∙ Suitable for real-time analytics functions

∙ Enhanced scalability and flexibility

When a record is saved to a relational database, anything (particularly metadata) that

A materialized view is a database object that contains the results of a query

By:Yojana Kiran Kumar,Asst. Professor,Dept of BVOC ,SDM

∙ Update the materialized view as soon as the relation on which it is defined is

By:Yojana Kiran Kumar,Asst. Professor,Dept of BVOC ,SDM

There is an SQL standard of defining There is no SQL standard for defining a

Modeling for Data Access

By:Yojana Kiran Kumar,Asst. Professor,Dept of BVOC ,SDM

By:Yojana Kiran Kumar,Asst. Professor,Dept of BVOC ,SDM

By:Yojana Kiran Kumar,Asst. Professor,Dept of BVOC ,SDM

These relationships have names like PURCHASED, PAID_WITH, or BELONGS_TO

By:Yojana Kiran Kumar,Asst. Professor,Dept of BVOC ,SDM

• Aggregate-oriented databases often compute materialized views to provide data organized

By:Yojana Kiran Kumar,Asst. Professor,Dept of BVOC ,SDM

You might also like