Module 5 - Nosql

The document discusses the differences between SQL and NoSQL databases, highlighting key characteristics such as schema definitions, scalability, and query capabilities. It explains the CAP theorem, which outlines the trade-offs between consistency, availability, and partition tolerance in distributed systems. Additionally, various types of NoSQL databases are described, including key-value, document-based, column-based, and graph-based models, along with their advantages, disadvantages, and applications.

Uploaded by

Pranav Vasu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views45 pages

Module 5 - Nosql

Uploaded by

Pranav Vasu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 45

NO SQL and It’s Understanding

By,
Dr. Bhargava R
SQL vs No SQL

SQL NO SQL
Relational DB Distributed DB
Defined Schema Dynamic Schema
Vertical Scalable Horizontal Scalable
Low Availability Highly Available
Support Complex Queries Not Supported for Complex Queries
CAP Theorem

The three letters in CAP refer to three desirable properties of

distributed systems with replicated data:
• Consistency (among replicated copies)
• Availability (of the system for read and write operations)
• Partition tolerance (in the face of the nodes in the system being
partitioned by a network fault).
Consistency

• Consistency means that the nodes will have the same copies of
a replicated data item visible for various transactions.
• A guarantee that every node in a distributed cluster returns the
same, most recent and a successful write.
• Consistency refers to every client having the same view of the
data.
Availability

• Availability means that each read or write request for a data

item will either be processed successfully or will receive a
message that the operation cannot be completed.
• Every non-failing node returns a response for all the read and
write requests in a reasonable amount of time. The key word
here is “every”.
• In simple terms, every node (on either side of a network
partition) must be able to respond in a reasonable amount of
time.
Partition Tolerance

• Partition tolerance means that the system can continue

operating even if the network connecting the nodes has a fault
that results in two or more partitions, where the nodes in each
partition can only communicate among each other.
• That means, the system continues to function and upholds its
consistency guarantees in spite of network partitions. Network
partitions are a fact of life.
• Distributed systems guaranteeing partition tolerance can
gracefully recover from partitions once the partition heals.
•
CAP
Types of NoSQL
Key-Value Store

• A key-value data model or database is also referred to as a key-

value store.
• Array is used as a basic database in which an individual key is
linked with just one value in a collection.
• For the values, keys are special identifiers.
• The collection of key-value pairs stored on separate records is
called key-value databases and they do not have an already
defined structure.
Key-Value Store
Key-Value Store
When to use a key-value database:
Here are a few situations in which you can use a key-value database:-
• User session attributes in an online app like finance or gaming, which is referred to
as real-time random data access.
• Caching mechanism for repeatedly accessing data or key-based design.
• The application is developed on queries that are based on keys.

Features:
• One of the most un-complex kinds of NoSQL data models.
• For storing, getting, and removing data, key-value databases utilize simple functions.
• Querying language is not present in key-value databases.
• Built-in redundancy makes this database more reliable.
Key-Value Store
Advantages:
• It is very easy to use.
• Its response time is fast.
• Key-value store databases are scalable vertically as well as horizontally.
• Built-in redundancy makes this database more reliable.
Disadvantages:
• As querying language is not present in key-value databases, transportation of
queries from one database to a different database cannot be done.
• The key-value store database is not refined. You cannot query the database
without a key.
Document-Based
• A Document Data Model is a lot
different than other data models
because it stores data in JSON,
BSON, or XML documents.
• It works as a semi-structured
data model in which the records
and data associated with them are
stored in a single document
which means this data model is
not completely unstructured.
• The main thing is that data here is
stored in a document.
Document-Based
Features:
• Document Type Model: As we all know data is stored in documents rather than tables or graphs,
so it becomes easy to map things in many programming languages.
• Flexible Schema: Overall schema is very much flexible to support this statement one must know
that not all documents in a collection need to have the same fields.
• Distributed and Resilient: Document data models are very much dispersed which is the reason
behind horizontal scaling and distribution of data.
• Manageable Query Language: These data models are the ones in which query language allows
the developers to perform CRUD (Create Read Update Destroy) operations on the data model.
Applications of Document Data Model :
• Content Management: These data models are very much used in creating various video streaming
platforms, blogs, and similar services Because each is stored as a single document and the database
here is much easier to maintain as the service evolves over time.
• Book Database: These are very much useful in making book databases because as we know this
data model lets us nest.
• Catalog: When it comes to storing and reading catalog files these data models are very much used
because it has a fast reading ability if incase Catalogs have thousands of attributes stored.
• Analytics Platform: These data models are very much used in the Analytics Platform.
Document-Based
Advantages:
• Schema-less:
• Faster creation of document and maintenance:
• Open formats:
• Built-in versioning:

Disadvantages:
• Weak Atomicity: It lacks in supporting multi-document ACID transactions. A change in the
document data model involving two collections will require us to run two separate queries i.e. one
for each collection. This is where it breaks atomicity requirements.
• Consistency Check Limitations: One can search the collections and documents that are not
connected to an author collection but doing this might create a problem in the performance of
database performance.
• Security: Nowadays many web applications lack security which in turn results in the leakage of
sensitive data. So it becomes a point of concern, one must pay attention to web app vulnerabilities.
Column-Based
• Basically, the relational database stores data in rows and also reads the data row
by row, column store is organized as a set of columns.
• So if someone wants to run analytics on a small number of columns, one can read
those columns directly without consuming memory with the unwanted data.
• Columns are somehow are of the same type and gain from more efficient
compression, which makes reads faster than before.
• Examples of Columnar Data Model: Cassandra and Apache Hadoop Hbase.
• In Columnar Data Model instead of organizing information into rows, it does in
columns. This makes them function the same way that tables work in relational
databases.
Column-Based
Column-Based
Column-Based
Advantages of Columnar Data Model :
• Well structured:
• Flexibility:
• Aggregation queries are fast:
• Scalability:
• Load Times:
Disadvantages of Columnar Data Model:
• Designing indexing Schema: To design an effective and working schema is too difficult and very time-
consuming.
• Suboptimal data loading: incremental data loading is suboptimal and must be avoided, but this might
not be an issue for some users.
• Security vulnerabilities: If security is one of the priorities then it must be known that the Columnar
data model lacks inbuilt security features in this case, one must look into relational databases.
• Online Transaction Processing (OLTP): Online Transaction Processing (OLTP) applications are also
not compatible with columnar data models because of the way data is stored.
Column-Based
Applications of Columnar Data Model:

• Columnar Data Model is very much used in various Blogging Platforms.

• It is used in Content management systems like WordPress, Joomla, etc.
• It is used in Systems that maintain counters.
• It is used in Systems that require heavy write requests.
• It is used in Services that have expiring usage.
Graph-Based
Graph Based Data Model in NoSQL is a type of Data Model which tries to focus on
building the relationship between data elements.
As the name suggests Graph-Based Data Model, each element here is stored as a node,
and the association between these elements is often known as Links.
Association is stored directly as these are the first-class elements of the data model.
These data models give us a conceptual view of the data.
These are the data models which are based on topographical network structure.
Nodes: These are the instances of data that represent objects which is to be tracked.
Edges: As we already know edges represent relationships between nodes.
Properties: It represents information associated with nodes.
Graph-Based
Graph-Based
• In these data models, the nodes which are connected together are connected
physically and the physical connection among them is also taken as a piece of
data.
• Connecting data in this way becomes easy to query a relationship.
• This data model reads the relationship from storage directly instead of
calculating and querying the connection steps.
• Like many different NoSQL databases these data models don’t have any
schema as it is important because schema makes the model well and good and
easy to edit.
Graph-Based
Advantages of Graph Data Model :
• Structure: The structures are very agile and workable too.
• Explicit Representation: The portrayal of relationships between entities is
explicit.
• Real-time O/P Results: Query gives us real-time output results.
Disadvantages of Graph Data Model :
• No standard query language: Since the language depends on the platform
that is used so there is no certain standard query language.
• Unprofessional Graphs: Graphs are very unprofessional for transactional-
based systems.
• Small User Base: The user base is small which makes it very difficult to get
support when running into a system.
Graph-Based
Applications of Graph Data Model:
• Graph data models are very much used in fraud detection which itself is very
much useful and important.
• It is used in Digital asset management which provides a scalable database
model to keep track of digital assets.
• It is used in Network management which alerts a network administrator about
problems in a network.
• It is used in Context-aware services by giving traffic updates and many more.
• It is used in Real-Time Recommendation Engines which provide a better user
experience.
Neo4J
Neo4J
Neo4J
Neo4J
Neo4J
Neo4J
Neo4J
Neo4J
Neo4J
Neo4J
Neo4J
Neo4J
Neo4J
Neo4J
Neo4J
Neo4J
Neo4J
Neo4J
Neo4J
Thank You
By,
Dr. Bhargava R

Unit 2
No ratings yet
Unit 2
41 pages
NoSQL Databases
No ratings yet
NoSQL Databases
10 pages
Cs9152 DBT Unit IV Notes
100% (5)
Cs9152 DBT Unit IV Notes
61 pages
Full Stack UNIT3
No ratings yet
Full Stack UNIT3
57 pages
Graph Databases: Key Points: 1. Definition & Basics
No ratings yet
Graph Databases: Key Points: 1. Definition & Basics
20 pages
Slides Chapter 01 Statistics For Business and Economics
No ratings yet
Slides Chapter 01 Statistics For Business and Economics
24 pages
NOSQL
No ratings yet
NOSQL
15 pages
Unit 2
No ratings yet
Unit 2
65 pages
Types of NoSQL Databases - GeeksforGeeks
No ratings yet
Types of NoSQL Databases - GeeksforGeeks
9 pages
Big Data Unit 3
No ratings yet
Big Data Unit 3
374 pages
BD Unit 4
No ratings yet
BD Unit 4
45 pages
No SQL
No ratings yet
No SQL
38 pages
Oracl DB Monitoring
100% (1)
Oracl DB Monitoring
21 pages
Both Merged PDF h13-611
100% (2)
Both Merged PDF h13-611
196 pages
Unit 6
No ratings yet
Unit 6
143 pages
Case Study On Different Nosql Data Models
No ratings yet
Case Study On Different Nosql Data Models
6 pages
NOSQL
No ratings yet
NOSQL
50 pages
CH.5 NOSQL Database For Business Applications
No ratings yet
CH.5 NOSQL Database For Business Applications
21 pages
BIG Data 2
No ratings yet
BIG Data 2
18 pages
No SQL Database Compiled
No ratings yet
No SQL Database Compiled
20 pages
Ca23301-Full Stack Web Development Unit-III
No ratings yet
Ca23301-Full Stack Web Development Unit-III
61 pages
BDA Module 5 - Part1 (No SQL) 2023
No ratings yet
BDA Module 5 - Part1 (No SQL) 2023
32 pages
Module 3 Bigdata Analytics
No ratings yet
Module 3 Bigdata Analytics
19 pages
The Berlin Turnpike - US Human Trafficking Prevalence Report
No ratings yet
The Berlin Turnpike - US Human Trafficking Prevalence Report
107 pages
Lecture 3.1.2
No ratings yet
Lecture 3.1.2
47 pages
Unit 2 Handouts
No ratings yet
Unit 2 Handouts
11 pages
Lecture 6 - NoSQL
No ratings yet
Lecture 6 - NoSQL
28 pages
MODULE7
No ratings yet
MODULE7
23 pages
No SQL
No ratings yet
No SQL
32 pages
Artificial Intelligence (AI) Applications For Marketing - A Literature-Based Study - ScienceDirect
No ratings yet
Artificial Intelligence (AI) Applications For Marketing - A Literature-Based Study - ScienceDirect
15 pages
NoSQL Database
No ratings yet
NoSQL Database
8 pages
Introduction To NoSQL
No ratings yet
Introduction To NoSQL
29 pages
Unit 2 (Big Data Analytics)
No ratings yet
Unit 2 (Big Data Analytics)
11 pages
Oracle Quest
No ratings yet
Oracle Quest
521 pages
Module 1 Introduction
No ratings yet
Module 1 Introduction
9 pages
Dbms Presentation
No ratings yet
Dbms Presentation
22 pages
NOSQL Lecture 1 Notes
No ratings yet
NOSQL Lecture 1 Notes
31 pages
NoSql 2024 Assign2
No ratings yet
NoSql 2024 Assign2
189 pages
No SQL Lecture Notes
No ratings yet
No SQL Lecture Notes
17 pages
Unit 3 NoSQL
No ratings yet
Unit 3 NoSQL
98 pages
The Ultimate Data Science Career Guide
No ratings yet
The Ultimate Data Science Career Guide
60 pages
Unit Ii - Nosql Databases
No ratings yet
Unit Ii - Nosql Databases
112 pages
Aggregate Models in Big Data
No ratings yet
Aggregate Models in Big Data
3 pages
Chapter 1 - Introducing Big Data & NoSQL
No ratings yet
Chapter 1 - Introducing Big Data & NoSQL
14 pages
Introduction To Nosql: What Is A Nosql Database Used For?
No ratings yet
Introduction To Nosql: What Is A Nosql Database Used For?
6 pages
Unit II No-SQL DB Managment
No ratings yet
Unit II No-SQL DB Managment
33 pages
41 NoSQL Introduction
No ratings yet
41 NoSQL Introduction
18 pages
Lecture 1 - NoSQL
No ratings yet
Lecture 1 - NoSQL
31 pages
NoSQL Tutorial - New
No ratings yet
NoSQL Tutorial - New
10 pages
NOsql Presentation
No ratings yet
NOsql Presentation
20 pages
Unit No 1
No ratings yet
Unit No 1
34 pages
Unit 5 - 230601 - 174540-1
No ratings yet
Unit 5 - 230601 - 174540-1
14 pages
Full Stack-Unit-Iii
No ratings yet
Full Stack-Unit-Iii
56 pages
Module 5 - NoSQL Databases
No ratings yet
Module 5 - NoSQL Databases
33 pages
Lecture 1
No ratings yet
Lecture 1
31 pages
NOSQL
No ratings yet
NOSQL
25 pages
Unit 2
No ratings yet
Unit 2
26 pages
Data Set On SaratogaHouses
No ratings yet
Data Set On SaratogaHouses
150 pages
Features of Nosql: Non-Relational
No ratings yet
Features of Nosql: Non-Relational
7 pages
NoSQL Databases
No ratings yet
NoSQL Databases
20 pages
CHAPTER 03: Big Data Technology Landscape
No ratings yet
CHAPTER 03: Big Data Technology Landscape
81 pages
Aim&Algorithm
No ratings yet
Aim&Algorithm
3 pages
DSA Questions
No ratings yet
DSA Questions
13 pages
No SQL
No ratings yet
No SQL
12 pages
Cs 620 / Dasc 600 Introduction To Data Science & Analytics: Lecture 6-Nosql
No ratings yet
Cs 620 / Dasc 600 Introduction To Data Science & Analytics: Lecture 6-Nosql
31 pages
Learning Guide 2.1 - CloudDatabase - NOSQL PDF
No ratings yet
Learning Guide 2.1 - CloudDatabase - NOSQL PDF
44 pages
PM Reviewer
No ratings yet
PM Reviewer
5 pages
NoSQL Notes
No ratings yet
NoSQL Notes
11 pages
Marketing of Seeds in Bhadradri Kothagudem District Synopsis
No ratings yet
Marketing of Seeds in Bhadradri Kothagudem District Synopsis
14 pages
Edi 104 - Chapter 5
No ratings yet
Edi 104 - Chapter 5
43 pages
Lec 15 Notes
No ratings yet
Lec 15 Notes
3 pages
HBase
No ratings yet
HBase
36 pages
Bubbles (Brewery2) - Operations
No ratings yet
Bubbles (Brewery2) - Operations
3 pages
NoSQL Big Data Management
No ratings yet
NoSQL Big Data Management
36 pages
Ssrs Interview Questions and Answers
No ratings yet
Ssrs Interview Questions and Answers
15 pages
AUTOSAR EXP ClassicPlatformARTI
No ratings yet
AUTOSAR EXP ClassicPlatformARTI
45 pages
Computer Introduction Overview Its Types and Applications
No ratings yet
Computer Introduction Overview Its Types and Applications
12 pages
Ai in Cybersecurity Report Yasir
No ratings yet
Ai in Cybersecurity Report Yasir
27 pages
An Analysis of Solid Waste Management Efficiency in Multiple Urban Areas: A Case Study
No ratings yet
An Analysis of Solid Waste Management Efficiency in Multiple Urban Areas: A Case Study
8 pages
Database Migration Option
No ratings yet
Database Migration Option
4 pages
VIOS Backup Restore
No ratings yet
VIOS Backup Restore
8 pages
From Mathworks
No ratings yet
From Mathworks
7 pages
8604 1 2
No ratings yet
8604 1 2
15 pages
Shlok's Resume
No ratings yet
Shlok's Resume
1 page
SysCom V3x E
No ratings yet
SysCom V3x E
11 pages
The Lack of Interaction Between The Teacher and Student in The Classroom
No ratings yet
The Lack of Interaction Between The Teacher and Student in The Classroom
12 pages
Databases and DBMSS: Todd S. Bacastow January 2005
No ratings yet
Databases and DBMSS: Todd S. Bacastow January 2005
37 pages
BASK
No ratings yet
BASK
10 pages
SOP - NUIG - GYE06 - Krishani Mehta
No ratings yet
SOP - NUIG - GYE06 - Krishani Mehta
2 pages

Module 5 - Nosql

Uploaded by

Module 5 - Nosql

Uploaded by

NO SQL and It’s Understanding

The three letters in CAP refer to three desirable properties of

• Availability means that each read or write request for a data

• Partition tolerance means that the system can continue

• A key-value data model or database is also referred to as a key-

• Columnar Data Model is very much used in various Blogging Platforms.

You might also like