No SQL

The document discusses different types of NoSQL databases and why they emerged as an alternative to relational databases. It covers key reasons like handling large datasets, scaling to clusters of machines, and impedance mismatch between object models and relational models. It also discusses some example NoSQL databases like those created by Google and Amazon.

Uploaded by

HELLO World

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

71 views

No SQL

Uploaded by

HELLO World

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 36

Types of NoSQL Databases

Introduction
• It’s born out of a need to handle larger data
volumes which forced a fundamental shift to
building large hardware platforms through
clusters of commodity servers.
• Advocates of NoSQL databases claim that they
can build systems that are more performant,
scale much better, and are easier to program
with.
Why Are NoSQL Databases Interesting?
• Application development productivity. A lot
of application development effort is spent on
mapping data between in-memory data
structures and a relational database.
• A NoSQL database may provide a data model
that better fits the application’s needs, thus
simplifying that interaction and resulting in
less code to write, debug, and evolve.
Cont’d
• Large-scale data. Organizations are finding it valuable
to capture more data and process it more quickly.
• They are finding it expensive, if even possible, to do so
with relational databases.
• The primary reason is that a relational database is
designed to run on a single machine, but it is usually
more economic to run large data and computing loads
on clusters of many smaller and cheaper machines.
• Many NoSQL databases are designed explicitly to run
on clusters, so they make a better fit for big data
scenarios.
The Value of Relational Databases
• Getting at Persistent Data – provide a “backing” store
for volatile memory
– Two areas of memory:
• Fast, small, volatile main memory
• Larger, slower, non volatile backing store
• Since main memory is volatile to keep data around, we
write it to a backing store, commonly seen a disk which
can be persistent memory.
The backing store can be: • File system • Database
The database allows more flexibility than a file system in
storing large amounts of data in a way that allows an
application program to get information quickly and easily.
Concurrency
• Multiple applications accessing shared data
– Transactions
• Enterprise applications tend to have many people using
same data at once, possibly modifying that data.
• We have to worry about coordinating interactions
between them to avoid things like double booking of
hotel rooms
• Since enterprise applications can have lots of users and
other systems all working concurrently, there’s a lot of
room for bad things to happen.
• Relational databases help to handle this by controlling
all access to their data through transactions..
Integration
• Enterprise requires multiple applications, written by
different teams, to collaborate in order to get things done.
• Applications often need to use the same data and updates
made through one application have to be visible to others.
• A common way to do this is shared database integration
where multiple applications store their data in a single
database.
• Using a single database allows all the applications to use
each others’ data easily, while the database’s concurrency
control handles multiple applications in the same way as it
handles multiple users in a single application.
Impedance Mismatch
• Impedance mismatch is a term used in computer science to
describe the problem that arises when two systems or
components that are supposed to work together have
different data models, structures, or interfaces that make
communication difficult or inefficient.
• In the context of databases, impedance mismatch refers to the
discrepancy between the object-oriented programming (OOP)
model used in application code and the relational model used
in database management systems (DBMS).
• While OOP models are designed to represent data as objects
with properties and methods, relational models represent data
as tables with columns and rows.
• This impedance mismatch can create challenges when it comes
to mapping objects in code to tables in a database or vice
versa.
Impedance Mismatch
• The difference between the relational model
and the in-memory data structures.
• The relational data model organizes data into
a structure of tables.
– Where a tuple is a set of name-value pairs and a
relation is a set of tuples.
• Structure and relationships have to be
mapped
– Rich, in-memory structures have to be translated
to relational representation to be stored on disk
– Translation: impedance mismatch
Cont’d
Cont’d
• Impedance mismatch has been made much
easier to deal with by the wide availability of
object relational mapping frameworks.
• Impedance mismatch has been made much
easier to deal with by the wide availability of
object relational mapping frameworks, such as
Hibernate and iBATIS that implement well-
known mapping patterns but the mapping
problem is still an issue.
Application and Integration Databases
• Data integration is the process of taking data from
different sources and formats and combining it into a
single data set.
• Integration database - with multiple applications, usually
developed by separate teams, storing their data in a
common database.
• This improves communication because all the applications
are operating on a consistent set of persistent data.
Or
• An integration database is a database which acts as the
data store for multiple applications, and thus integrates
data across these applications .
Cont’d
Cont’d
Integrate many applications becomes (dramatically)
more complexthan any single application needs
−Changes to the data model must be
coordinated
−Different structural and performance needs for
different applications
−Database integrity becomes an issue
Instead, treat the database as an application
database
−Single application, single development team
−Provide alternate integration mechanisms
Cont’d
• Data integration platforms are an efficient
approach to data utilization and storage.

• Rather than replicating data across locations

or environments, the integration database
serves as a single source of truth.
Alternate Integration Mechanism: Services
During the 2000s we saw a distinct shift to web
services where applications would communicate over
HTTP.
More recent push to use Web Services where applications
integrate over HTTP communications
−XML-RPC, SOAP, REST
∙Results in more flexibility for exchange data structure
−XML, JSON, etc.
−Text-based protocols
∙Results in letting application developers choose database
−Application databases
−Relational databases are often still an appropriate
choice
Application Database
• Application Database for a database that is
controlled and accessed by a single application.
• With an application database, only the team using
the application needs to know about the database
structure, which makes it much easier to maintain
and evolve the schema.
• Since the application team controls both the
database and the application code, the
responsibility for database integrity can be put in
the application code.
The Attack of the Clusters
The 2000s saw the web grow enormously
−Web use tracking data, social networks, activity logs,
mapping data, etc.
−Huge websites serving huge numbers of visitors
∙To handle the increase in data and traffic required more
computing resources
∙Instead of building bigger machines with more
processors, storage, and memory, use clusters of small,
commodity machines
−Cheaper, more resilient
∙But relational databases are not designed to be run on
clusters
Cont’d
• Coping with the increase in data and traffic required
more computing resources.
• To handle this kind of increase, you have two choices:
• 1. Scaling up implies:
– bigger machines
– more processors
– more disk storage
– more memory
• Scaling up disadvantages:
– But bigger machines get more and more expensive.
– There are real limits as size increases.
Cont’d
• Use lots of small machines in a cluster:
– A cluster of small machines can use commodity
hardware and ends up being cheaper at these
kinds of scales.
– It can also be more resilient—while individual
machine failures are common, the overall cluster
can be built to keep going despite such failures,
providing high reliability.
Clustered Relational Databases
• Relational databases are not designed to be run on
Clusters.
• Clustered relational databases, such as the Oracle RAC
or Microsoft SQL Server, work on the concept of a
shared disk subsystem where cluster still has the disk
subsystem as a single point of failure.
• Relational databases could also be run as separate
servers for different sets of data, effectively sharding
the database.
• Even though this separates the load, all the sharding
has to be controlled by the application which has to
keep track of which database server to talk to for each
bit of data.
Cont’d
• We lose any querying, referential integrity, transactions,
or consistency controls that cross shards.
• Commercial relational databases (licensed) are usually
priced on a single-server assumption, so running on a
cluster raised prices.
• This mismatch between relational databases and
clusters led some organization to consider an alternative
route to data storage. Two companies in particular
– 1. Google
– 2.Amazon
• Both were running large clusters
• They were capturing huge amounts of data
The Emergence of NoSQL
• Historical note: ‘NoSQL’ was first used to name an open-
source relational database development led by Carlo
Strozzi.
• Current use of the phrase came from a conference meet
up discussing “open-source, distributed, nonrelational
databases.
• The name NoSQL comes from the fact that the NoSQL
databases doesn’t use SQL as a query language.
• Instead, the database is manipulated through shell
scripts that can be combined into the usual UNIX
pipelines.
Cont’d
• Most NoSQL databases are driven by the need to run on
clusters.
• Relational databases use ACID transactions to handle
consistency across the whole database.
• This inherently clashes with a cluster environment, so
NoSQL databases offer a range of options for consistency
and distribution.
• Not all NoSQL databases are strongly oriented towards
running on clusters.
• Graph databases are one style of NoSQL databases that
uses a distribution model similar to relational databases
but offers a different data model that makes it better at
handling data with complex relationships.
Cont’d
• NoSQL databases operate without a schema,
allowing you to freely add fields to database
records without having to define any changes
in structure first.
• Two primary reasons for considering NoSQL:
– 1) To handle data access with sizes and
performance that demand a cluster
– 2) To improve the productivity of application
development by using a more convenient data
interaction style.
Cont’d
• A NoSQL is a database that provides a
mechanism for storage and retrieval of data,
they are used in real-time web applications
and big data and their use are increasing over
time.
• Many NoSQL stores compromise consistency
in favor of availability, speed and partition
tolerance.
Advantages of NoSQL
• 1. High Scalability
– NoSQL databases use sharding for horizontal
scaling.
– It can handle huge amount of data because of
scalability, as the data grows NoSQL scale itself to
handle that data in efficient manner.
• 2. High Availability
– Auto replication feature in NoSQL databases
makes it highly available.
Disadvantages of NoSQL
1. Narrow Focus: It is mainly designed for storage, but it
provides very little functionality.
2. Open Source: NoSQL is open-source database that is two
database systems are likely to be unequal.
3. Management Challenge: Big data management in NoSQL
is much more complex than a relational database.
4. GUI is not available: GUI mode tools to access the
database is not flexibly available in the market.
5. Backup: it is a great weak point for some NoSQL
databases like MongoDB.
6. Large Document size: Data in JSON format increases the
document size.
When should NoSQL be used
• When huge amount of data need to be stored and
retrieved.
• The relationship between data you store is not
that important.
• The data changing over time and is not structured.
• Support of constraint and joins is not required at
database level.
• The data is growing continuously and you need to
scale the database regular to handle the data.
Characteristics of NoSQL Databases
They do not use SQL and the relational model
• Some do have query languages which are similar to SQL to
be easy to learn and use.
∙ Mostly open-source projects
∙Designed to be distributed –clustered
−No expectation of ACID properties
−Range of options for consistency and distribution
∙Schema free
−Freely add fields to records without having to define any
changes in structure first
−Non-uniform data and custom fields
∙A no Definition of NoSQL: An ill-defined set of mostly open-
source databases, mostly developed in the early 21stcentury, and
mostly not using SQL
Polyglot Persistence
• Polyglot persistence is a conceptual term that refers to the use of
different data storage approaches and technologies to support the
unique storage requirements of various data types that live within
enterprise applications.

• Polyglot persistence refers to using different data storage technologies

to handle varying data storage needs.

• Polyglot Persistence is a fancy term to mean that when storing data, it is

best to use multiple data storage technologies, chosen based upon the
way data is being used by individual applications or components of a
single application.

• Different kinds of data are best dealt with different data stores. In
short, it means picking the right tool for the right use case.
Example
• Looking at a Polyglot Persistence example, an e-
commerce platform will deal with many types
of data (i.e. shopping cart, inventory, completed
orders, etc). Instead of trying to store all this
data in one database, which would require a lot
of data conversion to make the format of the
data all the same, store the data in the
database best suited for that type of data. So
the e-commerce platform might look like this:
Cont’d
Cont’d
Cont’d

Introduction To Nosql
No ratings yet
Introduction To Nosql
73 pages
AWR25092002 Gurvinder DATA4000 Assessment 3.edited - Edited
No ratings yet
AWR25092002 Gurvinder DATA4000 Assessment 3.edited - Edited
10 pages
MODULE 1 -ppt -7B
No ratings yet
MODULE 1 -ppt -7B
70 pages
NoSql Mod 1 C
No ratings yet
NoSql Mod 1 C
16 pages
4.2 NoSQL Databases UNIT-1
No ratings yet
4.2 NoSQL Databases UNIT-1
35 pages
NOSQL
No ratings yet
NOSQL
64 pages
NOSQL_MOD1
No ratings yet
NOSQL_MOD1
31 pages
Nosqlmodule 1
100% (1)
Nosqlmodule 1
102 pages
Relational DB
No ratings yet
Relational DB
32 pages
ADBMS-Module 2
No ratings yet
ADBMS-Module 2
33 pages
BGD Mod 2 QB Solns
No ratings yet
BGD Mod 2 QB Solns
11 pages
Bda CHP 3
No ratings yet
Bda CHP 3
75 pages
Data Science v No SQL Databases
No ratings yet
Data Science v No SQL Databases
61 pages
NoSql Intro
No ratings yet
NoSql Intro
24 pages
Module 1 Nosql
No ratings yet
Module 1 Nosql
16 pages
Database Advice Guide
No ratings yet
Database Advice Guide
19 pages
Intro 2 DB
No ratings yet
Intro 2 DB
126 pages
UNIT 2 - Part1
No ratings yet
UNIT 2 - Part1
53 pages
Why No SQL wpCOUCHBASE2022
No ratings yet
Why No SQL wpCOUCHBASE2022
14 pages
BDA - M 3 - NoSQL
No ratings yet
BDA - M 3 - NoSQL
81 pages
UDBMS NOTES
No ratings yet
UDBMS NOTES
18 pages
DBMS PPT 1 ENG
No ratings yet
DBMS PPT 1 ENG
74 pages
DBMS PPT 1
No ratings yet
DBMS PPT 1
27 pages
BDA Unit-3
No ratings yet
BDA Unit-3
13 pages
DBMS (UNIT-6) (Advances in Databases and Big Data)
No ratings yet
DBMS (UNIT-6) (Advances in Databases and Big Data)
103 pages
Unit 1 Notes in NoSQL
No ratings yet
Unit 1 Notes in NoSQL
20 pages
NOSQL Database
No ratings yet
NOSQL Database
10 pages
NO SQL Unit 1
No ratings yet
NO SQL Unit 1
66 pages
Lecture#02 FileSystemAndDB
No ratings yet
Lecture#02 FileSystemAndDB
39 pages
Nosql Databases: P.Krishna Reddy Iiit Hyderabad
No ratings yet
Nosql Databases: P.Krishna Reddy Iiit Hyderabad
30 pages
UNIT 3 -BDA
No ratings yet
UNIT 3 -BDA
36 pages
2014 Ieee Computer Nosql
No ratings yet
2014 Ieee Computer Nosql
4 pages
Unit 4: Big Data Tehnology Landscape Two Inportant Technologies
No ratings yet
Unit 4: Big Data Tehnology Landscape Two Inportant Technologies
42 pages
Unit 6
No ratings yet
Unit 6
143 pages
Introduction To NoSQL
No ratings yet
Introduction To NoSQL
16 pages
nosql-technology (1)
No ratings yet
nosql-technology (1)
8 pages
Mysql PDF
No ratings yet
Mysql PDF
188 pages
Big Data
No ratings yet
Big Data
53 pages
Duda
No ratings yet
Duda
13 pages
Data Anal
No ratings yet
Data Anal
53 pages
1 - The Databases Revolutions
No ratings yet
1 - The Databases Revolutions
46 pages
Chapter 1: Introduction
No ratings yet
Chapter 1: Introduction
39 pages
NOs QL
No ratings yet
NOs QL
14 pages
009 Databases
No ratings yet
009 Databases
51 pages
WP SQL To Nosql Architectur Differences Considerations Migration 1+ (6) - 1641371845027
No ratings yet
WP SQL To Nosql Architectur Differences Considerations Migration 1+ (6) - 1641371845027
13 pages
Enter The Purpose-Built Database Era:: Finding The Right Database Type For The Right Job
No ratings yet
Enter The Purpose-Built Database Era:: Finding The Right Database Type For The Right Job
24 pages
BDA GTU Study Material Presentations Unit-3 29092021094744AM
No ratings yet
BDA GTU Study Material Presentations Unit-3 29092021094744AM
37 pages
Database Lec 1
No ratings yet
Database Lec 1
17 pages
NOSQL Data Management
No ratings yet
NOSQL Data Management
21 pages
Fdocuments - in Nosql-Seminar
No ratings yet
Fdocuments - in Nosql-Seminar
40 pages
Introduction To NoSQL
No ratings yet
Introduction To NoSQL
29 pages
BDA Unit2 Complete
No ratings yet
BDA Unit2 Complete
56 pages
Chapter_4 - NoSQL_1676181987
No ratings yet
Chapter_4 - NoSQL_1676181987
85 pages
102-COPIES-ADV-LESSON-1
No ratings yet
102-COPIES-ADV-LESSON-1
5 pages
05 Database Management Systems
No ratings yet
05 Database Management Systems
37 pages
S-Advance Database Management System 1
No ratings yet
S-Advance Database Management System 1
68 pages
Data Base System Assignment
No ratings yet
Data Base System Assignment
4 pages
CloudComputing DATABASE
No ratings yet
CloudComputing DATABASE
27 pages
DBS-C01-S02-B-03-Relational Databases
No ratings yet
DBS-C01-S02-B-03-Relational Databases
3 pages
Database And Computer Management: SERIES 1, #3
From Everand
Database And Computer Management: SERIES 1, #3
Elias Mutegi
No ratings yet
Practical Data Strategies and Recipes
From Everand
Practical Data Strategies and Recipes
Tom Henricksen
No ratings yet
Advanced Git For Beginners: Derrick Stolee Microsoft @stolee
No ratings yet
Advanced Git For Beginners: Derrick Stolee Microsoft @stolee
34 pages
Github
No ratings yet
Github
37 pages
Git VSC
No ratings yet
Git VSC
22 pages
Git Intro
No ratings yet
Git Intro
15 pages
IOB Seminar - Report
No ratings yet
IOB Seminar - Report
14 pages
ACM Spot Purchase: System Access Information
100% (1)
ACM Spot Purchase: System Access Information
8 pages
CIB
No ratings yet
CIB
18 pages
Q5 OS PROJECT
No ratings yet
Q5 OS PROJECT
4 pages
Chapter 4 EER
No ratings yet
Chapter 4 EER
35 pages
Aishwarya
No ratings yet
Aishwarya
2 pages
Systems Analysis and Design 6th Edition Dennis Test Bank - 2025 Version Is Available With All Chapters
100% (1)
Systems Analysis and Design 6th Edition Dennis Test Bank - 2025 Version Is Available With All Chapters
48 pages
s13222-024-00490-5
No ratings yet
s13222-024-00490-5
5 pages
Wa0006.
No ratings yet
Wa0006.
16 pages
BDA simple 1 to 4
No ratings yet
BDA simple 1 to 4
11 pages
Introduction To Statistics
No ratings yet
Introduction To Statistics
12 pages
Sharda Bia10e Tif 01
No ratings yet
Sharda Bia10e Tif 01
11 pages
Database Management Systems 1
No ratings yet
Database Management Systems 1
7 pages
CA - Case Study - Insurance Claims Dashboard-Design Pass 01
No ratings yet
CA - Case Study - Insurance Claims Dashboard-Design Pass 01
43 pages
Cit 3201 Database Systems
No ratings yet
Cit 3201 Database Systems
3 pages
Sorting & Aggregations: Intro To Database Systems Andy Pavlo
No ratings yet
Sorting & Aggregations: Intro To Database Systems Andy Pavlo
57 pages
May Jun 2024 Full Solutions
No ratings yet
May Jun 2024 Full Solutions
24 pages
Fundamentals of DB System
No ratings yet
Fundamentals of DB System
62 pages
Data Preprocessing: Why Preprocess The Data? Why Preprocess The Data?
No ratings yet
Data Preprocessing: Why Preprocess The Data? Why Preprocess The Data?
48 pages
CHAPTER 11 Slides
No ratings yet
CHAPTER 11 Slides
69 pages
Naukri DikshaGabha (5y 0m)
No ratings yet
Naukri DikshaGabha (5y 0m)
1 page
Chapter 03 Test Bank Version1
No ratings yet
Chapter 03 Test Bank Version1
40 pages
system_admin
No ratings yet
system_admin
588 pages
Chapter 9 - BDMT
No ratings yet
Chapter 9 - BDMT
61 pages
Data Mining - IMT Nagpur-Manish
No ratings yet
Data Mining - IMT Nagpur-Manish
82 pages
Java Data Mining Strategy Standard and Practice A Practical Guide for architecture design and implementation 1st Edition Mark F. Hornick instant download
100% (3)
Java Data Mining Strategy Standard and Practice A Practical Guide for architecture design and implementation 1st Edition Mark F. Hornick instant download
62 pages
Principles of Archives
No ratings yet
Principles of Archives
5 pages
SRC 7
No ratings yet
SRC 7
11 pages
ER Diagram Representation 1
No ratings yet
ER Diagram Representation 1
9 pages
Lecture 5 - SQL Part III
No ratings yet
Lecture 5 - SQL Part III
60 pages

No SQL

Uploaded by

No SQL

Uploaded by

Types of NoSQL Databases

• Rather than replicating data across locations

• Polyglot persistence refers to using different data storage technologies

• Polyglot Persistence is a fancy term to mean that when storing data, it is

You might also like