0% found this document useful (0 votes)

18 views18 pages

04-2 Intro Nosql

The document discusses the evolution of databases from RDBMS to NoSQL. It describes that traditional RDBMS were designed for business data processing but not suitable for modern web applications with different needs like scalability, flexibility and high availability. This led to the emergence of NoSQL databases with different data models like key-value, column-family, and document stores to address the shortcomings of RDBMS for large scale web and cloud applications. Popular NoSQL databases discussed include DynamoDB, HBase, Cassandra, Redis and MongoDB.

Uploaded by

Trần Tuấn Phong

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views18 pages

04-2 Intro Nosql

Uploaded by

Trần Tuấn Phong

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

28/11/2022

NoSQL

Eras of Databases

1
28/11/2022

Eras of Databases

DB engines ranking according to their

popularity (2019)

2
28/11/2022

Before NoSQL

Star schema

OLTP
OLAP cube
5

RDBMS: one size fits all needs

3
28/11/2022

ICDE 2005 conference

The last 25 years of commercial DBMS development can be summed up in a single phrase:
"one size fits all". This phrase refers to the fact that the traditional DBMS architecture
(originally designed and optimized for business data processing) has been used to support
many data-centric applications with widely varying characteristics and requirements. In this
paper, we argue that this concept is no longer applicable to the database market, and that the
commercial world will fracture into a collection of independent database engines ...
7

After is NoSQL

4
28/11/2022

NoSQL landscape

How to write a CV

5
28/11/2022

Why NoSQL

• Web applications have different needs

• Horizontal scalability – lowers cost
• Geographically distributed
• Elasticity
• Schema less, flexible schema for semi-structured data
• Easier for developers
• Heterogeneous data storage
• High Availability/Disaster Recovery
• Web applications do not always need
• Transaction
• Strong consistency
• Complex queries

SQL vs NoSQL

SQL NoSQL
Gigabytes to Terabytes Petabytes(1kTB) to Exabytes(1kPB) to
Zetabytes(1kEB)
Centralized Distributed
Structured Semi structured and Unstructured
Structured Query Language No declarative query language
Stable Data Model Schema less
Complex Relationships Less complex relationships
ACID Property Eventual Consistency
Transaction is priority High Availability, High Scalability
Joins Tables Embedded structures

6
28/11/2022

NoSQL use cases

• Massive data volume at scale (Big volume)

• Google, Amazon, Yahoo, Facebook – 10-100K servers
• Extreme query workload (Big velocity)
• High availability
• Flexible, schema evolution

Relational data model revisited

• Data is usually stored in row by row

manner (row store)
• Standardized query language (SQL)
• Data model defined before you add data
• Joins merge data from multiple tables
• Results are tables
• Pros: Mature ACID transactions with fine-grain
security controls, widely used
Oracle, MySQL, PostgreSQL,
• Cons: Requires up front data modeling, does not Microsoft SQL Server, IBM
scale well DB/2

7
28/11/2022

Key/value data model

• Simple key/value interface

• GET, PUT, DELETE
• Value can contain any kind of data
• Super fast and easy to scale (no joins)
• Examples
• Berkley DB, Memcache, DynamoDB, Redis, Riak

Key/value vs. table

• A table with two columns and a simple

interface
• Add a key-value
• For this key, give me the value
• Delete a key

8
28/11/2022

Key/value vs. Relational data model

Memcached

• Open source in-memory key-value caching system

• Make effective use of RAM on many distributed web servers
• Designed to speed up dynamic web applications by alleviating
database load
• Simple interface for highly distributed RAM caches
• 30ms read times typical
• Designed for quick deployment, ease of development
• APIs in many languages

9
28/11/2022

Redis

• Open source in-memory key-value store with optional

durability
• Focus on high speed reads and writes of common data
structures to RAM
• Allows simple lists, sets and hashes to be stored within the
value and manipulated
• Many features that developers like expiration, transactions,
pub/sub, partitioning

Amazon DynamoDB

• Scalable key-value store

• Fastest growing product in Amazon's history
• Focus on throughput on storage and predictable read and
write times
• Strong integration with S3 and Elastic MapReduce

10
28/11/2022

Column family store

• Dynamic schema, column-oriented data model

• Sparse, distributed persistent multi-dimensional sorted map
• (row, column (family), timestamp) -> cell contents

Column families

• Group columns into "Column families"

• Group column families into "Super-Columns"
• Be able to query all columns with a family or super family
• Similar data grouped together to improve speed

11
28/11/2022

Column family data model vs. relational

• Sparse matrix, preserve table structure

• One row could have millions of columns but can be very sparse
• Hybrid row/column stores
• Number of columns is extendible
• New columns to be inserted without doing an "alter table"

Bigtable

• ACM TOCS 2008

• Fault-tolerant, persistent
• Scalable
• Thousands of servers
• Terabytes of in-memory data
• Petabyte of disk-based data
• Millions of reads/writes per
second, efficient scans
• Self-managing
• Servers can be added/removed
dynamically
• Servers adjust to load imbalance

12
28/11/2022

Apache Hbase

• Open-source Bigtable, written in JAVA

• Part of Apache Hadoop project

Apache Cassandra

• Apache open source column family database

• Supported by DataStax
• Peer-to-peer distribution model
• Strong reputation for linear scale out (millions of
writes/second)
• Written in Java and works well with HDFS and MapReduce

13
28/11/2022

Graph data model

• Core abstractions: Nodes, Relationships, Properties on both

Graph database store

• A database stored data in an explicitly graph structure

• Each node knows its adjacent nodes
• Queries are really graph traversals

14
28/11/2022

Linking open data

Neo4j

• Graph database designed to be easy to use by Java

developers
• Disk-based (not just RAM)
• Full ACID
• High Availability (with Enterprise Edition)
• 32 Billion Nodes, 32 Billion Relationships,
64 Billion Properties
• Embedded java library
• REST API

15
28/11/2022

Document store

• Documents, not value, not tables

• JSON or XML formats
• Document is identified by ID
• Allow indexing on properties

MongoDB

• Open Source JSON data store created by 10gen

• Master-slave scale out model
• Strong developer community
• Sharding built-in, automatic
• Implemented in C++ with many APIs (C++, JavaScript, Java,
Perl, Python etc.)

16
28/11/2022

MongoDB architecture

• Replica set
• Copies of the data on each node
• Data safety
• High availability
• Disaster recovery
• Maintenance
• Read scaling
• Sharding
• “Partitions” of the data
• Horizontal scale

Apache CouchDB

• Apache project
• Open source JSON data store
• Written in ERLANG
• RESTful JSON API
• B-Tree based indexing, shadowing b-tree versioning
• ACID fully supported
• View model
• Data compaction
• Security

17
28/11/2022

Thank you for your attention!

Q&A

DBMS (UNIT-6) (Advances in Databases and Big Data)
No ratings yet
DBMS (UNIT-6) (Advances in Databases and Big Data)
103 pages
14 Types of Databases and Data Stores You Should Know
No ratings yet
14 Types of Databases and Data Stores You Should Know
16 pages
Emerging Trends in Database
No ratings yet
Emerging Trends in Database
4 pages
Designing Data Intensive Applications
25% (4)
Designing Data Intensive Applications
61 pages
Unit 6
No ratings yet
Unit 6
143 pages
BDA (2) Merged
No ratings yet
BDA (2) Merged
29 pages
CS8091-BIG DATA ANALYTICS UNIT V Notes
100% (4)
CS8091-BIG DATA ANALYTICS UNIT V Notes
31 pages
Top 18 Free and Widely Used, Open Source NoSQL Databases
No ratings yet
Top 18 Free and Widely Used, Open Source NoSQL Databases
4 pages
BDA Unit 2
No ratings yet
BDA Unit 2
30 pages
Assignment 4 Rdbms
No ratings yet
Assignment 4 Rdbms
18 pages
Slide 6 NoSQL Database and HBase Tutorial
No ratings yet
Slide 6 NoSQL Database and HBase Tutorial
110 pages
Big Data Complete Notes
No ratings yet
Big Data Complete Notes
9 pages
Bda CHP 3
No ratings yet
Bda CHP 3
75 pages
4.1 Intro Nosql-Converted-133751863122661863
No ratings yet
4.1 Intro Nosql-Converted-133751863122661863
43 pages
Databases in Computer World
No ratings yet
Databases in Computer World
71 pages
NoSQL Database Technology - A Survey and Comparison of Systems
No ratings yet
NoSQL Database Technology - A Survey and Comparison of Systems
44 pages
Chapter 14
No ratings yet
Chapter 14
35 pages
4.1 Intro Nosql
No ratings yet
4.1 Intro Nosql
43 pages
BDA Module 5 - Part1 (No SQL) 2023
No ratings yet
BDA Module 5 - Part1 (No SQL) 2023
32 pages
NOSQL, Graph Databases & Cypher
No ratings yet
NOSQL, Graph Databases & Cypher
78 pages
No SQL
No ratings yet
No SQL
32 pages
Module 1
No ratings yet
Module 1
34 pages
Lecture 10 - Interactive Querying
No ratings yet
Lecture 10 - Interactive Querying
27 pages
4.1 Intro Nosql
No ratings yet
4.1 Intro Nosql
45 pages
Lecture 1 - NoSQL
No ratings yet
Lecture 1 - NoSQL
31 pages
Introduction To NoSQL
No ratings yet
Introduction To NoSQL
29 pages
DBMS Unit2
No ratings yet
DBMS Unit2
26 pages
Bcse302l Dbms Module-7 Nosql
No ratings yet
Bcse302l Dbms Module-7 Nosql
30 pages
4.1 Intro Nosql
No ratings yet
4.1 Intro Nosql
43 pages
Bda Notes (Unit-2)
No ratings yet
Bda Notes (Unit-2)
26 pages
Fdocuments - in Nosql-Seminar
No ratings yet
Fdocuments - in Nosql-Seminar
40 pages
Udbms Notes
No ratings yet
Udbms Notes
18 pages
2 Big Data Analytics-Hadoop R21 A7902 ABP
No ratings yet
2 Big Data Analytics-Hadoop R21 A7902 ABP
16 pages
CloudComputing DATABASE
No ratings yet
CloudComputing DATABASE
27 pages
No SQL
No ratings yet
No SQL
12 pages
5.1 Intro Nosql
No ratings yet
5.1 Intro Nosql
22 pages
Case Study About Database Tools
No ratings yet
Case Study About Database Tools
13 pages
Database Types
No ratings yet
Database Types
9 pages
Dbms File
100% (4)
Dbms File
37 pages
Ijeme V13 N4 5
No ratings yet
Ijeme V13 N4 5
9 pages
Unit 3
No ratings yet
Unit 3
7 pages
Bda Ass Azlaan
No ratings yet
Bda Ass Azlaan
10 pages
Big Data Pyq 21-22
No ratings yet
Big Data Pyq 21-22
9 pages
Intro-Databases For Big Data
No ratings yet
Intro-Databases For Big Data
10 pages
Nosql Technology
No ratings yet
Nosql Technology
8 pages
Index: Mlbase Component, 100
No ratings yet
Index: Mlbase Component, 100
8 pages
Data Analytics
No ratings yet
Data Analytics
6 pages
App Dev Finals
No ratings yet
App Dev Finals
7 pages
Database Assignment
No ratings yet
Database Assignment
5 pages
Database
No ratings yet
Database
4 pages
ACS233025 M Talha
No ratings yet
ACS233025 M Talha
4 pages
Seminar Nosql
No ratings yet
Seminar Nosql
56 pages
They Come in Various Types
No ratings yet
They Come in Various Types
3 pages
20 - 04 - 2024 Cheatsheet
No ratings yet
20 - 04 - 2024 Cheatsheet
3 pages
Assignment SCM222 1581 2024 Stephen Maina GRP B
No ratings yet
Assignment SCM222 1581 2024 Stephen Maina GRP B
3 pages
IT Practical File CLASS 10 Project 11-15
No ratings yet
IT Practical File CLASS 10 Project 11-15
17 pages
Database Types
No ratings yet
Database Types
4 pages
NOSQL Databases
No ratings yet
NOSQL Databases
18 pages
1.mysql: Aim: To Study The Top 10 Databases Management Systems
No ratings yet
1.mysql: Aim: To Study The Top 10 Databases Management Systems
9 pages
Class 8 - MongoDB, Neo4j, InfluxDB, Cassandra
No ratings yet
Class 8 - MongoDB, Neo4j, InfluxDB, Cassandra
2 pages
Andhra Education Society: Dr. K.R.B.M Sr. Sec School
No ratings yet
Andhra Education Society: Dr. K.R.B.M Sr. Sec School
20 pages
Unit 4 MCQ
No ratings yet
Unit 4 MCQ
48 pages
Oracle 11g SQL Training - As Per Industry Standards: Course Highlights
No ratings yet
Oracle 11g SQL Training - As Per Industry Standards: Course Highlights
6 pages
DBMS Practice Questions
No ratings yet
DBMS Practice Questions
11 pages
Gartner Reprint
No ratings yet
Gartner Reprint
42 pages
Structured Query Language
No ratings yet
Structured Query Language
29 pages
Informix 4GL
No ratings yet
Informix 4GL
33 pages
SQL02 DML
No ratings yet
SQL02 DML
18 pages
Visvesvaraya Technological University: Data Base Management System
No ratings yet
Visvesvaraya Technological University: Data Base Management System
7 pages
1.purpose of Database System
No ratings yet
1.purpose of Database System
5 pages
Big Data Question Bank
No ratings yet
Big Data Question Bank
5 pages
ANL252 SU6 Jul2022
No ratings yet
ANL252 SU6 Jul2022
51 pages
Database - MySQL
No ratings yet
Database - MySQL
5 pages
SQL Injection Cheat Sheet
No ratings yet
SQL Injection Cheat Sheet
1 page
Outline: Multidatabase Query Processing
No ratings yet
Outline: Multidatabase Query Processing
41 pages
Already Table Exists in Oracle How To Append Table Row Using Data Pump - Google Search
No ratings yet
Already Table Exists in Oracle How To Append Table Row Using Data Pump - Google Search
2 pages
Dbms Mid-2 Unit-5 Longs Answers
No ratings yet
Dbms Mid-2 Unit-5 Longs Answers
50 pages
Dbms
No ratings yet
Dbms
11 pages
Database Fundamentals
No ratings yet
Database Fundamentals
31 pages
Train Ticket Booking System
No ratings yet
Train Ticket Booking System
53 pages
Logical Database Design
No ratings yet
Logical Database Design
20 pages
NoorLal 221370151
No ratings yet
NoorLal 221370151
3 pages
Trinity Institute of Professional Studies: Sec-9, DWARKA, NEW DELHI-110075
No ratings yet
Trinity Institute of Professional Studies: Sec-9, DWARKA, NEW DELHI-110075
10 pages
DuckDB - What's The Hype About - This Was A Blog Post That I Already - by Oliver Molander - Better Programming
No ratings yet
DuckDB - What's The Hype About - This Was A Blog Post That I Already - by Oliver Molander - Better Programming
19 pages
20BSC Worksheet 3.1
No ratings yet
20BSC Worksheet 3.1
8 pages
SQL - Data Definition and Data Manipulation Exercise
No ratings yet
SQL - Data Definition and Data Manipulation Exercise
9 pages
AlGhushaimi. نهائي قواعد بيانات 2020
No ratings yet
AlGhushaimi. نهائي قواعد بيانات 2020
9 pages
Normalization Dbms Int 306
No ratings yet
Normalization Dbms Int 306
2 pages
DBA's Guide to NoSQL
From Everand
DBA's Guide to NoSQL
The Enlightened DBA
5/5 (1)
The DynamoDB Handbook: Practical Solutions for Modern NoSQL Database Management
From Everand
The DynamoDB Handbook: Practical Solutions for Modern NoSQL Database Management
Robert Johnson
No ratings yet

04-2 Intro Nosql

Uploaded by

04-2 Intro Nosql

Uploaded by

28/11/2022

DB engines ranking according to their

RDBMS: one size fits all needs

ICDE 2005 conference

• Web applications have different needs

NoSQL use cases

• Massive data volume at scale (Big volume)

Relational data model revisited

• Data is usually stored in row by row

Key/value data model

• Simple key/value interface

Key/value vs. table

• A table with two columns and a simple

Key/value vs. Relational data model

• Open source in-memory key-value caching system

• Open source in-memory key-value store with optional

• Scalable key-value store

Column family store

• Dynamic schema, column-oriented data model

• Group columns into "Column families"

Column family data model vs. relational

• Sparse matrix, preserve table structure

• ACM TOCS 2008

• Open-source Bigtable, written in JAVA

• Apache open source column family database

Graph data model

• Core abstractions: Nodes, Relationships, Properties on both

Graph database store

• A database stored data in an explicitly graph structure

Linking open data

• Graph database designed to be easy to use by Java

• Documents, not value, not tables

• Open Source JSON data store created by 10gen

Thank you for your attention!

You might also like