0% found this document useful (0 votes)

162 views27 pages

Introduction To: Nosql

NoSQL databases are non-relational databases designed for large scale data storage needs. They are more flexible than traditional SQL databases as they do not require fixed schemas and typically scale horizontally. The four main categories of NoSQL databases are key-value stores, column-oriented databases, graph databases, and document databases. Each has their own advantages for different data storage needs. NoSQL databases sacrifice consistency to achieve high availability and partition tolerance as stated by the CAP theorem.

Uploaded by

Dileepp Choudhary

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

162 views27 pages

Introduction To: Nosql

Uploaded by

Dileepp Choudhary

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 27

INTRODUCTION TO

NOSQL
COMPUTER SCIENCE AND ENGINEERING
(DATA SCIENCE)

Presented by
V. Nagarjuna
HISTORY OF NOSQL

 The term NoSQL was coined by Carlo Strozzi in the year 1998. He used this term to name his
Open Source, Light Weight, Database which did not have an SQL interface.

 In the early 2009, when last.fm wanted to organize an event on open-source distributed
databases, Eric Evans, a Rackspace employee, reused the term to refer databases which are
non-relational, distributed, and does not conform to atomicity, consistency, isolation,
durability - four obvious features of traditional relational database systems.

 In the same year, the "no:sql(east)" conference held in Atlanta, USA, NoSQL was discussed
and debated a lot.

 And then, discussion and practice of NoSQL got a momentum, and NoSQL saw an
unprecedented growth.
HISTORY OF NOSQL
NOSQL……?

NoSQL is a non-relational database management systems, different from

traditional relational database management systems in some significant ways.

NoSQL is designed for distributed data stores where very large scale of data

storing needs (for example Google or Facebook which collects terabits of data

every day for their users). These type of data storing may not require fixed

schema, avoid join operations and typically scale horizontally.

WHY NOSQL?

In today’s time data is becoming easier to access and capture through third

parties such as Facebook, Google+ and others. Personal user information, social

graphs, geo location data, user-generated content and machine logging data are

just a few examples where the data has been increasing exponentially. To avail the

above service properly, it is required to process huge amount of data. Which SQL

databases were never designed. The evolution of NoSql databases is to handle

these huge data properly.

RDBMS VS NOSQL
RDBMS
Structured and organized data
Structured query language (SQL)
Data and its relationships are stored in separate tables.
Data Manipulation Language, Data Definition Language
Tight Consistency
NoSQL
Stands for Not Only SQL
No declarative query language
No predefined schema
Key-Value pair storage, Column Store, Document Store, Graph databases
Eventual consistency rather ACID property
Unstructured and unpredictable data
CAP THEOREM (BREWER’S THEOREM)
Understand the CAP theorem when you talk about NoSQL databases or in fact when
designing any distributed system. CAP theorem states that there are three basic
requirements which exist in a special relation when designing applications for a
distributed architecture.
Consistency:

This means that the data in the database remains consistent after the execution of an operation. For
example after an update operation all clients see the same data.

Availability :

This means that the system is always on (service guarantee availability), no downtime.

Partition Tolerance :

This means that the system continues to function even the communication among the servers is unreliable,
i.e. the servers may be partitioned into multiple groups that cannot communicate with one another.
CA - Single site cluster, all nodes are always in contact. When a partition occurs, the system blocks.

CP-Some data may not be accessible, but the rest is still consistent/accurate.

AP -System is still available under partitioning, but some of the data returned may be inaccurate.
NOSQL PROS/CONS
Advantages :

• High scalability
• Distributed Computing
• Lower cost
• Schema flexibility, semi-structure data
• No complicated Relationships

Disadvantages

• No standardization
• Limited query capabilities (so far)
THE BASE

The CAP theorem states that a distributed computer system cannot guarantee all of the
following three properties at the same time:
Consistency
Availability
Partition tolerance
A BASE system gives up on consistency.
o Basically Available indicates that the system does guarantee availability, in terms of
the CAP theorem.
o Soft state indicates that the state of the system may change over time, even without
input. This is because of the eventual consistency model.
o Eventual consistency indicates that the system will become consistent over time, given
that the system doesn't receive input during that time.
ACID VS BASE

ACID BASE

Atomic Basically Available

Consistency Soft state

Isolation Eventual consistency

Durable
NOSQL CATEGORIES
There are four general types (most common categories) of NoSQL databases.
Each of these categories has its own specific attributes and limitations. There is not
a single solutions which is better than all the others, however there are some
databases that are better to solve specific problems. To clarify the NoSQL databases,
lets discuss the most common categories :

• Key-value stores

• Column-oriented

• Graph

• Document oriented
KEY-VALUE STORES

Key-value stores are most basic types of NoSQL databases.

Designed to handle huge amounts of data.

Based on Amazon’s Dynamo paper.

Key value stores allow developer to store schema-less data.

In the key-value storage, database stores data as hash table where each key is

unique and the value can be string, JSON, BLOB (Binary Large OBjec) etc.
KEY-VALUE STORES

 A key may be strings, hashes, lists, sets, sorted sets and values are stored against these keys.

 For example a key-value pair might consist of a key like "Name" that is associated with a

value like "Robin".

 Key-Value stores can be used as collections, dictionaries, associative arrays etc.

 Key-Value stores follow the 'Availability' and 'Partition' aspects of CAP theorem.

 Key-Values stores would work well for shopping cart contents, or individual values like

color schemes, a landing page URI, or a default account number.

 Example of Key-value store DataBase : Redis, Dynamo, Riak. etc.

PICTORIAL PRESENTATION
PICTORIAL PRESENTATION
COLUMN-ORIENTED DATABASES
 Column-oriented databases primarily work on columns and every column is treated
individually.
 Values of a single column are stored contiguously.
 Column stores data in column specific files.
 In Column stores, query processors work on columns too.
 Alldata within each column datafile have the same type which makes it ideal for
compression.
 Column stores can improve the performance of queries as it can access specific column
data.
 High performance on aggregation queries (e.g. COUNT, SUM, AVG, MIN, MAX).
 Workson data warehouses and business intelligence, customer relationship
management (CRM), Library card catalogs etc.
 Example of Column-oriented databases : BigTable, Cassandra, SimpleDB etc.
PICTORIAL PRESENTATION
GRAPH DATABASES

 A graph data structure consists of a finite (and possibly mutable) set of ordered pairs,
called edges or arcs, of certain entities called nodes or vertices.

 The following picture presents a labeled graph of 6 vertices and 7 edges.

GRAPH DATABASES
Graph Databases…?
 A graph database stores data in a graph.
 It is capable of elegantly representing any kind of data in a highly accessible way.
 A graph database is a collection of nodes and edges
 Each node represents an entity (such as a student or business) and each edge represents
a connection or relationship between two nodes.
 Every node and edge are defined by a unique identifier.
 Each node knows its adjacent nodes.
 As the number of nodes increases, the cost of a local step (or hop) remains the same.
 Index for lookups.
 Here is a comparison between the classic relational model and the graph model :
COMPARISON BETWEEN THE CLASSIC RELATIONAL MODEL AND THE
GRAPH MODEL

Relational model Graph model

Tables Vertices and Edges set

Rows Vertices

Columns Key/value pairs

Joins Edges

Example of Graph databases : OrientDB, Neo4J, Titan.etc.

PICTORIAL PRESENTATION
DOCUMENT ORIENTED DATABASES
 A collection of documents
 Data in this model is stored inside documents.
 A document is a key value collection where the key allows access to its
value.
 Documents are not typically forced to have a schema and therefore are
flexible and easy to change.
 Documents are stored into collections in order to group different kinds
of data.
 Documents can contain many different key-value pairs, or key-array
pairs, or even nested documents.
COMPARISON BETWEEN THE CLASSIC RELATIONAL MODEL AND
THE DOCUMENT MODEL

Relational model Document model

Tables Collections

Rows Documents

Columns Key/value pairs

Joins not available

Example of Document Oriented databases : MongoDB, CouchDB etc.

PICTORIAL PRESENTATION
PRODUCTION DEPLOYMENT
 There is a large number of companies using NoSQL. To name a few :

 Google

 Facebook

 Mozilla

 Adobe

 Foursquare

 LinkedIn

 Digg

 McGraw-Hill Education

 Vermont Public Radio

Thank You

Unit 5
No ratings yet
Unit 5
27 pages
1734787260059cloud Computing AKTU Notes Password Chaudhary - Unlocked
No ratings yet
1734787260059cloud Computing AKTU Notes Password Chaudhary - Unlocked
55 pages
Web Design
No ratings yet
Web Design
67 pages
Cassandra: Types of Nosql Databases
No ratings yet
Cassandra: Types of Nosql Databases
6 pages
NoSQL Databases UNIT-3
No ratings yet
NoSQL Databases UNIT-3
20 pages
Installation and Configuration of Virtualization Using KVM
100% (1)
Installation and Configuration of Virtualization Using KVM
7 pages
SQL Program Practic
100% (2)
SQL Program Practic
13 pages
Unit II Ui Design
No ratings yet
Unit II Ui Design
28 pages
Unit 1 Bda Complete Notes
No ratings yet
Unit 1 Bda Complete Notes
15 pages
Data Mining and Data Warehousing
No ratings yet
Data Mining and Data Warehousing
12 pages
CCS334 BIG DATA ANALYTICS Session 1 Intr
No ratings yet
CCS334 BIG DATA ANALYTICS Session 1 Intr
18 pages
Distributed System
100% (1)
Distributed System
119 pages
Unit-V: Database Management System
No ratings yet
Unit-V: Database Management System
5 pages
FDP Brochure PDF
100% (1)
FDP Brochure PDF
2 pages
Data Mining Models - GeeksforGeeks
No ratings yet
Data Mining Models - GeeksforGeeks
4 pages
Triggers Lecture
100% (1)
Triggers Lecture
27 pages
DBMS Unit 1 Notes
100% (1)
DBMS Unit 1 Notes
22 pages
DBMS Notes
No ratings yet
DBMS Notes
141 pages
BCS304-DSA Notes M-5
100% (1)
BCS304-DSA Notes M-5
22 pages
Nosql - Journey Ahead!: Origin: Punch Cards To Dbms
No ratings yet
Nosql - Journey Ahead!: Origin: Punch Cards To Dbms
54 pages
Nosql Databases: by Amy Alexander and Tanya Christina
No ratings yet
Nosql Databases: by Amy Alexander and Tanya Christina
14 pages
BDA Unit2 Complete
No ratings yet
BDA Unit2 Complete
56 pages
Ai Notes
No ratings yet
Ai Notes
31 pages
Semantic Web SN
No ratings yet
Semantic Web SN
22 pages
Module 4 Nosql
No ratings yet
Module 4 Nosql
8 pages
Bda Unit 1
No ratings yet
Bda Unit 1
32 pages
Software Engineering Notes (Unit-III)
No ratings yet
Software Engineering Notes (Unit-III)
21 pages
CC Module 5
No ratings yet
CC Module 5
26 pages
An Introduction To Microsoft Azure AI - Azure AI Essentials
No ratings yet
An Introduction To Microsoft Azure AI - Azure AI Essentials
3 pages
Mc5502 Bda Unit I Notes
No ratings yet
Mc5502 Bda Unit I Notes
106 pages
Big Data, Map Reduce & Hadoop: By: Surbhi Vyas (7) Varsha
No ratings yet
Big Data, Map Reduce & Hadoop: By: Surbhi Vyas (7) Varsha
40 pages
Data Views Lecture Note PDF
No ratings yet
Data Views Lecture Note PDF
34 pages
NOSQL
No ratings yet
NOSQL
23 pages
DBMS - Unit 4
No ratings yet
DBMS - Unit 4
22 pages
4.2 NoSQL Databases UNIT-1
No ratings yet
4.2 NoSQL Databases UNIT-1
35 pages
Dbms Lab Manual RGPV
No ratings yet
Dbms Lab Manual RGPV
38 pages
MCA - BigData Notes
No ratings yet
MCA - BigData Notes
136 pages
HBase
No ratings yet
HBase
36 pages
E Commerce Security Protocols: Presentation By: Jyotsna Mishra Id: 618057 BSC 6 Semester
No ratings yet
E Commerce Security Protocols: Presentation By: Jyotsna Mishra Id: 618057 BSC 6 Semester
8 pages
BE02000041 Funda of AI Unit 1 Introduction
No ratings yet
BE02000041 Funda of AI Unit 1 Introduction
63 pages
Cloud Computing Unit-1
No ratings yet
Cloud Computing Unit-1
61 pages
Hadoop Report
No ratings yet
Hadoop Report
110 pages
Unit - III
No ratings yet
Unit - III
34 pages
Database Management Systems Nov
No ratings yet
Database Management Systems Nov
6 pages
Views in SQL
No ratings yet
Views in SQL
10 pages
Business Data Analytics Question Bank
No ratings yet
Business Data Analytics Question Bank
2 pages
NoSql Notes
No ratings yet
NoSql Notes
4 pages
No SQL
No ratings yet
No SQL
12 pages
Notes For Unit 5 Mobile Computing Architecture
No ratings yet
Notes For Unit 5 Mobile Computing Architecture
29 pages
Big Data
No ratings yet
Big Data
22 pages
Unit 3
No ratings yet
Unit 3
28 pages
Mrcet R20 Iv 1 QB
No ratings yet
Mrcet R20 Iv 1 QB
79 pages
Class 11 Notes Computer Science Chap 10 (2024-25)
No ratings yet
Class 11 Notes Computer Science Chap 10 (2024-25)
25 pages
An Investigation of NoSQL Database Performance From A MYSQL Perspective
No ratings yet
An Investigation of NoSQL Database Performance From A MYSQL Perspective
3 pages
ADF
No ratings yet
ADF
54 pages
NoSql 2024 Assign2
No ratings yet
NoSql 2024 Assign2
189 pages
Introduction To Database Systems: Database Management Systems, R. Ramakrishnan and J. Gehrke 1
No ratings yet
Introduction To Database Systems: Database Management Systems, R. Ramakrishnan and J. Gehrke 1
21 pages
Rdbms Assignments 15
No ratings yet
Rdbms Assignments 15
41 pages
Project - 0x00. AirBnB Clone - The Console - ALX Africa Intranet
100% (1)
Project - 0x00. AirBnB Clone - The Console - ALX Africa Intranet
29 pages
DataWarehouseMining Complete Notes
No ratings yet
DataWarehouseMining Complete Notes
55 pages
Sri Aurobindo Institute of Technology: RDBMS LAB (ME-606)
No ratings yet
Sri Aurobindo Institute of Technology: RDBMS LAB (ME-606)
20 pages
Redis
No ratings yet
Redis
8 pages
Python Interview Questions
No ratings yet
Python Interview Questions
28 pages
Udemy, Inc. Is An American Massive Open Online
No ratings yet
Udemy, Inc. Is An American Massive Open Online
39 pages
Complete Java
No ratings yet
Complete Java
121 pages
Bits F232
No ratings yet
Bits F232
3 pages
L Glib PDF
100% (1)
L Glib PDF
46 pages
Cbse Python Language Basics - Unlocked
No ratings yet
Cbse Python Language Basics - Unlocked
11 pages
Python Bootcamp Slides
No ratings yet
Python Bootcamp Slides
35 pages
Advanced Applications of Python Data Structures and Algorithms
No ratings yet
Advanced Applications of Python Data Structures and Algorithms
318 pages
1
No ratings yet
1
49 pages
Python UNIT 2
No ratings yet
Python UNIT 2
31 pages
4 Btech - Cse - 13.03
No ratings yet
4 Btech - Cse - 13.03
52 pages
Computer Science File Kavya
No ratings yet
Computer Science File Kavya
34 pages
12 - CS Practical File 2024-25
No ratings yet
12 - CS Practical File 2024-25
44 pages
B Tech-CSBS
No ratings yet
B Tech-CSBS
44 pages
Python Book
No ratings yet
Python Book
70 pages
Appunti Data
No ratings yet
Appunti Data
61 pages
Python Dictionary
No ratings yet
Python Dictionary
62 pages
Raghav Pract..
No ratings yet
Raghav Pract..
36 pages
Python Lab Manual 2024
No ratings yet
Python Lab Manual 2024
20 pages
Internship Report - Anusha Shekar
No ratings yet
Internship Report - Anusha Shekar
28 pages
Python Practicals - 2022 - 23 - Qs
No ratings yet
Python Practicals - 2022 - 23 - Qs
5 pages
Lecture 1, Part 2: Introduction To Computing - Problem Solving and Data Manipulation
No ratings yet
Lecture 1, Part 2: Introduction To Computing - Problem Solving and Data Manipulation
46 pages
Class 12 Computer Science Practical File
No ratings yet
Class 12 Computer Science Practical File
30 pages
Dictionery
No ratings yet
Dictionery
9 pages
40 LINQ Methods
No ratings yet
40 LINQ Methods
12 pages
Python - Dictionary Exercises
No ratings yet
Python - Dictionary Exercises
1 page
Touchpad Plus Ver. 1.1 Class 7
From Everand
Touchpad Plus Ver. 1.1 Class 7
Nisha Batra
No ratings yet
AppDynamics Third Edition
From Everand
AppDynamics Third Edition
Gerardus Blokdyk
No ratings yet

Introduction To: Nosql

Uploaded by

Introduction To: Nosql

Uploaded by

INTRODUCTION TO

NoSQL is a non-relational database management systems, different from

traditional relational database management systems in some significant ways.

schema, avoid join operations and typically scale horizontally.

databases were never designed. The evolution of NoSql databases is to handle

these huge data properly.

Consistency Soft state

Isolation Eventual consistency

Key-value stores are most basic types of NoSQL databases.

Designed to handle huge amounts of data.

Based on Amazon’s Dynamo paper.

Key value stores allow developer to store schema-less data.

value like "Robin".

 Key-Value stores can be used as collections, dictionaries, associative arrays etc.

color schemes, a landing page URI, or a default account number.

 Example of Key-value store DataBase : Redis, Dynamo, Riak. etc.

 The following picture presents a labeled graph of 6 vertices and 7 edges.

Relational model Graph model

Columns Key/value pairs

Example of Graph databases : OrientDB, Neo4J, Titan.etc.

Relational model Document model

Columns Key/value pairs

Joins not available

Example of Document Oriented databases : MongoDB, CouchDB etc.

 Vermont Public Radio

You might also like