0% found this document useful (0 votes)

32 views69 pages

4.NoSQL 1

Uploaded by

Amrit Sapkota

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views69 pages

4.NoSQL 1

Uploaded by

Amrit Sapkota

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 69

NoSQL

Structured and Unstructured Data

Taxonomy of NoSQL Implementation
Discussion of basic architecture of Hbase, Cassandra
and MongoDb
Structured Data
Structured data is defined as the data that can fit into the
fixed record or file.
Such data can be stored in relational databases and
spreadsheet.
It is easily searchable by the basic algorithm or basic
queries.
It is written in the format that is easy for machine to
understand.
It is simple to enter, store, query and analyze.
It must be strictly defined in terms of field name and type.

2
Structured Data
The example of structured data is:

Employee Age Salary

Ram 25 39000
Shyam 23 42000
Hari 24 490000
Sita 21 450000

3
Unstructured Data
 Unstructured data is defined as the data that cannot be classified
and cannot fit in a fixed record.
 Such data cannot be stored in relational databases as they do not
possess any well-known structure.
 It generally allows keyword based queries or sophisticated
conceptual queries.
 t is written in the format that is easy for humans to understand.
 It is being increasingly valuable and available.
 It is difficult to unbox, understand and analyze.
 It refers to the free text data.
 The example of structured data includes personal messaging,
business documents, web content and so on.
4
5
Distributed system
A distributed system is a network that stores data on
more than one node (physical or virtual machines) at the
same time.

6
7
Scaling of Database
Vertical Scaling
It is also known as scaling up.
It is achieved by upgrading the new hardware
requirements to fulfill the processing demands.
It is generally configured in the single machine.
It may hamper availability of resources at the time of
upgrade.

8
Scaling of Database
Horizontal Scaling
It is also known as scaling out.
It is achieved by adding the necessary hardware parallel
to the currently available hardware.
It is generally configured in multiple machines.
It do not hamper availability of resources.
It involves database sharing and database replication.
Database sharing refers to the stripping of data and
storing in multiple machine to allow concurrent access
to the database.
Database replication refers to the storing of copies of
database in multiple machines to ensure availability and
prevent from single point failure.
9
CAP theorem
CAP stands for Consistency, Availability and Partition
Tolerance.
CAP theorem is used to describe the limitations of
distributed databases.
Consistency refers to the process of maintaining the same
state of data in all the replicas at any instance.
Availability refers to the process of successfully operate at
any instance even if some node crashes.
Partition tolerance refers to the process of operation of the
system in presence of network partition.
CAP theorem states that any distributed database with
shared data can have it must two of the three desirable
properties (Consistency, Availability and Partition
Tolerance). 10
CAP theorem
CAP theorem can be summarized as: For any distributed
database, one of the following can hold:
 If a database guarantees availability and partition
tolerance, it must forfeit consistency. Egg: Cassandra,
CouchDB and so on.
If a database guarantees consistency and partition
tolerance, it must forfeit availability. Egg: HBase, Mongo
DB and so on.
If a database guarantees availability and consistency,
there is no possibility of network partition. Egg: RDBMS
like MySQL, Postgres and so on.
11
Consistency: means that all clients see the same data at
the same time, no matter which node they connect to in a
distributed system. To achieve consistency, whenever data
is written to one node, it must be instantly forwarded or
replicated to all the other nodes in the system before the
write is deemed successful.
Availability: means that every non-failing node returns a
response for all read and write requests in a reasonable
amount of time, even if one or more nodes are down.
Another way to state this — all working nodes in the
distributed system return a valid response for any request,
without failing or exception.
12
Partition Tolerance: means that the system continues to
operate despite arbitrary message loss or failure of part of
the system. In other words, even if there is a network
outage in the data center and some of the computers are
unreachable, still the system continues to perform.
Distributed systems guaranteeing partition tolerance can
gracefully recover from partitions once the partition heals.

13
The CAP theorem categorizes systems into three
categories:
CP (Consistent and Partition Tolerant) database: A
CP database delivers consistency and partition tolerance
at the expense of availability. When a partition occurs
between any two nodes, the system has to shut down the
non-consistent node (i.e., make it unavailable) until the
partition is resolved.
Partition refers to a communication break between
nodes within a distributed system. Meaning, if a node
cannot receive any messages from another node in the
system, there is a partition between the two nodes.
Partition could have been because of network failure,
server crash, or any other reason.
14
AP (Available and Partition Tolerant) database: An AP
database delivers availability and partition tolerance at the
expense of consistency. When a partition occurs, all nodes
remain available but those at the wrong end of a partition
might return an older version of data than others. When
the partition is resolved, the AP databases typically resync
the nodes to repair all inconsistencies in the system.
CA (Consistent and Available) database: A CA delivers
consistency and availability in the absence of any network
partition. Often a single node’s DB servers are categorized
as CA systems. Single node DB servers do not need to deal
with partition tolerance and are thus considered CA
systems.

15
The following diagram shows the classification of
different databases based on the CAP theorem.

16
Why absolute consistency is generally
sacrificed?
 The large companies generally scales out horizontally, that means the
network partition is present that is spread over thousands of nodes.
 Among consistency and availability, such companies have to choose
only one as per the CAP theorem.
 Due to large number of network partitioning, there is high chance of
failure of some nodes.
 If the data is not become available on time, it means a huge loss to the
company.
 So, the company choose availability in terms of financial gain and Big
Data trust from the client.
 In this sense, strict consistency is generally sacrificed.
 In other to maintain consistency, the company follow eventual
consistency process instead of strict or absolute consistency.

17
Consistency technically means is that it refers to a
situation where all the replica nodes have the exact same
data at the exact same point in time.
Consistency Level (CL): is the number of replica nodes that
must acknowledge a read or write request for the whole
operation/query to be successful.
Write CL controls how many replica nodes must
acknowledge that they received and wrote the partition.
Read CL controls how many replica nodes must send
their most recent copy of partition to the coordinator.

18
19
 Immediate consistency: is having the identical data on all
replica nodes at any given point in time.
 Eventual consistency: by controlling our read and write
consistencies, we can allow our data to be different on our replica
nodes, but our queries will still return the most correct version of
the partition data.
 Tunable Consistency means that you can set the CL for each
read and write request. So, you can contorl how consistent your
data is. You can allow some queries to be immediately consistent
and other queries to be eventually consistent.
 Quorum consistency is consistency for high mechanism and to
ensure that how many nodes will respond when we will define
the read and write consistency. In Quorum consistency a
majority of (n/2 +1) nodes of the replicas must respond. 20
BASE: Basically Available, Soft
state, Eventual consistency
Basically, available means DB is available all the time
as per CAP theorem
Soft state means the state of the system could change
over time
Eventual consistency means that the system will
become consistent over time

21
22
23
NoSQL
NoSQL Database is a non-relational Data Management
System, that does not require a fixed schema. It avoids
joins, and is easy to scale.
The major purpose of using a NoSQL database is for
distributed data stores with humongous data storage needs.
NoSQL is used for Big data and real-time web apps. For
example, companies like Twitter, Facebook and Google
collect terabytes of user data every single day.
NoSQL database stands for “Not Only SQL” or “Not SQL.”
Though a better term would be “NoREL”, NoSQL
Carl Strozz introduced the NoSQL concept in 1998.

24
NoSQL Taxonomy
 NoSQL Databases are mainly categorized into four types: Key-
value pair, Column-oriented, Graph-based and Document-
oriented.
 Every category has its unique attributes and limitations.
 None of the above-specified database is better to solve all the
problems.
 Users should select the database based on their product needs.
 Types of NoSQL Databases:
 Key-value Pair Based
 Column-oriented Graph
 Graphs based
 Document-oriented
25
Key Value Pair Based
 Data is stored in key/value pairs. It is designed in such a way to handle lots of
data and heavy load.
 Key-value pair storage databases store data as a hash table where each key is
unique, and the value can be a JSON, BLOB(Binary Large Objects), string, etc.
 Redis, Dynamo, Riak are some NoSQL examples of key-value store DataBases.
They are all based on Amazon’s Dynamo paper.

26
Column-based
 Column-oriented databases work on columns and are based on BigTable paper
by Google. Every column is treated separately. Values of single column
databases are stored contiguously.
 Column-based NoSQL databases are widely used to manage data
warehouses, business intelligence, CRM, Library card catalogs,
 HBase, Cassandra, HBase, Hypertable are NoSQL query examples of column
based database.

27
Document-Oriented
 Document-Oriented NoSQL DB stores and retrieves data as a key value pair but
the value part is stored as a document. The document is stored in JSON or XML
formats. The value is understood by the DB and can be queried.
 Amazon SimpleDB, CouchDB, MongoDB, Riak, Lotus Notes, MongoDB, are
popular Document originated DBMS systems

28
Graph-Based
 A graph type database stores entities as well the relations amongst those
entities. The entity is stored as a node with the relationship as edges. An edge
gives a relationship between nodes. Every node and edge has a unique
identifier.
 Graph base database mostly used for social networks, logistics, spatial data.
 Neo4J, Infinite Graph, OrientDB, FlockDB are some popular graph-based
databases.

29
HBase

HBase is a distributed column-oriented database built on

top of the Hadoop file system. It is an open-source project
and is horizontally scalable.
Apache HBase is a column-oriented key/value data store
built to run on top of the Hadoop Distributed File System
(HDFS).
Hadoop is a framework for handling large datasets in a
distributed computing environment.

30
Storage Mechanism in HBase
 HBase is a column-oriented database and the tables in it are
sorted by row.
 The table schema defines only column families, which are the
key value pairs.
 A table have multiple column families and each column family
can have any number of columns. Subsequent column values are
stored contiguously on the disk. Each cell value of the table has a
timestamp.
 In short, in an HBase:
 Table is a collection of rows.
 Row is a collection of column families.
 Column family is a collection of columns.
 Column is a collection of key value pairs.

31
Given below is an example schema of table in HBase.

Below shows column families in a column-oriented database

32
Features of Hbase
HBase is linearly scalable.
It has automatic failure support.
It provides consistent read and writes.
It integrates with Hadoop, both as a source and a
destination.
It has easy java API for client.
It provides data replication across clusters.

33
Automatic Recovery from Failure using
write ahead log(WAL)

34
HFile : stores the rows of data as sorted keyvalue on disk
MemStore : is a write cache that stores new data that
has not yet been written to disk, there is one memstore
per column family
A HBase Store hosts a MemStore and 0 or more
StoreFiles (HFiles). A Store corresponds to a column
family for a table for a given region.
The Write Ahead Log (WAL) records all changes to data
in HBase, to file-based storage. if a RegionServer crashes
or becomes unavailable before the MemStore is flushed,
the WAL ensures that the changes to the data can be
replayed.
35
Hbase Architecture

36
HBase has three crucial components:
Zookeeper used for monitoring.
HMaster Server assigns regions and load-balancing.
Region Server serves data for write and read. it refers to
different computers in the Hadoop cluster. Each Region
Server have a region, HLog, a store ,memstore.

37
HMaster
HMaster in HBase is the implementation of a Master
server in HBase architecture.
It acts as a monitoring agent to monitor all Region Server
instances present in the cluster and acts as an interface for
all the metadata changes.
In a distributed cluster environment, Master runs on
NameNode. Master runs several background threads.

38
HMaster
The following are important roles performed by HMaster
in HBase.
Plays a vital role in terms of performance and
maintaining nodes in the cluster.
HMaster provides admin performance and distributes
services to different region servers.
HMaster assigns regions to region servers.
HMaster has the features like controlling load balancing
and failover to handle the load over nodes present in the
cluster.
When a client wants to change any schema and to
change any Metadata operations, HMaster takes
responsibility for these operations. 39
HMaster
 Some of the methods exposed by HMaster Interface are
primarily Metadata oriented methods.
 Table (createTable, removeTable, enable, disable)
 ColumnFamily (add Column, modify Column)
 Region (move, assign)
 The client communicates in a bi-directional way with both
HMaster and ZooKeeper. For read and write operations, it
directly contacts with HRegion servers. HMaster assigns regions
to region servers and in turn, check the health status of region
servers.
 In entire architecture, we have multiple region servers. Hlog
present in region servers which are going to store all the log files.
40
HBase Region Servers
When HBase Region Server receives writes and read requests
from the client, it assigns the request to a specific region, where
the actual column family resides.
However, the client can directly contact with HRegion servers,
there is no need of HMaster mandatory permission to the
client regarding communication with HRegion servers.
The client requires HMaster help when operations related to
metadata and schema changes are required.
HRegionServer is the Region Server implementation. It is
responsible for serving and managing regions or data that is
present in a distributed cluster. The region servers run on Data
Nodes present in the Hadoop cluster.
41
HBase Region Servers
HMaster can get into contact with multiple HRegion
servers and performs the following functions.
Hosting and managing regions
Splitting regions automatically
Handling read and writes requests
Communicating with the client directly

42
HBase Regions
HRegions are the basic building elements of HBase cluster
that consists of the distribution of tables and are comprised
of Column families.
It contains multiple stores, one for each column family.
It consists of mainly two components, which are Memstore
and Hfile.

43
ZooKeeper
HBase Zookeeper is a centralized monitoring server which
maintains configuration information and provides
distributed synchronization. Distributed synchronization
is to access the distributed applications running across the
cluster with the responsibility of providing coordination
services between nodes. If the client wants to communicate
with regions, the server’s client has to approach ZooKeeper
first.
It is an open source project, and it provides so many
important services.

44
ZooKeeper
Services provided by ZooKeeper
Maintains Configuration information
Provides distributed synchronization
Client Communication establishment with region
servers
Provides ephemeral nodes for which represent different
region servers
Master servers usability of ephemeral nodes for
discovering available servers in the cluster
To track server failure and network partitions

45
ZooKeeper
Master and HBase slave nodes ( region servers) registered
themselves with ZooKeeper. The client needs access to
ZK(zookeeper) quorum configuration to connect with
master and region servers.
During a failure of nodes that present in HBase cluster,
ZKquoram will trigger error messages, and it starts to repair
the failed nodes.

46
META table which holds the location of the
regions in the cluster

47
HBase : read mechanism
The client sends a request to get the region server that
hosts the META table from ZooKeeper.

The Zookeeper replies by sending the META table location.

48
HBase : read mechanism
 The client will query the META server to get the region server
corresponding to the row key it wants to access
 The client caches this information along side with the META
table location

49
HBase : read mechanism
Finally the region Server answer with the row key, so now it
could get row or rows.

50
Hbase write Mechanism

51
Hbase write Mechanism
1. write the data to the write-ahead-log (WAL), HBase
always has WAL to look into, if any error occurs
while writing data.
2. Once the data is written to the WAL, it is then copied
to the MemStore
3. Once the data is placed in the MemStore, the client
then receives the acknowledgement (ACK)
4. When the MemStore reaches the threshold, it dumps
or commit the data into HFile

52
Cassandra
Apache Cassandra is an open-source, distributed, NoSQL
database. It presents a partitioned wide column storage
model with eventually consistent semantics.
The reasons for choosing Cassandra are as follows:
Value availability over consistency
Require high write throughput
High scalability required
No single point of failure

53
Features of Cassandra Architecture
No masters and slaves (Peer to peer).
Ring type architecture
Automatic data distribution across all nodes
Replication of data across nodes
Data kept in memory and written to disk in a lazy
fashion
Hash values of the keys are used to distribute data
among nodes

54
The key components of Cassandra are as follows:
 Node - place where data is stored
Data center - collection of related nodes
Cluster - collection of one or more data centers
Commit logs - write operation is written for crash
recovery
Mem table - after commit log, data is written to mem
table
Sstable (Sorted Strings Table ) - disk file to which data
is flushed from mem table when its contents reach
threshold value
Bloom filter - algorithm for testing whether the
element is a member of a set

55
Cassandra : Ring type architecture

56
Cassandra : Write process

57
Cassandra Data Model
Cassandra is a NoSQL database, which is a key-value store.
Some of the features of Cassandra data model are as
follows:
Data in Cassandra is stored as a set of rows that are
organized into tables.
Tables are also called column families.
Each Row is identified by a primary key value.
Data is partitioned by the primary key.
We can get the entire data or some data based on the
primary key.

58
Tunable Consistency – Write CL (Consistency
Level)
 Write consistency means having consistent data (immediate or
eventual) after your write query to your Cassandra cluster.
 Write CL controls how many replica nodes must acknowledge
that they received and wrote the partition.
 You can tune the write consistency for performance (by setting
the write CL as ONE) or immediate consistency for critical piece
of data (by setting the write CL as ALL) Following is how it
works:
 A client sends a write request to the coordinator.
 The coordinator forwards the write request (INSERT, UPDATE or
DELETE) to all replica nodes whatever write CL you have set.
 The coordinator waits for n number of replica nodes to
respond. n is set by the write CL.
 The coordinator sends the response back to the client.

59
Tunable Consistency – Read CL
( Consistency Level)
 Read CL controls how many replica nodes must send their most
recent copy of partition to the coordinator.
 Read consistency refers to having same data on all replica nodes
for any read request. Following is how it works:
 A client sends a read request to the coordinator.
 The coordinator forwards the read (SELECT) request
to n number of replica nodes. n is set by the read CL.
 The coordinator waits for n number of replica nodes to
respond.
 The coordinator then merges (finds out most recent copy of
written data) the n number of responses to a single response
and sends response to the client.
60
MongoDB
MongoDB is a document-oriented NoSQL database used
for high volume data storage.
Instead of using tables and rows as in the relational
databases, MongoDB makes use of collections and
documents.
Documents consist of key-value pairs which are the basic
unit of data in MongoDB.
Collections contain sets of documents and function which
is the equivalent of relational database tables.

61
MongoDB Example

62
63
Database: Database is a physical container for collections.
Each database gets its own set of files on the file system. A
single MongoDB server typically has multiple databases.
Collection : Collection is a group of documents and is
similar to an RDBMS table. A collection exists within a
single database. Collections do not enforce a schema.
Documents within a collection can have different fields.
Document : A document is a set of key-value pairs.
Documents have dynamic schema. Dynamic schema means
that documents in the same collection do not need to have
the same set of fields or structure, and common fields in a
collection’s documents may hold different types of data.

64
Sharding
Sharding is a method for distributing data across multiple
machines. MongoDB uses sharding to support
deployments with very large data sets and high throughput
operations.
Database systems with large data sets or high throughput
applications can challenge the capacity of a single server.
For example, high query rates can exhaust the CPU
capacity of the server. Working set sizes larger than the
system’s RAM stress the I/O capacity of disk drives.
MongoDB supports horizontal scaling through sharding.

65
Sharding cluster

66
Shard: Each shard contains a subset of the sharded data.
Each shard can be deployed as a replica set to provide
redundancy and high availability. Together, the cluster’s
shards hold the entire data set for the cluster.
Mongos: The mongos acts as a query router, providing an
interface between client applications and the sharded
cluster.
Config Servers: Config servers store metadata and
configuration settings for the cluster. They are also
deployed as a replica set.

67
References
https://medium.com/@yuneeh/unstructured-data-vs-stru
ctured-data-explained-with-real-life-examples-a62dbadbb
49d
https://medium.com/@varun.sja/structured-data-vs-unstr
uctured-data-vs-semi-structured-data-what-is-the-differen
ce-f0e88eaba560
https://medium.com/analytics-vidhya/vertical-vs-horizont
al-scaling-b2754d68d77f
https://www.spiceworks.com/tech/cloud/articles/horizont
al-vs-vertical-cloud-scaling/
https://medium.com/hands-on-apache-hbase/an-introduc
tion-to-apache-hbase-2cdd1d9ff13
68
References
https://medium.com/hands-on-apache-hbase/an-introduc
tion-to-apache-hbase-2cdd1d9ff13#:~:text=HBase%20is%2
0a%20distributed%20column,Distributed%20File%20Syst
em%20(HDFS)
https://www.guru99.com/hbase-architecture-data-flow-us
ecases.html
https://informationit27.medium.com/hbase-architecture-1
d508455fe65
https://medium.com/@wangwei09310931/apache-hbase-a-
brief-introduction-7e2e3a1bbc91
https://medium.com/@aymannaitcherif/beginners-guide-
to-learn-cassandra-part-1-cassandra-overview-bf1634e4ce3
0
https://medium.com/nerd-for-tech/all-basics-of-mongodb69

Instrumentation II Handwritten Notes
No ratings yet
Instrumentation II Handwritten Notes
252 pages
Engineering Mathematics II
100% (1)
Engineering Mathematics II
214 pages
Concave Impact
No ratings yet
Concave Impact
30 pages
CATLOUGEfor Sheds
No ratings yet
CATLOUGEfor Sheds
39 pages
Exam Question Related To Case Study With Solutions
No ratings yet
Exam Question Related To Case Study With Solutions
5 pages
Nosql Databases
No ratings yet
Nosql Databases
379 pages
Conceptual Modeling (CM) For Military
100% (1)
Conceptual Modeling (CM) For Military
334 pages
Nosql
No ratings yet
Nosql
64 pages
Admin Resume Sample
100% (1)
Admin Resume Sample
6 pages
04 NoSQL
No ratings yet
04 NoSQL
126 pages
DBMS - Unit 6 (Advances in Databases)
No ratings yet
DBMS - Unit 6 (Advances in Databases)
19 pages
Agriculture Smart
No ratings yet
Agriculture Smart
12 pages
A, Sign Language Detection
No ratings yet
A, Sign Language Detection
32 pages
Distributed Cluster Operations with DC/OS: Definitive Reference for Developers and Engineers
From Everand
Distributed Cluster Operations with DC/OS: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Module 2
No ratings yet
Module 2
104 pages
Introduction To NoSQL
No ratings yet
Introduction To NoSQL
43 pages
Bigdata Unit 4
No ratings yet
Bigdata Unit 4
97 pages
Graduation Date On Resume
100% (1)
Graduation Date On Resume
7 pages
LeanUX Canvas v5
No ratings yet
LeanUX Canvas v5
2 pages
BDA UT2 QB Answers
100% (1)
BDA UT2 QB Answers
22 pages
Module 1
No ratings yet
Module 1
69 pages
Unit 4
No ratings yet
Unit 4
47 pages
Module 2
No ratings yet
Module 2
100 pages
Solar Power and Solar Inverter Data
No ratings yet
Solar Power and Solar Inverter Data
6 pages
Chap.5 FINANCIAL ASSET Valuation
No ratings yet
Chap.5 FINANCIAL ASSET Valuation
39 pages
NoSQL Databases
No ratings yet
NoSQL Databases
52 pages
Module 2
No ratings yet
Module 2
40 pages
NoSql 2024 Assign2
No ratings yet
NoSql 2024 Assign2
189 pages
Chapter 4 NOSQL 250525 070847
No ratings yet
Chapter 4 NOSQL 250525 070847
28 pages
Intro To Cement Laboratory Test
No ratings yet
Intro To Cement Laboratory Test
49 pages
NGD Unit 1-4
No ratings yet
NGD Unit 1-4
43 pages
No SQL
No ratings yet
No SQL
39 pages
Database And Computer Management: SERIES 1, #3
From Everand
Database And Computer Management: SERIES 1, #3
Elias Mutegi
No ratings yet
CIS - 468 - 04 - NOSQL Databases and Big Data Storage Systems
No ratings yet
CIS - 468 - 04 - NOSQL Databases and Big Data Storage Systems
102 pages
NoSQL Database
No ratings yet
NoSQL Database
64 pages
Construction Phase Plan Iss 3 Guidance To Completion
No ratings yet
Construction Phase Plan Iss 3 Guidance To Completion
53 pages
2 - NoSQL
No ratings yet
2 - NoSQL
32 pages
Unit VI - 1
No ratings yet
Unit VI - 1
31 pages
Intro No SQL
No ratings yet
Intro No SQL
44 pages
BIG - DATA - Unit 4
No ratings yet
BIG - DATA - Unit 4
99 pages
NoSQL D
No ratings yet
NoSQL D
26 pages
Lecture 6 - NoSQL
No ratings yet
Lecture 6 - NoSQL
28 pages
Report Emp
No ratings yet
Report Emp
36 pages
No SQL
No ratings yet
No SQL
109 pages
CAP Theorem
No ratings yet
CAP Theorem
39 pages
Big Data Analytics Lecture 3A
No ratings yet
Big Data Analytics Lecture 3A
27 pages
09 - Cloud-Enabling Technologies - v2
No ratings yet
09 - Cloud-Enabling Technologies - v2
45 pages
Electric Rop
No ratings yet
Electric Rop
2 pages
Unit 4 Cap Mongodb
No ratings yet
Unit 4 Cap Mongodb
23 pages
BigData NoSQL
No ratings yet
BigData NoSQL
30 pages
RK NoSQL
No ratings yet
RK NoSQL
35 pages
8.4 NoSQL Database
No ratings yet
8.4 NoSQL Database
36 pages
Module 5 - NoSQL Databases
No ratings yet
Module 5 - NoSQL Databases
33 pages
Final Defence Ecommerce WORD
No ratings yet
Final Defence Ecommerce WORD
47 pages
Big Data Analysis
No ratings yet
Big Data Analysis
9 pages
Child Labour
No ratings yet
Child Labour
14 pages
Bda Module 3
No ratings yet
Bda Module 3
20 pages
Lec21Notes Merged
No ratings yet
Lec21Notes Merged
20 pages
Nosql KK
No ratings yet
Nosql KK
23 pages
Big Data Slides
No ratings yet
Big Data Slides
26 pages
A Project Work Report
No ratings yet
A Project Work Report
9 pages
Introduction To NoSQL
No ratings yet
Introduction To NoSQL
29 pages
HDR 2020 Overview English
No ratings yet
HDR 2020 Overview English
36 pages
IntroNoSQL Revised
No ratings yet
IntroNoSQL Revised
28 pages
Chapter 5-NoSQL PDF
No ratings yet
Chapter 5-NoSQL PDF
47 pages
BDA Module-3
No ratings yet
BDA Module-3
7 pages
Bda Ia2 Bda
No ratings yet
Bda Ia2 Bda
7 pages
Hbase Hive Pig
No ratings yet
Hbase Hive Pig
144 pages
American - Sign - Language - Progress Final
No ratings yet
American - Sign - Language - Progress Final
44 pages
Module 2.3
No ratings yet
Module 2.3
25 pages
Nosql
No ratings yet
Nosql
12 pages
CS3492-DBMS Unit-5
No ratings yet
CS3492-DBMS Unit-5
9 pages
American SIGN - LANGUAGE - DETECTION
No ratings yet
American SIGN - LANGUAGE - DETECTION
35 pages
Aasl
No ratings yet
Aasl
34 pages
Chapter24 Nosql Dbs
No ratings yet
Chapter24 Nosql Dbs
35 pages
NoSQL Databases
No ratings yet
NoSQL Databases
20 pages
NoSQL Database
No ratings yet
NoSQL Database
8 pages
Data Engineering Unit 3
No ratings yet
Data Engineering Unit 3
4 pages
MDS 271 2448001
No ratings yet
MDS 271 2448001
9 pages
Information About Netbook Axioo Neon CNW
0% (1)
Information About Netbook Axioo Neon CNW
16 pages
Sign Language Detection
No ratings yet
Sign Language Detection
32 pages
Amazon RDS Custom
No ratings yet
Amazon RDS Custom
26 pages
Lit - Ch01 - Kimmel Et Al. 2013 - Ch13-2
No ratings yet
Lit - Ch01 - Kimmel Et Al. 2013 - Ch13-2
28 pages
Use of Virtual C... Tools
No ratings yet
Use of Virtual C... Tools
13 pages
Big Data Management and Nosql Databases: Doc. Rndr. Irena Holubova, PH.D
No ratings yet
Big Data Management and Nosql Databases: Doc. Rndr. Irena Holubova, PH.D
27 pages
Simulation in Military Training Recent D
No ratings yet
Simulation in Military Training Recent D
11 pages
Chp2-Binary Numbers and Codes (15.1.09)
No ratings yet
Chp2-Binary Numbers and Codes (15.1.09)
16 pages
Challenges of Handicraft Industries
No ratings yet
Challenges of Handicraft Industries
11 pages
Nosql Overview: Implementation Free
No ratings yet
Nosql Overview: Implementation Free
40 pages
DUO CONE SEALS-install, Caterpillar
No ratings yet
DUO CONE SEALS-install, Caterpillar
16 pages
Recent Trends - Nosql Database Management
No ratings yet
Recent Trends - Nosql Database Management
26 pages
Optimal Product Mix (LP - Simplex)
No ratings yet
Optimal Product Mix (LP - Simplex)
11 pages
Empathy Statements For Customer Service
No ratings yet
Empathy Statements For Customer Service
3 pages
ADVANCED PNEUMATICS Reviewer
No ratings yet
ADVANCED PNEUMATICS Reviewer
4 pages
Lab Sheet 2
No ratings yet
Lab Sheet 2
2 pages
Free Cover Letter Template
No ratings yet
Free Cover Letter Template
1 page
Introduction To Nosql: Gabriele Pozzani
No ratings yet
Introduction To Nosql: Gabriele Pozzani
49 pages
NOSQL
No ratings yet
NOSQL
23 pages
Intelligent Smoke & Heat Detectors: WWW - Apollo-Fire - Co.uk
No ratings yet
Intelligent Smoke & Heat Detectors: WWW - Apollo-Fire - Co.uk
4 pages
Peter Velikov Petrov: Personal Details
No ratings yet
Peter Velikov Petrov: Personal Details
2 pages
Code Composer Studio
No ratings yet
Code Composer Studio
4 pages
A Study On The Binary Option Model and Its Pricing
No ratings yet
A Study On The Binary Option Model and Its Pricing
7 pages
Server Rack
No ratings yet
Server Rack
7 pages
Shayna Parker Resume 2018
No ratings yet
Shayna Parker Resume 2018
2 pages
Kabel - PC SPSC2000 FW2 PDF
No ratings yet
Kabel - PC SPSC2000 FW2 PDF
1 page

4.NoSQL 1

Uploaded by

4.NoSQL 1

Uploaded by

NoSQL

Structured and Unstructured Data

Employee Age Salary

HBase is a distributed column-oriented database built on

Below shows column families in a column-oriented database

The Zookeeper replies by sending the META table location.

You might also like