0% found this document useful (0 votes)
115 views17 pages

Nosql Database: New Era of Databases For Big Data Analytics - Classification, Characteristics and Comparison

This document discusses NoSQL databases and provides classifications and characteristics. It begins with an introduction to NoSQL databases and their uses for large datasets. It then classifies NoSQL databases into four main categories - key-value stores, document databases, wide-column stores, and graph databases - and describes their primary uses. The document also compares NoSQL databases, discusses their adoption trends, and concludes with speculating their major uses and providing an overview of non-relational NoSQL databases.

Uploaded by

james smith
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
115 views17 pages

Nosql Database: New Era of Databases For Big Data Analytics - Classification, Characteristics and Comparison

This document discusses NoSQL databases and provides classifications and characteristics. It begins with an introduction to NoSQL databases and their uses for large datasets. It then classifies NoSQL databases into four main categories - key-value stores, document databases, wide-column stores, and graph databases - and describes their primary uses. The document also compares NoSQL databases, discusses their adoption trends, and concludes with speculating their major uses and providing an overview of non-relational NoSQL databases.

Uploaded by

james smith
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 17

NOSQL DATABASE: NEW ERA OF DATABASES

FOR BIG DATA ANALYTICS -


CLASSIFICATION, CHARACTERISTICS AND
COMPARISON
GROUP MEMBERS
 DILIP BDR CHHETRI
 MAHESH BHANDARI
 SAGUN SHRESTHA
 NIRAJAN POUDEL
1.INTRODUCTION:
 NoSQL, for “Not Only SQL,” refers to an eclectic and increasingly familiar group of non-
relational data management systems; where databases are not built primarily on tables, and
generally do not use SQL for data manipulation.
 NoSQL database management systems are useful when working with a huge quantity of data when
the data's nature does not require a relational model.
 NoSQL database systems arose alongside major Internet companies, such as Google, Amazon, and
Facebook; which had challenges in dealing with huge quantities of data with conventional RDBMS
solutions could not cope.
 Originally motivated by Web 2.0 applications, these systems are designed to scale to thousands or
millions of users doing updates as well as reads, in contrast to traditional DBMSs and data
warehouses.
BACKGROUND:

There are two trends that bringing these problems to the attention of the international software community:
 The exponential growth of the volume of data generated by users, systems and sensors, further
accelerated by the concentration of large part of this volume on big distributed systems like Amazon,
Google and other cloud services.
 The increasing interdependency and complexity of data accelerated by the Internet, Web2.0, social
networks and open and standardized access to data sources from a large number of different systems.
CHARACTERISTICS OF NOSQL DATABASES:

 Strong Consistency: all clients see the same version of the data, even on updates to the dataset - e. g. by
means of the two-phase commit protocol (XA transactions), and ACID,
 High Availability: all clients can always find at least one copy of the requested data, even if some of the
machines in a cluster is down,
 Partition-tolerance: the total system keeps its characteristic even when being deployed on different
servers, transparent to the client.
 The CAP-Theorem postulates that only two of the three different aspects of scaling out are can be
achieved fully at the same time.
CLASSIFICATION OF NOSQL DATABASES

4.1 Key-Value stores:


 Typically, these DMS store items as alpha-numeric identifiers (keys) and associated values in simple,
standalone tables (referred to as hash tables‖).
 The values may be simple text strings or more complex lists and sets.
• primary use :
 The simplicity of Key-Value Stores makes them ideally suited to lightning-fast, highlyscalable retrieval
of the values needed for application tasks like managing user profiles or sessions or retrieving product
names.
• 4.2 Document databases:
 Inspired by Lotus Notes, document databases were, as their name implies, designed to manage and store
documents.
 These documents are encoded in a standard data exchange format such as XML, JSON (Javascript
Option Notation) or BSON (Binary JSON).
. Primary use:
> Document databases are good for storing and managing Big Data-size collections of literal documents,
like text documents, email messages, and XML documents, as well as conceptual ‖documents‖ like de-
normalized (aggregate) representations of a database entity such as a product or customer.
• 4.3 Wide-Column (or Column-Family) Stores (BigTable-implementations):

• Primary use:
 This type of DMS is great for Distributed data storage, Large-scale, batch-oriented data processing
and Exploratory and predictive analytics performed by expert statisticians and programmers.
• 4.4 Graph Databases:

Primary use:
 graph databases are useful when users are more interested in relationships between data than in the data
itself.
5. COMPARISON OF NOSQL DATABASE:

 In this section, some of NoSQL databases (four categories) with a matrix on basis of few attributes-
design, integrity, indexing, distribution, system are evaluated.
6. ADOPTION OF NOSQL DATABASE:
 In companies with more than 250 developers, nearly 70% will fund NoSQL projects over the course of
2012.
 Lack of scalability and high latency/low performance also ranked highly among the reasons given for
migrating to NoSQL
 0% overall say that NoSQL is very important or critical to their daily operations, with another 37%
indicating it is becoming more important.
7. CONCLUSION:

 Computational and storage requirements of applications such as for Big Data Analytics, Business
Intelligence and social networking over peta-byte datasets have pushed sql-like centralized databases to
their limits.
 We speculate some of the major (primarily) uses of NoSQL Databeses: Large-scale data processing
(parallel processing over distributed systems); Embedded IR (basic machine-to-machine information
look-up & retrieval); Exploratory analytics on semi-structured data (expert level); Large volume data
storage (unstructured, semi-structured, small-packet structured).
 This study report motivation to provide an independent understanding of the strengths and weaknesses
of various NoSQL database approaches to supporting applications that process huge volumes of data; as
well as to provide a global overview of this non-relational NoSQL databases.
REFERENCES:
• [1] http://en.wikipedia.org/wiki/NoSQL
• [2] Hecht, R., & Jablonski, S. (2011, December). NoSQL evaluation: A use case oriented survey. In Cloud and Service Computing (CSC),
2011 International Conference on (pp. 336-341). IEEE.
• [3] Use relational DBMS, N. (2009). Saying good-bye to DBMSs, designing effective interfaces. Communications of the ACM, 52(9).
• [4] Leavitt, N. (2010). Will NoSQL databases live up to their promise?. Computer,43(2), 12-14.
• [5] Abadi, D. J. (2009). Data management in the cloud: Limitations and opportunities. IEEE Data Eng. Bull, 32(1), 3-12.
• [6] Chang, Fay, et al. "Bigtable: A distributed storage system for structured data."ACM Transactions on Computer Systems (TOCS) 26.2
(2008): 4.
• [7] Lakshman, A., & Malik, P. (2010). Cassandra—A decentralized structured storage system. Operating systems review, 44(2), 35.
• [8] Konstantinou, I., Angelou, E., Boumpouka, C., Tsoumakos, D., & Koziris, N. (2011, October). On the elasticity of nosql databases over
cloud management platforms. In Proceedings of the 20th ACM international conference on Information and knowledge management
(pp. 2385-2388). ACM.
• [9] Russom, P. (2011). big data analytics. TDWI Best Practices Report, 4 th Quarter 2011.
• [10] Luhn, H. P. (1958). A business intelligence system. IBM Journal of Research and Development, 2(4), 314-319.
• [11] G. DeCandia, et al.,(2007) "Dynamo: amazon's highly available key-value store," in SOSP '07 Proceedings of twenty-first ACM SIGOPS,
New York, USA, pp. 205-220.
• [12] K. Orend, (2010) "Analysis and Classification of NoSQL Databases and Evaluation of their Ability to Replace an Object-relational
Persistence Layer," Master Thesis, Technical University of Munich, Munich.
• [13] R. Cattell, (2010) "Scalable SQL and NoSQL Data Stores," ACM SIGMODRecord, vol. 39.
• [14] Han, J., Haihong, E., Le, G., & Du, J. (2011, October). Survey on nosql database. In Pervasive Computing and
Applications (ICPCA), 2011 6th International Conference on (pp. 363-366). IEEE.
• [15] Tudorica, B. G., & Bucur, C. (2011, June). A comparison between several NoSQL databases with comments
and notes. In Roedunet International Conference (RoEduNet), 2011 10th (pp. 1-5). IEEE.
• [16] Padhy, R. P., Patra, M. R., & Satapathy, S. C. (2011). RDBMS to NoSQL: Reviewing Some Next-Generation Non-
Relational Database‗s‖. International Journal of Advanced Engineering Science and Technologies, 11(1), 15-30
• [17] Pokorny, J. (2011, December). NoSQL Databases: a step to database scalability in Web environment. In
Proceedings of the 13th International Conference on Information Integration and Web-based Applications and
Services (pp. 278-283). ACM.
• [18] Tiwari, S. (2011). Professional NoSQL. Wrox. (Book)
• [19] Thantriwatte, T. A. M. C., & Keppetiyagama, C. I. (2011, September). NoSQL query processing system for
wireless ad-hoc and sensor networks. InAdvances in ICT for Emerging Regions (ICTer), 2011 International
Conference on (pp. 78-82). IEEE.
• [20] Find InfiniteGraph from: http://objectivity.com/
• [21] Find Neo4j from: http://www.neo4j.org/
• [22] Find CouchDB from: http://couchdb.apache.org/
• [23] Find MongoDB from: http://www.mongodb.org/
• [24] Find Hypertable from: http://hypertable.org/
• [25] Find Riak from: http://basho.com/products/riak-overview/
• [26] Find Redis from: http://redis.io/
• [27] Find DynamoDB from: http://aws.amazon.com/dynamodb/
• [28] Find SimpleDB from: http://aws.amazon.com/simpledb/
• [29] Find Cassandra from: http://cassandra.apache.org/
• [30] Find Accumulo from: http://accumulo.apache.org/
• [31] Find neo4j from: http://www.neo4j.org/
• [32] ASID detail from web: http://en.wikipedia.org/wiki/ACID
• [33] HBase Databases from web: http://hbase.apache.org/
• [34] Microsoft SQL Server Databases from web: http://www.microsoft.com/enus/sqlserver/default.aspx
• [35] MySQL Databases from web: http://www.mysql.com/
• [36] Oracle Databases from web: http://www.oracle.com/us/products/database/overview/index.html
• [37] Couchbase Survey from web: http://
www.couchbase.com/press-releases/couchbase-surveyshows-accelerated-adoption-nosql-2012
• [38] Brewer's CAP Theorem, By Julian Browne on January 11, 2009 web: http://
www.julianbrowne.com/article/viewer/brewers-cap-theorem
• [39] Graph Databases, NOSQL and Neo4j from: http://www.infoq.com/articles/graph-nosql-neo4j
• [40] Potter, J. L. (1985). The Massively Parallel Processor.
• [41] http://hortonworks.com/blog/7-key-drivers-for-the-big-data-market/
• [42] Baxter, W. F., Gelinas, R. G., Guyer, J. M., Huck, D. R., Hunt, M. F., Keating, D. L., ... & Yeung, S. N. (1999). U.S. Patent No.
5,887,146. Washington, DC: U.S. Patent and Trademark Office.
• [43] Petrides, M., Alivisatos, B., Evans, A. C., & Meyer, E. (1993). Dissociation of human middorsolateral from posterior dorsolateral
frontal cortex in memory processing. Proceedings of the National Academy of Sciences, 90(3), 873-877.
• [44] http://www.slideshare.net/adorepump/voldemort-nosql
ANY QUESTION?
If yes
Google it
Else
Clap
THANK
YOU !!

You might also like