Nosql Database: New Era of Databases For Big Data Analytics - Classification, Characteristics and Comparison
Nosql Database: New Era of Databases For Big Data Analytics - Classification, Characteristics and Comparison
There are two trends that bringing these problems to the attention of the international software community:
The exponential growth of the volume of data generated by users, systems and sensors, further
accelerated by the concentration of large part of this volume on big distributed systems like Amazon,
Google and other cloud services.
The increasing interdependency and complexity of data accelerated by the Internet, Web2.0, social
networks and open and standardized access to data sources from a large number of different systems.
CHARACTERISTICS OF NOSQL DATABASES:
Strong Consistency: all clients see the same version of the data, even on updates to the dataset - e. g. by
means of the two-phase commit protocol (XA transactions), and ACID,
High Availability: all clients can always find at least one copy of the requested data, even if some of the
machines in a cluster is down,
Partition-tolerance: the total system keeps its characteristic even when being deployed on different
servers, transparent to the client.
The CAP-Theorem postulates that only two of the three different aspects of scaling out are can be
achieved fully at the same time.
CLASSIFICATION OF NOSQL DATABASES
• Primary use:
This type of DMS is great for Distributed data storage, Large-scale, batch-oriented data processing
and Exploratory and predictive analytics performed by expert statisticians and programmers.
• 4.4 Graph Databases:
Primary use:
graph databases are useful when users are more interested in relationships between data than in the data
itself.
5. COMPARISON OF NOSQL DATABASE:
In this section, some of NoSQL databases (four categories) with a matrix on basis of few attributes-
design, integrity, indexing, distribution, system are evaluated.
6. ADOPTION OF NOSQL DATABASE:
In companies with more than 250 developers, nearly 70% will fund NoSQL projects over the course of
2012.
Lack of scalability and high latency/low performance also ranked highly among the reasons given for
migrating to NoSQL
0% overall say that NoSQL is very important or critical to their daily operations, with another 37%
indicating it is becoming more important.
7. CONCLUSION:
Computational and storage requirements of applications such as for Big Data Analytics, Business
Intelligence and social networking over peta-byte datasets have pushed sql-like centralized databases to
their limits.
We speculate some of the major (primarily) uses of NoSQL Databeses: Large-scale data processing
(parallel processing over distributed systems); Embedded IR (basic machine-to-machine information
look-up & retrieval); Exploratory analytics on semi-structured data (expert level); Large volume data
storage (unstructured, semi-structured, small-packet structured).
This study report motivation to provide an independent understanding of the strengths and weaknesses
of various NoSQL database approaches to supporting applications that process huge volumes of data; as
well as to provide a global overview of this non-relational NoSQL databases.
REFERENCES:
• [1] http://en.wikipedia.org/wiki/NoSQL
• [2] Hecht, R., & Jablonski, S. (2011, December). NoSQL evaluation: A use case oriented survey. In Cloud and Service Computing (CSC),
2011 International Conference on (pp. 336-341). IEEE.
• [3] Use relational DBMS, N. (2009). Saying good-bye to DBMSs, designing effective interfaces. Communications of the ACM, 52(9).
• [4] Leavitt, N. (2010). Will NoSQL databases live up to their promise?. Computer,43(2), 12-14.
• [5] Abadi, D. J. (2009). Data management in the cloud: Limitations and opportunities. IEEE Data Eng. Bull, 32(1), 3-12.
• [6] Chang, Fay, et al. "Bigtable: A distributed storage system for structured data."ACM Transactions on Computer Systems (TOCS) 26.2
(2008): 4.
• [7] Lakshman, A., & Malik, P. (2010). Cassandra—A decentralized structured storage system. Operating systems review, 44(2), 35.
• [8] Konstantinou, I., Angelou, E., Boumpouka, C., Tsoumakos, D., & Koziris, N. (2011, October). On the elasticity of nosql databases over
cloud management platforms. In Proceedings of the 20th ACM international conference on Information and knowledge management
(pp. 2385-2388). ACM.
• [9] Russom, P. (2011). big data analytics. TDWI Best Practices Report, 4 th Quarter 2011.
• [10] Luhn, H. P. (1958). A business intelligence system. IBM Journal of Research and Development, 2(4), 314-319.
• [11] G. DeCandia, et al.,(2007) "Dynamo: amazon's highly available key-value store," in SOSP '07 Proceedings of twenty-first ACM SIGOPS,
New York, USA, pp. 205-220.
• [12] K. Orend, (2010) "Analysis and Classification of NoSQL Databases and Evaluation of their Ability to Replace an Object-relational
Persistence Layer," Master Thesis, Technical University of Munich, Munich.
• [13] R. Cattell, (2010) "Scalable SQL and NoSQL Data Stores," ACM SIGMODRecord, vol. 39.
• [14] Han, J., Haihong, E., Le, G., & Du, J. (2011, October). Survey on nosql database. In Pervasive Computing and
Applications (ICPCA), 2011 6th International Conference on (pp. 363-366). IEEE.
• [15] Tudorica, B. G., & Bucur, C. (2011, June). A comparison between several NoSQL databases with comments
and notes. In Roedunet International Conference (RoEduNet), 2011 10th (pp. 1-5). IEEE.
• [16] Padhy, R. P., Patra, M. R., & Satapathy, S. C. (2011). RDBMS to NoSQL: Reviewing Some Next-Generation Non-
Relational Database‗s‖. International Journal of Advanced Engineering Science and Technologies, 11(1), 15-30
• [17] Pokorny, J. (2011, December). NoSQL Databases: a step to database scalability in Web environment. In
Proceedings of the 13th International Conference on Information Integration and Web-based Applications and
Services (pp. 278-283). ACM.
• [18] Tiwari, S. (2011). Professional NoSQL. Wrox. (Book)
• [19] Thantriwatte, T. A. M. C., & Keppetiyagama, C. I. (2011, September). NoSQL query processing system for
wireless ad-hoc and sensor networks. InAdvances in ICT for Emerging Regions (ICTer), 2011 International
Conference on (pp. 78-82). IEEE.
• [20] Find InfiniteGraph from: http://objectivity.com/
• [21] Find Neo4j from: http://www.neo4j.org/
• [22] Find CouchDB from: http://couchdb.apache.org/
• [23] Find MongoDB from: http://www.mongodb.org/
• [24] Find Hypertable from: http://hypertable.org/
• [25] Find Riak from: http://basho.com/products/riak-overview/
• [26] Find Redis from: http://redis.io/
• [27] Find DynamoDB from: http://aws.amazon.com/dynamodb/
• [28] Find SimpleDB from: http://aws.amazon.com/simpledb/
• [29] Find Cassandra from: http://cassandra.apache.org/
• [30] Find Accumulo from: http://accumulo.apache.org/
• [31] Find neo4j from: http://www.neo4j.org/
• [32] ASID detail from web: http://en.wikipedia.org/wiki/ACID
• [33] HBase Databases from web: http://hbase.apache.org/
• [34] Microsoft SQL Server Databases from web: http://www.microsoft.com/enus/sqlserver/default.aspx
• [35] MySQL Databases from web: http://www.mysql.com/
• [36] Oracle Databases from web: http://www.oracle.com/us/products/database/overview/index.html
• [37] Couchbase Survey from web: http://
www.couchbase.com/press-releases/couchbase-surveyshows-accelerated-adoption-nosql-2012
• [38] Brewer's CAP Theorem, By Julian Browne on January 11, 2009 web: http://
www.julianbrowne.com/article/viewer/brewers-cap-theorem
• [39] Graph Databases, NOSQL and Neo4j from: http://www.infoq.com/articles/graph-nosql-neo4j
• [40] Potter, J. L. (1985). The Massively Parallel Processor.
• [41] http://hortonworks.com/blog/7-key-drivers-for-the-big-data-market/
• [42] Baxter, W. F., Gelinas, R. G., Guyer, J. M., Huck, D. R., Hunt, M. F., Keating, D. L., ... & Yeung, S. N. (1999). U.S. Patent No.
5,887,146. Washington, DC: U.S. Patent and Trademark Office.
• [43] Petrides, M., Alivisatos, B., Evans, A. C., & Meyer, E. (1993). Dissociation of human middorsolateral from posterior dorsolateral
frontal cortex in memory processing. Proceedings of the National Academy of Sciences, 90(3), 873-877.
• [44] http://www.slideshare.net/adorepump/voldemort-nosql
ANY QUESTION?
If yes
Google it
Else
Clap
THANK
YOU !!