0% found this document useful (0 votes)

36 views3 pages

Aggregate Models in Big Data

Uploaded by

vishal.gahlot14

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

36 views3 pages

Aggregate Models in Big Data

Uploaded by

vishal.gahlot14

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Aggregate Models In Big Data

Submitted By: Gurmohit Singh

SID: 18205002

Model Key-value
Description A key-value database (also known as a key-value store and key-value store database) is a
type of NoSQL database that uses a simple key/value method to store data.

The key-value part refers to the fact that the database stores data as a collection of
key/value pairs. This is a simple method of storing data, and it is known to scale well.

The key-value pair is a well established concept in many programming languages.

Programming languages typically refer to a key-value as an associative array or data
structure. A key-value is also commonly referred to as a dictionary or hash.
Pros Simple data format makes write and read operations fast

Value can be anything, including JSON, flexible schemas

Cons Optimized only for data with single key and value. A parser is required to store multiple
values.

Not optimized for lookup. Lookup requires scanning the whole collection or creating
separate index values
Good For  User profiles
 Session information
 Article/blog comments
 Emails
 status messages
Supported  Redis
DBMS  Oracle Nosql DB

Model Document
Description A document database is a type of nonrelational database that is designed to store and
query data as JSON-like documents.
Document databases make it easier for developers to store and query data in a database by
using the same document-model format they use in their application code. The flexible,
semistructured, and hierarchical nature of documents and document databases allows
them to evolve with applications’ needs.

The document model works well with use cases such as catalogs, user profiles, and
content management systems where each document is unique and evolves over time.
Document databases enable flexible indexing, powerful ad hoc queries, and analytics over
collections of documents.
Pros Add nodes on the fly with advantage of scalability (mongo detect them as you add)
Rich set of client libraries
Uses BSON (superset of JSON which is easy to deal with)
Great speed if your inserts are not failsafe (which is on by default - use case logging)
Indexing fields is easy (if you need speed at some field just index it, mongodb allows you
do that easily)
Geospatial indexing and querying
Cons No joins
Less flexible queries
For complex jobs you need Map-Reduce
May face with unexpected failures (generally not mongo's fault - wrong setup,config etc.)
You are good as long as your index fits into memory (memory mapped files)
Using single node is dangereous (you may lost your data)
Good For  Content management
 Catalogs
Supported  MongoDB. © MongoDB. ...
DBMS  Apache Cassandra. © Apache Software Foundation. ...
 Amazon DynamoDB. ...
 Couchbase. ...

Model Column- family

Description A columnar database is a database management system (DBMS) that stores data in
columns instead of rows.

The goal of a columnar database is to efficiently write and read data to and from hard
disk storage in order to speed up the time it takes to return a query.

In a columnar database, all the column 1 values are physically together, followed by all
the column 2 values, etc. The data is stored in record order, so the 100th entry for
column 1 and the 100th entry for column 2 belong to the same input record. This allows
individual data elements, such as customer name for instance, to be accessed in columns
as a group, rather than individually row-by-row.
Pros Columnar databases have been traditionally developed with horizontal scalability
as a primary design goal. As such, they’re particularly suited to “Big
“Data” problems, living on clusters of tens, hundreds, or thousands of nodes.
They also tend to have built-in support for features such as compression and
versioning. The canonical example of a good columnar data storage problem
is indexing web pages. Pages on the Web are highly textual (benefits from
compression), somewhat interrelated, and change over time (benefits from
versioning).

Cons Different columnar databases have different features and therefore different
drawbacks. But one thing they have in common is that it’s best to design
your schema based on how you plan to query the data. This means you should
have some idea in advance of how your data will be used, not just what it’ll
consist of. If data usage patterns can’t be defined in advance—for example,
fast adhoc reporting—then a columnar database may not be the best fit.

Good For Large organisations that need to make the most

Supported C-Store
DBMS MonetDb
LucidDb

Model Graph based

Description Graph databases are purpose-built to store and navigate relationships. Relationships are
first-class citizens in graph databases, and most of the value of graph databases is derived
from these relationships. Graph databases use nodes to store data entities, and edges to
store relationships between entities. An edge always has a start node, end node, type, and
direction, and an edge can describe parent-child relationships, actions, ownership, and the
like. There is no limit to the number and kind of relationships a node can have.

A graph in a graph database can be traversed along specific edge types or across the entire
graph. In graph databases, traversing the joins or relationships is very fast because the
relationships between nodes are not calculated at query times but are persisted in the
database.
Pros Graph databases seem to be tailor-made for networking applications. The prototypical
example is a social network, where nodes represent users who have various kinds of
relationships to each other. Modeling this kind of data using any of the other styles is
often a tough fit, but a graph database would accept it with relish.

They are also perfect matches for an object-oriented system.

Cons Because of the high degree of interconnectedness between nodes, graph databases are
generally not suitable for network partitioning.

Graph databases don’t scale out well.

Good For Fraud Detection
Recommendation Engines
Supported  Neo4J
DBMS  OrientDb
 Dgraph

Migration From ECC To HANA
No ratings yet
Migration From ECC To HANA
38 pages
Software Requirement Specification For Automated Parking Garage System, Vehicle Parking Management System
No ratings yet
Software Requirement Specification For Automated Parking Garage System, Vehicle Parking Management System
11 pages
No SQL
No ratings yet
No SQL
32 pages
Lec 15 Notes
No ratings yet
Lec 15 Notes
3 pages
CH.5 NOSQL Database For Business Applications
No ratings yet
CH.5 NOSQL Database For Business Applications
21 pages
Session 8 - NoSQL
No ratings yet
Session 8 - NoSQL
17 pages
Lecture 9 Chapter 5 Part 5 Big Data Storage Concepts
No ratings yet
Lecture 9 Chapter 5 Part 5 Big Data Storage Concepts
15 pages
No SQL
No ratings yet
No SQL
38 pages
Unit 5 - 230601 - 174540-1
No ratings yet
Unit 5 - 230601 - 174540-1
14 pages
Lecture 3.1.2
No ratings yet
Lecture 3.1.2
47 pages
Types of NoSQL Databases - GeeksforGeeks
No ratings yet
Types of NoSQL Databases - GeeksforGeeks
9 pages
NoSQL Database
No ratings yet
NoSQL Database
8 pages
Module 5 - Nosql
No ratings yet
Module 5 - Nosql
45 pages
Unit 2
No ratings yet
Unit 2
26 pages
Big Data Unit 3
No ratings yet
Big Data Unit 3
374 pages
NoSQL Databases
No ratings yet
NoSQL Databases
10 pages
Unit II No-SQL DB Managment
No ratings yet
Unit II No-SQL DB Managment
33 pages
3.2NOSQL Categories
No ratings yet
3.2NOSQL Categories
7 pages
3.2NOSQL Categories
No ratings yet
3.2NOSQL Categories
7 pages
Full Stack UNIT3
No ratings yet
Full Stack UNIT3
57 pages
10gen Top 5 NoSQL Considerations
No ratings yet
10gen Top 5 NoSQL Considerations
10 pages
Features of Nosql: Non-Relational
No ratings yet
Features of Nosql: Non-Relational
7 pages
NoSQL Notes
No ratings yet
NoSQL Notes
11 pages
Unit 2
No ratings yet
Unit 2
65 pages
No SQL Lecture Notes
No ratings yet
No SQL Lecture Notes
17 pages
Module 1 Introduction
No ratings yet
Module 1 Introduction
9 pages
Module 3 Bigdata Analytics
No ratings yet
Module 3 Bigdata Analytics
19 pages
No SQL
No ratings yet
No SQL
12 pages
NoSQL Vs SQL Databases Explained
No ratings yet
NoSQL Vs SQL Databases Explained
23 pages
More Details On Data Models
No ratings yet
More Details On Data Models
23 pages
NoSql 2024 Assign2
No ratings yet
NoSql 2024 Assign2
189 pages
NoSQL Database
No ratings yet
NoSQL Database
10 pages
NOSQL
No ratings yet
NOSQL
25 pages
No SQL
No ratings yet
No SQL
12 pages
Unit 3 Nosql Databases Adt
No ratings yet
Unit 3 Nosql Databases Adt
64 pages
Full Stack-Unit-Iii
No ratings yet
Full Stack-Unit-Iii
56 pages
U5 Final
No ratings yet
U5 Final
45 pages
Chapter 1 - Introducing Big Data & NoSQL
No ratings yet
Chapter 1 - Introducing Big Data & NoSQL
14 pages
NoSQL Data Models
No ratings yet
NoSQL Data Models
32 pages
Bda CHP 3
No ratings yet
Bda CHP 3
75 pages
NoSQL DATABSES
No ratings yet
NoSQL DATABSES
12 pages
MongoDB Slides Until ClassTest
No ratings yet
MongoDB Slides Until ClassTest
221 pages
Module 5 - NoSQL Databases
No ratings yet
Module 5 - NoSQL Databases
33 pages
BDA Module 5 - Part1 (No SQL) 2023
No ratings yet
BDA Module 5 - Part1 (No SQL) 2023
32 pages
No SQL DB
No ratings yet
No SQL DB
18 pages
Introduction To Nosql: What Is A Nosql Database Used For?
No ratings yet
Introduction To Nosql: What Is A Nosql Database Used For?
6 pages
Unit 3 NoSQL
No ratings yet
Unit 3 NoSQL
98 pages
2 - Disadvantages of NoSQL Technology
No ratings yet
2 - Disadvantages of NoSQL Technology
3 pages
Unit 3
No ratings yet
Unit 3
10 pages
DBMS MASTER: Become Pro in Database Management System
From Everand
DBMS MASTER: Become Pro in Database Management System
Ummed Singh
No ratings yet
Unit 2 Handouts
No ratings yet
Unit 2 Handouts
11 pages
BIG Data 2
No ratings yet
BIG Data 2
18 pages
CHAPTER 03: Big Data Technology Landscape
No ratings yet
CHAPTER 03: Big Data Technology Landscape
81 pages
Unit 6
No ratings yet
Unit 6
143 pages
NOSQL
No ratings yet
NOSQL
15 pages
Dbms Presentation
No ratings yet
Dbms Presentation
22 pages
Unit Ii - Nosql Databases
No ratings yet
Unit Ii - Nosql Databases
112 pages
Learning Guide 2.1 - CloudDatabase - NOSQL PDF
No ratings yet
Learning Guide 2.1 - CloudDatabase - NOSQL PDF
44 pages
Chapter 5: No SQL Data Management and Mongodb: Unit-2
No ratings yet
Chapter 5: No SQL Data Management and Mongodb: Unit-2
65 pages
BD Unit 4
No ratings yet
BD Unit 4
45 pages
Database And Computer Management: SERIES 1, #3
From Everand
Database And Computer Management: SERIES 1, #3
Elias Mutegi
No ratings yet
Databases: System Concepts, Designs, Management, and Implementation
From Everand
Databases: System Concepts, Designs, Management, and Implementation
Jonathan Rigdon
No ratings yet
Array, Pointer, String
No ratings yet
Array, Pointer, String
36 pages
Typescript (1.4) Angularjs (1.4.X) Cheat Sheet
No ratings yet
Typescript (1.4) Angularjs (1.4.X) Cheat Sheet
2 pages
Intermediate Code Generation and Code Optimization
No ratings yet
Intermediate Code Generation and Code Optimization
40 pages
C# - Variables: Type Example
No ratings yet
C# - Variables: Type Example
2 pages
Documentation Format
No ratings yet
Documentation Format
58 pages
Custom Print in Oracle Primavera Unifier
No ratings yet
Custom Print in Oracle Primavera Unifier
2 pages
Internet and Open Source Concepts
No ratings yet
Internet and Open Source Concepts
6 pages
Expressions in C
No ratings yet
Expressions in C
7 pages
Install Apache OpenMeetings On CentOS 6
No ratings yet
Install Apache OpenMeetings On CentOS 6
8 pages
Client Plusone
No ratings yet
Client Plusone
9 pages
Practical No. 2
No ratings yet
Practical No. 2
12 pages
Finite Automata Theory and Formal Languages: Assignment # 02
No ratings yet
Finite Automata Theory and Formal Languages: Assignment # 02
5 pages
CSS - DLL Week 4 3rd Quarter
No ratings yet
CSS - DLL Week 4 3rd Quarter
4 pages
01 4 - Rivas Frisina PDF
No ratings yet
01 4 - Rivas Frisina PDF
20 pages
Designing Interfaces and Dialogues 3
No ratings yet
Designing Interfaces and Dialogues 3
17 pages
Sap Ooabap 2
No ratings yet
Sap Ooabap 2
11 pages
Telemedicine Documentation
No ratings yet
Telemedicine Documentation
100 pages
Python Coding
No ratings yet
Python Coding
61 pages
Extension ANSYS Workbench LS-DYNA
No ratings yet
Extension ANSYS Workbench LS-DYNA
1 page
Holiday HW
No ratings yet
Holiday HW
8 pages
PPS GTU Study Material Presentations Unit-3 25122020081402AM
No ratings yet
PPS GTU Study Material Presentations Unit-3 25122020081402AM
35 pages
CSS Chapter 2
No ratings yet
CSS Chapter 2
34 pages
Notepad Commands Ethical Hacking Guide
No ratings yet
Notepad Commands Ethical Hacking Guide
7 pages
Oo Abap Declarations
No ratings yet
Oo Abap Declarations
15 pages
S4HANA User Interface
No ratings yet
S4HANA User Interface
20 pages
Tool Catalog Definition Tutorial
No ratings yet
Tool Catalog Definition Tutorial
10 pages
Mobile SDK Developer Guide
No ratings yet
Mobile SDK Developer Guide
387 pages
Pacmem: Enforcing Spatial and Temporal Memory Safety Via Arm Pointer Authentication
No ratings yet
Pacmem: Enforcing Spatial and Temporal Memory Safety Via Arm Pointer Authentication
15 pages

Aggregate Models in Big Data

Uploaded by

Aggregate Models in Big Data

Uploaded by

Aggregate Models In Big Data

Submitted By: Gurmohit Singh

The key-value pair is a well established concept in many programming languages.

Value can be anything, including JSON, flexible schemas

Model Column- family

Good For Large organisations that need to make the most

Model Graph based

They are also perfect matches for an object-oriented system.

Graph databases don’t scale out well.

You might also like