More Details On Data Models

Graph databases are suited for data with complex interconnected relationships. They model data as nodes connected by edges, allowing for efficient traversal of relationships. In contrast, most NoSQL databases use a simpler aggregate-oriented model with references between large records. Materialized views can pre-compute and cache queries to provide alternative structures for accessing data organized in aggregates.

Uploaded by

chitraalavani

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

100 views23 pages

More Details On Data Models

Uploaded by

chitraalavani

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 23

More Details on Data Models

Relationships
• Aggregates are useful in that they put together
data that is commonly accessed together.
• But there are still lots of cases where data that’s
related is accessed differently.
• An important aspect of relationships between
aggregates is how they handle updates.
• If you update multiple aggregates at once, you
have to deal yourself with a failure partway
through.
Graph Databases
• Graph databases are an odd fish in the NoSQL pond.
• Most NoSQL databases were inspired by the need
to run on clusters, which led to aggregate-oriented
data models of large records with simple
connections.
• Graph databases are motivated by a different
frustration with relational databases and thus have
an opposite model—small records with complex
interconnections
Graph Databases
• Graph isn’t a bar chart or histogram; instead, we
refer to a graph data structure of nodes connected
by edges.
• The Figure have a web of information whose nodes
are very small (nothing more than a name) but there
is a rich structure of interconnections between them.
• With this structure, we can ask questions such as
“find the books in the Databases category that are
written by someone whom a friend of mine likes.”
Graph Databases
• Graph databases specialize in capturing this sort of
information—but on a much larger scale than a
readable diagram could capture
• The fundamental data model of a graph database is
very simple: nodes connected by edges (also called
arcs).
• Beyond this essential characteristic there is a lot of
variation in data models—in particular, what
mechanisms you have to store data in your nodes
and edges.
Graph Databases
• A quick sample of some current capabilities
illustrates this variety of possibilities:
– FlockDB is simply nodes and edges with no
mechanism for additional attributes
– Neo4J allows you to attach Java objects as
properties to nodes and edges in a schemaless
fashion
– Infinite Graph stores your Java objects,which are
subclasses of its built-in types, as nodes and edges.
Graph Databases
• Once you have built up a graph of nodes and edges,
a graph database allows you to query that network
with query operations designed with this kind of
graph in mind
• Important difference between relational and graph
database
– relational databases can implement relationships using
foreign keys, the joins required to navigate around can get
quite expensive
– Graph databases make traversal along the relationships
very cheap. A large part of this is because graph databases
shift most of the work of navigating relationships from
query time to insert time.
Graph Databases
Most of the time you find data by navigating
through the network of edges, with queries such
as “tell me all the things that both Anna and
Barbara like.” You do need a starting place,
however, so usually some nodes can be indexed
by an attribute such as ID. So you might start
with an ID lookup (i.e., look up the people
named “Anna” and “Barbara”) and then start
using the edges. Still, graph databases expect
most of your query work to be navigating
relationships.
Which Model to used when
Key Value
– We use it for : storing session information, user
profiles , preferences, shopping cart data.
– We would avoid it : when we need to query data
having relationships between entities.
Column based
– We use it for : content management systems, blogging
platforms, log aggregation.
– We would avoid it for : systems that are in early
development, changing query patterns.
Which Model to used when
Document Based
– We use it for : content management systems, blogging
platforms, web analytics, real-time analytics, e-commerce
applications.
– We would avoid it for : systems that need complex
transactions spanning multiple operations or queries
against varying aggregate structures.
Graph Based
– It is well suited for : connected data, such as social
networks, spatial data, routing information for goods and
supply.
Schemaless Databases
• Key-value store allows you to store any data you like under a
key
• Document databases make no restrictions on the structure of
the documents you store
• Column-family databases allow you to store any data under
any column you like
• Graph databases allow you to freely add new edges and freely
add properties to nodes and edges as you wish
Schemaless Databases
• NoSQL allows to easily change the data store
as we learn more about the project.
• NoSQL allows to add new things and stop
adding things not needed any more
• Schemaless store also make nonuniform data
– data where each record has a different set of
fields.
Pros and cons of schemaless data
• Pros:
– More freedom and flexibility
– You can easily change your data organization
– You can deal with non-uniform data
• Cons:
– A program that accesses data: .
• almost always relies on some form of implicit schema
• it assumes that certain fields are present
– The implicit schema is shifted into the application code that accesses
data
• To understand what data is present you have look at the application code
– The schema cannot be used to:
• decide how to store and retrieve data efficiently
• ensure data consistency
– Problems if multiple applications, developed by different people, access
the same database.
Schemaless Database
• Schemaless database shifts the schema into the
application code that accesses it.
• Encapsulate all database interaction within a single
application and integrate it with other applications
using web services. This fits in well with many
people’s current preference for using web services
for integration.
• Clearly define different areas of an aggregate for
access by different applications. These could be
different sections in a document database or
different column families in a column-family
database.
Schemaless Database
• Schema lessness does have a big impact on
changes of a database’s structure over time,
particularly for more uniform data.
• We have to exercise control when changing how
one store data in a schemaless database so that
one can easily access both old and new data.
• The flexibility that schemalessness gives you
only applies within an aggregate—if you need
to change your aggregate boundaries, the
migration is every bit as complex as it is in the
relational case.
Materialized Views
• A relational view is a table defined by computation over the
base tables
• Materialized views: computed in advance and cached on
disk
• NoSQL databases:
– do not have views
– have precomputed and cached queries usually called
“materialized view”
• Strategies to building a materialized view
– Eager approach
• the materialized view is updated at the same time of the base data .
good when you have more frequent reads than writes
– Detached approach
• batch jobs update the materialized views at regular intervals . good when
you don’t want to pay an overhead on each update
Modeling for Data Access
when modeling data aggregates we need to consider how the data is going to be read as
well as what are the side effects on data related to those aggregates.

• The application can read the customer’s

information and all the related data by
using the key
• If the requirements are to read the
orders or the products sold in each order,
the whole object has to be read and then
parsed on the client side to build the
results.
• When references are needed, we could
switch to document stores and then
query inside the documents, or even
change the data for the key-value store
to split the value object into Customer
and Order objects and then
maintain these objects’ references to
each other.
Modeling for Data Access
We can now find the orders
independently from the Customer, and
with the orderId reference in the
Customer we can find all Orders for the
Customer. Using aggregates this way
allows for read optimization, but we have
to push the orderId reference into
Customer every time with a new Order.
Key Points
• Aggregate-oriented databases make inter-aggregate
relationships more difficult to handle than intra-aggregate
relationships.
• Graph databases organize data into node and edge graphs;
they work best for data that has complex relationship
structures.
• Schemaless databases allow you to freely add fields to records,
but there is usually an implicit schema expected by users of
the data.
• Aggregate-oriented databases often compute materialized
views to provide data organized differently from their primary
aggregates. This is often done with map-reduce computations.

BW80S Factory Service Manual
100% (1)
BW80S Factory Service Manual
133 pages
Untitled
100% (2)
Untitled
728 pages
Data Mining Unit-IV
No ratings yet
Data Mining Unit-IV
37 pages
Unit 4 - Data Mining - WWW - Rgpvnotes.in
No ratings yet
Unit 4 - Data Mining - WWW - Rgpvnotes.in
12 pages
SACE Stage 2 Chemistry - Cation Exchange Capacity Deconstruct & Design
No ratings yet
SACE Stage 2 Chemistry - Cation Exchange Capacity Deconstruct & Design
7 pages
Velammal Vidyalaya: Section A (Objective Type)
No ratings yet
Velammal Vidyalaya: Section A (Objective Type)
7 pages
Chapter 5 JDBC Programming
No ratings yet
Chapter 5 JDBC Programming
25 pages
CCS334 BIG DATA ANALYTICS Session 1 Intr
No ratings yet
CCS334 BIG DATA ANALYTICS Session 1 Intr
18 pages
Aggregate Data Models
100% (1)
Aggregate Data Models
55 pages
Nosqlmodule 1
100% (1)
Nosqlmodule 1
102 pages
DBMS Ninja Notes
No ratings yet
DBMS Ninja Notes
134 pages
What Kind of Data Can Be Mined
No ratings yet
What Kind of Data Can Be Mined
6 pages
DBMS Architecture: 1-Tier, 2-Tier & 3-Tier: What Is Database Architecture?
100% (1)
DBMS Architecture: 1-Tier, 2-Tier & 3-Tier: What Is Database Architecture?
3 pages
UKZN Map - Westville
0% (1)
UKZN Map - Westville
1 page
Data Mining Concept Description: Characterization and Comparison
No ratings yet
Data Mining Concept Description: Characterization and Comparison
14 pages
Dataming T PDF
No ratings yet
Dataming T PDF
48 pages
Distribution Model
100% (1)
Distribution Model
24 pages
DBMS Unit 1 Notes
100% (1)
DBMS Unit 1 Notes
22 pages
DBMS Notes
No ratings yet
DBMS Notes
141 pages
Batch B DWM Experiments
No ratings yet
Batch B DWM Experiments
90 pages
Software Engineering Notes (Unit-III)
No ratings yet
Software Engineering Notes (Unit-III)
21 pages
Distributed Database Systems (DDBS)
No ratings yet
Distributed Database Systems (DDBS)
30 pages
Hbase PPT PDF
No ratings yet
Hbase PPT PDF
100 pages
Big Data, Map Reduce & Hadoop: By: Surbhi Vyas (7) Varsha
No ratings yet
Big Data, Map Reduce & Hadoop: By: Surbhi Vyas (7) Varsha
40 pages
Ch#22 TRANSACTION - MANAGEMENT
No ratings yet
Ch#22 TRANSACTION - MANAGEMENT
80 pages
Unit V Graph Structures
No ratings yet
Unit V Graph Structures
39 pages
Nosql Database Systems: M.Tech. (Iind, Sem Ce/Cn)
100% (1)
Nosql Database Systems: M.Tech. (Iind, Sem Ce/Cn)
135 pages
DW DM Notes
No ratings yet
DW DM Notes
107 pages
Weighted Moving Average Formula
No ratings yet
Weighted Moving Average Formula
25 pages
Unit V Big Data Analytics
No ratings yet
Unit V Big Data Analytics
47 pages
Dbms PDF
100% (1)
Dbms PDF
4 pages
Unit V
No ratings yet
Unit V
67 pages
Module 4 Nosql
No ratings yet
Module 4 Nosql
8 pages
Huntingthreat 2
0% (1)
Huntingthreat 2
3 pages
Big Data Analytics Unit-5
No ratings yet
Big Data Analytics Unit-5
28 pages
Lesson 39 - Transcript. Build Applications With Glide - Part 2
No ratings yet
Lesson 39 - Transcript. Build Applications With Glide - Part 2
112 pages
Mapreduce and Hadoop Distributed File System
No ratings yet
Mapreduce and Hadoop Distributed File System
36 pages
DDM 5
No ratings yet
DDM 5
46 pages
Implement - Column-Family Stores
No ratings yet
Implement - Column-Family Stores
37 pages
Cp4152 Database Practice Lab Manual R 2021
No ratings yet
Cp4152 Database Practice Lab Manual R 2021
48 pages
Carel Probes and Sensors Selection and Optimal Installation Guide 2021 06 26
No ratings yet
Carel Probes and Sensors Selection and Optimal Installation Guide 2021 06 26
40 pages
Unit 2 - Knowledge Delivery
No ratings yet
Unit 2 - Knowledge Delivery
31 pages
Data Mining: Concepts and Techniques: Jiawei Han and Micheline Kamber
No ratings yet
Data Mining: Concepts and Techniques: Jiawei Han and Micheline Kamber
46 pages
Nosql - Journey Ahead!: Origin: Punch Cards To Dbms
No ratings yet
Nosql - Journey Ahead!: Origin: Punch Cards To Dbms
54 pages
4.2 NoSQL Databases UNIT-1
No ratings yet
4.2 NoSQL Databases UNIT-1
35 pages
DataWarehouseMining Complete Notes
No ratings yet
DataWarehouseMining Complete Notes
55 pages
Introduction To Data Warehouse
No ratings yet
Introduction To Data Warehouse
34 pages
Dbms Aicte Lab
No ratings yet
Dbms Aicte Lab
42 pages
WT Unit 3
No ratings yet
WT Unit 3
57 pages
Distributed Database Concepts
No ratings yet
Distributed Database Concepts
52 pages
Hadoop Unit-4
No ratings yet
Hadoop Unit-4
44 pages
NoSQL Notes
No ratings yet
NoSQL Notes
5 pages
HBase
No ratings yet
HBase
36 pages
Nosql Databases: by Amy Alexander and Tanya Christina
No ratings yet
Nosql Databases: by Amy Alexander and Tanya Christina
14 pages
DBMS Unit 2
No ratings yet
DBMS Unit 2
19 pages
Shaft Requirements
No ratings yet
Shaft Requirements
4 pages
Consistency
No ratings yet
Consistency
42 pages
Stretch Wrap
No ratings yet
Stretch Wrap
18 pages
HBase
No ratings yet
HBase
31 pages
ER Practical 7r
No ratings yet
ER Practical 7r
5 pages
Implement - Graph Databases
No ratings yet
Implement - Graph Databases
40 pages
DMWH M1
No ratings yet
DMWH M1
25 pages
Stock Watson 3U ExerciseSolutions Chapter11 Instructors
No ratings yet
Stock Watson 3U ExerciseSolutions Chapter11 Instructors
12 pages
31b Syllabus
No ratings yet
31b Syllabus
7 pages
Distributed Database Systems: January 2002
No ratings yet
Distributed Database Systems: January 2002
25 pages
Statistical Mechanics Part 1
No ratings yet
Statistical Mechanics Part 1
17 pages
Daniel 9 Exegesis
No ratings yet
Daniel 9 Exegesis
6 pages
Sample Paper Q0503
No ratings yet
Sample Paper Q0503
20 pages
A New LC MS MS Method For Quantification of Gangliosides in Human Plasma PDF
No ratings yet
A New LC MS MS Method For Quantification of Gangliosides in Human Plasma PDF
32 pages
5.1 Mining Data Streams
No ratings yet
5.1 Mining Data Streams
16 pages
File Management
No ratings yet
File Management
14 pages
SQL NoSQL NewSQL
No ratings yet
SQL NoSQL NewSQL
12 pages
Journals Price List For 2022: No. Title Journal Abbreviation Issn (Print) Issn (E-Only)
No ratings yet
Journals Price List For 2022: No. Title Journal Abbreviation Issn (Print) Issn (E-Only)
24 pages
NoSQL Systems For Big Data Management
No ratings yet
NoSQL Systems For Big Data Management
8 pages
Lab Report 1
67% (3)
Lab Report 1
4 pages
Taco HS2 Quick Start Guide
No ratings yet
Taco HS2 Quick Start Guide
30 pages
Unit 3 - Data Mining - WWW - Rgpvnotes.in PDF
No ratings yet
Unit 3 - Data Mining - WWW - Rgpvnotes.in PDF
10 pages
Modelling of Preconditioning by Blasting in Block and Panel Caving
No ratings yet
Modelling of Preconditioning by Blasting in Block and Panel Caving
19 pages
Magnetic Fields 2
No ratings yet
Magnetic Fields 2
4 pages
Undercarriage
No ratings yet
Undercarriage
8 pages
Model Test Paper Dbms
No ratings yet
Model Test Paper Dbms
14 pages
A Modified Two-Step Sequential Spin-Coating Method For Perovskite Solar Cells Using CsI Containing Organic Salts in Mixed Ethanol Methanol Solvent
No ratings yet
A Modified Two-Step Sequential Spin-Coating Method For Perovskite Solar Cells Using CsI Containing Organic Salts in Mixed Ethanol Methanol Solvent
7 pages
PGG311T - PGM216D - Paper
No ratings yet
PGG311T - PGM216D - Paper
5 pages
Acta Paediatrica
No ratings yet
Acta Paediatrica
13 pages
Cassandra: Types of Nosql Databases
No ratings yet
Cassandra: Types of Nosql Databases
6 pages
P7650A/B/U: Differential Pressure Sensors
No ratings yet
P7650A/B/U: Differential Pressure Sensors
4 pages
Oop hw1
No ratings yet
Oop hw1
7 pages
Uc 3841
No ratings yet
Uc 3841
10 pages
Ug NX
No ratings yet
Ug NX
4 pages
Advantages of Data Warehouse
No ratings yet
Advantages of Data Warehouse
2 pages
Parallel Database Systems
No ratings yet
Parallel Database Systems
17 pages
CS964 Data Warehousing and Data Mining
No ratings yet
CS964 Data Warehousing and Data Mining
1 page
Data Engineering with Scala and Spark: Build streaming and batch pipelines that process massive amounts of data using Scala
From Everand
Data Engineering with Scala and Spark: Build streaming and batch pipelines that process massive amounts of data using Scala
Eric Tome
No ratings yet
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
From Everand
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
Robert Johnson
No ratings yet
Optimizing Hadoop for MapReduce
From Everand
Optimizing Hadoop for MapReduce
Khaled Tannir
No ratings yet

More Details On Data Models

Uploaded by

More Details On Data Models

Uploaded by

More Details on Data Models

• The application can read the customer’s

You might also like