0% found this document useful (0 votes)

29 views10 pages

Intro-Databases For Big Data

Uploaded by

Xenos Playground aka Boxman Studios

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views10 pages

Intro-Databases For Big Data

Uploaded by

Xenos Playground aka Boxman Studios

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 10

DATS310 d

Databases for Big Data

DR. RICHA SHARMA

C O M M O N W E A LT H U N I V E R S I T Y

1
Introduction
 Architecture for databases:
 Focuses on storage and organization of information to
allow easy access and modification (insert, update, delete
operation) of data.

 Database design and application development depends a

lot on database architecture!

 Architectural design of Database varies just as network

topology varies.

 Helps in identifying which database design is best suitable

for the problem at hand, i.e. the application to be
developed!
2
Tools/Technologies for Big Data
 Few Examples:
 Apache Hadoop, Spark, Kafka, Hive, Storm

 MongoDB and CouchDB

 Redis, Cassandra and Neo4j

 Druid and Google Big Query

 AWS DynamoDB

 Google Big Query

 Tableau

3
Questions to explore
 Type of database – does the problem at hand requires
relational database, key-value pair database, columnar
database, document-oriented database or graph
database?

 Nature of problem and usage of database – does the

problem require flexibility or does it require parallel
processing?

 Communication interface of database – are we going to

interact with database through an interactive command-like
interface or through the application requiring database
connectivity and programming language interfacing?

4
Questions to explore
 Unique characteristic of database – Any database will support
writing data and reading it back again, but what makes it
unique? Some allow querying on arbitrary fields; some
provide indexing for rapid lookup; some support ad hoc
queries, while queries must be planned for others.
 Performance – How does this database function and at what
cost? How about replication? Is this database tuned for
reading, writing, or some other operation?
 Scalability – Scalability closely related to performance and
point to explore is if the database is geared more for
horizontal scaling (MongoDB, HBase, DynamoDB) or
traditional vertical scaling (Postgres, Neo4J, Redis), or
something in between.
5
RDBMS vs Big Databases

6
Key-Value Pair Database
 Simplest database model, storing data as key-value (KV) pair
just like a hash-table.
 Some KV implementations provide a means of iterating
through the keys, but not all!
 A file system can be considered a key-value store assuming
the file path as the key and the file contents as the value.
 Since this database model doesn’t require complex data
structures for storage, it can be incredibly performant in a
number of scenarios but generally won’t be helpful when we
have complex query and aggregation requirements.
 Example: Redis, DynamoDB, Voldemort, Riak etc.

7
Columnar Database
 Columnar, or column-oriented, databases are so named
because these database store the data from a given column
(in the two-dimensional table sense) together, as opposite to
row-oriented databases (RDBMS).

 These databases make adding columns to table quite

inexpensive, and this is done on a row-by-row basis.

 Each row can have a different set of columns, or none at all,

allowing tables to remain sparse without incurring a storage
cost for null values.

 With respect to structure, columnar is about midway between

relational and key-value. Example: HBase, Cassandra etc.
8
Document Database
 Meant to store documents, considering a document like a
hash, with a unique ID field, and values that may be any of a
variety of types, including more hashes.
 Documents can contain nested structures, and so they exhibit
a high degree of flexibility, allowing for variable domains.
 But, the system imposes few restrictions on incoming data, as
long as it meets the basic requirement of being expressible as
a document.
 Different document databases take different approaches with
respect to indexing, ad hoc querying, replication, consistency,
and other design decisions.
 Example: MongoDB, CouchDB etc.
9
Graph Database
 Less commonly used database styles, but graph databases
are best for working with highly interconnected data.

 A graph database consists of nodes and relationships

between nodes.

 Both nodes and relationships can have properties and key-

value pairs that store data.

 Real strength of graph databases is traversing through the

nodes by following relationships..

 Example: Neo4J, Polyglot etc.

21st Century Boys v02, (2007) (Obxist)
No ratings yet
21st Century Boys v02, (2007) (Obxist)
205 pages
Deed of EXCHANGE of MOTOR VEHICLE
100% (10)
Deed of EXCHANGE of MOTOR VEHICLE
2 pages
Invitation Letter For Visa Spouse
No ratings yet
Invitation Letter For Visa Spouse
2 pages
Relativism in Ethics - William Shaw
No ratings yet
Relativism in Ethics - William Shaw
4 pages
Ergonomics Risk Assessment PDF
0% (3)
Ergonomics Risk Assessment PDF
2 pages
Negro Who's Who in California (1948)
100% (2)
Negro Who's Who in California (1948)
154 pages
Lecture 6 - NoSQL
No ratings yet
Lecture 6 - NoSQL
43 pages
Chap 4
No ratings yet
Chap 4
18 pages
DBMS MASTER: Become Pro in Database Management System
From Everand
DBMS MASTER: Become Pro in Database Management System
Ummed Singh
No ratings yet
4.1 Intro Nosql
No ratings yet
4.1 Intro Nosql
43 pages
Unit 2
No ratings yet
Unit 2
65 pages
4.1 Intro Nosql-Converted-133751863122661863
No ratings yet
4.1 Intro Nosql-Converted-133751863122661863
43 pages
Bda CHP 3
No ratings yet
Bda CHP 3
75 pages
Unit 2
No ratings yet
Unit 2
26 pages
BD Unit 4
No ratings yet
BD Unit 4
45 pages
Bcse302l Dbms Module-7 Nosql
No ratings yet
Bcse302l Dbms Module-7 Nosql
30 pages
5.1 Intro Nosql
No ratings yet
5.1 Intro Nosql
22 pages
Advance Database
No ratings yet
Advance Database
5 pages
Lecture 3.1.2
No ratings yet
Lecture 3.1.2
47 pages
Big Data Unit 3
No ratings yet
Big Data Unit 3
374 pages
No SQL
No ratings yet
No SQL
12 pages
Types of NoSQL Databases - GeeksforGeeks
No ratings yet
Types of NoSQL Databases - GeeksforGeeks
9 pages
Module 3 Bigdata Analytics
No ratings yet
Module 3 Bigdata Analytics
19 pages
Unit 5
No ratings yet
Unit 5
36 pages
4.1 Intro Nosql
No ratings yet
4.1 Intro Nosql
43 pages
NoSQL Database
No ratings yet
NoSQL Database
45 pages
MongoDB Slides Until ClassTest
No ratings yet
MongoDB Slides Until ClassTest
221 pages
Unit 6
No ratings yet
Unit 6
143 pages
NoSQL Unit 1 & 2 QnA
No ratings yet
NoSQL Unit 1 & 2 QnA
18 pages
No SQL
No ratings yet
No SQL
32 pages
4.1 Intro Nosql
No ratings yet
4.1 Intro Nosql
45 pages
Unit 3
No ratings yet
Unit 3
7 pages
Chapter 6b - No SQL
No ratings yet
Chapter 6b - No SQL
27 pages
CH.5 NOSQL Database For Business Applications
No ratings yet
CH.5 NOSQL Database For Business Applications
21 pages
Lecture 1 - NoSQL
No ratings yet
Lecture 1 - NoSQL
31 pages
Cs 620 / Dasc 600 Introduction To Data Science & Analytics: Lecture 6-Nosql
No ratings yet
Cs 620 / Dasc 600 Introduction To Data Science & Analytics: Lecture 6-Nosql
31 pages
Unit 3 Nosql Databases Adt
No ratings yet
Unit 3 Nosql Databases Adt
64 pages
Database Advice Guide
No ratings yet
Database Advice Guide
19 pages
M5 DBM SQL Notes
No ratings yet
M5 DBM SQL Notes
8 pages
No SQL
No ratings yet
No SQL
38 pages
BIG Data - Storing Data
No ratings yet
BIG Data - Storing Data
40 pages
Session 8 - NoSQL
No ratings yet
Session 8 - NoSQL
17 pages
Types of Databases
No ratings yet
Types of Databases
9 pages
3.2NOSQL Categories
No ratings yet
3.2NOSQL Categories
7 pages
NOSQL Lecture 1 Notes
No ratings yet
NOSQL Lecture 1 Notes
31 pages
3.2NOSQL Categories
No ratings yet
3.2NOSQL Categories
7 pages
Bda Notes (Unit-2)
No ratings yet
Bda Notes (Unit-2)
26 pages
Databases: System Concepts, Designs, Management, and Implementation
From Everand
Databases: System Concepts, Designs, Management, and Implementation
Jonathan Rigdon
No ratings yet
BDA Module 5 - Part1 (No SQL) 2023
No ratings yet
BDA Module 5 - Part1 (No SQL) 2023
32 pages
D B M S: ATA ASE Anage Me NT Ystem
No ratings yet
D B M S: ATA ASE Anage Me NT Ystem
114 pages
What Are The Types of Databases?
No ratings yet
What Are The Types of Databases?
5 pages
Data 1
No ratings yet
Data 1
4 pages
Lecture 1
No ratings yet
Lecture 1
31 pages
06 NoSQL
No ratings yet
06 NoSQL
80 pages
Ebook Database Advice Guide
No ratings yet
Ebook Database Advice Guide
19 pages
DBMS (UNIT-6) (Advances in Databases and Big Data)
No ratings yet
DBMS (UNIT-6) (Advances in Databases and Big Data)
103 pages
Seminar Topic Nosql
No ratings yet
Seminar Topic Nosql
73 pages
cp5293 Big Data Analytics Unit 5 PDF
No ratings yet
cp5293 Big Data Analytics Unit 5 PDF
28 pages
BIG Data 2
No ratings yet
BIG Data 2
18 pages
No SQL
No ratings yet
No SQL
32 pages
Lec 15 Notes
No ratings yet
Lec 15 Notes
3 pages
NoSQL DATABSES
No ratings yet
NoSQL DATABSES
12 pages
Unit 3 NoSQL
No ratings yet
Unit 3 NoSQL
98 pages
Nosql What Does It Mean
No ratings yet
Nosql What Does It Mean
15 pages
HBase
No ratings yet
HBase
36 pages
Unit 2 Handouts
No ratings yet
Unit 2 Handouts
11 pages
Chapter 3 J v8.0 V04
No ratings yet
Chapter 3 J v8.0 V04
150 pages
Chapter 08 2
No ratings yet
Chapter 08 2
20 pages
SQL Triggers & Functions
No ratings yet
SQL Triggers & Functions
16 pages
Chapter 04
No ratings yet
Chapter 04
29 pages
Chapter 14
No ratings yet
Chapter 14
35 pages
Chapter 06
No ratings yet
Chapter 06
46 pages
Lhu Comp 200: Chapter 2 (2 C) Application Layer
No ratings yet
Lhu Comp 200: Chapter 2 (2 C) Application Layer
37 pages
Chapter 02
No ratings yet
Chapter 02
45 pages
Columnar Database
No ratings yet
Columnar Database
18 pages
Query Optimization
No ratings yet
Query Optimization
10 pages
SQL Views & Procedures
No ratings yet
SQL Views & Procedures
23 pages
Review - Normal Forms2
No ratings yet
Review - Normal Forms2
17 pages
SQL Queries5
No ratings yet
SQL Queries5
20 pages
Chapter 6 Management A Practical Introduction
No ratings yet
Chapter 6 Management A Practical Introduction
6 pages
CAP Theorem
No ratings yet
CAP Theorem
15 pages
Examining Maslow's Hierarchy Need Theory in The Social Media Adoption
No ratings yet
Examining Maslow's Hierarchy Need Theory in The Social Media Adoption
11 pages
SQL Functions
No ratings yet
SQL Functions
18 pages
Eliot PsychoanalyticInterpretationGroup 1920
No ratings yet
Eliot PsychoanalyticInterpretationGroup 1920
21 pages
Review of DB Concepts
No ratings yet
Review of DB Concepts
27 pages
BLAME! Master Edition v02 (2016) (Digital) (Danke-Empire)
No ratings yet
BLAME! Master Edition v02 (2016) (Digital) (Danke-Empire)
364 pages
Deutsch GroupFormation 1973
No ratings yet
Deutsch GroupFormation 1973
20 pages
Quality Indicators For The Care of Older Adults W Disabilities in Longterm Care Wbased On Maslow Hierarchy of Needs
No ratings yet
Quality Indicators For The Care of Older Adults W Disabilities in Longterm Care Wbased On Maslow Hierarchy of Needs
7 pages
A Suggested Modification To Maslow's Need Hierarchy
No ratings yet
A Suggested Modification To Maslow's Need Hierarchy
6 pages
86EIGHTY-SIX Vol 10 Light Novel Fragmental Neoteny - Asato Asato
No ratings yet
86EIGHTY-SIX Vol 10 Light Novel Fragmental Neoteny - Asato Asato
289 pages
BLAME! Master Edition v03 (2017) (Digital) (Danke-Empire)
100% (1)
BLAME! Master Edition v03 (2017) (Digital) (Danke-Empire)
341 pages
BLAME! Master Edition v01 (2016) (Digital) (Danke-Empire)
100% (1)
BLAME! Master Edition v01 (2016) (Digital) (Danke-Empire)
396 pages
Works of Arthur Schopenhauer - Arthur Schopenhauer
100% (1)
Works of Arthur Schopenhauer - Arthur Schopenhauer
2,370 pages
Affidavit With Pay Slip
No ratings yet
Affidavit With Pay Slip
4 pages
Sunlight Dishwashing Liquid Msds
No ratings yet
Sunlight Dishwashing Liquid Msds
12 pages
Enciclopedia Masterflex
No ratings yet
Enciclopedia Masterflex
212 pages
Linguine Pasta - Google Search
No ratings yet
Linguine Pasta - Google Search
1 page
Completion (Natural Flow)
No ratings yet
Completion (Natural Flow)
3 pages
AutoCAD PLANT 3D 2015 System Tools Variables Cadgroup
No ratings yet
AutoCAD PLANT 3D 2015 System Tools Variables Cadgroup
24 pages
Chavez vs. CA
No ratings yet
Chavez vs. CA
1 page
Concurrence of Big Data Analytics and Healthcare
No ratings yet
Concurrence of Big Data Analytics and Healthcare
10 pages
DemoProject2Project Report
No ratings yet
DemoProject2Project Report
9 pages
Development Agreement
No ratings yet
Development Agreement
36 pages
Vaccine Cold Chain
No ratings yet
Vaccine Cold Chain
16 pages
Construction Cost For Vietnam q4 2013
No ratings yet
Construction Cost For Vietnam q4 2013
2 pages
Chapter 11 Test Bank PDF
No ratings yet
Chapter 11 Test Bank PDF
116 pages
Backgroud of Malaysia Airlines 1
No ratings yet
Backgroud of Malaysia Airlines 1
38 pages
2015 Dodge Challenger V6-3.6L Exterior Lights
No ratings yet
2015 Dodge Challenger V6-3.6L Exterior Lights
2 pages
GL850G Icpdf
No ratings yet
GL850G Icpdf
38 pages
Railway Bridge and Tunnels
No ratings yet
Railway Bridge and Tunnels
12 pages
Va 28 16 00
No ratings yet
Va 28 16 00
48 pages
Dividend Payout of Meezan Sovereign Fund and Meezan Cash Fund
No ratings yet
Dividend Payout of Meezan Sovereign Fund and Meezan Cash Fund
11 pages
Chapter 04
100% (1)
Chapter 04
27 pages
Preparation and Applications of Foam Ceramics
No ratings yet
Preparation and Applications of Foam Ceramics
6 pages
Web Design For Everyone Using Wordpress: Golam Morshed
No ratings yet
Web Design For Everyone Using Wordpress: Golam Morshed
31 pages
672448fa583fcf7e75908848 43302953161
No ratings yet
672448fa583fcf7e75908848 43302953161
2 pages
Haven Technical Services
No ratings yet
Haven Technical Services
12 pages
Royal Ahold NV
No ratings yet
Royal Ahold NV
6 pages
Cima f7 dvanced-Financial-Reporting PDF
100% (1)
Cima f7 dvanced-Financial-Reporting PDF
590 pages

Intro-Databases For Big Data

Uploaded by

Intro-Databases For Big Data

Uploaded by

DATS310 d

Databases for Big Data

DR. RICHA SHARMA

 Database design and application development depends a

 Architectural design of Database varies just as network

 Helps in identifying which database design is best suitable

 MongoDB and CouchDB

 Redis, Cassandra and Neo4j

 Druid and Google Big Query

 Google Big Query

 Nature of problem and usage of database – does the

 Communication interface of database – are we going to

 These databases make adding columns to table quite

 Each row can have a different set of columns, or none at all,

 With respect to structure, columnar is about midway between

 A graph database consists of nodes and relationships

 Both nodes and relationships can have properties and key-

 Real strength of graph databases is traversing through the

 Example: Neo4J, Polyglot etc.

You might also like