0% found this document useful (0 votes)

4 views

Lecture8

The document discusses NoSQL databases and big data technologies, highlighting the explosive growth of data since 1994 and the various types of NoSQL databases including document-based, key-value stores, and graph-based stores. It covers the characteristics of big data, the differences between SQL and NoSQL databases, and introduces Hadoop and MapReduce for processing large datasets. The document also provides examples of NoSQL databases like MongoDB and DynamoDB, along with their operational details and applications.

Uploaded by

trol.man890

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

Lecture8

Uploaded by

trol.man890

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 34

Lecture 9.

NoSQL Databases, Big

Data Technologies
(Chapter 24 and 25)
[email protected]
• Data and Big Data
• NoSQL Databases
• Document-Based (MongoDB)
• Key-Value Stores
• Graph-based Stores (Neo4j, OrientDB)

Outline • Break 10 min

• Big Data Technology: MapReduce and Hadoop

2(34)
Data
• The amount of data worldwide has been growing since 1994, as the
result there is an explosive growth in the amount of data generated
and communicated over networks worldwide.
• The applications collecting/generating every day information:
• The social media websites (LinkedIn with more than 250 million users,
Facebook with 1.3 billion and 800 million active users everyday, Twitter has
ca 980 million with ca. 1 billion tweets per day)
• Satellite imagery
• Communication Networks (Telenor, Telia, etc.)
• Banking
• Other

3(34)
Data Examples
• Network data:
• Facebook: 500 million users • Sensors data:
• Twitter: 300 million users • Mobile sensors
• Tele Communication data • Internet of Things (IoT) sensors network
• Transport data • Climate data: thousands of station
• Document data: • Linked data:
• Web as a document repository ca 50 billions of web pages • Subtype of network data with semantics
• Wikipedia: 4 million articles • Geographical data:
• Archives • Maps, geodata
• Financial data: • Event-data:
• Banking, Accounting • App log data if user interaction with App
• Transaction data: • Video data:
• Credit card companies: billions of transactions per • Human movements in Sport,
day.
• Queries in search engines (e.g., Google) • Image data:
• Membership cards allows to collect information • Satellite imagery
about customer preferences/needs • Medical Images

4(34)
Characteristics of Data
• Dependencies:
• nondependencies (e.g., text), and
• dependency-oriented data having relationship in time (time series, sequential
data, spatial data)
• Data structure:
• Structured: table (column, rows) or CSV file, network/graph (nodes, edges),
objects with nested objects (JSON files)
• Unstructured: image (pixels in rows,columns), voice data, text data

5(34)
Characteristics of Big Data
The Gartner Group introduced five V’s characteristics for Big Data:
• Volume: refers to the size of the data stored and managed by the system. Examples:
sensors, social media, environmental recording devices, credit card readers (transactional
data), and more.
• Velocity: refers to frequency or speed of data to be generated, stored, processed. For
example, streaming data (sensors, telecommunication data, health vital signals data,
stock exchanges,)
• Variety: refers to structure/type of data, event data (clickstream, social media), location
data (e.g., geospatial data, maps), images (surveillance, satellites, medical scanning),
supply chain data, sensors data, video data (movies, YouTube streaming, etc.)
• Veracity: refers to the credibility of the source, and the suitability of data for its target
audience (trust, and availability)
• Value: refers to what can we do with this data (to solve some problem, need, statistics,
quality)

6(34)
SQL-based Data and NoSQL based Data
• Data for SQL-based databases are:
• University database
• Hospital database
• Traveling Agency database
• Accounting database
• Banking databases
• Other…
• Data for NoSQL based databases are:
• Social media data (network structure + document-based structure)
• Archives (text data), images, videos ( stored as files + document-based database)
• Event-data or user interaction data with App, usually stored in JSON format, thus
document-based database)
• Sensors data (stored in files or in time-series databases TSBD (e.g.,InfluxData))

7(34)
NoSQL Databases

• NoSQL (Not Only SQL) are other databases to suit the particular data and its
characteristics (5 Vs), and application domain.
• NoSQL characteristics:
• Scalability: where usually horizontal scalability is used by adding more nodes for data storage and
processing as the volume of data grows
• Availability : guaranties high availability due to using the distributed approach. In addition, using
two access techniques: hashing and range partitioning
• Replication: support master-slave, and master-master replication.
• Consistency: Horizontal partitioning of the files records in NoSQL is usually used to access
concurrently the records. In addition, many NoSQL applications does not require serializability.
• Not Required Schema: allows semi-structured, self-described data (JSON objects). All constrains
should be programmed in the application program
• Less Powerful Query Language: only a subset of SQL based language is used (no JOIN operations)
• Versioning: some NoSQL databases allows to store multiple versions of the data items, with the
timestamps of when the data version was created.

8(34)
NoSQL Databases Categories
• 1. Document-based NoSQL database
• 2. NoSQL Key-Value Stores
• 3. Column based or wide column NoSQL:
• 4. Graph-based NoSQL

9(34)
1. Document-based NoSQL database

• Stores data in the form of collections of similar documents/objects

• Document is self-described data usually in BJSON format (Binary JavaScript
Object Notation)
• Documents are accessible via their document id, or also indexes.
• Example JSON document/object:
{
‘id’: this.gameID,
‘type’: "playmode",
‘event"’: "point_selection",
‘state’: {‘game_progress’: {‘fields’: {Money: 10, Joy: 50, Health: 30}, ‘score’: 10} ‘value’:{name: ‘Banana’, times: 5}, event_count: 4},
‘timestamp’ : 1667736467
}

• Examples of well-known databases: MongoDB, CouchDB, DocumentDB,

other
10(34)
MongoDB

Image taken from: https://medium.com/zenofai/scaling-dynamodb-for-big-data-using-parallel-scan-1b3baa3df0d8

11(34)
MongoDB Data Model (Flexible Schema)

Model relationships in MongoDB: https://devopedia.org/data-modelling-with-mongodb

12(34)
Example MongoDB Schema (Model)
Web application, JavaScript, Mongoose library (DB-API)

An example MongoDB data model using various design patterns.

Source: Genkina 2020, 31:38
13(34)
MongoDB Operations
Using MongoDB CLI (Command-line interface)
• Create database:
• use “db_name”
• Create collection:
• db.createCollection(name,structure)
• For example: db.createCollection(“project”,{capped:boolean,
size:int,max:int})
• CRUD operations:
• db.collection_name.insert(<document(s)>)
• db.collection_name.remove(<condition>)
• db.collection_name.find(<condition>)

14(34)
MongoDB
• MongoDB Server should be installed OR
• Using MongoDB Atlas cloud (no need to install Mongodb server)
• The DB-API is used to access mongodb from application
• For example, for web applications the JavaScript based Mongoose
library is used

15(34)
2. Key-Value Stores
• These systems have a simple data model based on fast access by the key to the value
associated with they key
• The key is a unique identifier associated with a data item (value)
• The value can be a record, an object or a document, or even more complex data
structure. Support different data types (strings of bytes, arrays of bytes, tuples, JSON
objects)
• No query language
• Set of operations that can be used by the application programmers (GET,PUT,DELETE).
• Main characteristic: is that every value (data item) must associate with unique key and
that retrieving the value by using key must be very fast.
• Usability/Applicability Examples: for streaming data, for real-time data processing and
analyzes.
• Databases: Redis, Apache Kafka, Apache Cassandra, DynamoDB, other

16(34)
DynamoDB (1)
• Provided by Amazon Web Services (AWS)
• Uses concepts of table, items, and attributes
• Item is a value

17(34)
DynamoDB (2)
• Table has a name and primary
key
• A primary key consists from two
attributes (partition key, sort
key).
• Partition key is used for hashing,
and because there are will be
same partition key, additional
sorting key is used for ordering
records in the same partition.

18(34)
Graph-based Databases
• Graph databases is represented as a graph, which is a collection of
vertices (nodes) and edges.
• Nodes and edges can be labeled to indicate the types of entities and
relationships they represent
• Uses graph theory and algorithms for optimizing the data search
• Own query language (e.g., Сypher)
• Applications: analyzing social networks data, recommendations,
geospatial data, postal delivery network
• Databases: Neo4j, OrientDb,

19(34)
Neo4j
• Uses concepts of nodes and relationships (edges)
• Separate structure for data structure and graph structure.
• Every node has a label (name) and properties (attributes)
• Relationships are edges
• Paths used for traversal in a graph (has start and end node)
• Indexing and node identifier. Each node has unique identifier, in
addition user can create indexes for collection of nodes that have a
particular label.
• https://console.neo4j.org/
20(34)
NoSQL Playgrounds
• Mongodb:
• https://mongoplayground.net/
• Monogodb, Neo4j, Cassandra:
• https://bitbucket.prodyna.com/projects/NOS/repos/nosql-playground/browse
• Kafka:
• https://kafka-docker-playground.io/#/
• https://www.conduktor.io/blog/kafka-playground-two-free-kafka-clusters-without-
operational-hassles/
• Redis:
• https://try.redis.io/
• Neo4j:
• https://neo4j.com/sandbox/
• https://console.neo4j.org/

21(34)
10 min break

22(34)
Hadoop
• Hadoop is an open-source Apache software for solving a problem
involving massive amounts of data and computation.
• Was crated for finding a fast and scalable approach for web search
engines.
• Consists of three main modules
• MapReduce is programming model for parallel processing of large data sets
• Hadoop Distributed File System (HDFS) for storage and provides high-
throughput access to a data
• Hadoop YARN: a framework for job scheduling and cluster resource
management

23(34)
MapReduce
• Developed by Dean and Ghemawat at Google in 2004
• Fault-tolerant implementation and runtime environment
• Programming style: map and reduce tasks
• Allows programmers to analyze very large datasets
• Underlying data mode assumed: key-value pairs

24(34)
MapReduce Programming Model
• The general form of map and reduce functions:
• Map[K1,V1] which is (key,value):List[K2,V2] and
• reduce(K2,List[V2]):List[K3,V3]
• where, map is a generic function that takes a key of type K1 and a
value of a type V1 and returns a list of key-value pairs of type K2 and
V2. and
• reduce is a generic function that takes a key pf type K2 and a list of
values of type V2 and returns a list of key-value pairs of type (K3,V3)
• In general, key types K1,K2,K3, etc. are different, with the only one
requirement that the output types from Map function must match
the input type of the Reduce function.
25(34)
MapReduce Example
Count word frequency in a document
• Input Splits: divides the input into List(K2,[V2])
fixed-sized n jobs or input splits.
• Mapping: in this example a job of List[K2,V2]
mapping phase is to count number
of occurrences of each word from
input splits and prepare a list of
[(word, frequency)] (List[K2,V2])
• Shuffling: is to consolidate
(sort/order) the relevant records
from Mapping phase.
• Reducing: in this example performs List[K3,V3]
aggregation (sum) function and
combines values from Shuffling
• Final output: is single output value.

Image taken from: https://www.guru99.com/introduction-to-mapreduce.html 26(34)

MapReduce Pceudocode
Example: Count word frequency in a document
map (String key, String value):
for each word in w in value Emitintermediate(w, “1”);
reduce (string key, Iterator values):
Int result = 0;
For each v in values:
result+ =Parseint(v);
Emit (key, Asstring(result));

27(34)
General MapReduce Architecture
• It is beneficial to have multiple splits with an
appropriate size (default 128 MB).
• One map task is created for each split.
• After mapping, the Sort operation is executed to
make all tuples with the same key are sent to the
corresponding reducer.
• The key-value list is a tuple type
• The output of the mapping is stored on local disk
and removed automatically after the reducer has
finished its task
• The mapping can be run in one machine and
reducer on another machine to optimize the
overload
• Shaffling (copy and merge) is done in another
machine (where is the reducer is located) to create
a list with (key, [value]) pairs where each reducer
will have one unique key
• The map and reduce executes the user defined-
functions (e.g., sum, count, etc.)
• The reduce output is stored in HDFS Machine 1 Machine 2

28(34)
MapReduce Runtime Environment
• The complete execution process (of Map
and Reduce tasks) is controlled by two
types of entities called:
• Job Tracker: acts like a master
• Multiple Task Trackers: acts like a
slave, each of them performing the
job. Run on DataNodes of the
cluster
• For every job submitted for execution in
the system, there is one Jobtracker
• Overall flow of MapReduce job:
• Job submission
• Job initialization
• Task assignment
• Task execution
• Job completion

29(34)
Hadoop Distributed File System (HDFS)
• HDFS:
• is designed to run on a cluster of commodity hardware
• Commodity hardware is involves the use of large numbers of already-available
computing components for parallel computing, to get the greatest amount of useful
computation at low cost.
• provides high-throughput access to large datasets
• is build using Java language and requires java to be installed to use this
storage system.
• stores file system metadata and application data separately on different
servers (NameNode and DataNodes)
• All servers are fully connected and communicate with each other using TCP-
based protocol
• Replication is done for DataNodes

30(34)
HDFS architecture
• Master-slave architecture
• NameNode and DataNodes are software designed to run
on commodity machines (with GNU/Linux OS).
• Namenode is running on dedicated machine
• Other machines in the cluster runs one instance of the
DataNode software.
• User data never flows through the NameNode.
• The files are broken into block-size chunks called data
blocks
• Rack is a collocation of 30-40 DataNodes
• NameNode stores the filesystem metadata, such as files
names, information about blocks of a file, blocks
locations, permissions, etc. and used for managing the
Datanodes
• Datanodes are storing the application data, it servers the
client read/write requests based on the NameNode
instructions.
• A cluster can have thousands of DataNodes, and tens of
thousands of HDFS clients simultaneously connected
• Each block is replicated (default three copies) to a
number of nodes in a cluster.

31(34)
File I/O operations in HDFS
• HDFS supports a traditional hierarchical file organization. The
NameNode maintains the file system namespace. Any change to the
file system namespace or its properties is recorded by the
NameNode.
• Provides single-writer, and multiple-reader model
• Files cannot be updated, only appended or removed
• A file consists of blocks
• Block placement:
• Nodes of Hadoop cluster typically spread across many racks
• Nodes on a rack share a switch

32(34)
The Hadoop Ecosystem

33(34)
References
• https://hadoop.apache.org/docs/stable/hadoop-project-dist/hadoop-
hdfs/HdfsDesign.html

34(34)

GM 3
100% (2)
GM 3
92 pages
Besavior PS5 Controller Operation Tutorial 12.16
No ratings yet
Besavior PS5 Controller Operation Tutorial 12.16
10 pages
Ccure-9000-Connected-Program Faq r12 LT en
No ratings yet
Ccure-9000-Connected-Program Faq r12 LT en
11 pages
PPT 2.1.2
No ratings yet
PPT 2.1.2
31 pages
No SQL
No ratings yet
No SQL
38 pages
06-NoSQL
No ratings yet
06-NoSQL
80 pages
PPT 2.2.1
No ratings yet
PPT 2.2.1
26 pages
NOSQL Lecture 1 Notes
No ratings yet
NOSQL Lecture 1 Notes
31 pages
Lecture 1 - NoSQL
No ratings yet
Lecture 1 - NoSQL
31 pages
Lecture 1
No ratings yet
Lecture 1
31 pages
BDA_(2)_merged[1]
No ratings yet
BDA_(2)_merged[1]
29 pages
chap 4
No ratings yet
chap 4
18 pages
Chapter14_BigData&NoSQLDatabases
No ratings yet
Chapter14_BigData&NoSQLDatabases
39 pages
BIG DATA UNIT-II NOTES
No ratings yet
BIG DATA UNIT-II NOTES
7 pages
2 Big Data Analytics-Hadoop R21 A7902 ABP
No ratings yet
2 Big Data Analytics-Hadoop R21 A7902 ABP
16 pages
Lesson 2 Unstructured Data
No ratings yet
Lesson 2 Unstructured Data
33 pages
03 Unit Bda Hadoop,Map Reduce
No ratings yet
03 Unit Bda Hadoop,Map Reduce
80 pages
Bcse302l Dbms Module-7 Nosql
No ratings yet
Bcse302l Dbms Module-7 Nosql
30 pages
Chapter-14
No ratings yet
Chapter-14
35 pages
Iccmc51019 2021 9418441
No ratings yet
Iccmc51019 2021 9418441
5 pages
Lecture 6 - NoSQL
No ratings yet
Lecture 6 - NoSQL
43 pages
Big Data With Hadoop
No ratings yet
Big Data With Hadoop
26 pages
Unit 6
No ratings yet
Unit 6
143 pages
NOsql Presentation
No ratings yet
NOsql Presentation
20 pages
2 BDA A6515 Hadoop
No ratings yet
2 BDA A6515 Hadoop
55 pages
BDS Session 1
100% (1)
BDS Session 1
70 pages
NGT NOV-19 (Sol) (E-next.in)
No ratings yet
NGT NOV-19 (Sol) (E-next.in)
33 pages
Screenshot 2023-12-07 at 00.20.37
No ratings yet
Screenshot 2023-12-07 at 00.20.37
21 pages
05 NoSQL
No ratings yet
05 NoSQL
21 pages
Bda Module-2
No ratings yet
Bda Module-2
32 pages
Module 1
No ratings yet
Module 1
34 pages
Cs 620 / Dasc 600 Introduction To Data Science & Analytics: Lecture 6-Nosql
No ratings yet
Cs 620 / Dasc 600 Introduction To Data Science & Analytics: Lecture 6-Nosql
31 pages
Introduction To Big Data and NoSQL
No ratings yet
Introduction To Big Data and NoSQL
52 pages
CloudComputing DATABASE
No ratings yet
CloudComputing DATABASE
27 pages
07-BigData-DataAnalysis
No ratings yet
07-BigData-DataAnalysis
66 pages
No SQL
No ratings yet
No SQL
38 pages
Slide 6 NoSQL Database and HBase Tutorial
No ratings yet
Slide 6 NoSQL Database and HBase Tutorial
110 pages
Unit II No-SQL Db Managment
No ratings yet
Unit II No-SQL Db Managment
33 pages
NoSQL MongoDB HBase Cassandra
100% (1)
NoSQL MongoDB HBase Cassandra
142 pages
41 NoSQL Introduction.pptx
No ratings yet
41 NoSQL Introduction.pptx
18 pages
4.1 Intro Nosql
No ratings yet
4.1 Intro Nosql
43 pages
Chapter 3 NoSQL Database (1)
No ratings yet
Chapter 3 NoSQL Database (1)
47 pages
Big Data
No ratings yet
Big Data
53 pages
CS8091 LN
No ratings yet
CS8091 LN
68 pages
Chapter 5c
No ratings yet
Chapter 5c
18 pages
NONSQL-DATABASE_NOTE
No ratings yet
NONSQL-DATABASE_NOTE
24 pages
BDA CW Chapter 3
No ratings yet
BDA CW Chapter 3
9 pages
NoSQL Database
No ratings yet
NoSQL Database
45 pages
10gen Top 5 NoSQL Considerations
No ratings yet
10gen Top 5 NoSQL Considerations
10 pages
NGT Paper
No ratings yet
NGT Paper
25 pages
BD unit 1
No ratings yet
BD unit 1
5 pages
Unit 2 Handouts
No ratings yet
Unit 2 Handouts
11 pages
4.1 Intro Nosql
No ratings yet
4.1 Intro Nosql
45 pages
BDA Assignment1 BE6 20
No ratings yet
BDA Assignment1 BE6 20
10 pages
NoSQL Databases
No ratings yet
NoSQL Databases
20 pages
Introduction To NoSQL
No ratings yet
Introduction To NoSQL
29 pages
Bigdata
No ratings yet
Bigdata
7 pages
DBMS Unit2
No ratings yet
DBMS Unit2
26 pages
Big Data NOTES
No ratings yet
Big Data NOTES
14 pages
Databases: System Concepts, Designs, Management, and Implementation
From Everand
Databases: System Concepts, Designs, Management, and Implementation
Jonathan Rigdon
No ratings yet
Image Retrieval: Fundamentals and Applications
From Everand
Image Retrieval: Fundamentals and Applications
Fouad Sabry
No ratings yet
Learn Hadoop in 24 Hours
From Everand
Learn Hadoop in 24 Hours
Alex Nordeen
No ratings yet
DBMS MASTER: Become Pro in Database Management System
From Everand
DBMS MASTER: Become Pro in Database Management System
Ummed Singh
No ratings yet
Miscellaneous
No ratings yet
Miscellaneous
6 pages
Null Space
No ratings yet
Null Space
2 pages
Buddhist Scriptures - An Overview - Naomi Appleton
No ratings yet
Buddhist Scriptures - An Overview - Naomi Appleton
15 pages
Lecture4
No ratings yet
Lecture4
47 pages
What Happened to the Ancient Library of Alexandria - Edited by Mostafa El-Abbadi
No ratings yet
What Happened to the Ancient Library of Alexandria - Edited by Mostafa El-Abbadi
282 pages
Lecture 2
No ratings yet
Lecture 2
36 pages
Lecture6 (1)
No ratings yet
Lecture6 (1)
28 pages
Lecture slides - Linear Regression (2025)
No ratings yet
Lecture slides - Linear Regression (2025)
45 pages
Lecture2
No ratings yet
Lecture2
36 pages
Homework_2_2025
No ratings yet
Homework_2_2025
2 pages
Why The Propeht Was Not The Author of The Quran.
No ratings yet
Why The Propeht Was Not The Author of The Quran.
91 pages
Ash-Sharh Wal-Ibānah Alā Usūl As-Sunnah Wad-Diyānah Wa Mujānabah Al-Mukhālifīn Wa Mubāyanah Ahlil-Ahwā Al-Māriqīn Also Known... (Ibn Battah Al - Ukbarī Abū Hājar (Translator) )
100% (1)
Ash-Sharh Wal-Ibānah Alā Usūl As-Sunnah Wad-Diyānah Wa Mujānabah Al-Mukhālifīn Wa Mubāyanah Ahlil-Ahwā Al-Māriqīn Also Known... (Ibn Battah Al - Ukbarī Abū Hājar (Translator) )
296 pages
NGDPV english year 2
No ratings yet
NGDPV english year 2
2 pages
GX Developer Version 8 Operating Manual (Function Block) Sh080376el
No ratings yet
GX Developer Version 8 Operating Manual (Function Block) Sh080376el
92 pages
Ffmicro Manual
No ratings yet
Ffmicro Manual
23 pages
Dip Multimedia Computing 2020
No ratings yet
Dip Multimedia Computing 2020
7 pages
Network Security Protocols
No ratings yet
Network Security Protocols
34 pages
Best Locally-Usable Voice Transcription AI in 2025
No ratings yet
Best Locally-Usable Voice Transcription AI in 2025
2 pages
Ethernet Smartmotor Troubleshooting v1.1
No ratings yet
Ethernet Smartmotor Troubleshooting v1.1
16 pages
VIVA Selected Question Ignou BCA MCA by NiPSAR
No ratings yet
VIVA Selected Question Ignou BCA MCA by NiPSAR
5 pages
Nagendra Salesforce Developer .
No ratings yet
Nagendra Salesforce Developer .
10 pages
Rajant SpecSheet Needletail 031924
No ratings yet
Rajant SpecSheet Needletail 031924
3 pages
SRS and URS
No ratings yet
SRS and URS
12 pages
XPG Summoner QSG
No ratings yet
XPG Summoner QSG
2 pages
Programming_Project_1
No ratings yet
Programming_Project_1
4 pages
Datasheet XZ000 G7 (BR)
No ratings yet
Datasheet XZ000 G7 (BR)
3 pages
student management
No ratings yet
student management
16 pages
Abin Lal: Education Skills
No ratings yet
Abin Lal: Education Skills
1 page
Wanpipe and The A116: Leo D'Alessandro Customer Engineer July 18, 2013
No ratings yet
Wanpipe and The A116: Leo D'Alessandro Customer Engineer July 18, 2013
52 pages
Learn Python 3 - Modules Cheatsheet - Codecademy
No ratings yet
Learn Python 3 - Modules Cheatsheet - Codecademy
4 pages
Lock & Key User Manual
No ratings yet
Lock & Key User Manual
44 pages
Chapter 4.1 Understand Object Oriented Design Using UML
No ratings yet
Chapter 4.1 Understand Object Oriented Design Using UML
30 pages
991618-5 XN120 MyCalls Getting Started Guide
No ratings yet
991618-5 XN120 MyCalls Getting Started Guide
76 pages
Difference Between Message and Packet Switching
No ratings yet
Difference Between Message and Packet Switching
4 pages
Eti Chapter 5
No ratings yet
Eti Chapter 5
21 pages
Transaction and Serializibility2
No ratings yet
Transaction and Serializibility2
47 pages
telco-cloud-platform-5g-edition-data-plane-performance-tuning-guide
No ratings yet
telco-cloud-platform-5g-edition-data-plane-performance-tuning-guide
36 pages
RE190 (EU) 4.0 - Datasheet
No ratings yet
RE190 (EU) 4.0 - Datasheet
5 pages
New Profile 2024 -2025
No ratings yet
New Profile 2024 -2025
8 pages
Lec 3 Arrays and Strings Mar 23
No ratings yet
Lec 3 Arrays and Strings Mar 23
33 pages

Lecture8

Uploaded by

Lecture8

Uploaded by

Lecture 9.

NoSQL Databases, Big

Outline • Break 10 min

• Stores data in the form of collections of similar documents/objects

• Examples of well-known databases: MongoDB, CouchDB, DocumentDB,

Image taken from: https://medium.com/zenofai/scaling-dynamodb-for-big-data-using-parallel-scan-1b3baa3df0d8

Model relationships in MongoDB: https://devopedia.org/data-modelling-with-mongodb

An example MongoDB data model using various design patterns.

Image taken from: https://www.guru99.com/introduction-to-mapreduce.html 26(34)

You might also like