0% found this document useful (0 votes)
18 views

Big Data Unit II

Uploaded by

DEVA PRASATH R
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
18 views

Big Data Unit II

Uploaded by

DEVA PRASATH R
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

UNIT-II

Question 1
The Hadoop Distributed File System (HDFS) manages and supports analysis of very large
volumes; petabytes and zetabytes of data and deal with NOSQL databases.Assume that
you are a database architect for a growing e-commerce company that needs to manage
vast product catalog. The company is exploring database options and is interested in the
flexibility and scalability provided by NoSQL database.

i) If the NoSQL databases provide sources in managing the product catalog for the
e-commerce company, Specify the specific characteristics that make them suitable for
the given scenario.

Characteristics of NoSQL Databases for Managing Product Catalogs


1. Schema-less and Flexible Data Models: Easily store and retrieve complex, nested objects
without predefined schemas.
2. Full Index Support: Efficient querying and retrieval based on various product attributes.
3. Horizontal Scalability: Distribute data across multiple nodes, allowing seamless growth
and maintaining performance.
4. High Availability and Fault Tolerance: Ensure data redundancy and accessibility through
replication and distribution.
5. Support for Large-Scale Data Processing: Handle vast volumes of data with efficient
fragmentation and cloud-level expansion.

II) Set-oriented storage and easy to store the object type

__________________________

Full index support

__________________________

Data fragments processing order to support the expansion of the cloud level

__________________________

__________________________

Fill out the missing portion on MongoDB - NoSQL-based distributed document data storage.
ANS:

Question 2
Assume that you are tasked with designing a social networking platform that focuses on building
meaningful connections among users. Users can have various relationships such as friends,
colleagues and the platform needs an efficient way to represent and navigate these relationship.
The value is a blob that the data store just stores, without caring or knowing what's inside; it's the
responsibility of the application to understand what was stored.

i)Assess the process of graph databases that is to be beneficial for managing


relationships in the social networking platform with any two characteristics of graph.

Key Characteristics:

1. Efficient Relationship Handling:


○ Directly stores relationships, allowing for fast traversal and querying of
connections, such as friends of friends or community detection.
2. Flexible Schema:
○ Easily adapt to changes in the network structure without requiring extensive
schema redesigns, supporting the dynamic nature of social networks.
ii) Towards the leveraging databases with their real-time analytics and flexible data
models, Specify the prominent variances between SQL and NoSQL.

iii) Does the MongoDB supports the ACID properties? If so, provide your
justification with respect to database classification.

Yes, MongoDB supports ACID properties.

Justification:

● Transactions: Starting from version 4.0, MongoDB supports multi-document transactions,


ensuring ACID compliance within these transactions. This makes it suitable for
applications requiring complex transactions.
● Database Classification: MongoDB is classified as a NoSQL database, typically known
for flexibility and scalability. However, its support for ACID properties within
transactions adds robustness, making it a versatile choice for both traditional and modern
applications.
Question 3:

i) Give an example for Schema less Database

One example of a schema-less database is MongoDB. MongoDB allows documents


within a collection to have different fields and structures without requiring a predefined
schema.

ii) Specify the features of key value and document data models

Features of Key-Value and Document Data Models

Key-Value Data Model:

● Stores data as a collection of key-value pairs.


● Simple and efficient for basic storage and retrieval operations.
● Examples: Redis, DynamoDB.

Document Data Model:

● Stores data in flexible, self-describing documents (e.g., JSON, BSON).


● Supports nested structures and arrays.
● Facilitates complex querying and indexing.
● Examples: MongoDB, Couchbase.

iii) Why Facebook is using Graph database

● Efficient Relationship Management: Graph databases excel at managing


complex relationships and connections, which are fundamental in social
networking platforms like Facebook where relationships between users, posts, and
other entities are critical.

● Scalability: Graph databases like TAO are designed to scale horizontally, allowing
Facebook to handle billions of connections and interactions efficiently.

● Real-time Querying: Graph databases provide fast traversal of relationships,


enabling real-time queries for things like friend recommendations, news feed
customization, and content delivery.

You might also like