Big Data Unit II
Big Data Unit II
Question 1
The Hadoop Distributed File System (HDFS) manages and supports analysis of very large
volumes; petabytes and zetabytes of data and deal with NOSQL databases.Assume that
you are a database architect for a growing e-commerce company that needs to manage
vast product catalog. The company is exploring database options and is interested in the
flexibility and scalability provided by NoSQL database.
i) If the NoSQL databases provide sources in managing the product catalog for the
e-commerce company, Specify the specific characteristics that make them suitable for
the given scenario.
__________________________
__________________________
Data fragments processing order to support the expansion of the cloud level
__________________________
__________________________
Fill out the missing portion on MongoDB - NoSQL-based distributed document data storage.
ANS:
Question 2
Assume that you are tasked with designing a social networking platform that focuses on building
meaningful connections among users. Users can have various relationships such as friends,
colleagues and the platform needs an efficient way to represent and navigate these relationship.
The value is a blob that the data store just stores, without caring or knowing what's inside; it's the
responsibility of the application to understand what was stored.
Key Characteristics:
iii) Does the MongoDB supports the ACID properties? If so, provide your
justification with respect to database classification.
Justification:
ii) Specify the features of key value and document data models
● Scalability: Graph databases like TAO are designed to scale horizontally, allowing
Facebook to handle billions of connections and interactions efficiently.