Nptel Assignment 1
Nptel Assignment 1
Ans- Big Data is characterized by the "Three Vs": variety (different types of data),
volume (large amounts of data), and velocity (speed at which data is generated
and processed). This definition captures the essence of Big Data, distinguishing
it from merely large or structured datasets.
2. Which technology is commonly used for processing and analyzing Big Data in
distributed computing environments?
a. MySQL
b. Hadoop
c. Excel
d. SQLite
3. What is a primary limitation of traditional RDBMS when dealing with Big Data?
a. They cannot handle structured data
b. They are too expensive to implement
c. They struggle with scaling to manage very large datasets
d. They are not capable of performing complex queries
5. Which Hadoop ecosystem tool is primarily used for querying and analyzing
large datasets stored in Hadoop's distributed storage?
a. HBase
b. Hive
c. Kafka
d. Sqoop
Ans- Hive is a data warehousing and SQL-like query language tool used to query
and analyze large datasets in Hadoop. HBase is a NoSQL database, Kafka is a
messaging system, and Sqoop is used for data transfer between Hadoop and
relational databases.