MCQ Questions
MCQ Questions
(Hadoop&mapreduce)
MCQ Questions:-
A) HDFS.
B) MapReduce.
C) Spark.
D) YARN.
A) Tables in a database.
B) JSON files.
C) Video files.
D) E-mail headers.
A) Hive.
B) Pig.
C) TensorFlow.
D) HDFS.
A) 32 MB.
B) 64 MB.
C) 128 MB.
D) 256 MB.
A) Splitting.
B) Mapping.
C) Reducing.
D) Combining.
A) Centralized storage.
B) Storing large files across distributed nodes.
C) Performing real-time analytics.
D) Encrypting small datasets.
A) Backup servers.
B) Data replication.
C) RAID arrays.
D) Cloud storage integration.
14. Which of the following tools in the Hadoop ecosystem supports SQL-like queries?
A) Pig.
B) Hive.
C) HDFS.
D) Spark Streaming.
16. Which of the following tools is used for large-scale data storage in Hadoop?
A) MapReduce.
B) YARN.
C) HDFS.
D) Oozie.
A) Splitting.
B) Shuffling.
C) Mapping.
D) Indexing.
A) A storage system.
B) A resource management framework.
C) A database query tool.
D) A data transformation tool.
A) Log files.
B) Relational database tables.
C) Social media posts.
D) Video streams.
26. What is the primary function of the "Reducer" in MapReduce?
A) XML.
B) CSV.
C) JSON.
D) All of the above.
A) Java.
B) Python.
C) R.
D) HTML.
29. Which of the following describes the "Shuffle and Sort" phase in MapReduce?
1. NameNode:
o Role: Manages the filesystem namespace and metadata (file directory, block locations).
o Responsibilities:
Keeps track of where blocks are stored across the cluster.
Handles client requests for file operations (read/write).
2. DataNode:
o Role: Stores the actual data blocks.
o Responsibilities:
Performs read and write operations as instructed by the NameNode.
Sends regular heartbeat signals to the NameNode to indicate it’s functional.
3. Secondary NameNode:
o Role: Maintains a backup of the NameNode's metadata and periodically updates it.
o Note: It is NOT a failover for the NameNode.
4. Blocks:
o Files are divided into smaller chunks called Blocks.
o Example: A 512 MB file is split into four 128 MB blocks, distributed across DataNodes.
Summary