0% found this document useful (0 votes)

53 views6 pages

MCQ Questions

The document contains multiple-choice questions related to Hadoop and MapReduce, covering topics such as HDFS, MapReduce phases, and components of the Hadoop ecosystem. Key concepts include the roles of NameNode, DataNode, and the function of MapReduce in processing large datasets. Additionally, it touches on applications of NLP and the structure of HDFS.

Uploaded by

malakasran339

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views6 pages

MCQ Questions

Uploaded by

malakasran339

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

lecture 4

(Hadoop&mapreduce)
MCQ Questions:-

1. Which of the following best describes Hadoop?

 A) A relational database system.

 B) A distributed framework for storing and processing large datasets.
 C) A cloud storage service.
 D) A SQL-based querying platform.

2. What is the main function of the "NameNode" in HDFS?

 A) To store data blocks.

 B) To manage metadata and the directory structure.
 C) To process MapReduce jobs.
 D) To replicate data across DataNodes.

3. In Hadoop, which component is responsible for processing data in parallel?

 A) HDFS.
 B) MapReduce.
 C) Spark.
 D) YARN.

4. What is an example of unstructured data that Hadoop can process?

 A) Tables in a database.
 B) JSON files.
 C) Video files.
 D) E-mail headers.

5. Which of the following is NOT part of the Hadoop ecosystem?

 A) Hive.
 B) Pig.
 C) TensorFlow.
 D) HDFS.

6. What does the "Map" phase in MapReduce do?

 A) It combines intermediate results.

 B) It processes input data and transforms it into key-value pairs.
 C) It merges data from different nodes.
 D) It reduces the size of data stored in HDFS.

7. Why is MapReduce scalable?

 A) It uses expensive hardware.
 B) It supports SQL queries.
 C) It divides tasks into smaller jobs that can run on multiple nodes.
 D) It relies on in-memory processing.

8. Which of the following is NOT a use case for MapReduce?

 A) Word count in large text files.

 B) Data sorting.
 C) Relational database updates.
 D) Log analysis.

9. What is the default block size in HDFS?

 A) 32 MB.
 B) 64 MB.
 C) 128 MB.
 D) 256 MB.

10. Which phase of MapReduce is responsible for aggregating data?

 A) Splitting.
 B) Mapping.
 C) Reducing.
 D) Combining.

11. Which of the following is a benefit of HDFS?

 A) Centralized storage.
 B) Storing large files across distributed nodes.
 C) Performing real-time analytics.
 D) Encrypting small datasets.

12. What does Hadoop use for fault tolerance?

 A) Backup servers.
 B) Data replication.
 C) RAID arrays.
 D) Cloud storage integration.

13. In MapReduce, what is a Combiner?

 A) A required step for reducing data.

 B) A local reducer that processes intermediate data on the same node.
 C) A function to map data into key-value pairs.
 D) A secondary task for YARN.

14. Which of the following tools in the Hadoop ecosystem supports SQL-like queries?

 A) Pig.
 B) Hive.
 C) HDFS.
 D) Spark Streaming.

15. What is the purpose of the JobTracker in Hadoop 1.x?

 A) Managing distributed data.

 B) Tracking metadata.
 C) Coordinating MapReduce jobs across nodes.
 D) Processing SQL queries.

16. Which of the following tools is used for large-scale data storage in Hadoop?

 A) MapReduce.
 B) YARN.
 C) HDFS.
 D) Oozie.

17. Which of the following is true about Pig?

 A) It is used for real-time streaming.

 B) It provides a high-level scripting language for data analysis.
 C) It processes relational database queries.
 D) It stores data on HDFS.

18. What is a DataNode in Hadoop?

 A) A node that manages metadata.

 B) A node that processes MapReduce jobs.
 C) A node that stores blocks of data.
 D) A node that tracks task execution.

19. What is a key feature of HDFS that ensures reliability?

 A) Data replication across multiple nodes.

 B) Encryption of stored data.
 C) Automated SQL query optimization.
 D) In-memory data processing.
20. Which of the following best describes Spark compared to MapReduce?

 A) It is slower but simpler.

 B) It processes data in memory, making it faster.
 C) It supports only unstructured data.
 D) It does not integrate with HDFS.

21. What is the role of the ResourceManager in Hadoop 2.x (YARN)?

 A) Managing storage blocks.

 B) Scheduling resources for applications.
 C) Tracking job progress.
 D) Replicating data across nodes.

22. Which of the following is NOT a phase in MapReduce?

 A) Splitting.
 B) Shuffling.
 C) Mapping.
 D) Indexing.

23. What is "YARN" in Hadoop?

 A) A storage system.
 B) A resource management framework.
 C) A database query tool.
 D) A data transformation tool.

24. In Hadoop, what is a Block Report?

 A) A list of corrupted blocks.

 B) Metadata sent by a DataNode to the NameNode.
 C) A summary of HDFS storage usage.
 D) A report detailing completed MapReduce jobs.

25. What is an example of structured data?

 A) Log files.
 B) Relational database tables.
 C) Social media posts.
 D) Video streams.
26. What is the primary function of the "Reducer" in MapReduce?

 A) Transforming input data into key-value pairs.

 B) Aggregating intermediate results from the Mappers.
 C) Storing data on HDFS.
 D) Dividing tasks into smaller jobs.

27. What is a typical file format supported by Hadoop?

 A) XML.
 B) CSV.
 C) JSON.
 D) All of the above.

28. Which programming language is NOT commonly used with Hadoop?

 A) Java.
 B) Python.
 C) R.
 D) HTML.

29. Which of the following describes the "Shuffle and Sort" phase in MapReduce?

 A) Splitting data into smaller chunks.

 B) Sorting and grouping intermediate results by key.
 C) Transforming key-value pairs into final results.
 D) Writing data to HDFS.

30. Which of the following is a real-world use case for MapReduce?

 A) Banking transactions processing.

 B) Search engine indexing.
 C) Real-time weather analysis.
 D) Video game rendering.
Applications of NLP

o ‫( تحليل المشاعر‬Sentiment Analysis).

o ‫استخراج المعلومات وتصنيف النصوص‬
‫ باستخدام‬NLP ‫و‬Machine Learning
o ‫( تصنيف النصوص‬Text Classification).

‫تحديد إذا كانت النصوص إيجابية أو‬

‫سلبية‬.

o ‫( استخراج الكيانات‬Named Entity

Recognition).

‫ زي اللي بيستخدمها‬Netflix ‫و‬Amazon

‫القتراح األفالم أو المنتجات‬.

3. Key Components of HDFS

1. NameNode:
o Role: Manages the filesystem namespace and metadata (file directory, block locations).
o Responsibilities:
 Keeps track of where blocks are stored across the cluster.
 Handles client requests for file operations (read/write).
2. DataNode:
o Role: Stores the actual data blocks.
o Responsibilities:
 Performs read and write operations as instructed by the NameNode.
 Sends regular heartbeat signals to the NameNode to indicate it’s functional.
3. Secondary NameNode:
o Role: Maintains a backup of the NameNode's metadata and periodically updates it.
o Note: It is NOT a failover for the NameNode.
4. Blocks:
o Files are divided into smaller chunks called Blocks.
o Example: A 512 MB file is split into four 128 MB blocks, distributed across DataNodes.

Summary

 HDFS is a distributed file system for storing large datasets.

 Key components: NameNode (manages metadata), DataNode (stores data), and Blocks (small parts of

Cloudera CCD 410
100% (1)
Cloudera CCD 410
21 pages
Big Data Exam Correction
100% (1)
Big Data Exam Correction
10 pages
Big Data MCQ
No ratings yet
Big Data MCQ
47 pages
DS QCM BigData 2021
No ratings yet
DS QCM BigData 2021
6 pages
Bda Unit 2
No ratings yet
Bda Unit 2
16 pages
2022 Assignment Answers
No ratings yet
2022 Assignment Answers
37 pages
Final Exam
17% (6)
Final Exam
6 pages
Bda MCQ
100% (1)
Bda MCQ
44 pages
Mcqs 5
No ratings yet
Mcqs 5
9 pages
Unit 3 - Database Management System - WWW - Rgpvnotes.in
No ratings yet
Unit 3 - Database Management System - WWW - Rgpvnotes.in
20 pages
Mcqs 4
No ratings yet
Mcqs 4
9 pages
Question 1: Your Answer
100% (1)
Question 1: Your Answer
26 pages
DMDW (Olap)
No ratings yet
DMDW (Olap)
31 pages
BDC Previous Papers 2 Marks
100% (1)
BDC Previous Papers 2 Marks
7 pages
Mid - 2 Questions & Bits
No ratings yet
Mid - 2 Questions & Bits
5 pages
Nptel Assignment 1
No ratings yet
Nptel Assignment 1
4 pages
Data Engineer Interview Questions
No ratings yet
Data Engineer Interview Questions
16 pages
RDBMS LAB Manual
No ratings yet
RDBMS LAB Manual
29 pages
Bda QB Soln
No ratings yet
Bda QB Soln
22 pages
Big Data and Hadoop - Semester Exam - 6th Sem-Set 01
No ratings yet
Big Data and Hadoop - Semester Exam - 6th Sem-Set 01
3 pages
BDA Viva
No ratings yet
BDA Viva
26 pages
Big Data Analytics - Unit 4
No ratings yet
Big Data Analytics - Unit 4
32 pages
Cloudera Testpassport CCD-470
No ratings yet
Cloudera Testpassport CCD-470
33 pages
Big Data Analysis IAT-1
No ratings yet
Big Data Analysis IAT-1
43 pages
CS8091 Big Data Analytics MCQ
100% (2)
CS8091 Big Data Analytics MCQ
22 pages
Basic Hadoop Interview Questionsxyzz
No ratings yet
Basic Hadoop Interview Questionsxyzz
18 pages
HADOOP
No ratings yet
HADOOP
19 pages
Compare Hadoop & Spark Criteria Hadoop Spark
No ratings yet
Compare Hadoop & Spark Criteria Hadoop Spark
18 pages
DS Unit 4.1
No ratings yet
DS Unit 4.1
14 pages
BDA IMPORTANT QUESTION (5marks)
No ratings yet
BDA IMPORTANT QUESTION (5marks)
7 pages
Bigdatamcq mcq1
No ratings yet
Bigdatamcq mcq1
21 pages
A Brief Overview On Apache CouchDB
No ratings yet
A Brief Overview On Apache CouchDB
25 pages
MCQ Da
No ratings yet
MCQ Da
28 pages
Data Egineer Interview Questions
No ratings yet
Data Egineer Interview Questions
126 pages
Bigdata Short
No ratings yet
Bigdata Short
8 pages
6-Database Design and Development
No ratings yet
6-Database Design and Development
6 pages
Questionbank 12 With-Answer
No ratings yet
Questionbank 12 With-Answer
3 pages
DSBDA Kadak Document
No ratings yet
DSBDA Kadak Document
249 pages
Big Data Quiz1.1
No ratings yet
Big Data Quiz1.1
4 pages
Big Data Hadoop
No ratings yet
Big Data Hadoop
11 pages
Hadoop
No ratings yet
Hadoop
14 pages
Data Engineer Interview Questions
No ratings yet
Data Engineer Interview Questions
6 pages
MS Access-Creating Table
No ratings yet
MS Access-Creating Table
52 pages
Big Data Analytics 2M Definitions
No ratings yet
Big Data Analytics 2M Definitions
3 pages
Entity Relationship Diagram - ER Diagram in DBMS
No ratings yet
Entity Relationship Diagram - ER Diagram in DBMS
7 pages
$RWLX60C
No ratings yet
$RWLX60C
21 pages
5Th Sem. / Computer Subject: Big Data: What Are The Challenges For Processing Bigdata? (C - 1)
No ratings yet
5Th Sem. / Computer Subject: Big Data: What Are The Challenges For Processing Bigdata? (C - 1)
2 pages
Bits
No ratings yet
Bits
2 pages
QCMSerie 1
No ratings yet
QCMSerie 1
4 pages
Week 2
No ratings yet
Week 2
7 pages
Bda Summer 2022 Solution
No ratings yet
Bda Summer 2022 Solution
30 pages
AWR Explanation Good
No ratings yet
AWR Explanation Good
91 pages
End Sem Paper
No ratings yet
End Sem Paper
4 pages
DS BigDATA 2ièmeN2TR UVT 2022 2023
No ratings yet
DS BigDATA 2ièmeN2TR UVT 2022 2023
4 pages
SECTION II: Technical Foundations of Database Management: Hierarchical
No ratings yet
SECTION II: Technical Foundations of Database Management: Hierarchical
5 pages
BigData Objective
No ratings yet
BigData Objective
93 pages
Big Data Analytics
No ratings yet
Big Data Analytics
6 pages
SSAS Tutorial
100% (1)
SSAS Tutorial
20 pages
Big Data Visualization
No ratings yet
Big Data Visualization
55 pages
Big Data BCS061 Complete Question Bank With RealWorld
No ratings yet
Big Data BCS061 Complete Question Bank With RealWorld
5 pages
Bda MCQ
No ratings yet
Bda MCQ
9 pages
Subject Name:: Knowledge Institute of Technology & Engineering-135
No ratings yet
Subject Name:: Knowledge Institute of Technology & Engineering-135
22 pages
Pig
No ratings yet
Pig
24 pages
Semantic Data Control
No ratings yet
Semantic Data Control
15 pages
PDF Reference
No ratings yet
PDF Reference
1,054 pages
Bigdatacourse
No ratings yet
Bigdatacourse
10 pages
Devoir Surveillé: Please Answer The Following Multiple-Choice Questions
No ratings yet
Devoir Surveillé: Please Answer The Following Multiple-Choice Questions
8 pages
Performance Tuning: Identifying Performance Bottleneck Taking Corrective Actions
100% (1)
Performance Tuning: Identifying Performance Bottleneck Taking Corrective Actions
21 pages
How To Get Data Into DropDown List From Database in JSP
100% (1)
How To Get Data Into DropDown List From Database in JSP
3 pages
Important Questions and Answers of Big Data Course
No ratings yet
Important Questions and Answers of Big Data Course
4 pages
Hadoop Interviews Q
No ratings yet
Hadoop Interviews Q
9 pages
Hadoop Interview Questions New
No ratings yet
Hadoop Interview Questions New
9 pages
Net Centric Computing Lab - Part2
No ratings yet
Net Centric Computing Lab - Part2
3 pages
Analysis Warung Dana & Input No Kupon OCT 2017
No ratings yet
Analysis Warung Dana & Input No Kupon OCT 2017
343 pages
VU21997 - Expose Website Security Vulnerabilities - Class 3 SQLi
No ratings yet
VU21997 - Expose Website Security Vulnerabilities - Class 3 SQLi
25 pages
Government College of Engineering and Technology: Aashutosh Gandotra Computer Engineering 202/18
No ratings yet
Government College of Engineering and Technology: Aashutosh Gandotra Computer Engineering 202/18
25 pages
DBMS Practical Fie
No ratings yet
DBMS Practical Fie
26 pages
1.purpose of Database System
No ratings yet
1.purpose of Database System
5 pages
DBMS Que
No ratings yet
DBMS Que
16 pages
Aginity Netezza Workbench Documentation
No ratings yet
Aginity Netezza Workbench Documentation
8 pages
Yuva Nikhil Reddy Lonka
No ratings yet
Yuva Nikhil Reddy Lonka
9 pages
CSC 203.1 Note
No ratings yet
CSC 203.1 Note
29 pages
DBMS IA2 Question Bank
No ratings yet
DBMS IA2 Question Bank
1 page
SQL MCQ 02
No ratings yet
SQL MCQ 02
4 pages
DP-203 Exam - Free Actual Q&As, Page 7 - ExamTopics
No ratings yet
DP-203 Exam - Free Actual Q&As, Page 7 - ExamTopics
11 pages
IP Database Tables
No ratings yet
IP Database Tables
5 pages
Michael Leeseberg - Apriso
No ratings yet
Michael Leeseberg - Apriso
2 pages
Class Xii SQL Practical
No ratings yet
Class Xii SQL Practical
2 pages
Mastering Data Engineering: Advanced Techniques with Apache Hadoop and Hive
From Everand
Mastering Data Engineering: Advanced Techniques with Apache Hadoop and Hive
Peter Jones
No ratings yet
Big Data Analytics
From Everand
Big Data Analytics
Nitin Kumar Yadav
No ratings yet

MCQ Questions

Uploaded by

MCQ Questions

Uploaded by

lecture 4

1. Which of the following best describes Hadoop?

 A) A relational database system.

2. What is the main function of the "NameNode" in HDFS?

 A) To store data blocks.

3. In Hadoop, which component is responsible for processing data in parallel?

4. What is an example of unstructured data that Hadoop can process?

5. Which of the following is NOT part of the Hadoop ecosystem?

6. What does the "Map" phase in MapReduce do?

 A) It combines intermediate results.

7. Why is MapReduce scalable?

8. Which of the following is NOT a use case for MapReduce?

 A) Word count in large text files.

9. What is the default block size in HDFS?

10. Which phase of MapReduce is responsible for aggregating data?

11. Which of the following is a benefit of HDFS?

12. What does Hadoop use for fault tolerance?

13. In MapReduce, what is a Combiner?

 A) A required step for reducing data.

15. What is the purpose of the JobTracker in Hadoop 1.x?

 A) Managing distributed data.

17. Which of the following is true about Pig?

 A) It is used for real-time streaming.

18. What is a DataNode in Hadoop?

 A) A node that manages metadata.

19. What is a key feature of HDFS that ensures reliability?

 A) Data replication across multiple nodes.

 A) It is slower but simpler.

21. What is the role of the ResourceManager in Hadoop 2.x (YARN)?

 A) Managing storage blocks.

22. Which of the following is NOT a phase in MapReduce?

23. What is "YARN" in Hadoop?

24. In Hadoop, what is a Block Report?

 A) A list of corrupted blocks.

25. What is an example of structured data?

 A) Transforming input data into key-value pairs.

27. What is a typical file format supported by Hadoop?

28. Which programming language is NOT commonly used with Hadoop?

 A) Splitting data into smaller chunks.

30. Which of the following is a real-world use case for MapReduce?

 A) Banking transactions processing.

o ‫( تحليل المشاعر‬Sentiment Analysis).

‫تحديد إذا كانت النصوص إيجابية أو‬

o ‫( استخراج الكيانات‬Named Entity

‫ زي اللي بيستخدمها‬Netflix ‫و‬Amazon

3. Key Components of HDFS

 HDFS is a distributed file system for storing large datasets.

You might also like