Homework 2 (22 11 2022)

HDFS is Hadoop's distributed file system for storage. YARN is the resource negotiator that improved upon MapReduce's scalability limitations. The NameNode manages metadata and heartbeats from DataNodes, which store data and service read/write requests. Hadoop administrators frequently add and remove DataNodes to scale the cluster's storage capacity with growing data volumes. HDFS uses data replication across multiple DataNodes to provide fault tolerance in the event of DataNode failures.

Uploaded by

Prabha K

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views2 pages

Homework 2 (22 11 2022)

Uploaded by

Prabha K

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 2

HOMEWORK-2(22-11-2022)

Q1. What are HDFS and YARN?

HDFS is the basic component of hadoop, that helps in storage of data ,it is called as hadoop
distributed file system. YARN is the acronym of yet another resource negotiater, in map
reduce version1 scalability is bottleneck when cluster grows to 4000+.

Q2. What are the various Hadoop daemons and their roles in a Hadoop cluster?
MASTER DAEMON ,this maintains and manages data nodes ,record metadata and receives
heartbeat and block report format data nodes. SLAVE DAEMON, this maintains and
manages name nodes, stores actual data and serves read and write requests from clients.

Q3. Why does one remove or add nodes in a Hadoop cluster frequently?
It is a striking feature of Hadoop Framework is the ease of scale in accordance with the rapid
growth in data volume. Because of these two reasons, one of the most common task of a
Hadoop administrator is to commission (Add) and decommission (Remove) “Data Nodes” in a
Hadoop Cluster.

Q4. What happens when two clients try to access the same file in the HDFS?
HDFS provides support only for exclusive writes so when one client is already writing the
file, the other client cannot open the file in write mode.

Q5. How does Name Node tackle Data Node failures?

Data blocks on the failed Data node are replicated on other Data nodes based on the specified
replication factor in hdfs - site. xml file. Once the failed data nodes comes back the Name
node will manage the replication factor again. This is how Name node handles the failure of
data node.

Q6. What will you do when Name Node is down?

When the NameNode goes down, the file system goes offline. There is an optional Secondary
Name Node that can be hosted on a separate machine. It only creates checkpoints of the name
space by merging the edits file into the image file and does not provide any real redundancy.

Q7. How is HDFS fault tolerant?

HDFS is highly fault tolerant. It uses replica process to handle faults. This means client data
is repeated many times (default replica factor is 3) on different DataNode in the HDFS cluster.
So that in case of any DataNode goes down, the data will be accessed from other DataNodes.
HOMEWORK-2(22-11-2022)
Q8. Why do we use HDFS for applications having large data sets and not when
there are a lot of small files?
HDFS is more efficient for a large number of data sets, maintained in a single file as
compared to the small chunks of data stored in multiple files. As the NameNode performs
storage of metadata for the file system in RAM, the amount of memory limits the number of
files in HDFS file system.

Q9. How do you define “block” in HDFS? What is the default block size in
Hadoop 1 and in Hadoop 2? Can it be changed?
Each file in HDFS is stored as "block". Default block size in Hadoop1 is 64MB and Default
block size in Hadoop2 is 128MB.Yes it can be changed.

IAA4
100% (1)
IAA4
4 pages
Hadoop Interview Questions New
No ratings yet
Hadoop Interview Questions New
9 pages
Chapter 1 - Overview of Mobile Application Development
100% (1)
Chapter 1 - Overview of Mobile Application Development
21 pages
Pcap 31 03
No ratings yet
Pcap 31 03
6 pages
Big Data Analytics
From Everand
Big Data Analytics
Nitin Kumar Yadav
No ratings yet
HDFS
100% (2)
HDFS
6 pages
Data Egineer Interview Questions
No ratings yet
Data Egineer Interview Questions
126 pages
HDFS Interview Questions
No ratings yet
HDFS Interview Questions
29 pages
BD Module 1 Final
No ratings yet
BD Module 1 Final
17 pages
BigData Fundamental and Hadoop Interview Questions
No ratings yet
BigData Fundamental and Hadoop Interview Questions
33 pages
5.apache Hadoop
No ratings yet
5.apache Hadoop
33 pages
Hadoop Distributed File System
No ratings yet
Hadoop Distributed File System
5 pages
Hdfs and Pig
No ratings yet
Hdfs and Pig
13 pages
HDFS
No ratings yet
HDFS
16 pages
Bda - M 2
No ratings yet
Bda - M 2
113 pages
HDFS - Interview Questions
No ratings yet
HDFS - Interview Questions
3 pages
Compare Hadoop & Spark Criteria Hadoop Spark
No ratings yet
Compare Hadoop & Spark Criteria Hadoop Spark
18 pages
Bigdata
No ratings yet
Bigdata
5 pages
Hdfs
No ratings yet
Hdfs
10 pages
Unit 4
No ratings yet
Unit 4
104 pages
Module-2-Introduction To HDFS and Tools
No ratings yet
Module-2-Introduction To HDFS and Tools
38 pages
Module-2 PPT-1
No ratings yet
Module-2 PPT-1
126 pages
HDFS Concepts
No ratings yet
HDFS Concepts
10 pages
BDS Session 5
No ratings yet
BDS Session 5
57 pages
Unit - 3 (HDFS)
No ratings yet
Unit - 3 (HDFS)
23 pages
Unit - 3 (HDFS) - 1
No ratings yet
Unit - 3 (HDFS) - 1
24 pages
Module 1 PDF
No ratings yet
Module 1 PDF
49 pages
Exploring Hadoop Ecosystem (Volume 1): Batch Processing
From Everand
Exploring Hadoop Ecosystem (Volume 1): Batch Processing
Wei Liu
No ratings yet
Data Engineer Interview Questions
No ratings yet
Data Engineer Interview Questions
16 pages
Big Data Assighmwnt 2
No ratings yet
Big Data Assighmwnt 2
60 pages
Hadoop
No ratings yet
Hadoop
5 pages
Big Data Lecture # 05
No ratings yet
Big Data Lecture # 05
22 pages
BDA - Unit-2
No ratings yet
BDA - Unit-2
24 pages
Read and Write Operation
No ratings yet
Read and Write Operation
10 pages
BigData Module 1
No ratings yet
BigData Module 1
17 pages
Big Data Unit 3 by Multi Atoms
No ratings yet
Big Data Unit 3 by Multi Atoms
6 pages
21CS72 Bigdata Module 2 HDFS
No ratings yet
21CS72 Bigdata Module 2 HDFS
55 pages
5 Final Hadoop Ecosystem Hdfs
No ratings yet
5 Final Hadoop Ecosystem Hdfs
130 pages
Module 4 - Hadoop HDFS
No ratings yet
Module 4 - Hadoop HDFS
102 pages
Hadoop Online Tutorials: 250 Hadoop Interview Questions and Answers For Experienced Hadoop Developers
No ratings yet
Hadoop Online Tutorials: 250 Hadoop Interview Questions and Answers For Experienced Hadoop Developers
34 pages
Cca 410
No ratings yet
Cca 410
7 pages
05 - Introduction To HDFS
No ratings yet
05 - Introduction To HDFS
27 pages
Hadoop Interview Guide
100% (1)
Hadoop Interview Guide
34 pages
Chapter N2 HDFS The Hadoop Distributed File System - Matrix
No ratings yet
Chapter N2 HDFS The Hadoop Distributed File System - Matrix
37 pages
Big Data Hadoop
No ratings yet
Big Data Hadoop
11 pages
Advanced Hadoop Techniques: A Comprehensive Guide to Mastery
From Everand
Advanced Hadoop Techniques: A Comprehensive Guide to Mastery
Adam Jones
No ratings yet
Hadoop
No ratings yet
Hadoop
31 pages
BDH Unit 3
No ratings yet
BDH Unit 3
25 pages
Mastering Hadoop
From Everand
Mastering Hadoop
Sandeep Karanth
No ratings yet
Basic Hadoop Interview Questionsxyzz
No ratings yet
Basic Hadoop Interview Questionsxyzz
18 pages
Bda - Unit 2
No ratings yet
Bda - Unit 2
56 pages
Hadoop Distributed File System (HDFS)
No ratings yet
Hadoop Distributed File System (HDFS)
22 pages
Hdfs Architecture
No ratings yet
Hdfs Architecture
16 pages
DSECL ZG 522: Big Data Systems: Session 6: Hadoop Architecture and Filesystem
No ratings yet
DSECL ZG 522: Big Data Systems: Session 6: Hadoop Architecture and Filesystem
56 pages
Unit 2 Da Material
No ratings yet
Unit 2 Da Material
71 pages
Hadoop Mock Test I
No ratings yet
Hadoop Mock Test I
8 pages
Unit-3 (HDFS)
No ratings yet
Unit-3 (HDFS)
59 pages
Fbda Unit-3
No ratings yet
Fbda Unit-3
27 pages
BDA Mid 2
No ratings yet
BDA Mid 2
21 pages
Unit-4 BDA As On 25-11-2024
No ratings yet
Unit-4 BDA As On 25-11-2024
248 pages
What Are The Core Components of Hadoop
No ratings yet
What Are The Core Components of Hadoop
6 pages
2-Hadoop History Terminologies DFS-03-01-2025
No ratings yet
2-Hadoop History Terminologies DFS-03-01-2025
52 pages
Bda Unit 5
No ratings yet
Bda Unit 5
17 pages
Modbus Implementation v364
No ratings yet
Modbus Implementation v364
184 pages
Set Up Local DNS Resolver On Ubuntu 22.04 - 20.04 With BIND9
No ratings yet
Set Up Local DNS Resolver On Ubuntu 22.04 - 20.04 With BIND9
27 pages
01-Configuration Preparation
No ratings yet
01-Configuration Preparation
5 pages
Virtualization Overall
No ratings yet
Virtualization Overall
30 pages
Biz Context
No ratings yet
Biz Context
12 pages
Milesight NVR User Manual en PDF
No ratings yet
Milesight NVR User Manual en PDF
455 pages
Aws Efs
No ratings yet
Aws Efs
5 pages
Software Metrics
No ratings yet
Software Metrics
121 pages
User Manual - Vision X
No ratings yet
User Manual - Vision X
46 pages
AMP Manual Aditya
No ratings yet
AMP Manual Aditya
61 pages
Ah/Muhyp, E JF Fy Y) Up: Gapw RPF Ifnal - 03
No ratings yet
Ah/Muhyp, E JF Fy Y) Up: Gapw RPF Ifnal - 03
6 pages
Computer 7 1Q Learning Module
No ratings yet
Computer 7 1Q Learning Module
18 pages
Dark Tower PDF
No ratings yet
Dark Tower PDF
1 page
Creating A Spike Prime or Robot Inventor Design in Cad: Prime Lessons Prime Lessons
No ratings yet
Creating A Spike Prime or Robot Inventor Design in Cad: Prime Lessons Prime Lessons
10 pages
Florida Cybersecurity Standards (FCS) 60GG-2, F.A.C. FCS Risk Assessment Tool (v2.2)
No ratings yet
Florida Cybersecurity Standards (FCS) 60GG-2, F.A.C. FCS Risk Assessment Tool (v2.2)
129 pages
OS UNIT 3 Threads
No ratings yet
OS UNIT 3 Threads
26 pages
PWC Token Audit V1
No ratings yet
PWC Token Audit V1
12 pages
S - Slave: High Performance Interchangeable Slave Interfaces Supporting All Major Industrial Networks
No ratings yet
S - Slave: High Performance Interchangeable Slave Interfaces Supporting All Major Industrial Networks
3 pages
GrowIT 11
No ratings yet
GrowIT 11
16 pages
Cst305 System Software, December 2022
No ratings yet
Cst305 System Software, December 2022
2 pages
Fresher Linux, AWS and DeVops Interview Questions & Answers
No ratings yet
Fresher Linux, AWS and DeVops Interview Questions & Answers
7 pages
SIH JurisAI - Ctrl+Shift+Hack
No ratings yet
SIH JurisAI - Ctrl+Shift+Hack
6 pages
Erp Proposal
No ratings yet
Erp Proposal
4 pages
PH M Minh Đ C: Objective
No ratings yet
PH M Minh Đ C: Objective
2 pages
Prisonbreaker 1.5 by Jake11price
0% (1)
Prisonbreaker 1.5 by Jake11price
118 pages
MMQ 44
No ratings yet
MMQ 44
15 pages
The Role of Databases in Sport Science Current Pra
No ratings yet
The Role of Databases in Sport Science Current Pra
18 pages

Homework 2 (22 11 2022)

Uploaded by

Homework 2 (22 11 2022)

Uploaded by

HOMEWORK-2(22-11-2022)

Q1. What are HDFS and YARN?

Q5. How does Name Node tackle Data Node failures?

Q6. What will you do when Name Node is down?

Q7. How is HDFS fault tolerant?

You might also like