Namenode and Datanodes

HDFS is a distributed file system designed to run on commodity hardware. It has a master/slave architecture with a single NameNode that manages the file system namespace and regulates client access to files. DataNodes manage storage on each node and serve read/write requests under the direction of the NameNode. The NameNode and DataNodes are designed to run on commodity machines using Java, making HDFS highly portable and able to deploy on a wide range of machines.

Uploaded by

tejaswini

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

61 views3 pages

Namenode and Datanodes

Uploaded by

tejaswini

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Introduction

The Hadoop Distributed File System (HDFS) is a distributed file

system designed to run on commodity hardware. It has many
similarities with existing distributed file systems. However, the
differences from other distributed file systems are significant.
HDFS is highly fault-tolerant and is designed to be deployed on
low-cost hardware. HDFS provides high throughput access to
application data and is suitable for applications that have large
data sets. HDFS relaxes a few POSIX requirements to enable
streaming access to file system data. HDFS was originally built as
infrastructure for the Apache Nutch web search engine project.
HDFS is now an Apache Hadoop subproject. The project URL
is http://hadoop.apache.org/hdfs/.

NameNode and DataNodes

HDFS has a master/slave architecture. An HDFS cluster consists
of a single NameNode, a master server that manages the file
system namespace and regulates access to files by clients. In
addition, there are a number of DataNodes, usually one per node
in the cluster, which manage storage attached to the nodes that
they run on. HDFS exposes a file system namespace and allows
user data to be stored in files. Internally, a file is split into one or
more blocks and these blocks are stored in a set of DataNodes.
The NameNode executes file system namespace operations like
opening, closing, and renaming files and directories. It also
determines the mapping of blocks to DataNodes. The DataNodes
are responsible for serving read and write requests from the file
systems clients. The DataNodes also perform block creation,
deletion, and replication upon instruction from the NameNode.
The NameNode and DataNode are pieces of software designed to
run on commodity machines. These machines typically run a
GNU/Linux operating system (OS).

HDFS is built using the Java language; any machine that supports
Java can run the NameNode or the DataNode software.

Usage of the highly portable Java language means that HDFS can
be deployed on a wide range of machines. A typical deployment
has a dedicated machine that runs only the NameNode software.
Each of the other machines in the cluster runs one instance of the
DataNode software. The architecture does not preclude running
multiple DataNodes on the same machine but in a real
deployment that is rarely the case.The existence of a single
NameNode in a cluster greatly simplifies the architecture of the
system. The NameNode is the arbitrator and repository for all
HDFS metadata. The system is designed in such a way that user
data never flows through the NameNode.
All HDFS communication protocols are layered on top of the
TCP/IP protocol. A client establishes a connection to a
configurable TCP port on the NameNode machine. It talks the
ClientProtocol with the NameNode. The DataNodes talk to the
NameNode using the DataNode Protocol. A Remote Procedure Call
(RPC) abstraction wraps both the Client Protocol and the
DataNode Protocol. By design, the NameNode never initiates any

RPCs. Instead, it only responds to RPC requests issued by

DataNodes or clients.

Bigdata 15cs82 Vtu Module 1 2 Notes
57% (14)
Bigdata 15cs82 Vtu Module 1 2 Notes
49 pages
History of Computers
100% (1)
History of Computers
7 pages
Unit II-bid Data Programming
No ratings yet
Unit II-bid Data Programming
23 pages
SIP5 Com-Modbus V06.00 Manual C545-1 en
No ratings yet
SIP5 Com-Modbus V06.00 Manual C545-1 en
74 pages
AZ-900 Full Summary
No ratings yet
AZ-900 Full Summary
52 pages
HDFS
No ratings yet
HDFS
16 pages
Unit-4 BDA As On 25-11-2024
No ratings yet
Unit-4 BDA As On 25-11-2024
258 pages
Module 1
No ratings yet
Module 1
66 pages
1) Discuss The Design of Hadoop Distributed File System (HDFS) and Concept in Detail
No ratings yet
1) Discuss The Design of Hadoop Distributed File System (HDFS) and Concept in Detail
11 pages
The Hadoop Approach
100% (2)
The Hadoop Approach
14 pages
BCS061 Notes Unit3
No ratings yet
BCS061 Notes Unit3
23 pages
Total Notes For UGC Net Paper-1
90% (48)
Total Notes For UGC Net Paper-1
42 pages
Module II
No ratings yet
Module II
46 pages
Unit 3
No ratings yet
Unit 3
44 pages
Hadoop Distributed File System: Presented by Mohammad Sufiyan Nagaraju Kola Prudhvi Krishna Kamireddy
No ratings yet
Hadoop Distributed File System: Presented by Mohammad Sufiyan Nagaraju Kola Prudhvi Krishna Kamireddy
17 pages
CC Unit 5
No ratings yet
CC Unit 5
43 pages
Unit Ii
No ratings yet
Unit Ii
39 pages
BDA Lab Assignment 2
No ratings yet
BDA Lab Assignment 2
18 pages
Cloud Computing - Unit 3
No ratings yet
Cloud Computing - Unit 3
38 pages
Module III Note
No ratings yet
Module III Note
36 pages
HDFS 3
No ratings yet
HDFS 3
51 pages
Bigdata Unit IV
No ratings yet
Bigdata Unit IV
29 pages
Unit - 3 HDFS MAPREDUCE HBASE
No ratings yet
Unit - 3 HDFS MAPREDUCE HBASE
34 pages
BD Sec B
No ratings yet
BD Sec B
19 pages
Hadoop PDF
0% (1)
Hadoop PDF
4 pages
Module III Hadoop Framework
No ratings yet
Module III Hadoop Framework
21 pages
Unit-2 CH 1 Updated
No ratings yet
Unit-2 CH 1 Updated
22 pages
Cloud Computing Unit 5updated
No ratings yet
Cloud Computing Unit 5updated
43 pages
Hadoop
No ratings yet
Hadoop
23 pages
BDA Mod 3 QB Solns
No ratings yet
BDA Mod 3 QB Solns
19 pages
Unit 3 Part 1
No ratings yet
Unit 3 Part 1
17 pages
Hadoop Working
No ratings yet
Hadoop Working
33 pages
Chapter 4 - Hadoop Ecosystem
No ratings yet
Chapter 4 - Hadoop Ecosystem
24 pages
Unit 3 Big Data - 240516 - 090400
No ratings yet
Unit 3 Big Data - 240516 - 090400
20 pages
HDFS
No ratings yet
HDFS
14 pages
BDA Module-1 Notes
No ratings yet
BDA Module-1 Notes
14 pages
HDFS Unit 4
No ratings yet
HDFS Unit 4
8 pages
Unit 2
No ratings yet
Unit 2
14 pages
CC Unit 5 Notes
No ratings yet
CC Unit 5 Notes
30 pages
Hadoop Architecture Overview
No ratings yet
Hadoop Architecture Overview
10 pages
Wa Introhdfs PDF
No ratings yet
Wa Introhdfs PDF
11 pages
Hadoop and Big Data Unit 2
No ratings yet
Hadoop and Big Data Unit 2
11 pages
60 Active Passive+Lab
No ratings yet
60 Active Passive+Lab
13 pages
Hadoop Common Hadoop Distributed File System (HDFS) Hadoop Yarn Hadoop Mapreduce
No ratings yet
Hadoop Common Hadoop Distributed File System (HDFS) Hadoop Yarn Hadoop Mapreduce
30 pages
Unit2 HDFS
No ratings yet
Unit2 HDFS
17 pages
Apache Hadoop 3.4.1 - HDFS Architecture
No ratings yet
Apache Hadoop 3.4.1 - HDFS Architecture
7 pages
HDFS Intro
No ratings yet
HDFS Intro
9 pages
File System Basics: Hadoop Distributed
No ratings yet
File System Basics: Hadoop Distributed
22 pages
Bigdata 15cs82 Vtu Module 1 2 Notes PDF
No ratings yet
Bigdata 15cs82 Vtu Module 1 2 Notes PDF
49 pages
Hadoop Distributed File System (HDFS)
No ratings yet
Hadoop Distributed File System (HDFS)
6 pages
Unit 3.4 Gfs and Hdfs
No ratings yet
Unit 3.4 Gfs and Hdfs
4 pages
The Hadoop Distributed File System
No ratings yet
The Hadoop Distributed File System
16 pages
Module 3 Session 3 HDFS
No ratings yet
Module 3 Session 3 HDFS
3 pages
Computer Science Apprenticeship Bigdata Assignement3
No ratings yet
Computer Science Apprenticeship Bigdata Assignement3
3 pages
10 Dfs
No ratings yet
10 Dfs
5 pages
Hadoop Distributed File System
No ratings yet
Hadoop Distributed File System
4 pages
Hadoop File System
No ratings yet
Hadoop File System
2 pages
Hadoop Distributed File System
No ratings yet
Hadoop Distributed File System
3 pages
Document 4 HDFS
No ratings yet
Document 4 HDFS
8 pages
Experiment No. 2 Training Session On Hadoop: Hadoop Distributed File System
No ratings yet
Experiment No. 2 Training Session On Hadoop: Hadoop Distributed File System
9 pages
Quick Look: HDFS: Assumptions and Goals
No ratings yet
Quick Look: HDFS: Assumptions and Goals
5 pages
The Architecture of Open Source Applications - The Hadoop Distributed File System
No ratings yet
The Architecture of Open Source Applications - The Hadoop Distributed File System
6 pages
Apache Hadoop: Google File System Hadoop Distributed File System
No ratings yet
Apache Hadoop: Google File System Hadoop Distributed File System
2 pages
Cygwin and Moshell Installation WCDMA
100% (2)
Cygwin and Moshell Installation WCDMA
4 pages
Instructions On AccuMark Plot Pieces by Lectra Alys Plotter
No ratings yet
Instructions On AccuMark Plot Pieces by Lectra Alys Plotter
5 pages
Unix - Module 1
No ratings yet
Unix - Module 1
38 pages
How Our Solution Is Unique Key Messages: (Internal Use) For Check Point Employees
No ratings yet
How Our Solution Is Unique Key Messages: (Internal Use) For Check Point Employees
4 pages
NetBackup104 DeployGuide Kubernetes Clusters
No ratings yet
NetBackup104 DeployGuide Kubernetes Clusters
318 pages
Fujitsu Mainboard D3 Mainboard D3162-A ATX: Data Sheet
No ratings yet
Fujitsu Mainboard D3 Mainboard D3162-A ATX: Data Sheet
3 pages
Intel® Desktop Board DP965LT: Specification Update
No ratings yet
Intel® Desktop Board DP965LT: Specification Update
8 pages
Unit 1 IOT
No ratings yet
Unit 1 IOT
20 pages
Transitioning Applications From CAN 2.0 To CAN FD
No ratings yet
Transitioning Applications From CAN 2.0 To CAN FD
8 pages
01-Fundamentals Configuration Guide-Book PDF
No ratings yet
01-Fundamentals Configuration Guide-Book PDF
166 pages
Installing The Dark GDK With Visual Studio 2008
No ratings yet
Installing The Dark GDK With Visual Studio 2008
9 pages
User
No ratings yet
User
6 pages
CET Test1 - Solutios
No ratings yet
CET Test1 - Solutios
5 pages
Pricelist (HP..)
No ratings yet
Pricelist (HP..)
32 pages
Intro To Microprocessors
No ratings yet
Intro To Microprocessors
97 pages
Zookeeper: Coordinating Your Cluster
No ratings yet
Zookeeper: Coordinating Your Cluster
13 pages
NanoBeacon Config Tool User Guide EN
No ratings yet
NanoBeacon Config Tool User Guide EN
52 pages
HP Procurve 8 Port Ethernet
No ratings yet
HP Procurve 8 Port Ethernet
2 pages
Adveon Installation Guide V1.3 and V1.5 - Edgecam - Basic
No ratings yet
Adveon Installation Guide V1.3 and V1.5 - Edgecam - Basic
46 pages
Software Test Plan PT. Sinar Sosro
No ratings yet
Software Test Plan PT. Sinar Sosro
25 pages
Osi and Protocols Worksheet
No ratings yet
Osi and Protocols Worksheet
3 pages
Practice Session of IIS Administration.: Do The Following Steps For Creating A New Web Sites
No ratings yet
Practice Session of IIS Administration.: Do The Following Steps For Creating A New Web Sites
10 pages
Location Wise Details MASTER
No ratings yet
Location Wise Details MASTER
2 pages
Log
No ratings yet
Log
2 pages
Dspic™ Asymmetric Key Embedded Encryption Library: Execution Time
No ratings yet
Dspic™ Asymmetric Key Embedded Encryption Library: Execution Time
2 pages
Exploring Hadoop Ecosystem (Volume 1): Batch Processing
From Everand
Exploring Hadoop Ecosystem (Volume 1): Batch Processing
Wei Liu
No ratings yet
Big Data Analytics
From Everand
Big Data Analytics
Nitin Kumar Yadav
No ratings yet

Namenode and Datanodes

Uploaded by

Namenode and Datanodes

Uploaded by

Introduction

The Hadoop Distributed File System (HDFS) is a distributed file

NameNode and DataNodes

RPCs. Instead, it only responds to RPC requests issued by

You might also like