The Main Components in Apache Hadoop YARN
The Main Components in Apache Hadoop YARN
communicate:
Functionality: The Resource Manager is the central authority for resource management
and job scheduling in a YARN cluster.
Main Functionality:
Communication:
2. NodeManager (NM):
Main Functionality:
Manages resources (CPU, memory, etc.) on a node and reports status to the
ResourceManager.
Communication:
Communicates with containers running on the node to monitor their status and
manage their execution.
3. Container:
Functionality: Containers provide isolated execution environments for running
application tasks on nodes in the cluster.
Main Functionality:
Communication:
Receives instructions from the NodeManager to start, monitor, and stop tasks.
4. ApplicationMaster (AM):
Main Functionality:
Coordinates task execution, monitors progress, and handles task failures and
retries.
Communication:
In summary, the ResourceManager manages cluster-wide resource allocation and job scheduling, while
NodeManagers manage resources and execute containers on individual nodes. Containers provide
isolated execution environments for application tasks, and ApplicationMasters manage application
lifecycles and coordinate task execution. Communication between these components enables efficient
resource management, job scheduling, and task execution in a YARN cluster.