On Master Nodes Nodes: Install and Edit Bashrc On All Nodes For JAVA and HADOOP
On Master Nodes Nodes: Install and Edit Bashrc On All Nodes For JAVA and HADOOP
Edit /etc/hosts file in all nodes (remove 127.0.1.1. on all slaves and master)
Hostname1:IP
Hostname2:IP
Hostname3:IP
Install and edit bashrc on all nodes for JAVA and HADOOP.
Check JPS
Edit env.sh and give JAVA path on both master and slave nodes.
On Master Node:
1.core-site.xml
<property>
<name>fs.default.name</name>
<value>hdfs://Master_hostname</value>
</property>
<property>
<name>dfs.permissions</name>
<value>false</value>
</property>
<property>
<name>hadoop.tmp.dir</name>
<value>/home/hadoop/tmp</value>
<description>Temporary Directory.</description>
</property>
<property>
<name>fs.defaultFS</name>
<value>hdfs:// Master_hostname:54310</value>
</property>
2.hdfs-site.xml
<property>
<name>dfs.namenode.name.dir</name>
<value>/home/hadoop/data/nameNode</value>
</property>
<property>
<name>dfs.datanode.data.dir</name>
<value>/home/hadoop/data/dataNode</value>
</property>
<property>
<name>dfs.replication</name>
<value>2</value>
</property>
3.mapred-site.xml :
<property>
<name>mapreduce.framework.name</name>
<value>yarn</value>
</property>
<property>
<name>mapred.job.tracker.address</name>
<value> Master_hostname:54311</value>
</property>
4.yarn-site.xml
<property>
<name>yarn.nodemanager.aux-services</name>
<value>mapreduce_shuffle</value>
</property>
<property>
<name>yarn.nodemanager.aux-services.mapreduce.shuffle.class</name>
<value>org.apache.hadoop.mapred.ShuffleHandler</value>
</property>
<property>
<name>yarn.resourcemanager.scheduler.address</name>
<value> Master_hostname:8030</value>
</property>
<property>
<name>yarn.resourcemanager.address</name>
<value> Master_hostname:8032</value>
</property>
<property>
<name>yarn.resourcemanager.webapp.address</name>
<value> Master_hostname:8088</value>
</property>
<property>
<name>yarn.resourcemanager.resource-tracker.address</name>
<value> Master_hostname:8031</value>
</property>
<property>
<name>yarn.resourcemanager.admin.address</name>
<value> Master_hostname:8033</value>
</property>
5. Create masters file and do entry of maternode_hostname.
<property>
<name>fs.default.name</name>
</property>
<property>
<name>dfs.permissions</name>
<value>false</value>
</property>
<property>
<name>fs.defaultFS</name>
<value>hdfs:// Master_hostname:54310</value>
</property>
2. hdfs-site.xml
<property>
<name>dfs.namenode.name.dir</name>
<value>/home/hadoop/data/nameNode</value>
</property>
<property>
<name>dfs.datanode.data.dir</name>
<value>/home/hadoop/data/dataNode</value>
</property>
<property>
<name>dfs.replication</name>
<value>2</value>
</property>
3. yarn-site.xml
<property>
<name>yarn.nodemanager.aux-services</name>
<value>mapreduce_shuffle</value>
</property>
<property>
<name>yarn.nodemanager.aux-services.mapreduce.shuffle.class</name>
<value>org.apache.hadoop.mapred.ShuffleHandler</value>
</property>
<property>
<name>yarn.resourcemanager.scheduler.address</name>
<value> Master_hostname:8030</value>
</property>
<property>
<name>yarn.resourcemanager.address</name>
<value> Master_hostname:8032</value>
</property>
<property>
<name>yarn.resourcemanager.webapp.address</name>
<value> Master_hostname:8088</value>
</property>
<property>
<name>yarn.resourcemanager.resource-tracker.address</name>
<value> Master_hostname:8031</value>
</property>
<property>
<name>yarn.resourcemanager.admin.address</name>
<value> Master_hostname:8033</value>
</property>
Running the Cluster
start-dfs.sh
start-yarn.sh
NodeManager
Jps
SecondaryNameNode
NameNode
DataNode
ResourceManager
Jps
NodeManager
DataNode