Ap21110010351 4
Ap21110010351 4
ASSIGNMENT IV
T.Naga Abhiram
AP21110010404
Explain each step with commands and proper screenshots on how to install and
configure Hadoop on your virtual machine.
• Run this command and at the last paste the following commands. [sudo nano
.bashrc]
export JAVA_HOME=/usr/lib/jvm/java-11-openjdk-amd64
export PATH=$PATH:/usr/lib/jvm/java-11-openjdk-amd64/bin
export HADOOP_HOME=~/hadoop-3.3.6/
export PATH=$PATH:$HADOOP_HOME/bin export
PATH=$PATH:$HADOOP_HOME/sbin
export HADOOP_MAPRED_HOME=$HADOOP_HOME
export YARN_HOME=$HADOOP_HOME
export HADOOP_CONF_DIR=$HADOOP_HOME/etc/hadoop
export HADOOP_COMMON_LIB_NATIVE_DIR=$HADOOP_HOME/lib/native
export HADOOP_OPTS="-Djava.library.path=$HADOOP_HOME/lib/native"
export
HADOOP_STREAMING=$HADOOP_HOME/share/hadoop/tools/lib/hadoop-strea
ming-3.3.6.jar
export HADOOP_LOG_DIR=$HADOOP_HOME/logs
export PDSH_RCMD_TYPE=ssh
Because of you are overwriting a file, you have to save it by ‘ctrl + o’. Then ‘ctrl+x’.
• Run [sudo nano hdfs-site.xml] and paste the following inside configuration.
<property>
<name>dfs.replication</name>
<value>1</value>
</property>
• Run [sudo nano mapred-site.xml] and paste the following inside configuration.
<property>
<name>mapreduce.framework.name</name> <value>yarn</value>
</property>
<property>
<name>mapreduce.application.classpath</name>
<value>$HADOOP_MAPRED_HOME/share/hadoop/mapreduce/*:$HADOOP_M
APRED_HOME/share/hadoop/mapreduce
/lib/*</value>
</property>
• Run [sudo nano yarn-site.xml] and paste the following inside configuration.
<property>
<name>yarn.nodemanager.aux-services</name>
<value>mapreduce_shuffle</value>
</property>
<property>
<name>yarn.nodemanager.env-whitelist</name>
<value>JAVA_HOME,HADOOP_COMMON_HOME,HADOOP_HDFS_HOME,H
ADOOP_CONF_DIR,CLASSPATH_P
REP END_DISTCACHE,HADOOP_YARN_HOME,HADOOP_MAPRED_HOME</val
ue>
</property>