0% found this document useful (0 votes)

62 views8 pages

Installation of Hadoop

The document provides steps to install and configure Hadoop on Ubuntu, including: 1) Installing Java and configuring JAVA_HOME. 2) Creating a dedicated Hadoop user "hduser" and generating an SSH key. 3) Downloading and extracting Hadoop before configuring core-site.xml, mapred-site.xml, and hdfs-site.xml files. 4) Formatting the NameNode and starting all Hadoop services using start-all.sh. 5) Verifying services are running using the jps tool.

Uploaded by

David Joseph

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

62 views8 pages

Installation of Hadoop

Uploaded by

David Joseph

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

1. Installing Sun JDK 1.6: Installing JDK is a required step to install Hadoop.

You can follow the steps in

my previous post.

1. Based on your linux architecture, download the proper version from Oracle website (Oracle
JDK 1.7)
2. Then, uncompress the jdk archive using the following command:
tar -xvf jdk-7u65-linux-i586.tar
Or using the following command for 64 bits:
tar -xvf jdk-7u65-linux-x64.tar
3. Create a folder named jvm under (if not exists) using the following command
sudo mkdir -p /usr/lib/jvm
4. Then, move the extracted directory to /usr/lib/jvm:
sudo mv ~/Downloads/jdk1.7.0_71 /usr/lib/jvm/
5. Run the following commands to update the execution alternatives:
sudo update-alternatives --install "/usr/bin/java" "java"
"/usr/lib/jvm/jdk1.7.0_71/bin/java" 1 sudo update-alternatives
--install "/usr/bin/javac" "javac"
"/usr/lib/jvm/jdk1.7.0_71/bin/javac" 1 sudo update-alternatives
--install "/usr/bin/javaws" "javaws"
"/usr/lib/jvm/jdk1.7.0_71/bin/javaws" 1
6. Finally, you need to export JAVA_HOME variable:
export JAVA_HOME=/usr/lib/jvm/jdk1.7.0_71
or it is better to set JAVA_HOME in .bashrc:
nano ~/.bashrc
then add the same line:
export JAVA_HOME=/usr/lib/jvm/jdk1.7.0_71

2. Adding a dedicated Hadoop system user: You will need a user for hadoop system you will install. To
create a new user "hduser" in a group called "hadoop", run the following commands in your terminal:
$sudo addgroup hadoop
$sudo adduser --ingroup hadoop hduser
3.ConfiguringSSH:inMichaelBlog,heassumedthattheSSHisalreadyinstalled.Butifyoudidn'tinstallSSH
serverbefore,youcanrunthefollowingcommandinyourterminal:Bythiscommand,youwillhaveinstalledssh

serveronyourmachine,theportis22bydefault.

$sudo apt-get install openssh-server

WehaveinstalledSSHbecauseHadooprequiresaccesstolocalhost(incasesinglenodecluster)or
communicateswithremotenodes(incasemultinodecluster).
Afterthisstep,youwillneedtogenerateSSHkeyforhduser(andtheusersyouneedtoadministerHadoopif
any)byrunningthefollowingcommands,butyouneedfirsttoswitchtohduser:
$su - hduser
$ssh-keygen -t rsa -P ""
TobesurethatSSHinstallationiswentwell,youcanopenanewterminalandtrytocreatesshsessionusing
hduserbythefollowingcommand:
$ssh localhost

InstallingHadoop
NowwecandownloadHadooptobegininstallation.GotoApacheDownloadsanddownloadHadoopversion
0.20.2.Toovercomethesecurityissues,youcandownloadthetarfileinhduserdirectory,for
example,/home/hduser.Checkthefollowingsnapshot:

Thenyouneedtoextractthetarfileandrenametheextractedfolderto'hadoop'.Openanewterminalandrunthe
followingcommand:
$ cd /home/hduser
$ sudo tar xzf hadoop-0.20.2.tar.gz
$ sudo mv hadoop-0.20.2 hadoop
Pleasenoteifyouwanttograntaccessforanotherhadoopadminuser(e.g.hduser2),youhavetogrant
readpermissiontofolder/home/hduserusingthefollowingcommand:
sudo chown -R hduser2:hadoop hadoop

Update$HOME/.bashrc
Youwillneedtoupdatethe.bachrcforhduser(andforeveryuseryouneedtoadministerHadoop).Toopen.bachrc
file,youwillneedtoopenitasroot:
$sudogedit/home/hduser/.bashrc
Thenyouwilladdthefollowingconfigurationsattheendof.bachrcfile

# Set Hadoop-# related environment variables

export HADOOP_HOME=/home/hduser/hadoop

# Set JAVA_HOME (we will also configure JAVA_HOME directly for Hadoop later
on)

export JAVA_HOME=/usr/lib/jvm/java-6-sun
# or you can write the following command if you used this post to install your java
# export JAVA_HOME=/usr/lib/jvm/jdk1.7.0_71
export JAVA_HOME=/usr/lib/jvm/java-7-openjdk-amd64
export PATH=$JAVA_HOME/bin:$PATH

# Some convenient aliases and functions for running Hadoop-related commands

unalias fs &> /dev/null
alias fs="hadoop fs"
unalias hls &> /dev/null
alias hls="fs -ls"
# If you have LZO compression enabled in your Hadoop cluster and
# compress job outputs with LZOP (not covered in this tutorial):
# Conveniently inspect an LZOP compressed file from the command
# line; run via:
#
# $ lzohead /hdfs/path/to/lzop/compressed/file.lzo
#
# Requires installed 'lzop' command.
#
lzohead () {
hadoop fs -cat $1 | lzop -dc | head -1000 | less
}
# Add Hadoop bin/ directory to PATH
export PATH=$PATH:$HADOOP_HOME/bin

HadoopConfiguration

Now,weneedtoconfigureHadoopframeworkonUbuntumachine.Thefollowingareconfigurationfileswecan
usetodotheproperconfiguration.Toknowmoreabouthadoopconfigurations,youcanvisitthissite

hadoopenv.sh
WeneedonlytoupdatetheJAVA_HOMEvariableinthisfile.Simplyyouwillopenthisfileusingatexteditor
usingthefollowingcommand:

$sudo gedit /home/hduser/hadoop/conf/hadoop-env.sh

Thenyouwillneedtochangethefollowingline
# export JAVA_HOME=/usr/lib/j2sdk1.5-sun
To
export JAVA_HOME=/usr/lib/jvm/java-6-sun
or you can write the following command if you used this post to install your java
# export JAVA_HOME=/usr/lib/jvm/jdk1.7.0_71
export JAVA_HOME=/usr/lib/jvm/java-7-openjdk-amd64
Note:ifyoufaced"Error:JAVA_HOMEisnotset"Errorwhilestartingtheservices,thenyouseemsthatyou
forgottoeuncommentthepreviousline(justremove#).

coresite.xml
First,weneedtocreateatempdirectoryforHadoopframework.Ifyouneedthisenvironmentfortestingoraquick
prototype(e.g.developsimplehadoopprogramsforyourpersonaltest...),Isuggesttocreatethisfolder
under/home/hduser/directory,otherwise,youshouldcreatethisfolderinasharedplaceundersharedfolder(like
/usr/local...)butyoumayfacesomesecurityissues.Buttoovercometheexceptionsthatmaycausedbysecurity
(likejava.io.IOException),Ihavecreatedthetmpfolderunderhduserspace.
Tocreatethisfolder,typethefollowingcommand:
$ sudo mkdir

/home/hduser/tmp

Pleasenotethatifyouwanttomakeanotheradminuser(e.g.hduser2inhadoopgroup),youshouldgranthimaread
andwritepermissiononthisfolderusingthefollowingcommands:

$ sudo chown hduser2:hadoop /home/hduser/tmp

$ sudo chmod 755 /home/hduser/tmp
Now,wecanopenhadoop/conf/coresite.xmltoeditthehadoop.tmp.direntry.
Wecanopenthecoresite.xmlusingtexteditor:
$sudogedit/home/hduser/hadoop/conf/coresite.xml
Thenaddthefollowingconfigurationsbetween<configuration>..</configuration>xmlelements:

<property>
<name>hadoop.tmp.dir</name>
<value>/home/hduser/tmp</value>
<description>A base for other temporary directories.</description>
</property>
<property>
<name>fs.default.name</name>
<value>hdfs://localhost:9000</value>
<description>The name of the default file system. A URI whose
scheme and authority determine the FileSystem implementation. The
uri's scheme determines the config property (fs.SCHEME.impl) naming
the FileSystem implementation class. The uri's authority is used to
determine the host, port, etc. for a filesystem.</description>
</property>

mapredsite.xml
Wewillopenthehadoop/conf/mapredsite.xmlusingatexteditorandaddthefollowingconfigurationvalues(like
coresite.xml)

<property>
<name>mapred.job.tracker</name>
<value>localhost:54311</value>
<description>The host and port that the MapReduce job tracker runs
at. If "local", then jobs are run in-process as a single map
and reduce task.
</description>
</property>

hdfssite.xml
Openhadoop/conf/hdfssite.xmlusingatexteditorandaddthefollowingconfigurations:

<property>
<name>dfs.replication</name>
<value>1</value>
<description>Default block replication.
The actual number of replications can be specified when the file is created.
The default is used if replication is not specified in create time.
</description>
</property>

FormattingNameNode
YoushouldformattheNameNodeinyourHDFS.Youshouldnotdothisstepwhenthesystemisrunning.Itis
usuallydoneonceatfirsttimeofyourinstallation.
Runthefollowingcommand
$/home/hduser/hadoop/bin/hadoop namenode -format

NameNode Formatting

StartingHadoopCluster
Youwillneedtonavigatetohadoop/bindirectoryandrun./startall.shscript.

Starting Hadoop Services using ./start-all.sh

Thereisanicetoolcalledjps.Youcanuseittoensurethatalltheservicesareup.

Using jps tool

The key feature of a Writable is that the framework knows how to serialize and deserialize
a Writable object. The WritableComparable adds the compareTo interface so the framework
knows how to sort the WritableComparable objects.

Biblical Meaning of Numbers From One To Forty by Dr. Stephen E. Jones
100% (3)
Biblical Meaning of Numbers From One To Forty by Dr. Stephen E. Jones
77 pages
213nt1306 - Big Data Analytics Lab Manual
No ratings yet
213nt1306 - Big Data Analytics Lab Manual
80 pages
Anurag 1-6 Merged
No ratings yet
Anurag 1-6 Merged
60 pages
Embedded Systems and Robotics
No ratings yet
Embedded Systems and Robotics
34 pages
Install Apache Hadoop Using Cloudera
No ratings yet
Install Apache Hadoop Using Cloudera
132 pages
Bca Project
25% (4)
Bca Project
18 pages
BDA Lab Manual UPDATED
No ratings yet
BDA Lab Manual UPDATED
45 pages
Exp 1 1
No ratings yet
Exp 1 1
24 pages
NCA-6.10 Nutanix Certified Associate (NCA) v6.10 Exam Free Dumps
No ratings yet
NCA-6.10 Nutanix Certified Associate (NCA) v6.10 Exam Free Dumps
4 pages
Mimas V2 Spartan 6 FPGA Development Board: User Guide
No ratings yet
Mimas V2 Spartan 6 FPGA Development Board: User Guide
33 pages
Lab Manual
No ratings yet
Lab Manual
27 pages
Bdamanual
No ratings yet
Bdamanual
8 pages
2023MCS320004 HEMANTH TARRA - Hadoop Installation - Assignment
No ratings yet
2023MCS320004 HEMANTH TARRA - Hadoop Installation - Assignment
9 pages
BDA LAB Programs
No ratings yet
BDA LAB Programs
56 pages
Hadoop Installation
No ratings yet
Hadoop Installation
5 pages
Hadoop Configuration
No ratings yet
Hadoop Configuration
12 pages
PRACTICAL 4 - Single and Multi Node Hadoop Install
No ratings yet
PRACTICAL 4 - Single and Multi Node Hadoop Install
11 pages
Experiment-2 BDA Lab
No ratings yet
Experiment-2 BDA Lab
13 pages
Installing Hadoop On Ubuntu
No ratings yet
Installing Hadoop On Ubuntu
29 pages
Hadoop Installation Guide
No ratings yet
Hadoop Installation Guide
18 pages
Hadoop
No ratings yet
Hadoop
5 pages
Hadoop Single Node Installation
No ratings yet
Hadoop Single Node Installation
7 pages
Installation of Hadoop in Ubuntu
No ratings yet
Installation of Hadoop in Ubuntu
15 pages
Hbase Installationn
No ratings yet
Hbase Installationn
12 pages
Hadoop 3 Installation
No ratings yet
Hadoop 3 Installation
10 pages
Experiment No - 1
No ratings yet
Experiment No - 1
13 pages
Edam5000 Manual v14
No ratings yet
Edam5000 Manual v14
159 pages
Hadoop
No ratings yet
Hadoop
4 pages
BDAO
No ratings yet
BDAO
23 pages
Hadoop Installation
No ratings yet
Hadoop Installation
6 pages
DataVisuaization Lab
No ratings yet
DataVisuaization Lab
5 pages
Install Hadoop
No ratings yet
Install Hadoop
8 pages
TP2 - 3IM - en
No ratings yet
TP2 - 3IM - en
7 pages
Seminar Report Sky X Technology
100% (1)
Seminar Report Sky X Technology
25 pages
Hadoop 2.6 Installing On Ubuntu 14.04 (Single-Node Cluster)
No ratings yet
Hadoop 2.6 Installing On Ubuntu 14.04 (Single-Node Cluster)
27 pages
Hadoop Install
No ratings yet
Hadoop Install
19 pages
EX. NO Date Program NO Sign
No ratings yet
EX. NO Date Program NO Sign
80 pages
Hadoop Installation
No ratings yet
Hadoop Installation
4 pages
How To Install Hadoop On Ubuntu 18.04 or 20.04
No ratings yet
How To Install Hadoop On Ubuntu 18.04 or 20.04
15 pages
Updated CMD
No ratings yet
Updated CMD
23 pages
BDA Practical1 MC18-23
No ratings yet
BDA Practical1 MC18-23
17 pages
Bda Lab
No ratings yet
Bda Lab
37 pages
Installing A Single Node Hadoop Cluster
No ratings yet
Installing A Single Node Hadoop Cluster
4 pages
Edureka Apache Hadoop Single Node Cluster On Ubuntu
No ratings yet
Edureka Apache Hadoop Single Node Cluster On Ubuntu
9 pages
Hadoop Installatio1
No ratings yet
Hadoop Installatio1
22 pages
Installationof Hadoop 3
No ratings yet
Installationof Hadoop 3
6 pages
Hadoop Installation Manual 2.odt
No ratings yet
Hadoop Installation Manual 2.odt
20 pages
Hadoop Installation Steps
100% (1)
Hadoop Installation Steps
6 pages
Hadoop Installation Step by Step
No ratings yet
Hadoop Installation Step by Step
8 pages
Hadoop Installation
No ratings yet
Hadoop Installation
12 pages
Unix Commands Part 2
No ratings yet
Unix Commands Part 2
37 pages
Hadoop Single Node Installation
No ratings yet
Hadoop Single Node Installation
4 pages
Experiment 1 Hadoop Installation
No ratings yet
Experiment 1 Hadoop Installation
6 pages
Hadoop 2.6.5 Installing On Ubuntu 16.04 and 18.04 (Single-Node Cluster)
No ratings yet
Hadoop 2.6.5 Installing On Ubuntu 16.04 and 18.04 (Single-Node Cluster)
7 pages
Single Node Hadoop Cluster
No ratings yet
Single Node Hadoop Cluster
9 pages
Step 1 - Install Oracle Java 8 On Ubuntu
No ratings yet
Step 1 - Install Oracle Java 8 On Ubuntu
7 pages
Map, Filter and Reduce Functions
No ratings yet
Map, Filter and Reduce Functions
149 pages
Hadoop/Hbase Installation: Install Java
No ratings yet
Hadoop/Hbase Installation: Install Java
11 pages
Exam 42
No ratings yet
Exam 42
9 pages
Online:: Setting Up The Environment
No ratings yet
Online:: Setting Up The Environment
9 pages
Hadoop Cluster Creation
No ratings yet
Hadoop Cluster Creation
8 pages
GST 04204 - Computer Applications
No ratings yet
GST 04204 - Computer Applications
226 pages
Sqoop Tutorial: Sqoop: "SQL To Hadoop and Hadoop To SQL"
No ratings yet
Sqoop Tutorial: Sqoop: "SQL To Hadoop and Hadoop To SQL"
11 pages
Opn Exam Study Guide RN Functional 2844181
100% (1)
Opn Exam Study Guide RN Functional 2844181
17 pages
Java-Hadoop 2.X Setting Up
No ratings yet
Java-Hadoop 2.X Setting Up
12 pages
Hadoop 2 - Pseudo Node Installation
No ratings yet
Hadoop 2 - Pseudo Node Installation
9 pages
Hadoop 2.7.3 Setup On Ubuntu 15.10
No ratings yet
Hadoop 2.7.3 Setup On Ubuntu 15.10
7 pages
Osa Eurecom Kaltenberger
No ratings yet
Osa Eurecom Kaltenberger
51 pages
Block Diagram of A Computer
No ratings yet
Block Diagram of A Computer
21 pages
Header and Footer Format
No ratings yet
Header and Footer Format
10 pages
Stack Manager and High Availability Configuration Guide, Cisco IOS XE Amsterdam 17.2.x (Catalyst 9300 Switches)
No ratings yet
Stack Manager and High Availability Configuration Guide, Cisco IOS XE Amsterdam 17.2.x (Catalyst 9300 Switches)
46 pages
The Project Analytics Framework: Oracle Business Intelligence
No ratings yet
The Project Analytics Framework: Oracle Business Intelligence
53 pages
Install Sqoop
No ratings yet
Install Sqoop
7 pages
SystemVerilog Meets C++
No ratings yet
SystemVerilog Meets C++
9 pages
HADOOP 1.X Installation Steps On Ubuntu
No ratings yet
HADOOP 1.X Installation Steps On Ubuntu
3 pages
$ Sudo Apt-Get Install Oracle-Java8-Installer
No ratings yet
$ Sudo Apt-Get Install Oracle-Java8-Installer
4 pages
ZAPI Flash TMX Slave
100% (1)
ZAPI Flash TMX Slave
2 pages
Micronics
No ratings yet
Micronics
11 pages
User's Manual: Notebook
No ratings yet
User's Manual: Notebook
58 pages
Aligning Oracle Fusion Financials To GST 2017
0% (1)
Aligning Oracle Fusion Financials To GST 2017
2 pages
Endoscope
No ratings yet
Endoscope
8 pages
Demo21 Installation Quick Guide January 2017 - V1.7
No ratings yet
Demo21 Installation Quick Guide January 2017 - V1.7
31 pages
User's Manual: 54M/ 150M/300Mbps
No ratings yet
User's Manual: 54M/ 150M/300Mbps
11 pages
ThinkPad - L14 - Gen - 3 - Intel GBC
No ratings yet
ThinkPad - L14 - Gen - 3 - Intel GBC
9 pages
10.detailed Results
No ratings yet
10.detailed Results
14 pages
Aircraft Trajectory Prediction Made Easy With Predictive Analytics
No ratings yet
Aircraft Trajectory Prediction Made Easy With Predictive Analytics
10 pages
IOT Based College Notice Board LED Display
No ratings yet
IOT Based College Notice Board LED Display
6 pages
Situational Intelligence:: The Missing Link in Emergency Notification
No ratings yet
Situational Intelligence:: The Missing Link in Emergency Notification
8 pages
Networking Group Assignment
No ratings yet
Networking Group Assignment
10 pages
APACHE REDIS Training: Trainer:David Joseph
No ratings yet
APACHE REDIS Training: Trainer:David Joseph
3 pages
Sas University Edition 107140 PDF
No ratings yet
Sas University Edition 107140 PDF
4 pages
Lesson 9 - Types of Computers
No ratings yet
Lesson 9 - Types of Computers
5 pages
Read-Only Memories & Programmable Logic Arrays
No ratings yet
Read-Only Memories & Programmable Logic Arrays
7 pages
445 - Mastering The Spring Framework
No ratings yet
445 - Mastering The Spring Framework
4 pages
Big Data All Stars
No ratings yet
Big Data All Stars
33 pages
01 Overview Hadoop
No ratings yet
01 Overview Hadoop
22 pages
Ore Trng5 Operatnlzgrscripts 1501640
No ratings yet
Ore Trng5 Operatnlzgrscripts 1501640
59 pages
SAP HYBRIS Knowledge-Tree
No ratings yet
SAP HYBRIS Knowledge-Tree
1 page
Software Architecture: Eucalyptus
No ratings yet
Software Architecture: Eucalyptus
3 pages
Guidechronologylatest1 PDF
No ratings yet
Guidechronologylatest1 PDF
1 page
Mails 4 Sree 09@
No ratings yet
Mails 4 Sree 09@
1 page
M.Tech II Semester Supplementary Examinations January/February 2019
No ratings yet
M.Tech II Semester Supplementary Examinations January/February 2019
1 page
Guide Calendar 2015-11-23
No ratings yet
Guide Calendar 2015-11-23
1 page
Guide Graphical Timeline 2015 10 25pm Nofooter PDF
No ratings yet
Guide Graphical Timeline 2015 10 25pm Nofooter PDF
1 page
Quick Configuration of Openldap and Kerberos In Linux and Authenicating Linux to Active Directory
From Everand
Quick Configuration of Openldap and Kerberos In Linux and Authenicating Linux to Active Directory
Dr. Hidaia Mahmood Alassouli
No ratings yet

Installation of Hadoop

Uploaded by

Installation of Hadoop

Uploaded by

1. Installing Sun JDK 1.6: Installing JDK is a required step to install Hadoop.

You can follow the steps in

$sudo apt-get install openssh-server

# Set Hadoop-# related environment variables

# Some convenient aliases and functions for running Hadoop-related commands

$sudo gedit /home/hduser/hadoop/conf/hadoop-env.sh

$ sudo chown hduser2:hadoop /home/hduser/tmp

Starting Hadoop Services using ./start-all.sh

Using jps tool

You might also like