0% found this document useful (0 votes)

4 views

Installation of Hadoop

This document outlines the steps for installing Hadoop, including prerequisites like Java and SSH, downloading Hadoop, configuring environment variables, and setting up necessary configuration files. It also details how to format the Hadoop file system, start services, and verify the installation through web UIs and a sample job. The instructions are intended for users familiar with command-line operations on a Linux system.

Uploaded by

Kalyan G V

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

Installation of Hadoop

Uploaded by

Kalyan G V

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

AMC ENGINEERING COLLEGE

Dept. Of Computer Science and Engineering

Big Data Analytics [21CS71] Assignement-2
Topic: - Installation of Hadoop Kalyan G V (1AM21CS077)
Installation of Hadoop

Steps for Hadoop Installation:

1. Install Java Development Kit (JDK):

Hadoop requires Java to be installed on your system.

• To check if Java is installed:

java -version

• If Java is not installed, install it using:

sudo apt update

sudo apt install openjdk-8-jdk

• Verify the installation:

java -version

2. Install SSH:

Hadoop uses SSH to communicate between its nodes.

• Install SSH if it is not already present:

sudo apt install openssh-server

• Ensure SSH is running:

sudo systemctl start ssh

sudo systemctl enable ssh
3. Download Hadoop:

• Download Hadoop from the official Apache website:

wget https://downloads.apache.org/hadoop/common/hadoop-3.3.1/hadoop-3.3.1.tar.gz

• Extract the downloaded tar file:

tar -xzvf hadoop-3.3.1.tar.gz

• Move the extracted folder to /usr/local/hadoop:

sudo mv hadoop-3.3.1 /usr/local/hadoop

4. Configure Hadoop Environment Variables:

• Open the .bashrc file to add Hadoop-related environment variables:

nano ~/.bashrc

• Add the following lines at the end of the file:

export HADOOP_HOME=/usr/local/hadoop
export PATH=$PATH:$HADOOP_HOME/bin:$HADOOP_HOME/sbin
export HADOOP_CONF_DIR=$HADOOP_HOME/etc/hadoop
export YARN_HOME=$HADOOP_HOME

• Save and close the file. Then, apply the changes:

source ~/.bashrc

5. Configure Hadoop Files:

Hadoop requires several configuration files to be set up for proper functioning.

• core-site.xml: Navigate to $HADOOP_HOME/etc/hadoop and open core-site.xml:

nano $HADOOP_HOME/etc/hadoop/core-site.xml

Add the following configuration:

<configuration>
<property>
<name>fs.defaultFS</name>
<value>hdfs://localhost:9000</value>
</property>
</configuration>

• hdfs-site.xml: Edit hdfs-site.xml:

nano $HADOOP_HOME/etc/hadoop/hdfs-site.xml

Add the following configuration:

<configuration>
<property>
<name>dfs.replication</name>
<value>1</value>
</property>
</configuration>
• mapred-site.xml: Edit mapred-site.xml:

cp $HADOOP_HOME/etc/hadoop/mapred-site.xml.template
$HADOOP_HOME/etc/hadoop/mapred-site.xml
nano $HADOOP_HOME/etc/hadoop/mapred-site.xml

Add the following configuration:

<configuration>
<property>
<name>mapreduce.framework.name</name>
<value>yarn</value>
</property>
</configuration>

• yarn-site.xml: Edit yarn-site.xml:

nano $HADOOP_HOME/etc/hadoop/yarn-site.xml

Add the following configuration:

<configuration>
<property>
<name>yarn.resourcemanager.address</name>
<value>localhost:8032</value>
</property>
</configuration>
6. Format the Hadoop File System:

• Format the Hadoop Distributed File System (HDFS) for the first time:

hdfs namenode -format

7. Start Hadoop Services:

• Start the HDFS daemons (Namenode, Datanode):

start-dfs.sh

• Start YARN daemons (ResourceManager, NodeManager):

start-yarn.sh
8. Verify Hadoop Installation:

• Check if Hadoop is running by opening the ResourceManager and Namenode web UIs:
o ResourceManager UI: http://localhost:8088
o Namenode UI: http://localhost:9870
o

o
• Check the status of HDFS:

hdfs dfsadmin -report

• You can also run a simple Hadoop job:

yarn jar $HADOOP_HOME/share/hadoop/mapreduce/hadoop-mapreduce-examples-3.3.1.jar

pi 2 5

BDAO
No ratings yet
BDAO
23 pages
Hadoop Installation
No ratings yet
Hadoop Installation
6 pages
Amc Engineering College: Dept. of Computer Science and Engineering
No ratings yet
Amc Engineering College: Dept. of Computer Science and Engineering
6 pages
Hadoop Installation Guide
No ratings yet
Hadoop Installation Guide
18 pages
Install Hadoop
No ratings yet
Install Hadoop
8 pages
Experiment No - 1
No ratings yet
Experiment No - 1
13 pages
Install Sqoop
No ratings yet
Install Sqoop
7 pages
Hadoop Installation
No ratings yet
Hadoop Installation
4 pages
Hadoop Installation Steps
100% (1)
Hadoop Installation Steps
6 pages
TP2 _3IM - En
No ratings yet
TP2 _3IM - En
7 pages
Steps of Hadoop installation
No ratings yet
Steps of Hadoop installation
3 pages
Hadoop Installation Manual 2.odt
No ratings yet
Hadoop Installation Manual 2.odt
20 pages
Hadoop for Ubuntu 2
No ratings yet
Hadoop for Ubuntu 2
4 pages
Hadoop 3 Installation
No ratings yet
Hadoop 3 Installation
10 pages
big data
No ratings yet
big data
5 pages
Hadoop Single Node Installation
No ratings yet
Hadoop Single Node Installation
4 pages
Installing Standalone and Pseudocode Hadoop Cluster: 1. Setting Up Vmware Virtual Machine
No ratings yet
Installing Standalone and Pseudocode Hadoop Cluster: 1. Setting Up Vmware Virtual Machine
14 pages
Hadoop Installation Step by Step
No ratings yet
Hadoop Installation Step by Step
8 pages
Nitish Steps To Install Hadoop
No ratings yet
Nitish Steps To Install Hadoop
3 pages
BDA Practical1 MC18-23
No ratings yet
BDA Practical1 MC18-23
17 pages
BDA LAB Programs
No ratings yet
BDA LAB Programs
56 pages
Hadoop Installation Commands
No ratings yet
Hadoop Installation Commands
3 pages
HADOOP 1.X Installation Steps On Ubuntu
No ratings yet
HADOOP 1.X Installation Steps On Ubuntu
3 pages
Experiment 1 Hadoop Installation
No ratings yet
Experiment 1 Hadoop Installation
6 pages
Week 1 in Terminal
No ratings yet
Week 1 in Terminal
10 pages
Bda Lab
No ratings yet
Bda Lab
37 pages
Hadoop Installation Guide
No ratings yet
Hadoop Installation Guide
18 pages
Had Oop Installation
No ratings yet
Had Oop Installation
4 pages
Sqoop Tutorial: Sqoop: "SQL To Hadoop and Hadoop To SQL"
No ratings yet
Sqoop Tutorial: Sqoop: "SQL To Hadoop and Hadoop To SQL"
11 pages
Experiment-2_BDA_Lab
No ratings yet
Experiment-2_BDA_Lab
13 pages
Computer Science & Engineering: Department of
No ratings yet
Computer Science & Engineering: Department of
6 pages
Step 1 - Install Oracle Java 8 On Ubuntu
No ratings yet
Step 1 - Install Oracle Java 8 On Ubuntu
7 pages
$ Sudo Apt-Get Install Oracle-Java8-Installer
No ratings yet
$ Sudo Apt-Get Install Oracle-Java8-Installer
4 pages
Installation of Hadoop in Ubuntu
No ratings yet
Installation of Hadoop in Ubuntu
15 pages
Hadoop 2 - Pseudo Node Installation
No ratings yet
Hadoop 2 - Pseudo Node Installation
9 pages
Big Data Analytics Lab Experiments
No ratings yet
Big Data Analytics Lab Experiments
16 pages
213nt1306- Big Data Analytics Lab Manual
No ratings yet
213nt1306- Big Data Analytics Lab Manual
80 pages
Big Data Manual Ai
No ratings yet
Big Data Manual Ai
33 pages
Single Node Hadoop Cluster
No ratings yet
Single Node Hadoop Cluster
9 pages
HarshYadav 20CS3032 Assignment1
No ratings yet
HarshYadav 20CS3032 Assignment1
22 pages
Online:: Setting Up The Environment
No ratings yet
Online:: Setting Up The Environment
9 pages
Hadoop Single Node Installation
No ratings yet
Hadoop Single Node Installation
7 pages
Original
No ratings yet
Original
17 pages
BDT Lab Manual
No ratings yet
BDT Lab Manual
34 pages
BDA Lab Manual-1
No ratings yet
BDA Lab Manual-1
60 pages
big data
No ratings yet
big data
32 pages
Installationof Hadoop 3
No ratings yet
Installationof Hadoop 3
6 pages
DataVisuaization Lab
No ratings yet
DataVisuaization Lab
5 pages
Big Data Record 2024-25
No ratings yet
Big Data Record 2024-25
46 pages
Hadoop 2.7.3 Setup On Ubuntu 15.10
No ratings yet
Hadoop 2.7.3 Setup On Ubuntu 15.10
7 pages
Installation of Hadoop
No ratings yet
Installation of Hadoop
8 pages
Hadoop 2.6.5 Installing On Ubuntu 16.04 and 18.04 (Single-Node Cluster)
No ratings yet
Hadoop 2.6.5 Installing On Ubuntu 16.04 and 18.04 (Single-Node Cluster)
7 pages
Final Copy - BDA LAB Record
No ratings yet
Final Copy - BDA LAB Record
44 pages
6 Hadoop
No ratings yet
6 Hadoop
20 pages
EX. NO Date Program NO Sign
No ratings yet
EX. NO Date Program NO Sign
80 pages
Hadoop Installation
No ratings yet
Hadoop Installation
12 pages
bigdatamanual(2)
No ratings yet
bigdatamanual(2)
45 pages
Hadoop Installation (1)
No ratings yet
Hadoop Installation (1)
6 pages
Expt 1 - Hadoop Installation
No ratings yet
Expt 1 - Hadoop Installation
10 pages
Quick Configuration of Openldap and Kerberos In Linux and Authenicating Linux to Active Directory
From Everand
Quick Configuration of Openldap and Kerberos In Linux and Authenicating Linux to Active Directory
Dr. Hidaia Mahmood Alassouli
No ratings yet
TCS_iON_NQT_Corporates_Brochure
No ratings yet
TCS_iON_NQT_Corporates_Brochure
4 pages
DOM_Manipulation_Cheat_Sheet
No ratings yet
DOM_Manipulation_Cheat_Sheet
2 pages
Spark & SparkMLLib
No ratings yet
Spark & SparkMLLib
6 pages
Full Stack Development-Module 1
No ratings yet
Full Stack Development-Module 1
42 pages
IOT
No ratings yet
IOT
5 pages
21CS53 DBMS IA02 QBankAnswers
No ratings yet
21CS53 DBMS IA02 QBankAnswers
21 pages
Windows Forensics Cheatsheet
No ratings yet
Windows Forensics Cheatsheet
1 page
Github: Terms
No ratings yet
Github: Terms
6 pages
How To Run Turbo C in Windows 7 Using Dosbox
No ratings yet
How To Run Turbo C in Windows 7 Using Dosbox
2 pages
Unix Bible Commands
No ratings yet
Unix Bible Commands
45 pages
Linux Mini Shell
No ratings yet
Linux Mini Shell
7 pages
Docker Introduction
100% (1)
Docker Introduction
59 pages
Listing Program SISTEM PENDUKUNG KEPUTUSAN PROMOSI JABATAN STRUKTURAL MENGGUNAKAN METODE SAW
No ratings yet
Listing Program SISTEM PENDUKUNG KEPUTUSAN PROMOSI JABATAN STRUKTURAL MENGGUNAKAN METODE SAW
23 pages
Oracle XE 21c Manual ConfigureLDCMDatabaseApp
No ratings yet
Oracle XE 21c Manual ConfigureLDCMDatabaseApp
21 pages
Passwords Eve
No ratings yet
Passwords Eve
6 pages
How To Install Linux, Apache, Mysql, PHP (Lamp) Stack On Debian 9 Stretch
No ratings yet
How To Install Linux, Apache, Mysql, PHP (Lamp) Stack On Debian 9 Stretch
24 pages
DBOM
No ratings yet
DBOM
11 pages
Ansible Sample Exam For EX407
100% (1)
Ansible Sample Exam For EX407
7 pages
Devops Full Notes
No ratings yet
Devops Full Notes
101 pages
Log
No ratings yet
Log
13 pages
200 Voucher-Barokah - Net-10-Jam-04.10.23
No ratings yet
200 Voucher-Barokah - Net-10-Jam-04.10.23
6 pages
WinCC TIA Archiving ServerNAS 12 en
No ratings yet
WinCC TIA Archiving ServerNAS 12 en
35 pages
How To Enable Root Account in Ubuntu?: Download PDF
No ratings yet
How To Enable Root Account in Ubuntu?: Download PDF
8 pages
Rezlt
No ratings yet
Rezlt
11 pages
Chapter Eight: File Management
No ratings yet
Chapter Eight: File Management
53 pages
Sage Pastel Accounting Error Code 20
No ratings yet
Sage Pastel Accounting Error Code 20
19 pages
LAB 7.3.10
No ratings yet
LAB 7.3.10
2 pages
Oracle R12 On VMware Server V 1.5
No ratings yet
Oracle R12 On VMware Server V 1.5
107 pages
Scripts
No ratings yet
Scripts
313 pages
ElsaWin 4.0 Installation ENU
No ratings yet
ElsaWin 4.0 Installation ENU
38 pages
MPC SMB and Nfs Guide PCH
0% (1)
MPC SMB and Nfs Guide PCH
5 pages
Windows Abused Privileges
No ratings yet
Windows Abused Privileges
1 page
Software Update Manager Levantar
No ratings yet
Software Update Manager Levantar
5 pages
Exchange Server 2003 To Exchange Server 2010 Active Directory Schema Changes Reference
No ratings yet
Exchange Server 2003 To Exchange Server 2010 Active Directory Schema Changes Reference
165 pages
Dns DHCP FTP Configuration On Windows
No ratings yet
Dns DHCP FTP Configuration On Windows
6 pages
Lab 3: Managing Disks and File Systems: Overview
No ratings yet
Lab 3: Managing Disks and File Systems: Overview
12 pages

Installation of Hadoop

Uploaded by

Installation of Hadoop

Uploaded by

AMC ENGINEERING COLLEGE

Dept. Of Computer Science and Engineering

Steps for Hadoop Installation:

1. Install Java Development Kit (JDK):

Hadoop requires Java to be installed on your system.

• To check if Java is installed:

• If Java is not installed, install it using:

sudo apt update

• Verify the installation:

Hadoop uses SSH to communicate between its nodes.

• Install SSH if it is not already present:

sudo apt install openssh-server

• Ensure SSH is running:

sudo systemctl start ssh

• Download Hadoop from the official Apache website:

• Extract the downloaded tar file:

tar -xzvf hadoop-3.3.1.tar.gz

• Move the extracted folder to /usr/local/hadoop:

sudo mv hadoop-3.3.1 /usr/local/hadoop

4. Configure Hadoop Environment Variables:

• Open the .bashrc file to add Hadoop-related environment variables:

• Add the following lines at the end of the file:

• Save and close the file. Then, apply the changes:

5. Configure Hadoop Files:

Hadoop requires several configuration files to be set up for proper functioning.

• core-site.xml: Navigate to $HADOOP_HOME/etc/hadoop and open core-site.xml:

Add the following configuration:

• hdfs-site.xml: Edit hdfs-site.xml:

Add the following configuration:

Add the following configuration:

• yarn-site.xml: Edit yarn-site.xml:

Add the following configuration:

hdfs namenode -format

7. Start Hadoop Services:

• Start the HDFS daemons (Namenode, Datanode):

• Start YARN daemons (ResourceManager, NodeManager):

hdfs dfsadmin -report

• You can also run a simple Hadoop job:

yarn jar $HADOOP_HOME/share/hadoop/mapreduce/hadoop-mapreduce-examples-3.3.1.jar

You might also like