0% found this document useful (0 votes)

15 views17 pages

Hadoop Installation

This installation guide provides detailed steps for setting up Apache Hadoop on a Windows system, starting with the installation of Java 8. It includes instructions for downloading, extracting, and configuring Hadoop version 3.3.6, as well as setting necessary environment variables and configuring XML files. The guide concludes with steps to format the NameNode, start Hadoop daemons, and access the web interfaces for monitoring.

Uploaded by

Jeya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views17 pages

Hadoop Installation

Uploaded by

Jeya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Installation Guide

This guide provides a comprehensive walkthrough for installing and setting up Apache Hadoop
on a Windows system. Hadoop is an open-source framework designed for distributed storage and
processing of large datasets using clusters of computers.

1. Install Java

Before installing Hadoop, it is essential to install Java, as Hadoop runs on the Java platform. We
will be using Java 8, which is widely supported and stable for Hadoop.

1.1 Download Java:

I. Visit the official Oracle Java download page:

https://www.oracle.com/java/technologies/downloads/#java8

II. Scroll to the Java SE Development Kit 8 section and click the link to download the
Windows x64 Installer.

III. Oracle requires you to sign in:

A. If you already have an Oracle account, log in.

B. If not, create a new account — it’s free.

Fig 1.1 Installation of Java

1.2 Install and Organize Java:

1) Create a folder named java directly in your C drive:

C:\java

2) During installation, change the destination path to the folder you created:

a) Set the path to: C:\java

3) After installation:

a) If Java was installed in C:\Program Files\Java, cut the entire JDK folder (e.g.,
jdk1.8.0_351) and paste it inside your C:\java folder for consistency.
Fig 1.2 Sample setup

1.3 Set Environment Variables:

1. Open the Start menu and search for Environment Variables.

2. Click “Edit the system environment variables”, then click the Environment Variables
button.

3. Under User Variables:

○ Click New and enter the following:

■ Variable Name: JAVA_HOME

■ Variable Value: C:\java\jdk1.8.0_351 (adjust based on your JDK folder

path)
Fig 1.3 Setting of java path in user variable

4. Still in System Variables, select the Path variable and click Edit:

○ Click Add and paste:

C:\java\jdk1.8.0_351\bin(your folder path)
Fig 1.4 System variable path

5. Click OK to close all dialogs and save changes.

Verify Java Installation

1. Open Command Prompt. Type the following command: java -version
2. You should see the installed Java version displayed.
Example output:
java version "1.8.0_351"
3. Java is now successfully installed and configured

2. Install Hadoop (Version 3.3.6)

Now that Java is installed, let's move on to installing Apache Hadoop. In this guide, we'll be
using Hadoop version 3.3.6, which is the latest stable release at the time of writing.

2.1Download Hadoop:
1. Visit the official Hadoop release page:
https://hadoop.apache.org/release/3.3.6.html

2. On the right-hand side, click the “Download tar.gz” button to download the Hadoop
compressed file.

Fig:2.1 Installation of hadoop

2.2 Extract and Organize

1. After the download is complete, extract the entire tar.gz file using tools like WinRAR or
7-Zip.

2. Create a new folder in the C drive and name it something like:
C:\Hadoop_test

3. Cut and paste the extracted Hadoop folder (e.g., hadoop-3.3.6) into this new
Hadoop_test folder:
Final path should be: C:\Hadoop_test\hadoop-3.3.6

Fig 2.2 Hadoop folder setup

2.3 Set Hadoop Environment Variable

Just like we did for Java, now it's time to configure the environment variables for Hadoop.

1. Open the Start menu and search for Environment Variables.

2. Click “Edit the system environment variables”, then click the Environment Variables
button.

3. Under User Variables:

○ Click New and enter the following:

■ Variable Name: HADOOP_HOME

■ Variable Value: C:\Hadoop_test\hadoop-3.3.6

(Make sure the path matches your actual folder)
4. Click OK to save and exit.

Fig 2.3 Creating variable and value in user variable

5. Under System Variable:

○ Click the path and click edit.
■ Copy and paste the path of the Java file and javabin.\
6. Click ok to save it.

Fig 2.4 Path in System variable

3. Configure Hadoop

After installing Hadoop and setting the HADOOP_HOME environment variable, you
now need to configure Hadoop by editing some XML configuration files and preparing system
folders.

3.1 Update Java Path in Hadoop Configuration

1. Go to:
C:\Hadoop_test\hadoop-3.3.6\etc\hadoop
2. Open the file hadoop-env.cmd in a text editor (Right-click → Edit).
3. Find the line that sets the JAVA_HOME and change it to your actual Java path. Example:
set JAVA_HOME=C:\java\jdk1.8.0_351(Your java jdk path)

Fig 3.1 Java path in hadoop-env.cmd

3.2 Set Hadoop Environment Variables

1. Open Environment Variables (Search → “Edit the system environment variables”).
2. Under User Variables:

○ Click New → Name: HADOOP_HOME, Value: C:\Hadoop_test\hadoop-3.3.6

3. Under System Variables:

○ Select Path → Click Edit → Click Add, and paste:

■ C:\Hadoop_test\hadoop-3.3.6\bin

■ C:\Hadoop_test\hadoop-3.3.6\sbin

3.3 Create Data Directories

Create folders to store NameNode and DataNode data:

1. Navigate to:

C:\Hadoop_test\hadoop-3.3.6

2. Create a new folder: data

3. Inside data, create two folders:

○ namenode

○ datanode

So the full paths will be:

● C:\Hadoop_test\hadoop-3.3.6\data\namenode
● C:\Hadoop_test\hadoop-3.3.6\data\datanode

Fig 3.2 Creation of datanode and namenode folder

3.4 Configure Hadoop XML Files

Go to C:\Hadoop_test\hadoop-3.3.6\etc\hadoop and update the following files:

3.4.1. core-site.xml

Replace the content with:

<configuration>
<property>
<name>fs.defaultFS</name>
<value>hdfs://localhost:9000</value>
</property>
</configuration>

3.4.2. hdfs-site.xml

Replace with:

<configuration>
<property>
<name>dfs.replication</name>
<value>1</value>
</property>
<property>
<name>dfs.namenode.name.dir</name>
<value>C:/Hadoop_test/hadoop-3.3.6/data/namenode</value>(Give your’s file
path)
</property>
<property>
<name>dfs.datanode.data.dir</name>
<value>C:/Hadoop_test/hadoop-3.3.6/data/datanode</value> (Give your’s file
path)
</property>
</configuration>
In the values give the path of the datanode and , namenode.
Fig 3.3 Hdfs-site.xml file
Repeat this for httpfs-site.xml file also with same changes.

3.4.3. mapred-site.xml

If the file doesn’t exist, copy mapred-site.xml.template and rename it to mapred-site.xml.

Then paste:

<configuration>
<property>
<name>mapreduce.framework.name</name>
<value>yarn</value>
</property>
</configuration>
3.4.4 yarn-site.xml

Paste the following:

<configuration>
<property>
<name>yarn.nodemanager.aux-services</name>
<value>mapreduce_shuffle</value>
</property>
<property>
<name>yarn.nodemanager.auxservices.mapreduce.shuffle.class</name>
<value>org.apache.hadoop.mapred.shuffleHandler</value>
</property>
</configuration>

3.5 Fix Missing bin Files (winutils.exe)

The default Hadoop distribution does not include the necessary Windows binaries.
Follow these steps:

1. Delete the existing bin folder inside C:\Hadoop_test\hadoop-3.3.6.

2. Download the fixed bin folder for Windows from the following link:
Download Hadoop bin for Windows or

Link: https://drive.google.com/file/d/1nCN_jK7EJF2DmPUUxgOggnvJ6k6tksYz/view?pli=1

3. Extract and place the bin folder into C:\Hadoop_test\hadoop-3.3.6.

3.6 Fix winutils.exe Error (msvcr120.dll)

1. Try to run winutils.exe from:

C:\Hadoop_test\hadoop-3.3.6\bin\winutils.exe

2. If it shows an error like “msvcr120.dll missing”, download the DLL file from a trusted
site.

3. Copy the downloaded msvcr120.dll to:

C:\Windows\System32

4. Re-run winutils.exe – the error should be gone.

3.7 Install Microsoft C++ Redistributable

1. Visit the Microsoft VC++ download page:

https://learn.microsoft.com/en-us/cpp/windows/latest-supported-vc-redist?view=msvc-17
0

2. Download and install the x64 version.

3.8 Format the NameNode

1. Open Command Prompt.

2. Run the following command: hdfs namenode -format
3. You should see:
“Successfully formatted NameNode”
3.9 Start Hadoop Daemons
Navigate to:
C:\Hadoop_test\hadoop-3.3.6\sbin

1. Start HDFS:start-dfs.cmd

2. This will start the NameNode and DataNode.
Start YARN: start-yarn.cmd
3. This starts ResourceManager and NodeManager.

4. Access Web Interfaces

After starting the services, open your browser and check:

● NameNode UI → http://localhost:9870

● ResourceManager UI → http://localhost:8088

Fig 4.1 Localhost:8088

Fig 4.2 Localhost:9870

CCS334 BDA Lab Manual
No ratings yet
CCS334 BDA Lab Manual
35 pages
BDA Lab Manual R22
0% (1)
BDA Lab Manual R22
70 pages
Big Data & Analytics Lab Manual
No ratings yet
Big Data & Analytics Lab Manual
51 pages
Knowledge & Practical Interests PDF
100% (3)
Knowledge & Practical Interests PDF
204 pages
BDA Lab Manual by T.Naga Praveena
No ratings yet
BDA Lab Manual by T.Naga Praveena
40 pages
Bda Manual
No ratings yet
Bda Manual
80 pages
Hadoop Installation Steps
No ratings yet
Hadoop Installation Steps
16 pages
Install and Run Hadoop On Windows
No ratings yet
Install and Run Hadoop On Windows
29 pages
Processing - Options - For - Gold-Tellurides VIE 21 JUL 2017
No ratings yet
Processing - Options - For - Gold-Tellurides VIE 21 JUL 2017
9 pages
Spacer Fabric
100% (1)
Spacer Fabric
6 pages
Big Data Manual
No ratings yet
Big Data Manual
19 pages
HADOOP PPT
No ratings yet
HADOOP PPT
21 pages
Hadoop 3 Installation
No ratings yet
Hadoop 3 Installation
10 pages
How To Install Hadoop in Windows 10 & 11 - Hadoop Installation
No ratings yet
How To Install Hadoop in Windows 10 & 11 - Hadoop Installation
9 pages
Network Communication Types: by Ahmed El Hefny
100% (1)
Network Communication Types: by Ahmed El Hefny
15 pages
NICU Discharge Plan
No ratings yet
NICU Discharge Plan
58 pages
Script For Turn-Over and Installation Ceremonies
100% (15)
Script For Turn-Over and Installation Ceremonies
3 pages
Bda Record
No ratings yet
Bda Record
83 pages
English Paper 1 2025
No ratings yet
English Paper 1 2025
143 pages
Practical N0.2 AIM: Install Hadoop Hadoop Installation On Windows 10
No ratings yet
Practical N0.2 AIM: Install Hadoop Hadoop Installation On Windows 10
12 pages
New Bda Manual
No ratings yet
New Bda Manual
80 pages
Unit 1 Bdhall
No ratings yet
Unit 1 Bdhall
66 pages
Interview OFW
No ratings yet
Interview OFW
4 pages
Hadoopfile PP
No ratings yet
Hadoopfile PP
83 pages
User's Manual of Haiwell D Series IoT Cloud HMI
No ratings yet
User's Manual of Haiwell D Series IoT Cloud HMI
12 pages
Hadoop Installation For Windows
No ratings yet
Hadoop Installation For Windows
10 pages
Bda Lab Record
No ratings yet
Bda Lab Record
60 pages
CCS334-BDA LAB MANUAL Final
No ratings yet
CCS334-BDA LAB MANUAL Final
46 pages
Bigdatamanual
No ratings yet
Bigdatamanual
45 pages
Beeswax
100% (1)
Beeswax
4 pages
Hadoop Record 2024-Final
No ratings yet
Hadoop Record 2024-Final
59 pages
BIGDATA LAB MANUAL
No ratings yet
BIGDATA LAB MANUAL
27 pages
Experiment: - 1: Aim: Installing Hadoop, Configure HDFS, Configuring Hadoop
No ratings yet
Experiment: - 1: Aim: Installing Hadoop, Configure HDFS, Configuring Hadoop
67 pages
Step 1: Download Binary Package
No ratings yet
Step 1: Download Binary Package
50 pages
Bda 1
No ratings yet
Bda 1
54 pages
CS3481 DBMS
No ratings yet
CS3481 DBMS
55 pages
Final Copy - BDA LAB Record
No ratings yet
Final Copy - BDA LAB Record
44 pages
Anushka Shetty 35
No ratings yet
Anushka Shetty 35
34 pages
Big Data Journal
No ratings yet
Big Data Journal
50 pages
University of California, Los Angeles: UNDERGRADUATE Student Copy Transcript Report
No ratings yet
University of California, Los Angeles: UNDERGRADUATE Student Copy Transcript Report
4 pages
Bda Manual
No ratings yet
Bda Manual
33 pages
Hadoop 1
No ratings yet
Hadoop 1
39 pages
Lab Manual
No ratings yet
Lab Manual
34 pages
Big Data Lab Record
No ratings yet
Big Data Lab Record
30 pages
Big Data
No ratings yet
Big Data
28 pages
CHE134P FINAL EXAM 2013 14 4t
No ratings yet
CHE134P FINAL EXAM 2013 14 4t
10 pages
BIG Data File
No ratings yet
BIG Data File
28 pages
Big Data Manual Ai
No ratings yet
Big Data Manual Ai
33 pages
EX1-Installation of Hadoop
No ratings yet
EX1-Installation of Hadoop
6 pages
Forest Managemnet Assignment
No ratings yet
Forest Managemnet Assignment
3 pages
Big Data
No ratings yet
Big Data
32 pages
BDH Lab Manual FINAL (Hadoop)
No ratings yet
BDH Lab Manual FINAL (Hadoop)
29 pages
BDA Lab Manual
No ratings yet
BDA Lab Manual
26 pages
Null 001.2015.issue 273 en
No ratings yet
Null 001.2015.issue 273 en
26 pages
Sqoop Tutorial: Sqoop: "SQL To Hadoop and Hadoop To SQL"
No ratings yet
Sqoop Tutorial: Sqoop: "SQL To Hadoop and Hadoop To SQL"
11 pages
HDFS Installation Steps
No ratings yet
HDFS Installation Steps
17 pages
Analysis of Segment Reporting With Reference To Selected Software Companies
No ratings yet
Analysis of Segment Reporting With Reference To Selected Software Companies
18 pages
Hadoop Installation Process
No ratings yet
Hadoop Installation Process
16 pages
Program
No ratings yet
Program
25 pages
2 - Installation
No ratings yet
2 - Installation
15 pages
Hive INstallation
No ratings yet
Hive INstallation
13 pages
Hadoop On Windows
No ratings yet
Hadoop On Windows
13 pages
Hbase Installationn
No ratings yet
Hbase Installationn
12 pages
Meditation 5
No ratings yet
Meditation 5
12 pages
Amc Engineering College: Dept. of Computer Science and Engineering
No ratings yet
Amc Engineering College: Dept. of Computer Science and Engineering
6 pages
Hadoop Installation Guide
No ratings yet
Hadoop Installation Guide
10 pages
Hadoop Installation
No ratings yet
Hadoop Installation
6 pages
BDA1
No ratings yet
BDA1
7 pages
Database Development Life Cycle: Provided by Research Papers in Economics
No ratings yet
Database Development Life Cycle: Provided by Research Papers in Economics
10 pages
Data Analytics Lab
No ratings yet
Data Analytics Lab
9 pages
Hadoop 2.3 Installation For Windows
No ratings yet
Hadoop 2.3 Installation For Windows
6 pages
CS3481 DBMS-28-36
No ratings yet
CS3481 DBMS-28-36
9 pages
Accudemia For Tutors FA24
No ratings yet
Accudemia For Tutors FA24
7 pages
Jasmine B Resume Revised
No ratings yet
Jasmine B Resume Revised
2 pages
Different Types of Water According To USP
No ratings yet
Different Types of Water According To USP
9 pages
Hadoop On Windows
No ratings yet
Hadoop On Windows
6 pages
CC EXP 8 VBHV
No ratings yet
CC EXP 8 VBHV
8 pages
Setup Hadoop On Windows 10 Machines
No ratings yet
Setup Hadoop On Windows 10 Machines
4 pages
1-1. Location: 1. Background To Nairobi City
No ratings yet
1-1. Location: 1. Background To Nairobi City
9 pages
Smart M Air Connection Manual
No ratings yet
Smart M Air Connection Manual
6 pages
Steps of Hadoop Installation
No ratings yet
Steps of Hadoop Installation
3 pages
Business Ethics Case Study PDF
No ratings yet
Business Ethics Case Study PDF
5 pages
Alkem Introduction
No ratings yet
Alkem Introduction
5 pages
Hadoop Installation Report
No ratings yet
Hadoop Installation Report
5 pages
React Problem Statements
No ratings yet
React Problem Statements
3 pages
PROPOSED - Date Sheet For Mid-Term Examination. March 2024
No ratings yet
PROPOSED - Date Sheet For Mid-Term Examination. March 2024
5 pages
Memorandums
No ratings yet
Memorandums
2 pages
Big Data 1
No ratings yet
Big Data 1
2 pages
B.A (Hons) XIX (B) Literary Theory (I) Sem-V (1293)
No ratings yet
B.A (Hons) XIX (B) Literary Theory (I) Sem-V (1293)
4 pages
Hadoop Installation Steps
No ratings yet
Hadoop Installation Steps
3 pages
64c641a1f3760 Ecostrategist Casestudy Wipro
No ratings yet
64c641a1f3760 Ecostrategist Casestudy Wipro
2 pages
Destruction of Microorganisms
No ratings yet
Destruction of Microorganisms
3 pages
Healthy Snacks Outline
No ratings yet
Healthy Snacks Outline
2 pages
Recommendation Forms
No ratings yet
Recommendation Forms
1 page
Quick Configuration of Openldap and Kerberos In Linux and Authenicating Linux to Active Directory
From Everand
Quick Configuration of Openldap and Kerberos In Linux and Authenicating Linux to Active Directory
Dr. Hidaia Mahmood Alassouli
No ratings yet

Hadoop Installation

Uploaded by

Hadoop Installation

Uploaded by

Installation Guide

1.1 Download Java:

I.​ Visit the official Oracle Java download page:​

III.​ Oracle requires you to sign in:​

A.​ If you already have an Oracle account, log in.​

B.​ If not, create a new account — it’s free.

1.2 Install and Organize Java:

1)​ Create a folder named java directly in your C drive:​

a)​ Set the path to: C:\java​

3)​ After installation:​

1.3 Set Environment Variables:

3.​ Under User Variables:​

○​ Click New and enter the following:​

■​ Variable Name: JAVA_HOME​

■​ Variable Value: C:\java\jdk1.8.0_351 (adjust based on your JDK folder

○​ Click Add and paste:​

5.​ Click OK to close all dialogs and save changes.​

Verify Java Installation

2. Install Hadoop (Version 3.3.6)

​ ​ ​ Fig:2.1 Installation of hadoop

2.2 Extract and Organize

2.3 Set Hadoop Environment Variable

3.​ Under User Variables:​

○​ Click New and enter the following:​

■​ Variable Name: HADOOP_HOME​

■​ Variable Value: C:\Hadoop_test\hadoop-3.3.6​

​ ​ Fig 2.3 Creating variable and value in user variable

5.​ Under System Variable:

Fig 2.4 Path in System variable

3.1 Update Java Path in Hadoop Configuration

​ ​ Fig 3.1 Java path in hadoop-env.cmd

3.2 Set Hadoop Environment Variables

○​ Click New → Name: HADOOP_HOME, Value: C:\Hadoop_test\hadoop-3.3.6​

3.​ Under System Variables:​

○​ Select Path → Click Edit → Click Add, and paste:​

3.3 Create Data Directories

Create folders to store NameNode and DataNode data:

1.​ Navigate to:​

2.​ Create a new folder: data​

3.​ Inside data, create two folders:​

So the full paths will be:

​ ​ Fig 3.2 Creation of datanode and namenode folder

3.4 Configure Hadoop XML Files

Go to C:\Hadoop_test\hadoop-3.3.6\etc\hadoop and update the following files:

Replace the content with:

If the file doesn’t exist, copy mapred-site.xml.template and rename it to mapred-site.xml.

Paste the following:

3.5 Fix Missing bin Files (winutils.exe)

1.​ Delete the existing bin folder inside C:\Hadoop_test\hadoop-3.3.6.​

3.​ Extract and place the bin folder into C:\Hadoop_test\hadoop-3.3.6.

1.​ Try to run winutils.exe from:​

3.​ Copy the downloaded msvcr120.dll to:​

4.​ Re-run winutils.exe – the error should be gone.​

3.7 Install Microsoft C++ Redistributable

1.​ Visit the Microsoft VC++ download page:​

2.​ Download and install the x64 version.​

3.8 Format the NameNode

1.​ Open Command Prompt.

1.​ Start HDFS:start-dfs.cmd

4. Access Web Interfaces

After starting the services, open your browser and check:

​ ​ ​ ​ ​ Fig 4.1 Localhost:8088

You might also like

I. Visit the official Oracle Java download page:

III. Oracle requires you to sign in:

A. If you already have an Oracle account, log in.

B. If not, create a new account — it’s free.

1) Create a folder named java directly in your C drive:

a) Set the path to: C:\java

3) After installation:

3. Under User Variables:

○ Click New and enter the following:

■ Variable Name: JAVA_HOME

■ Variable Value: C:\java\jdk1.8.0_351 (adjust based on your JDK folder

○ Click Add and paste:

5. Click OK to close all dialogs and save changes.

Fig:2.1 Installation of hadoop

3. Under User Variables:

○ Click New and enter the following:

■ Variable Name: HADOOP_HOME

■ Variable Value: C:\Hadoop_test\hadoop-3.3.6

Fig 2.3 Creating variable and value in user variable

5. Under System Variable:

Fig 3.1 Java path in hadoop-env.cmd

○ Click New → Name: HADOOP_HOME, Value: C:\Hadoop_test\hadoop-3.3.6

3. Under System Variables:

○ Select Path → Click Edit → Click Add, and paste:

1. Navigate to:

2. Create a new folder: data

3. Inside data, create two folders:

Fig 3.2 Creation of datanode and namenode folder

1. Delete the existing bin folder inside C:\Hadoop_test\hadoop-3.3.6.

3. Extract and place the bin folder into C:\Hadoop_test\hadoop-3.3.6.

1. Try to run winutils.exe from:

3. Copy the downloaded msvcr120.dll to:

4. Re-run winutils.exe – the error should be gone.

1. Visit the Microsoft VC++ download page:

2. Download and install the x64 version.

1. Open Command Prompt.

1. Start HDFS:start-dfs.cmd

Fig 4.1 Localhost:8088