0% found this document useful (0 votes)

48 views6 pages

Computer Science & Engineering: Department of

The document details the steps to install Hadoop single node cluster on an Ubuntu VM and run the word count application. It provides instructions on downloading and extracting Hadoop, configuring environment variables, formatting the namenode, starting Hadoop services, and checking the outputs of resource manager and namenode UIs.

Uploaded by

Mriganka shekher Mukhopadhyay

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

48 views6 pages

Computer Science & Engineering: Department of

Uploaded by

Mriganka shekher Mukhopadhyay

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

DEPARTMENT OF

COMPUTER SCIENCE & ENGINEERING

Experiment – 3.1

Student Name: Anurag Kumar UID: 21BCS1040

Branch: BE-CSE Section/Group: 605-A
th
Semester: 6 Date of Performance: 04/04/2024
Subject Name: Cloud Computing Lab Subject Code: 21CSP-378

AIM:
Install Hadoop single node cluster and run applications like word count.

OBJECTIVE:
To install and test Hadoop single node cluster.

PROCEDURE:
1) Install jdk to your ubuntu vm
using:
Command: sudo apt install openjdk-11-jdk

2) Download Hadoop archive file from apache website using:

Command: wget https://archive.apache.org/dist/hadoop/common/hadoop-3.4.0/hadoop-3.4.0.tar.gz

3) Extract the tar file using:

Command: tar -xvf hadoop-3.4.0.tar.gz

4) Add the Hadoop and Java paths in the bash file (.bashrc). Open .bashrc file. Now, add Hadoop and Java Path as shown
below. (prefer the change to bash format for .bashrc file) Use “nano .bashrc” to edit the file.

Fig 1
Then, save the config file and close it. For applying all these changes to the current Terminal, execute the source
command.
DEPARTMENT OF
COMPUTER SCIENCE & ENGINEERING

Command: source .bashrc

5) Check for successful installation of Hadoop and JAVA:

Command: java -version
Command: hadoop version

Fig 2

6) Edit the Hadoop Configuration files

We will be editing some configuration file of Hadoop Change the directory
to: hadoop-3.4.0/etc/hadoop Using command:
• Command: cd hadoop-3.4.0/etc/hadoop
• Command: ls

Fig 3

7) Open core-site.xml and edit the property mentioned below inside configuration tag:

core-site.xml informs Hadoop daemon where NameNode runs in the cluster. It contains configuration settings of
Hadoop core such as I/O settings that are common to HDFS &
MapReduce

Command: nano core-site.xml

<?xml version="1.0" encoding="UTF-8"?>

DEPARTMENT OF
COMPUTER SCIENCE & ENGINEERING
<?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
<configuration>
<property>
<name>fs.default.name</name>
<value>hdfs://localhost:9000</value>
</property>
</configuration>

8) Edit hdfs-site.xml and edit the property mentioned below inside configuration tag:

hdfs-site.xml contains configuration settings of HDFS daemons (i.e. NameNode, DataNode, Secondary NameNode). It
also includes the replication factor and block size of HDFS.

Command: nano hdfs-site.xml

<?xml version="1.0" encoding="UTF-8"?>

<?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
<configuration>
<property>
<name>dfs.replication</name>
<value>1</value>
</property>
<property>
<name>dfs.permission</name>
<value>false</value>
</property>
</configuration>

9) Edit the mapred-site.xml file and edit the property mentioned below inside configuration tag:

mapred-site.xml contains configuration settings of MapReduce application like number of JVM that can run in parallel,
the size of the mapper and the reducer process, CPU cores available for a process, etc.
In some cases, mapred-site.xml file is not available. So, we have to create the mapred- site.xml file using
mapredsite.xml template.

Command: cp mapred-site.xml.template mapred-site.xml

Command: nano vi mapred-site.xml

<?xml version="1.0" encoding="UTF-8"?>

<?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
DEPARTMENT OF
COMPUTER SCIENCE & ENGINEERING
<configuration>
<property>
<name>mapreduce.framework.name</name>
<value>yarn</value>
</property>
</configuration>

10) Edit yarn-site.xml and edit the property mentioned below inside configuration tag:

yarn-site.xml contains configuration settings of ResourceManager and NodeManager like application memory
management size, the operation needed on program & algorithm, etc.

Command: nano yarn-site.xml

<?xml version="1.0">
<configuration>
<property>
<name>yarn.nodemanager.aux-services</name>
<value>mapreduce_shuffle</value>
</property>
<property>
<name>yarn.nodemanager.auxservices.mapreduce.shuffle.class</ name>
<value>org.apache.hadoop.mapred.ShuffleHandler</value> </property>

11) Edit hadoop-env.sh and add the Java Path as mentioned below:
hadoop-env.sh contains the environment variables that are used in the script to run Hadoop like Java home path, etc.

Command: nano hadoop–env.sh

Command: export JAVA_HOME=/home/ubuntu22/usr/lib/jvm/java-11-openjdk-amd64

12) Go to Hadoop home directory and format the NameNode.

Command: cd
DEPARTMENT OF
COMPUTER SCIENCE & ENGINEERING
Command: cd hadoop-3.4.0
Command: bin/hadoop namenode -format

This formats the HDFS via NameNode. This command is only executed for the first time. Formatting the file system
means initializing the directory specified by the dfs.name.dir variable.
Never format, up and running Hadoop filesystem. You will lose all your data stored in the HDFS.

13) Once the NameNode is formatted, go to hadoop-2.7.3/sbin directory and start all the daemons.

Command: cd hadoop-3.4.0/sbin

Either you can start all daemons with a single command or do it individually.

Command: ./start-all.sh

14) To check that all the Hadoop services are up and running, run the below command.

Command: jps

Fig 4

15) If everything is done as per mentioned above one can open the Mozilla browser and go to http://localhost:8088/ to
check for The ResourceManager UI
And to check the NameNode UI go to http://localhost:9870/ .
DEPARTMENT OF
COMPUTER SCIENCE & ENGINEERING

Fig 5

Result & Analysis:

Thus the Hadoop one cluster was installed and simple applications executed successfully. The
word count application successfully processes input data and generates output. Analyze the
output to understand word frequencies, identify common words, and gain insights into the
dataset.

Iam Ug
No ratings yet
Iam Ug
364 pages
Desai Tech Dubai 2016-2
No ratings yet
Desai Tech Dubai 2016-2
72 pages
JohnBiggs UPF
No ratings yet
JohnBiggs UPF
16 pages
Understanding Information: Unit 5
No ratings yet
Understanding Information: Unit 5
77 pages
IOS Interview Questions
No ratings yet
IOS Interview Questions
11 pages
Resources and Help For GIS
No ratings yet
Resources and Help For GIS
5 pages
Noteshub (Noteshub - Co.In) : Dbms Lab File 4 Semester
No ratings yet
Noteshub (Noteshub - Co.In) : Dbms Lab File 4 Semester
41 pages
Internship Report
No ratings yet
Internship Report
47 pages
BRMS Detail
No ratings yet
BRMS Detail
290 pages
Avaya Aura Communication Manager Feature Description and Implementation - PDF Filenameutf-8avaya20aurac2 6-8-2021
No ratings yet
Avaya Aura Communication Manager Feature Description and Implementation - PDF Filenameutf-8avaya20aurac2 6-8-2021
21 pages
Nava Graha Stotram English Large
No ratings yet
Nava Graha Stotram English Large
3 pages
Maintenance and Service Guide: HP Pavilion 15 Laptop PC
No ratings yet
Maintenance and Service Guide: HP Pavilion 15 Laptop PC
102 pages
BDA Lab Manual-1
No ratings yet
BDA Lab Manual-1
60 pages
Hadoop File Complte
No ratings yet
Hadoop File Complte
18 pages
13 Megaco h248
No ratings yet
13 Megaco h248
33 pages
Big Data Analytics Notes
67% (3)
Big Data Analytics Notes
16 pages
Service Discovery in Microservices - Baeldung On Computer Science
No ratings yet
Service Discovery in Microservices - Baeldung On Computer Science
9 pages
Installing Standalone and Pseudocode Hadoop Cluster: 1. Setting Up Vmware Virtual Machine
No ratings yet
Installing Standalone and Pseudocode Hadoop Cluster: 1. Setting Up Vmware Virtual Machine
14 pages
Rest Api v1
No ratings yet
Rest Api v1
10 pages
Aranird001: Estimation of Efficiency of Low Pressure Steam Turbine Blading Usind CFD Technique
No ratings yet
Aranird001: Estimation of Efficiency of Low Pressure Steam Turbine Blading Usind CFD Technique
10 pages
Data Storage Data Processing: Hadoop Distributed File System (HDFS) Mapreduce
No ratings yet
Data Storage Data Processing: Hadoop Distributed File System (HDFS) Mapreduce
35 pages
DDOS 7.12 Command Reference Guide
No ratings yet
DDOS 7.12 Command Reference Guide
338 pages
Practical-1: Aim: Hadoop Configuration and Single Node Cluster Setup and Perform File Management Task in
No ratings yet
Practical-1: Aim: Hadoop Configuration and Single Node Cluster Setup and Perform File Management Task in
61 pages
Hadoop Installation Steps
No ratings yet
Hadoop Installation Steps
4 pages
MC0069-System Analysis and Design Model Question Paper
No ratings yet
MC0069-System Analysis and Design Model Question Paper
23 pages
All About Preferences DataStore. in This Post, We Will Take A Look at - by Simona Milanović - Android Developers - Medium
No ratings yet
All About Preferences DataStore. in This Post, We Will Take A Look at - by Simona Milanović - Android Developers - Medium
16 pages
TR 2013 2
No ratings yet
TR 2013 2
40 pages
BDA Lab Manual
No ratings yet
BDA Lab Manual
34 pages
Hadoop Installation Manual 2.odt
No ratings yet
Hadoop Installation Manual 2.odt
20 pages
DSA Lab 05
No ratings yet
DSA Lab 05
5 pages
Irony and Satire English Quiz Presentation in Cream Modern Abstract Style
No ratings yet
Irony and Satire English Quiz Presentation in Cream Modern Abstract Style
20 pages
Big Data
No ratings yet
Big Data
23 pages
Bda Lab
No ratings yet
Bda Lab
37 pages
Big Data File
No ratings yet
Big Data File
16 pages
Big Data Manual Ai
No ratings yet
Big Data Manual Ai
33 pages
Hadoop Single Node Cluster Setup Steps
No ratings yet
Hadoop Single Node Cluster Setup Steps
7 pages
Hadoop Installation
No ratings yet
Hadoop Installation
11 pages
CC 8
No ratings yet
CC 8
4 pages
3 Hadoop
No ratings yet
3 Hadoop
40 pages
CC Worksheet 3.1 2396
No ratings yet
CC Worksheet 3.1 2396
3 pages
Worksheet3.1 CC
No ratings yet
Worksheet3.1 CC
8 pages
Start Hadoop
No ratings yet
Start Hadoop
4 pages
Install Sqoop
No ratings yet
Install Sqoop
7 pages
Experiment No. 3.1: 1) JAVA-Java JDK 2) HADOOP-Hadoop Package - Step 1: Verify The Java Installed
No ratings yet
Experiment No. 3.1: 1) JAVA-Java JDK 2) HADOOP-Hadoop Package - Step 1: Verify The Java Installed
6 pages
PLC Based Home Automation System: Sahil Sahni, R.K. Jarial
No ratings yet
PLC Based Home Automation System: Sahil Sahni, R.K. Jarial
5 pages
Hadoopfile PP
No ratings yet
Hadoopfile PP
83 pages
Bdafile
No ratings yet
Bdafile
9 pages
Bigdatamanualfinal 231019063224 d211cb48
No ratings yet
Bigdatamanualfinal 231019063224 d211cb48
45 pages
CSNETWK - Machine Project Demo Kit T3 AY2023-2024
No ratings yet
CSNETWK - Machine Project Demo Kit T3 AY2023-2024
2 pages
Experiment: - 1: Aim: Installing Hadoop, Configure HDFS, Configuring Hadoop
No ratings yet
Experiment: - 1: Aim: Installing Hadoop, Configure HDFS, Configuring Hadoop
67 pages
Big Data Manual
No ratings yet
Big Data Manual
19 pages
Amrita CC 3.1
No ratings yet
Amrita CC 3.1
7 pages
Big Data Analytics Lab Experiments
No ratings yet
Big Data Analytics Lab Experiments
16 pages
DSA All Units
No ratings yet
DSA All Units
60 pages
Bda Lab Record
No ratings yet
Bda Lab Record
60 pages
Big Data Record 2024-25
No ratings yet
Big Data Record 2024-25
46 pages
Big Data
No ratings yet
Big Data
28 pages
SpyGlass CDC Rules Reference Guide, Version N-2017.12-SP2
No ratings yet
SpyGlass CDC Rules Reference Guide, Version N-2017.12-SP2
2,294 pages
Hadoop Installation Guide
No ratings yet
Hadoop Installation Guide
18 pages
Bda Manual
No ratings yet
Bda Manual
33 pages
Hadoop For Ubuntu 2
No ratings yet
Hadoop For Ubuntu 2
4 pages
Bigdatamanual
No ratings yet
Bigdatamanual
45 pages
BigData Lab Manual
No ratings yet
BigData Lab Manual
44 pages
Amc Engineering College: Dept. of Computer Science and Engineering
No ratings yet
Amc Engineering College: Dept. of Computer Science and Engineering
6 pages
CCS334-BDA LAB MANUAL Final
No ratings yet
CCS334-BDA LAB MANUAL Final
46 pages
Big Data Analytics lab-JD
No ratings yet
Big Data Analytics lab-JD
49 pages
TP2 - 3IM - en
No ratings yet
TP2 - 3IM - en
7 pages
Week 1 in Terminal
No ratings yet
Week 1 in Terminal
10 pages
Installation of Hadoop
No ratings yet
Installation of Hadoop
6 pages
Applog
No ratings yet
Applog
56 pages
Unit 6 Software Metrics
No ratings yet
Unit 6 Software Metrics
6 pages
Part B Assignment - No - 11
No ratings yet
Part B Assignment - No - 11
6 pages
Assignment Tanupriya BDDV
No ratings yet
Assignment Tanupriya BDDV
8 pages
Install Hadoop
No ratings yet
Install Hadoop
8 pages
Group A 1st
No ratings yet
Group A 1st
4 pages
SAM Lecture Five
No ratings yet
SAM Lecture Five
14 pages
BIG Data File
No ratings yet
BIG Data File
28 pages
BDA Lab Manual
No ratings yet
BDA Lab Manual
49 pages
BDA Lab Manual UPDATED
No ratings yet
BDA Lab Manual UPDATED
45 pages
Lab Manual
No ratings yet
Lab Manual
34 pages
Exp 1-2
No ratings yet
Exp 1-2
9 pages
Ccs334 Bda Lab Manual PRINT
No ratings yet
Ccs334 Bda Lab Manual PRINT
53 pages
Big Data Lab Manual
No ratings yet
Big Data Lab Manual
32 pages
Bi Lab File
No ratings yet
Bi Lab File
19 pages
Bda Lab Manual
No ratings yet
Bda Lab Manual
42 pages
Ccs334 Bda Lab Ex
No ratings yet
Ccs334 Bda Lab Ex
45 pages
Hive INstallation
No ratings yet
Hive INstallation
13 pages
Hadoop Configuration
No ratings yet
Hadoop Configuration
12 pages
Big Data Lab Record
No ratings yet
Big Data Lab Record
30 pages
Quick Configuration of Openldap and Kerberos in Linux and Authenicating Linux to Active Directory
From Everand
Quick Configuration of Openldap and Kerberos in Linux and Authenicating Linux to Active Directory
Dr. Hidaia Mahmood Alassouli
No ratings yet

Computer Science & Engineering: Department of

Uploaded by

Computer Science & Engineering: Department of

Uploaded by

DEPARTMENT OF

COMPUTER SCIENCE & ENGINEERING

Student Name: Anurag Kumar UID: 21BCS1040

2) Download Hadoop archive file from apache website using:

3) Extract the tar file using:

Command: source .bashrc

5) Check for successful installation of Hadoop and JAVA:

6) Edit the Hadoop Configuration files

Command: nano core-site.xml

<?xml version="1.0" encoding="UTF-8"?>

Command: nano hdfs-site.xml

<?xml version="1.0" encoding="UTF-8"?>

Command: cp mapred-site.xml.template mapred-site.xml

Command: nano vi mapred-site.xml

<?xml version="1.0" encoding="UTF-8"?>

Command: nano yarn-site.xml

Command: nano hadoop–env.sh

Command: export JAVA_HOME=/home/ubuntu22/usr/lib/jvm/java-11-openjdk-amd64

12) Go to Hadoop home directory and format the NameNode.

Result & Analysis:

You might also like