0% found this document useful (0 votes)

56 views4 pages

BDM Lab Manual 2

This document provides instructions for loading and executing sample WordCount MapReduce code in both Hadoop and IntelliJ IDE. It describes how to create a JAR file of Java code in IntelliJ, upload it to a Hadoop VM and execute it. It also explains how to view job status and logs through the Hadoop GUI and command line.

Uploaded by

Vijay Mano

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

56 views4 pages

BDM Lab Manual 2

Uploaded by

Vijay Mano

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

MANUAL for BIG DATA MODELLING LABORATORY – 2

Ex.No. 6
Load and Execute Wordcount MapReduce Java code in Hadoop
Load and Execute Wordcount MapReduce Python code in Hadoop

Sample Steps to Load and Execute Wordcount MapReduce code in Hadoop

To execute the example MapReduce code from repository,
$ hadoop jar /home/hadoop/install/hadoop/hadoop-2.6.5/share/hadoop/mapreduce/hadoop-
mapredue-examples-2.6.5.jar wordcount /dir2/test2.txt /output 

Where wordcount is the MapReduce program which runs on test2.txt file and generates the
output file inside the /output folder.

To view the output,

$ hadoop fs -cat /output/part-r-00000 

Note: During the subsequent execution of the program, inside the same /output folder, it can’t
create the new output file in the same name. So, either delete the existing output file and run
another code, or choose some other folder as output folder.

Ex.No. 7
Load and execute existing WordCount MapReduce code in IntelliJ

i) Download the following files

WordCount.java, WordCountMapper.java, and WordCountReducer.java files
Hadoop jar files

ii) Create a new project in IntelliJ Idea software

File Menu > New > Project > Java > Click on Next Button > Type Project name >
Finish > Ok

iii) Load the word count files into the current project
Copy all the three word count Java code files from its folder
Paste it inside the src folder of Project Explorer window in IntelliJ

The file WordCount.java contains the main() method which should be executed by
choosing the menu option Run or “Run WordCount.main()” option from right-
click menu. But it is erroneous one, since it requires the supportive Hadoop
libraries which are not available in IntelliJ by default.

iv) Add the Hadoop libraries into the current project

File Menu > Project Structure > Libraries > + > Java Libraries > Choose the
folder which contains Hadoop Jar files > Select all Jar files from the list > Ok >
Add it into the current project > Ok

v) Now select the menu option Run or “Run WordCount.main()” option from right-
click menu to execute the code.
On successful execution also, it can’t show the output. Since, it requires the input
file name as command line argument.

Ex.No.8
Create the Jar file of your sample Java code in IntelliJ, upload it into the VM and
execute that code in Hadoop

i) To create the Jar file, first of all inform the system where to create the Jar file.

In IntelliJ, select the following options in the menu

Menu - File > Project Structure > Artifacts > + > Jar > From modules with
dependencies > select the Java file which contains main() method > Ok

It creates META-INF folder inside the src folder (refer the Project Explorer window)

Project Explorer > src folder > META-INF folder > manifest.mf file

ii) To create the Jar file,

Menu - Build > Build Artifact > Project Name

List of Actions > Select Build to create the Jar file

It creates the folder “out” above “src” folder.

Out > Artifact  Project-name-Jar > Project-name.jar

> Production (in orange color)

iii) If any further modifications done in the code, rebuild the artifacts to get its reflection
in the .jar file also.

iv) To load the Jar file into the VM,

Open MobaXTerm > Start the session with VM
Sftp > Upload > Browse and get project-name.jar file

Otherwise, create a new folder in VM’s file system and then upload the .jar file into it.

v) To start Hadoop daemons,

$ start-all.sh 

Verify it by using,
$ jps 
... Jps
... Resource Manager
... Name Node
... Backup Name Node

vi) To run the .jar file in VM

$ hadoop jar project-name.jar 

vii) To run the WordCount.jar file in VM, upload the text file into HDFS
$ vi test.txt 
$ hadoop fs –put test.txt /test.txt 

Then run the WordCount.jar file,

$ hadoop jar WordCount.jar /test.txt /output 
where /test.txt is the input HDFS file, /output is the HDFS folder to create the output
file. The name of the output file would be part-r00000 at the very first time.

To view the output,

$hadoop fs -cat /output/part-r00000 

viii) If the text file (input file of the word count problem) exists in Host OS File
System(i.e., Windows) of some other system in the same network, you can get into
the Guest OS File System through any one of the two ways in MobaXterm.

a) Upload the file

b) scp -v test.txt username@ipaddress-of-Hadoop-VM:. 

Otherwise, through E-Mail, you can download it into your VM.

ix) Check the daemons which are created during the execution of the MapReduce code,
During the execution of MapReduce WordCount program in the current terminal
window, open a new terminal window, and then give the command jps in that new
terminal. It will list out the following daemons.
... Jps
... Resource Manager
... Name Node
... Backup Name Node
... Node Manager
... MRApp Manager
... YarnChild1
... YarnChild2

MR Application Manager is to monitor and manage the currently running job in the
cluster. The YarnChilds are the daemons of Mapper and Reducer processes.

x) View the status in GUI

Open a web browser (eg. FireFox) in VM
Give the web address as localhost:50070
Select utilities > Browse FS > Enter / > It will list all the files in the master machine

xi) View the number of blocks required to handle the input file of the Word Count Map
Reduce program.

Browser window > Double click on the test.txt file > Select Block >
Block 0
Slave 1
Slave 2 like details will be shown if it is multinode cluster configuration.

xii) View the Job History in GUI

To start the job history,
$ mr -jobhistory-daemon.sh start historyserver 

In browser window:
localhost:19888/jobhistory 

It shows the log information of both Mapper code as well as the Reducer code.

xiii) View specific job details

localhost:19888
Click on specific job > Select Mapper > Select Log

MapReduce Word Count Example - Javatpoint
No ratings yet
MapReduce Word Count Example - Javatpoint
12 pages
Hands-On Exercises With Big Data: Lab Sheet 1: Getting Started With Mapreduce and Hadoop
No ratings yet
Hands-On Exercises With Big Data: Lab Sheet 1: Getting Started With Mapreduce and Hadoop
14 pages
Cloud PDF
No ratings yet
Cloud PDF
47 pages
Unit IV Programming Model
No ratings yet
Unit IV Programming Model
30 pages
20CSPL701 - Bda - Record 2024-2025
No ratings yet
20CSPL701 - Bda - Record 2024-2025
61 pages
Group 11 Assignment 4
No ratings yet
Group 11 Assignment 4
10 pages
BDA Record
No ratings yet
BDA Record
58 pages
CS702 Big Data Programs
No ratings yet
CS702 Big Data Programs
59 pages
Big Data Akshat
No ratings yet
Big Data Akshat
57 pages
Ancient and Medieval History Book
No ratings yet
Ancient and Medieval History Book
68 pages
Word Count Program With MapReduce and Java
No ratings yet
Word Count Program With MapReduce and Java
6 pages
BDF Programs
No ratings yet
BDF Programs
32 pages
Map Reduce
No ratings yet
Map Reduce
57 pages
6 WIBD-Practicals
No ratings yet
6 WIBD-Practicals
19 pages
DSBDA GRP B Print
No ratings yet
DSBDA GRP B Print
21 pages
CS702 Big Data Programs
No ratings yet
CS702 Big Data Programs
58 pages
Hadoop Lab Notes: Nicola Tonellotto November 15, 2010
No ratings yet
Hadoop Lab Notes: Nicola Tonellotto November 15, 2010
9 pages
Bda Megh
No ratings yet
Bda Megh
50 pages
Word Count (2021)
No ratings yet
Word Count (2021)
50 pages
Hadoop 2
No ratings yet
Hadoop 2
31 pages
Run The WordCount Program Instructions
No ratings yet
Run The WordCount Program Instructions
3 pages
CS-702 (D) BigData
No ratings yet
CS-702 (D) BigData
61 pages
Wordcount
No ratings yet
Wordcount
3 pages
Setting Up Eclipse:: Codelab 1 Introduction To The Hadoop Environment (Version 0.17.0)
No ratings yet
Setting Up Eclipse:: Codelab 1 Introduction To The Hadoop Environment (Version 0.17.0)
9 pages
Tutorial-Counting Words in File (S) Using Mapreduce: Prerequisites
No ratings yet
Tutorial-Counting Words in File (S) Using Mapreduce: Prerequisites
11 pages
Run Wordcount
No ratings yet
Run Wordcount
3 pages
Hadoop Exercise Mapreduce
No ratings yet
Hadoop Exercise Mapreduce
5 pages
Ravinder Big Data 4 PDF
No ratings yet
Ravinder Big Data 4 PDF
15 pages
Running Jar Program
No ratings yet
Running Jar Program
3 pages
Hadoop and Map Reduce
No ratings yet
Hadoop and Map Reduce
27 pages
Bda Experiment No2
No ratings yet
Bda Experiment No2
12 pages
WordCount Program Hadoop Task 2
No ratings yet
WordCount Program Hadoop Task 2
7 pages
Mapreduce, Hadoop and Amazon Aws: Yasser Ganjisaffar
No ratings yet
Mapreduce, Hadoop and Amazon Aws: Yasser Ganjisaffar
33 pages
Ravikant Hadoop File
No ratings yet
Ravikant Hadoop File
22 pages
Big Data Analytics Lab Manual (BE AI&DS)
No ratings yet
Big Data Analytics Lab Manual (BE AI&DS)
29 pages
Practical-2 Aim: Write A Program of Word Count in Map Reduce Over HDFS. Description
No ratings yet
Practical-2 Aim: Write A Program of Word Count in Map Reduce Over HDFS. Description
6 pages
Steps: /usr/lib/hadoop-0.20/ Usr/lib/hadoop-0.20/lib
No ratings yet
Steps: /usr/lib/hadoop-0.20/ Usr/lib/hadoop-0.20/lib
4 pages
Lab2 WC
No ratings yet
Lab2 WC
2 pages
Dsbda 11
No ratings yet
Dsbda 11
15 pages
Hadoop Map-Reduce
No ratings yet
Hadoop Map-Reduce
2 pages
03 - Run The WordCount Program Instructions
No ratings yet
03 - Run The WordCount Program Instructions
4 pages
Steps To Create Jar File and Execute Word Count Problem in Mapper Reducer
No ratings yet
Steps To Create Jar File and Execute Word Count Problem in Mapper Reducer
5 pages
Practical 2c
No ratings yet
Practical 2c
2 pages
BDA Manual
No ratings yet
BDA Manual
41 pages
Ex No 04
No ratings yet
Ex No 04
4 pages
Labs Lecture2
No ratings yet
Labs Lecture2
6 pages
Hadoop Administrator Training - Lab Hand Book
No ratings yet
Hadoop Administrator Training - Lab Hand Book
12 pages
Experiment-4 BDA LAB
No ratings yet
Experiment-4 BDA LAB
7 pages
Word Count Program To Demonstrate The Use of Map and Reduce Tasks
No ratings yet
Word Count Program To Demonstrate The Use of Map and Reduce Tasks
5 pages
B1 Instructions
No ratings yet
B1 Instructions
9 pages
Example - (Map Function in Word Count)
No ratings yet
Example - (Map Function in Word Count)
6 pages
Intellipaat Hands On Exercises PDF
No ratings yet
Intellipaat Hands On Exercises PDF
49 pages
CS246 TA Session: Hadoop Tutorial: Peyman Kazemian 1/11/2011
No ratings yet
CS246 TA Session: Hadoop Tutorial: Peyman Kazemian 1/11/2011
13 pages
02-Wordcount Mapreduce
No ratings yet
02-Wordcount Mapreduce
5 pages
Part B Assignment - No - 11
No ratings yet
Part B Assignment - No - 11
6 pages
Word Count Program With MapReduce and Java
No ratings yet
Word Count Program With MapReduce and Java
6 pages
Assignment 11 DSBDA
No ratings yet
Assignment 11 DSBDA
4 pages
Developing A Simple Map-Reduce Program For Hadoop: Big Data Course CS6350 Professor: Dr. Latifur Khan
No ratings yet
Developing A Simple Map-Reduce Program For Hadoop: Big Data Course CS6350 Professor: Dr. Latifur Khan
22 pages
Prerequisites: Single Node Setup Cluster Setup
No ratings yet
Prerequisites: Single Node Setup Cluster Setup
5 pages
Lec - A-01 - Number Systems and Codes
No ratings yet
Lec - A-01 - Number Systems and Codes
97 pages
Kusum Shaw Bridge Design Engineer: Personal Details
No ratings yet
Kusum Shaw Bridge Design Engineer: Personal Details
5 pages
Precast Concrete Slabs On Load Bearing Masonry Walls: Good Practice Guide
No ratings yet
Precast Concrete Slabs On Load Bearing Masonry Walls: Good Practice Guide
8 pages
Ancient Roman Architecture
No ratings yet
Ancient Roman Architecture
20 pages
Superheroes Suck WT
No ratings yet
Superheroes Suck WT
5 pages
Alvar Aalto HOA-III
No ratings yet
Alvar Aalto HOA-III
17 pages
Building Envelopes
No ratings yet
Building Envelopes
12 pages
Spark Monitoring With Graphite and Grafana Guide
No ratings yet
Spark Monitoring With Graphite and Grafana Guide
7 pages
Android Developer CV
No ratings yet
Android Developer CV
5 pages
Unicode Setup
No ratings yet
Unicode Setup
2 pages
ITP
No ratings yet
ITP
32 pages
Registration For New Passers Without A Chapter
No ratings yet
Registration For New Passers Without A Chapter
1 page
ASSA ABLOY SL500 Resilience R128 TechDataSheet
100% (1)
ASSA ABLOY SL500 Resilience R128 TechDataSheet
2 pages
Risc Visa Reference Sheet
No ratings yet
Risc Visa Reference Sheet
2 pages
Erp Material Master File As On 17.11.2023
No ratings yet
Erp Material Master File As On 17.11.2023
310 pages
Security PDF
No ratings yet
Security PDF
516 pages
Computer Organization and Architecture CSE
No ratings yet
Computer Organization and Architecture CSE
3 pages
Teoh Zi Wei 0323372 Vivian Tay 0323869 Vinnie Tan 0323706 Yeoh Sin Yuen 0323737 Tey Cheng Fern 0323912
No ratings yet
Teoh Zi Wei 0323372 Vivian Tay 0323869 Vinnie Tan 0323706 Yeoh Sin Yuen 0323737 Tey Cheng Fern 0323912
22 pages
The WDS Group - Sydney Build Expo 2020
No ratings yet
The WDS Group - Sydney Build Expo 2020
19 pages
Manual Asus p5q3 Deluxe-wifi-AP
No ratings yet
Manual Asus p5q3 Deluxe-wifi-AP
190 pages
Chapter 9 Heritage Management
No ratings yet
Chapter 9 Heritage Management
6 pages
Outline Restructure
No ratings yet
Outline Restructure
3 pages
Gallintel Brochure PDF
No ratings yet
Gallintel Brochure PDF
16 pages
VLSI Grader Info
No ratings yet
VLSI Grader Info
6 pages
Arboretum Magazine January - June 2017
No ratings yet
Arboretum Magazine January - June 2017
24 pages
Stair Details
No ratings yet
Stair Details
1 page
OSCA Individual Assignment
No ratings yet
OSCA Individual Assignment
32 pages
Take-Home Final Exam
No ratings yet
Take-Home Final Exam
7 pages
Sprint Galaxy Note 3 International & Dom - Sprint Samsung Galaxy Note 3 PDF
No ratings yet
Sprint Galaxy Note 3 International & Dom - Sprint Samsung Galaxy Note 3 PDF
6 pages

BDM Lab Manual 2

Uploaded by

BDM Lab Manual 2

Uploaded by

MANUAL for BIG DATA MODELLING LABORATORY – 2

Sample Steps to Load and Execute Wordcount MapReduce code in Hadoop

To view the output,

i) Download the following files

ii) Create a new project in IntelliJ Idea software

iv) Add the Hadoop libraries into the current project

In IntelliJ, select the following options in the menu

ii) To create the Jar file,

Menu - Build > Build Artifact > Project Name

It creates the folder “out” above “src” folder.

Out > Artifact  Project-name-Jar > Project-name.jar

iv) To load the Jar file into the VM,

v) To start Hadoop daemons,

vi) To run the .jar file in VM

Then run the WordCount.jar file,

To view the output,

a) Upload the file

Otherwise, through E-Mail, you can download it into your VM.

x) View the status in GUI

xii) View the Job History in GUI

xiii) View specific job details

You might also like