0% found this document useful (0 votes)

14 views

Steps to create jar file and execute word count problem in mapper reducer

Uploaded by

vaishnavireddy1809vs

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views

Steps to create jar file and execute word count problem in mapper reducer

Uploaded by

vaishnavireddy1809vs

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Steps to create jar file and execute word count problem in mapper reducer

1. First Open Eclipse -> then select File -> New -> Java Project ->Name it WordCount -> then
Finish.

2. Create Three Java Classes into the project. Name them WCDriver(having the main
function), WCMapper, WCReducer.

3. You have to include two Reference Libraries for that:

Right Click on Project -> then select Build Path-> Click on Configure Build Path. You can see
the Add External JARs option on the Right Hand Side.
3.1 Go to C:\hadoop-3.3.6\share\hadoop\common
Select all jar file listed in this folder
3.2 C:\hadoop-3.3.6\share\hadoop\mapreduce
Select all jar file listed in this folder
3.3 Click on apply

4. Create a class file named as WCMapper in the WordCount Project

Mapper Code: You have to copy paste this program into the WCMapper Java Class file.

// Importing libraries
import java.io.IOException;
import org.apache.hadoop.io.IntWritable;
import org.apache.hadoop.io.LongWritable;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapred.MapReduceBase;
import org.apache.hadoop.mapred.Mapper;
import org.apache.hadoop.mapred.OutputCollector;
import org.apache.hadoop.mapred.Reporter;

public class WCMapper extends MapReduceBase implements Mapper<LongWritable,

Text, Text, IntWritable> {

// Map function
public void map(LongWritable key, Text value, OutputCollector<Text,
IntWritable> output, Reporter rep) throws IOException
{

String line = value.toString();

// Splitting the line on spaces

for (String word : line.split(" "))
{
if (word.length() > 0)
{
output.collect(new Text(word), new IntWritable(1));
}
}
}
}

5. Reducer Code: You have to copy paste this program into the WCReducer Java Class file.
// Importing libraries
import java.io.IOException;
import java.util.Iterator;
import org.apache.hadoop.io.IntWritable;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapred.MapReduceBase;
import org.apache.hadoop.mapred.OutputCollector;
import org.apache.hadoop.mapred.Reducer;
import org.apache.hadoop.mapred.Reporter;

public class WCReducer extends MapReduceBase implements Reducer<Text,

IntWritable, Text, IntWritable> {

// Reduce function
public void reduce(Text key, Iterator<IntWritable> value,
OutputCollector<Text, IntWritable> output,
Reporter rep) throws IOException
{

int count = 0;

// Counting the frequency of each words

while (value.hasNext())
{
IntWritable i = value.next();
count += i.get();
}

output.collect(key, new IntWritable(count));

}
}

6. Driver Code: You have to copy paste this program into the WCDriver Java Class file.
// Importing libraries
import java.io.IOException;
import org.apache.hadoop.conf.Configured;
import org.apache.hadoop.fs.Path;
import org.apache.hadoop.io.IntWritable;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapred.FileInputFormat;
import org.apache.hadoop.mapred.FileOutputFormat;
import org.apache.hadoop.mapred.JobClient;
import org.apache.hadoop.mapred.JobConf;
import org.apache.hadoop.util.Tool;
import org.apache.hadoop.util.ToolRunner;

public class WCDriver extends Configured implements Tool {

public int run(String args[]) throws IOException

{
if (args.length < 2)
{
System.out.println("Please give valid inputs");
return -1;
}

JobConf conf = new JobConf(WCDriver.class);

FileInputFormat.setInputPaths(conf, new Path(args[1]));
FileOutputFormat.setOutputPath(conf, new Path(args[2]));
conf.setMapperClass(WCMapper.class);
conf.setReducerClass(WCReducer.class);
conf.setMapOutputKeyClass(Text.class);
conf.setMapOutputValueClass(IntWritable.class);
conf.setOutputKeyClass(Text.class);
conf.setOutputValueClass(IntWritable.class);
JobClient.runJob(conf);
return 0;
}

// Main Method
public static void main(String args[]) throws Exception
{
int exitCode = ToolRunner.run(new WCDriver(), args);
System.out.println(exitCode);
}
}
7. Now you have to make a jar file:
Right Click on Project-> Click on Export-> Select export destination as Jar File-> Name the jar
File(WordCount.jar) -> Click on next -> at last Click on Finish. Now copy this file into the
C:/hadoop-3.3.6/share/hadoop/mapreduce/

8. create one txt file named as test.txt with some repeated words

9. copy that data file into input directory

C:\hadoop-3.3.6\sbin>hadoop fs -put C:/Users/IIITK/Documents/files/test.txt /input3

10. list the contents of hdfs

C:\hadoop-3.3.6\sbin>hadoop fs -ls /input3/

11. display the contents of test.txt file

hadoop dfs -cat /input3/test.txt

12. run the wordcount.jar file saved in the shared directory of Hadoop
C:\hadoop-3.3.6\ sbin> adoop jar C:/hadoop-
3.3.6/share/hadoop/mapreduce/wordcount.jar WCDriver /input3 /output3

13. display the output stored in /output3 directory

14. C:\hadoop-3.3.6\sbin>hadoop fs -cat /output3

16. 16. we can see the output in browser also

Localhost:9870
Go to utilities

How To Configure Browser For R13 Using TOCF (EE) in Jboss
No ratings yet
How To Configure Browser For R13 Using TOCF (EE) in Jboss
8 pages
DA Lab Program-2
No ratings yet
DA Lab Program-2
6 pages
Import Import Import Import Import Import Import Import Public Class Extends Implements
No ratings yet
Import Import Import Import Import Import Import Import Public Class Extends Implements
7 pages
BDA3
No ratings yet
BDA3
7 pages
Exp 4 Word Count
No ratings yet
Exp 4 Word Count
4 pages
Experiment-4 BDA LAB
No ratings yet
Experiment-4 BDA LAB
7 pages
CS246 TA Session: Hadoop Tutorial: Peyman Kazemian 1/11/2011
No ratings yet
CS246 TA Session: Hadoop Tutorial: Peyman Kazemian 1/11/2011
13 pages
DSBDA 11
No ratings yet
DSBDA 11
15 pages
Ravinder Big Data 4 PDF
No ratings yet
Ravinder Big Data 4 PDF
15 pages
B1 instructions
No ratings yet
B1 instructions
9 pages
Steps: /usr/lib/hadoop-0.20/ Usr/lib/hadoop-0.20/lib
No ratings yet
Steps: /usr/lib/hadoop-0.20/ Usr/lib/hadoop-0.20/lib
4 pages
Word Count Program To Demonstrate The Use of Map and Reduce Tasks
No ratings yet
Word Count Program To Demonstrate The Use of Map and Reduce Tasks
5 pages
Part B Assignment - No - 1
No ratings yet
Part B Assignment - No - 1
6 pages
wrordcount
No ratings yet
wrordcount
2 pages
Word Count
No ratings yet
Word Count
10 pages
Ravikant_Hadoop_file
No ratings yet
Ravikant_Hadoop_file
22 pages
Word Count Program With MapReduce and Java
No ratings yet
Word Count Program With MapReduce and Java
6 pages
Running Jar Program
No ratings yet
Running Jar Program
3 pages
Practical 2c
No ratings yet
Practical 2c
2 pages
Lab3_BigData-MapReduce
No ratings yet
Lab3_BigData-MapReduce
8 pages
02-Wordcount Mapreduce
No ratings yet
02-Wordcount Mapreduce
5 pages
Palak
No ratings yet
Palak
10 pages
Practical 2-1
No ratings yet
Practical 2-1
4 pages
WordCount Program Hadoop Task 2
No ratings yet
WordCount Program Hadoop Task 2
7 pages
Word Count Example
No ratings yet
Word Count Example
4 pages
Word Count Program
No ratings yet
Word Count Program
3 pages
3 MapReduce program ex code
No ratings yet
3 MapReduce program ex code
14 pages
Word Count Program With MapReduce and Java
No ratings yet
Word Count Program With MapReduce and Java
6 pages
Developing A Simple Map-Reduce Program For Hadoop: Big Data Course CS6350 Professor: Dr. Latifur Khan
No ratings yet
Developing A Simple Map-Reduce Program For Hadoop: Big Data Course CS6350 Professor: Dr. Latifur Khan
22 pages
Big Data Practical 2
No ratings yet
Big Data Practical 2
11 pages
Map Reduce
No ratings yet
Map Reduce
57 pages
Setting Up Eclipse:: Codelab 1 Introduction To The Hadoop Environment (Version 0.17.0)
No ratings yet
Setting Up Eclipse:: Codelab 1 Introduction To The Hadoop Environment (Version 0.17.0)
9 pages
Example - (Map Function in Word Count)
No ratings yet
Example - (Map Function in Word Count)
6 pages
MR Progs For Self Excercise
No ratings yet
MR Progs For Self Excercise
14 pages
BDC Output 3
No ratings yet
BDC Output 3
4 pages
ADA Lab Manual
No ratings yet
ADA Lab Manual
34 pages
CS702_Big_Data_Programs
No ratings yet
CS702_Big_Data_Programs
58 pages
wc
No ratings yet
wc
13 pages
Run Wordcount
No ratings yet
Run Wordcount
3 pages
ExNo04
No ratings yet
ExNo04
4 pages
6 WIBD-Practicals
No ratings yet
6 WIBD-Practicals
19 pages
Exp 3-Word Count
No ratings yet
Exp 3-Word Count
4 pages
Source Code for Wordcount
No ratings yet
Source Code for Wordcount
3 pages
Wordcount
No ratings yet
Wordcount
3 pages
Hadoop and Map Reduce
No ratings yet
Hadoop and Map Reduce
27 pages
Prerequisites: Single Node Setup Cluster Setup
No ratings yet
Prerequisites: Single Node Setup Cluster Setup
5 pages
Practical 3bcbs
No ratings yet
Practical 3bcbs
5 pages
CS-702 (D) BigData
No ratings yet
CS-702 (D) BigData
61 pages
Installation of Hadoop
No ratings yet
Installation of Hadoop
37 pages
BDA
No ratings yet
BDA
6 pages
6 - Simple Wordcount
No ratings yet
6 - Simple Wordcount
2 pages
Hadoop Wordcount Program
No ratings yet
Hadoop Wordcount Program
20 pages
DSBDA GRP B Print
No ratings yet
DSBDA GRP B Print
21 pages
Big Data 4 Vivek
No ratings yet
Big Data 4 Vivek
3 pages
BDA Lab 8 Manual
No ratings yet
BDA Lab 8 Manual
7 pages
Advanced Mapreduce
No ratings yet
Advanced Mapreduce
37 pages
Hadoop Developingapps PDF
No ratings yet
Hadoop Developingapps PDF
17 pages
Tutorial-Counting Words in File (S) Using Mapreduce: Prerequisites
No ratings yet
Tutorial-Counting Words in File (S) Using Mapreduce: Prerequisites
11 pages
Hadoop Mapred
100% (1)
Hadoop Mapred
11 pages
Unit IV Programming Model
No ratings yet
Unit IV Programming Model
30 pages
50 Recipes for Programming Node.js
From Everand
50 Recipes for Programming Node.js
Jamie Munro
3/5 (4)
19-Jan-Paper-II-Statistics-HN-1
No ratings yet
19-Jan-Paper-II-Statistics-HN-1
34 pages
20-Jan-Paper-I-EN
No ratings yet
20-Jan-Paper-I-EN
55 pages
MODULE 5
No ratings yet
MODULE 5
27 pages
2021BCS0103_ICS322
No ratings yet
2021BCS0103_ICS322
3 pages
MOD 2
No ratings yet
MOD 2
43 pages
MODULE 1 INTRO
No ratings yet
MODULE 1 INTRO
32 pages
2021BCS0103_ICS322_Assignment2
No ratings yet
2021BCS0103_ICS322_Assignment2
10 pages
s42003-023-05480-z
No ratings yet
s42003-023-05480-z
12 pages
2021BCS0103_CSE411_Lab5
No ratings yet
2021BCS0103_CSE411_Lab5
11 pages
2021BCS0103_ML
No ratings yet
2021BCS0103_ML
1 page
2021BCS0103
No ratings yet
2021BCS0103
7 pages
2021BCS0103_CSE411_lab8
No ratings yet
2021BCS0103_CSE411_lab8
12 pages
2021BCS0103_CSE321_LAB7
No ratings yet
2021BCS0103_CSE321_LAB7
3 pages
2021BCS0103_CSE411_LAB-9
No ratings yet
2021BCS0103_CSE411_LAB-9
10 pages
Spark graphX
No ratings yet
Spark graphX
43 pages
2021BCS0103
No ratings yet
2021BCS0103
15 pages
OfferLetter.pdf
No ratings yet
OfferLetter.pdf
1 page
2021BCS0103_Lab2_Microproc
No ratings yet
2021BCS0103_Lab2_Microproc
3 pages
9_Pig Latin (1)
No ratings yet
9_Pig Latin (1)
42 pages
2021BCS0103_CSE411_LAB6
No ratings yet
2021BCS0103_CSE411_LAB6
11 pages
2021BCS0103_CSE321_LAB
No ratings yet
2021BCS0103_CSE321_LAB
16 pages
2021BCS0103_MicroP_lab
No ratings yet
2021BCS0103_MicroP_lab
3 pages
Steps of Hadoop installation
No ratings yet
Steps of Hadoop installation
3 pages
Hive-Part-2
No ratings yet
Hive-Part-2
53 pages
2021BCS0103_CSE321_LAB6
No ratings yet
2021BCS0103_CSE321_LAB6
12 pages
Spark SQL_updated
No ratings yet
Spark SQL_updated
19 pages
struc_patterns
No ratings yet
struc_patterns
86 pages
Spark MLIB
No ratings yet
Spark MLIB
50 pages
akka parlour menu
No ratings yet
akka parlour menu
4 pages
PIG_installation step
No ratings yet
PIG_installation step
2 pages
SOL HowToConfigureNVIDIAConnectX 5orConnectX 6AdapterinSR IOVModeonVMwareESXi6.7and7.0andabove 280723 1204 23570
No ratings yet
SOL HowToConfigureNVIDIAConnectX 5orConnectX 6AdapterinSR IOVModeonVMwareESXi6.7and7.0andabove 280723 1204 23570
9 pages
Operating Systems Foundations MSIT 5170
No ratings yet
Operating Systems Foundations MSIT 5170
12 pages
Chapter 3 Database Recovery Technique
No ratings yet
Chapter 3 Database Recovery Technique
36 pages
Message Passing Fundamentals: Reference: Http://foxtrot - Ncsa.uiuc - edu:8900/public/MPI
No ratings yet
Message Passing Fundamentals: Reference: Http://foxtrot - Ncsa.uiuc - edu:8900/public/MPI
22 pages
OpenMP P3
No ratings yet
OpenMP P3
22 pages
Cheat Codes
No ratings yet
Cheat Codes
9 pages
Memory
No ratings yet
Memory
57 pages
Ibm Global Services: HACMP Generic Manual Takeover
No ratings yet
Ibm Global Services: HACMP Generic Manual Takeover
8 pages
Lecture 5: IPC - Semaphore and Shared Memory: Message Queues
No ratings yet
Lecture 5: IPC - Semaphore and Shared Memory: Message Queues
6 pages
BuildTools Log
No ratings yet
BuildTools Log
867 pages
Veritas Error Codes
No ratings yet
Veritas Error Codes
30 pages
DMA
No ratings yet
DMA
5 pages
Google File System
No ratings yet
Google File System
48 pages
OSV-Practical File
No ratings yet
OSV-Practical File
57 pages
OS by JJsir
No ratings yet
OS by JJsir
269 pages
Democenter-Infrastructure Paderborn en
No ratings yet
Democenter-Infrastructure Paderborn en
27 pages
Computer Organization SET-1
No ratings yet
Computer Organization SET-1
3 pages
DxDiag
No ratings yet
DxDiag
36 pages
Installing ABAQUS 6
No ratings yet
Installing ABAQUS 6
2 pages
Q.1a What Are Different Linux Distribution? Explain Each in Brief
No ratings yet
Q.1a What Are Different Linux Distribution? Explain Each in Brief
5 pages
Example: $ Cat /proc/meminfo: Statistics Details
No ratings yet
Example: $ Cat /proc/meminfo: Statistics Details
3 pages
Exercise 6a. Inter Process Communication - Pipe Date
No ratings yet
Exercise 6a. Inter Process Communication - Pipe Date
5 pages
Lom Log
No ratings yet
Lom Log
226 pages
Weblogic Setup Guide
No ratings yet
Weblogic Setup Guide
131 pages
Com - Gk.speed - Booster.tool Logcat
No ratings yet
Com - Gk.speed - Booster.tool Logcat
75 pages
Untitled12
No ratings yet
Untitled12
71 pages
ZFX-Constructor 4.4 Making Your Own Portable App LZMA2 Compressed
No ratings yet
ZFX-Constructor 4.4 Making Your Own Portable App LZMA2 Compressed
8 pages
Porting Linux To ARM
No ratings yet
Porting Linux To ARM
10 pages
DIAL Communication Framework Setup Log
No ratings yet
DIAL Communication Framework Setup Log
2 pages

Steps to create jar file and execute word count problem in mapper reducer

Uploaded by

Steps to create jar file and execute word count problem in mapper reducer

Uploaded by

Steps to create jar file and execute word count problem in mapper reducer

3. You have to include two Reference Libraries for that:

4. Create a class file named as WCMapper in the WordCount Project

public class WCMapper extends MapReduceBase implements Mapper<LongWritable,

String line = value.toString();

// Splitting the line on spaces

public class WCReducer extends MapReduceBase implements Reducer<Text,

// Counting the frequency of each words

output.collect(key, new IntWritable(count));

public class WCDriver extends Configured implements Tool {

public int run(String args[]) throws IOException

JobConf conf = new JobConf(WCDriver.class);

9. copy that data file into input directory

10. list the contents of hdfs

11. display the contents of test.txt file

13. display the output stored in /output3 directory

16. 16. we can see the output in browser also

You might also like