0% found this document useful (0 votes)

48 views9 pages

Execute Java Map Reduce Sample Using Eclipse

To execute a Java MapReduce sample in Eclipse: 1. Create a new Java project in Eclipse and add the required Hadoop jar dependencies. 2. Create Mapper and Reducer classes that implement the map and reduce logic. 3. Add a main method that configures a Job instance with the Mapper and Reducer classes and executes the job. Running the project requires adding program arguments, external JAR dependencies, and the Hadoop configuration directory.

Uploaded by

Arjun S

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

48 views9 pages

Execute Java Map Reduce Sample Using Eclipse

Uploaded by

Arjun S

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Execute Java Map reduce sample using Eclipse

 3 mins read

To create New Eclipse project:

Create new java project in Eclipse.

Select “Java Project” and click “Next” button.

Enter project name and click “Finish” Button.

New java project will be created in the Eclipse.

To create a new java class:

Once the project created add the class file in the project by right-clicking the “src” in a project and select “New”,”class” from the menu.

Type the class name and click “Finish” button.

It will create a class under the “src” folder. Add your code in the created class.

To add the dependencies in the project:

Add the required dependencies to the project by right-clicking the project and select Build Path-Configure Build Path.

Click “Add External JARs” and browse the required jar files and add it.

Add the below mentioned dependencies to build the project

a. hadoop-common-*.*.*.jar

b. hadoop-mapreduce-client-core-*.*.*.jar
To create a MapReduce Java Program:

MapReduce program contains Map and Reduce algorithms under Mapper and Reducer class respectively. Brief details about the Mapper and
Reducer classes are as follows,

Mapper Class:

A mapper’s main work is to produce a list of key value pairs to be processed later.
A mapper receives a key value pair as parameters, and produce a list of new key value pairs.

For Example:
From each input to the mapper, the generated list of key value pairs is the key, combined with each of the values separated by comma.
Input: (aaa,bbb,ccc,ddd))
Output: List(aaa 1, bbb 1, ccc 1,aaa 1)

Code:

public static class MapClass extends Mapper<LongWritable, Text, Text, IntWritable> {

private final static IntWritable one = new IntWritable(1);

private Text word = new Text();

public void map(LongWritable key, Text value,

OutputCollector<Text, IntWritable> output,
Reporter reporter) throws IOException {
String line = value.toString();
StringTokenizer itr = new StringTokenizer(line);
while (itr.hasMoreTokens()) {
word.set(itr.nextToken());
output.collect(word, one);
}
}
}

Shuffler Class:

After the mapper and before the reducer, the shuffler and combining phases take place. The shuffler phase assures that every key value pair with
the same key goes to the same reducer, the combining part converts all the key value pairs of the same key to group and form key,list(values) this
is what the reducer ultimately receives.

Reducer Class:

Reducer’s job is to take the key list(values) pair, operate on the grouped values, and store it somewhere. It takes the key list(values) pair, loop
through the values concatenating them to a pipe-separated string, and send the new key value pair to the output.

For Example:
Input: [(aaa,List(1,1)),(bbb,List(1)),(ccc,List(1))]
Output: [(aaa,2),(bbb,1),(ccc,1)]

Code:
public static class Reduce extends Reducer<Text, IntWritable, Text, IntWritable> {

public void reduce(Text key, Iterator<IntWritable> values,

OutputCollector<Text, IntWritable> output,
Reporter reporter) throws IOException {
int sum = 0;
while (values.hasNext()) {
sum += values.next().get();
}
output.collect(key, new IntWritable(sum));
}
}
Main Method:

Create a instance for the Job Class and set the Mapper and Reducer class in the Main() method and execute the program.

Code:

public static void main(String[] args) throws Exception

{

String arguments[] = new String[2];

//For remote cluster set remote host_name:port instead of localhost:9000
arguments[0] = "hdfs://localhost:9000/Data/WarPeace.txt"; // Input HDFS File
arguments[1] = "hdfs://localhost:9000/OutPut"; // Output directory
Configuration conf = new Configuration();
Job job = new Job(conf, "WordCount");
FileInputFormat.addInputPath(job, new Path(arguments[0]));
FileOutputFormat.setOutputPath(job, new Path(arguments[1]));
job.setJarByClass(WordCount.class);
job.waitForCompletion(true);
job.setOutputKeyClass(Text.class);
job.setOutputValueClass(IntWritable.class);
job.setMapperClass(MapClass.class);
job.setReducerClass(Reduce.class);
job.setInputFormatClass(TextInputFormat.class);
job.setOutputFormatClass(TextOutputFormat.class);
}

Required Dependencies to execute project:

1. Jar files under the specified folder is required to execute Mapreduce java program,

a. HADOOP_HOME\share\hadoop\common

b. HADOOP_HOME\share\hadoop\common\lib

c. HADOOP_HOME\share\hadoop\hdfs

d. HADOOP_HOME\share\hadoop\yarn

e. HADOOP_HOME\share\hadoop\mapreduce

To Run the Project:

Once the build is successful, execute the project by right-clicking the class and select

Run As – Run Configurations.. from the menu.

Double-click the “Java Applications” from the opened window.

Navigate to the “Arguments” tab and add the arguments in the provided space if your Mapreduce program has to get arguments at runtime.

Navigate to “Classpath” tab and select the “User Entries” and click “Add External JARs” and add the dependencies in it.
After adding dependencies click the “Advanced” button and select “Add External folders” and click “Ok” button.

Select the Hadoop configuration file directory and click “Ok” button.

Navigate to the “Environment” tab and set “HADOOP_HOME” and click “Apply” and click “Run”.

Execution will be completed and the logs will be like shown below,

Developing A Simple Map-Reduce Program For Hadoop: Big Data Course CS6350 Professor: Dr. Latifur Khan
No ratings yet
Developing A Simple Map-Reduce Program For Hadoop: Big Data Course CS6350 Professor: Dr. Latifur Khan
22 pages
CS246 TA Session: Hadoop Tutorial: Peyman Kazemian 1/11/2011
No ratings yet
CS246 TA Session: Hadoop Tutorial: Peyman Kazemian 1/11/2011
13 pages
Unit 4 BDA
No ratings yet
Unit 4 BDA
31 pages
Palak
No ratings yet
Palak
10 pages
Mapreduce: Simplified Data Processing On Large Clusters by Jeffrey Dean and Sanjay Ghemawa Presented by Jon Logan
No ratings yet
Mapreduce: Simplified Data Processing On Large Clusters by Jeffrey Dean and Sanjay Ghemawa Presented by Jon Logan
30 pages
Hadoop Wordcount Program
No ratings yet
Hadoop Wordcount Program
20 pages
Import Import Import Import Import Import Import Import Public Class Extends Implements
No ratings yet
Import Import Import Import Import Import Import Import Public Class Extends Implements
7 pages
Prerequisites: Single Node Setup Cluster Setup
No ratings yet
Prerequisites: Single Node Setup Cluster Setup
5 pages
Example - (Map Function in Word Count)
No ratings yet
Example - (Map Function in Word Count)
6 pages
Word Count Program With MapReduce and Java
No ratings yet
Word Count Program With MapReduce and Java
6 pages
Hadoop
No ratings yet
Hadoop
28 pages
Steps: /usr/lib/hadoop-0.20/ Usr/lib/hadoop-0.20/lib
No ratings yet
Steps: /usr/lib/hadoop-0.20/ Usr/lib/hadoop-0.20/lib
4 pages
Map Reduce
No ratings yet
Map Reduce
57 pages
Cap456-Introduction To Big Data
No ratings yet
Cap456-Introduction To Big Data
1 page
Data Science Presentation
No ratings yet
Data Science Presentation
20 pages
43 PPT On Apache Pig
No ratings yet
43 PPT On Apache Pig
16 pages
Hadoop Questions
No ratings yet
Hadoop Questions
41 pages
05 Movies Data Analysis Using Mapreduce
No ratings yet
05 Movies Data Analysis Using Mapreduce
20 pages
BDA Lab 8 Manual
No ratings yet
BDA Lab 8 Manual
7 pages
Unit 2 - From Hadoop Streaming PDF
No ratings yet
Unit 2 - From Hadoop Streaming PDF
20 pages
Hadoop and Map Reduce
No ratings yet
Hadoop and Map Reduce
27 pages
Hadoop - Project 5th Sem - 1
No ratings yet
Hadoop - Project 5th Sem - 1
62 pages
18mcs35e U4
No ratings yet
18mcs35e U4
7 pages
BDA-MapReduce (1) 5rfgy656yhgvcft6
No ratings yet
BDA-MapReduce (1) 5rfgy656yhgvcft6
60 pages
Lecture 04
No ratings yet
Lecture 04
25 pages
BDA Unit-2
No ratings yet
BDA Unit-2
11 pages
Big Data Analytics - Sem 7 CVMU
No ratings yet
Big Data Analytics - Sem 7 CVMU
4 pages
Advanced Mapreduce
No ratings yet
Advanced Mapreduce
37 pages
CS702 Big Data Programs
No ratings yet
CS702 Big Data Programs
58 pages
21CS1601 Unit 5 Understanding Big Data Technolgies
No ratings yet
21CS1601 Unit 5 Understanding Big Data Technolgies
20 pages
Bda Unit-Iii
No ratings yet
Bda Unit-Iii
42 pages
DSBDA Manual Assignment 11
No ratings yet
DSBDA Manual Assignment 11
6 pages
Unit 3 Notes
No ratings yet
Unit 3 Notes
21 pages
Introduction To MapReduce
No ratings yet
Introduction To MapReduce
9 pages
Big Data - A Primer
100% (3)
Big Data - A Primer
195 pages
3 MapReduce Program Ex Code
No ratings yet
3 MapReduce Program Ex Code
14 pages
Understanding MapReduce
No ratings yet
Understanding MapReduce
15 pages
Big Data Mapreduce and Streaming
No ratings yet
Big Data Mapreduce and Streaming
10 pages
Bda Experiment No2
No ratings yet
Bda Experiment No2
12 pages
MapReduce Is A Framework Using Which We Can Write Applications To Process Huge Amounts of Data
No ratings yet
MapReduce Is A Framework Using Which We Can Write Applications To Process Huge Amounts of Data
12 pages
Map Reduce Programming
No ratings yet
Map Reduce Programming
67 pages
Developing A Mapreduce Application: by Dr. K. Venkateswara Rao Professor Department of Cse
No ratings yet
Developing A Mapreduce Application: by Dr. K. Venkateswara Rao Professor Department of Cse
83 pages
Bda Unit III r20csm
No ratings yet
Bda Unit III r20csm
54 pages
Big Data Practical 2
No ratings yet
Big Data Practical 2
11 pages
Ravinder Big Data 4 PDF
No ratings yet
Ravinder Big Data 4 PDF
15 pages
Map Reduce
No ratings yet
Map Reduce
74 pages
M4 06 MapReduce
No ratings yet
M4 06 MapReduce
28 pages
BDC Output 3
No ratings yet
BDC Output 3
4 pages
Map Reduce 2
No ratings yet
Map Reduce 2
14 pages
MapReduce - Notes
No ratings yet
MapReduce - Notes
17 pages
Unit 2
No ratings yet
Unit 2
12 pages
Week-8 de
No ratings yet
Week-8 de
9 pages
Steps To Create Jar File and Execute Word Count Problem in Mapper Reducer
No ratings yet
Steps To Create Jar File and Execute Word Count Problem in Mapper Reducer
5 pages
Unit 5 - Mapreduce
No ratings yet
Unit 5 - Mapreduce
8 pages
Word Count Example
No ratings yet
Word Count Example
4 pages
Bda Unit-3
No ratings yet
Bda Unit-3
44 pages
Unit 3
No ratings yet
Unit 3
14 pages
MapReduce Questions and Answers Part 1 - Java Code Geeks
No ratings yet
MapReduce Questions and Answers Part 1 - Java Code Geeks
8 pages
Assignment 2 Write-Up
No ratings yet
Assignment 2 Write-Up
7 pages
MapReduce Programs
No ratings yet
MapReduce Programs
10 pages
Super Important Questions For BDA
100% (1)
Super Important Questions For BDA
26 pages
Unit 2
No ratings yet
Unit 2
24 pages
Experiment-4 BDA LAB
No ratings yet
Experiment-4 BDA LAB
7 pages
The Next Generation of Modeling & Simulation: Integrating Big Data and Deep Learning
No ratings yet
The Next Generation of Modeling & Simulation: Integrating Big Data and Deep Learning
9 pages
Information Technology Engineering Syllabus Sem Viii Mumbai University
No ratings yet
Information Technology Engineering Syllabus Sem Viii Mumbai University
60 pages
M.Tech - Dissertation Presentation
No ratings yet
M.Tech - Dissertation Presentation
28 pages
Array: Gurjit Singh Bhathal, Amardeep Singh
No ratings yet
Array: Gurjit Singh Bhathal, Amardeep Singh
8 pages
BIGDATA LAB MANUAL
No ratings yet
BIGDATA LAB MANUAL
27 pages
Bda Lab
No ratings yet
Bda Lab
37 pages
Data Migration From RDBMS To Hadoop: Platform Migration Approach
No ratings yet
Data Migration From RDBMS To Hadoop: Platform Migration Approach
25 pages
Clickstream Analysis
No ratings yet
Clickstream Analysis
52 pages
Unit-2 (MapReduce-II)
No ratings yet
Unit-2 (MapReduce-II)
11 pages
Lecture 31
No ratings yet
Lecture 31
19 pages
Unit 1
No ratings yet
Unit 1
19 pages
Thecodingshef: Unit 2 Big Data MCQ Aktu
No ratings yet
Thecodingshef: Unit 2 Big Data MCQ Aktu
10 pages
coursBUTONLYQA Merged
No ratings yet
coursBUTONLYQA Merged
52 pages
(FREE PDF Sample) Mastering Large Datasets With Python Parallelize and Distribute Your Python Code 1st Edition John T Wolohan Ebooks
100% (2)
(FREE PDF Sample) Mastering Large Datasets With Python Parallelize and Distribute Your Python Code 1st Edition John T Wolohan Ebooks
62 pages
BDA University Questions
No ratings yet
BDA University Questions
10 pages
Reference: Apache Hadoop: Hadoop: The Definitive Guide, by Tom White, 2 Edition, Oreilly's, 2010
100% (1)
Reference: Apache Hadoop: Hadoop: The Definitive Guide, by Tom White, 2 Edition, Oreilly's, 2010
57 pages
Notes
No ratings yet
Notes
4 pages
Unit 3 Bda
No ratings yet
Unit 3 Bda
41 pages
BDS Session 8 MapReduce YARN
No ratings yet
BDS Session 8 MapReduce YARN
68 pages
unit-3-bigdata (1)
No ratings yet
unit-3-bigdata (1)
31 pages
Relational Algebra Operations in Mapreduce
No ratings yet
Relational Algebra Operations in Mapreduce
28 pages
Introduction To Big Data Analytics
No ratings yet
Introduction To Big Data Analytics
47 pages
5 Mtech Ii Sem Regular & Supply Sep 2023
No ratings yet
5 Mtech Ii Sem Regular & Supply Sep 2023
54 pages
MapReduceBusinessDriver - NOSQL Case Studypdf
No ratings yet
MapReduceBusinessDriver - NOSQL Case Studypdf
3 pages
HDFS Map Reduce
No ratings yet
HDFS Map Reduce
16 pages

Execute Java Map Reduce Sample Using Eclipse

Uploaded by

Execute Java Map Reduce Sample Using Eclipse

Uploaded by

Execute Java Map reduce sample using Eclipse

To create New Eclipse project:

Create new java project in Eclipse.

Select “Java Project” and click “Next” button.

Enter project name and click “Finish” Button.

To create a new java class:

Type the class name and click “Finish” button.

To add the dependencies in the project:

Add the below mentioned dependencies to build the project

public static class MapClass extends Mapper<LongWritable, Text, Text, IntWritable> {

private final static IntWritable one = new IntWritable(1);

public void map(LongWritable key, Text value,

public void reduce(Text key, Iterator<IntWritable> values,

public static void main(String[] args) throws Exception

String arguments[] = new String[2];

Required Dependencies to execute project:

To Run the Project:

Run As – Run Configurations.. from the menu.

You might also like