0% found this document useful (0 votes)

5 views

Word Count Program

Uploaded by

harshith123cs

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views

Word Count Program

Uploaded by

harshith123cs

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 3

1.

Map Reduce program to count the number of occurrences of each word in a

given input text.

driver.java
package wordcount;
import java.io. *;
import java.util.*;
import org.apache.hadoop.mapred.*;
import org.apache.hadoop.io.*;
import org.apache.hadoop.fs.Path;
public class driver
{
public static void main(String args[]) throws IOException
{
JobConf conf=new JobConf(driver.class);
conf.setMapperClass(mapper.class);
conf.setReducerClass(reducer.class);
conf.setOutputKeyClass(Text.class);
conf.setOutputValueClass(IntWritable.class);
FileInputFormat.addInputPath(conf, new Path(args[0]));
FileOutputFormat.setOutputPath(conf,new Path(args[1]));
JobClient.runJob(conf);
}
}

mapper.java
package wordcount;
import java.io.*;
import java.util.*;
import org.apache.hadoop.mapred.*;
import org.apache.hadoop.io.*;

public class mapper extends MapReduceBase implements Mapper<LongWritable, Text, Text,

IntWritable> {

// Static final variable for the count of 1

private final static IntWritable one = new IntWritable(1);

// Reusable Text object to hold each word

private Text word = new Text();

// The map function

public void map(LongWritable key, Text value, OutputCollector<Text, IntWritable> output,
Reporter reporter)
throws IOException {

// Convert the input value (line of text) to a string

String line = value.toString();

// Tokenize the line into words

StringTokenizer tokenizer = new StringTokenizer(line);
// Iterate through the tokens (words)
while (tokenizer.hasMoreTokens()) {
// Set the current word into the Text object
word.set(tokenizer.nextToken());

// Collect the word and emit (word, 1) as key-value pairs

output.collect(word, one);
}
}
}

reducer.java
package wordcount;
import java.io.*;
import java.util.*;
import org.apache.hadoop.mapred.*;
import org.apache.hadoop.io.*;

public class reducer extends MapReduceBase implements Reducer<Text, IntWritable, Text,

IntWritable> {

public void reduce(Text key, Iterator<IntWritable> values, OutputCollector<Text, IntWritable>

output,
Reporter reporter) throws IOException {

int sum = 0;

// Sum up the counts for each word

while (values.hasNext()) {
sum += values.next().get();
}

// Emit the word with the total count

output.collect(key, new IntWritable(sum));
}
}

Steps to run
1. Create a New File named Bash.sh
2. Copy the Below code and Paste inside Bash.sh and save that File.
export JAVA_HOME=$(readlink -f $(which javac) | awk 'BEGIN {FS="/bin"} {print $1}')
export PATH=$(echo $PATH):$(pwd)/bin
export CLASSPATH=$(hadoop classpath)
3. Execute the bash.sh File using following command source Bash.sh.
4. Verify JAVA_HOME variable to be set to Java Path and PATH variable has your USN
Hadoop Folder.
If any previous PATH set to Hadoop Folder remove that inside .bashrc file.
5. Verify Hadoop is Installed or not by executing hadoop command.if command gives
Information about
Hadoop command then Hadoop is Successfully Installed.
6. Create a folder word count and move to that folder.
7. Make the driver.java , mapper.java and reducer.java files.
8. Compile all java files (driver.java mapper.java reducer.java)
javac -d . *.java
9. Set driver class in manifest
echo Main-Class: wordcount.driver > Manifest.txt
10. Create an executable jar file
jar cfm wordcount.jar Manifest.txt word count/*.class
11. oe.txt is input file for Oddeven create Input File
echo “hello good morning, hello have a nice day” > input.txt
12. Run the jar file
hadoop jar wordcount.jar input.txt output
13. To see the Output
cat output/*

CPP Chapter 1 To 9 Assessment
100% (2)
CPP Chapter 1 To 9 Assessment
252 pages
G. Engebreth, S. Sahu - PHP 8 Basics. For Programming and Web Development (2023)
100% (2)
G. Engebreth, S. Sahu - PHP 8 Basics. For Programming and Web Development (2023)
335 pages
Experiment 6 BDA
No ratings yet
Experiment 6 BDA
4 pages
Experiment-4 BDA LAB
No ratings yet
Experiment-4 BDA LAB
7 pages
Run Wordcount
No ratings yet
Run Wordcount
3 pages
Part B Assignment - No - 1
No ratings yet
Part B Assignment - No - 1
6 pages
Ravikant_Hadoop_file
No ratings yet
Ravikant_Hadoop_file
22 pages
To Count Using Map and Reduce Program: Wordcount - Java
No ratings yet
To Count Using Map and Reduce Program: Wordcount - Java
2 pages
✅ PART 1- Install Java and Hadoop on Ubuntu
No ratings yet
✅ PART 1- Install Java and Hadoop on Ubuntu
4 pages
Word Count Program To Demonstrate The Use of Map and Reduce Tasks
No ratings yet
Word Count Program To Demonstrate The Use of Map and Reduce Tasks
5 pages
Wordcount
No ratings yet
Wordcount
3 pages
Practical 2c
No ratings yet
Practical 2c
2 pages
Practical 3bcbs
No ratings yet
Practical 3bcbs
5 pages
049
No ratings yet
049
2 pages
11. WordCountApp
No ratings yet
11. WordCountApp
2 pages
Exp 3-Word Count
No ratings yet
Exp 3-Word Count
4 pages
Codigo Haddop
No ratings yet
Codigo Haddop
3 pages
Source Code for Wordcount
No ratings yet
Source Code for Wordcount
3 pages
1WordCount
No ratings yet
1WordCount
2 pages
02-Wordcount Mapreduce
No ratings yet
02-Wordcount Mapreduce
5 pages
wc
No ratings yet
wc
13 pages
Steps to create jar file and execute word count problem in mapper reducer
No ratings yet
Steps to create jar file and execute word count problem in mapper reducer
5 pages
Word Count Program
No ratings yet
Word Count Program
2 pages
DSBDA GRP B Print
No ratings yet
DSBDA GRP B Print
21 pages
ExNo04
No ratings yet
ExNo04
4 pages
579 BDA Week-04
No ratings yet
579 BDA Week-04
1 page
Exp 4 Word Count
No ratings yet
Exp 4 Word Count
4 pages
WordCount Program Hadoop Task 2
No ratings yet
WordCount Program Hadoop Task 2
7 pages
BDC Output 3
No ratings yet
BDC Output 3
4 pages
Word Count Program With MapReduce and Java
No ratings yet
Word Count Program With MapReduce and Java
6 pages
ContarPalabras Java
No ratings yet
ContarPalabras Java
2 pages
Hadoop WordCount
No ratings yet
Hadoop WordCount
2 pages
Big Data Practical 2
No ratings yet
Big Data Practical 2
11 pages
BDA3
No ratings yet
BDA3
7 pages
Setting Up Eclipse:: Codelab 1 Introduction To The Hadoop Environment (Version 0.17.0)
No ratings yet
Setting Up Eclipse:: Codelab 1 Introduction To The Hadoop Environment (Version 0.17.0)
9 pages
ADA Lab Manual
No ratings yet
ADA Lab Manual
34 pages
Ravinder Big Data 4 PDF
No ratings yet
Ravinder Big Data 4 PDF
15 pages
Word Count Program With MapReduce and Java
No ratings yet
Word Count Program With MapReduce and Java
6 pages
Word Count Program With MapReduce and Java
No ratings yet
Word Count Program With MapReduce and Java
5 pages
Word Count Example
No ratings yet
Word Count Example
4 pages
BDA
No ratings yet
BDA
6 pages
3 MapReduce program ex code
No ratings yet
3 MapReduce program ex code
14 pages
DA Lab Program-2
No ratings yet
DA Lab Program-2
6 pages
CS246 TA Session: Hadoop Tutorial: Peyman Kazemian 1/11/2011
No ratings yet
CS246 TA Session: Hadoop Tutorial: Peyman Kazemian 1/11/2011
13 pages
Running Jar Program
No ratings yet
Running Jar Program
3 pages
Assignment 11 DSBDA
No ratings yet
Assignment 11 DSBDA
4 pages
CTBD Sol02
No ratings yet
CTBD Sol02
2 pages
Big Data 4 Vivek
No ratings yet
Big Data 4 Vivek
3 pages
Word_Count(2021)
No ratings yet
Word_Count(2021)
50 pages
Exp-11
No ratings yet
Exp-11
4 pages
Import Import Import Import Import Import Import Import Public Class Extends Implements
No ratings yet
Import Import Import Import Import Import Import Import Public Class Extends Implements
7 pages
DSBDA 11
No ratings yet
DSBDA 11
15 pages
Hadoop and Map Reduce
No ratings yet
Hadoop and Map Reduce
27 pages
Example - (Map Function in Word Count)
No ratings yet
Example - (Map Function in Word Count)
6 pages
Steps: /usr/lib/hadoop-0.20/ Usr/lib/hadoop-0.20/lib
No ratings yet
Steps: /usr/lib/hadoop-0.20/ Usr/lib/hadoop-0.20/lib
4 pages
MapReduce Word Count Example - Javatpoint
No ratings yet
MapReduce Word Count Example - Javatpoint
12 pages
Map Reduce Java Program
No ratings yet
Map Reduce Java Program
2 pages
Lab2 WC
No ratings yet
Lab2 WC
2 pages
6 - Simple Wordcount
No ratings yet
6 - Simple Wordcount
2 pages
Practical 2-1
No ratings yet
Practical 2-1
4 pages
50 Recipes for Programming Node.js
From Everand
50 Recipes for Programming Node.js
Jamie Munro
3/5 (4)
Core Java Programming Book
From Everand
Core Java Programming Book
Manish Soni
No ratings yet
Introduction To Compiler Design: B.Sc. (SE) - 3rd Year (Session-2017-18)
No ratings yet
Introduction To Compiler Design: B.Sc. (SE) - 3rd Year (Session-2017-18)
40 pages
Advantage of Functions in Python
No ratings yet
Advantage of Functions in Python
7 pages
IBM Knowledge Center - Charraymember
No ratings yet
IBM Knowledge Center - Charraymember
3 pages
7.5x-Input Validation Exercise
No ratings yet
7.5x-Input Validation Exercise
19 pages
Using The Policy Framework in Microsoft Dynamics AX 2012
No ratings yet
Using The Policy Framework in Microsoft Dynamics AX 2012
14 pages
Unit 1 - Introduction Notes
No ratings yet
Unit 1 - Introduction Notes
51 pages
Priority Queue, Comparator, Comparable - Notes
No ratings yet
Priority Queue, Comparator, Comparable - Notes
10 pages
Principles of Parallel Algorithm Design
No ratings yet
Principles of Parallel Algorithm Design
78 pages
Medruck Change
No ratings yet
Medruck Change
6 pages
Mesos Tech Report
No ratings yet
Mesos Tech Report
14 pages
Lecture 2 - Visualization and Programming in Matlab
No ratings yet
Lecture 2 - Visualization and Programming in Matlab
48 pages
Shell Sort
No ratings yet
Shell Sort
17 pages
Try To Use Constant Variables
No ratings yet
Try To Use Constant Variables
3 pages
Lesson 07 Data Manipulation With Pandas
No ratings yet
Lesson 07 Data Manipulation With Pandas
82 pages
Insertion Sort Vs Merge Sort in Matlab
No ratings yet
Insertion Sort Vs Merge Sort in Matlab
4 pages
Data Structure KCS301
No ratings yet
Data Structure KCS301
2 pages
Practical No 29
No ratings yet
Practical No 29
2 pages
SPPU High Performance Computing
No ratings yet
SPPU High Performance Computing
12 pages
Package XLSX': R Topics Documented
No ratings yet
Package XLSX': R Topics Documented
45 pages
Technic For Faster PL SQL
100% (2)
Technic For Faster PL SQL
45 pages
Course Outline PF 2021
No ratings yet
Course Outline PF 2021
4 pages
DMS K Scheme report
No ratings yet
DMS K Scheme report
8 pages
### Backend (Golang) .Zip
No ratings yet
### Backend (Golang) .Zip
3 pages
CS6456 Oop Rejinpaul ND15 PDF
No ratings yet
CS6456 Oop Rejinpaul ND15 PDF
4 pages
B.Sc. (Computer Science)
No ratings yet
B.Sc. (Computer Science)
168 pages
Abdul Nayeem - Java Groovy Grails
No ratings yet
Abdul Nayeem - Java Groovy Grails
6 pages
Youjun 04
No ratings yet
Youjun 04
100 pages
Chapter 3-Memory Management
No ratings yet
Chapter 3-Memory Management
40 pages

Word Count Program

Uploaded by

Word Count Program

Uploaded by

1.

Map Reduce program to count the number of occurrences of each word in a

public class mapper extends MapReduceBase implements Mapper<LongWritable, Text, Text,

// Static final variable for the count of 1

// Reusable Text object to hold each word

// The map function

// Convert the input value (line of text) to a string

// Tokenize the line into words

// Collect the word and emit (word, 1) as key-value pairs

public class reducer extends MapReduceBase implements Reducer<Text, IntWritable, Text,

public void reduce(Text key, Iterator<IntWritable> values, OutputCollector<Text, IntWritable>

// Sum up the counts for each word

// Emit the word with the total count

You might also like