0% found this document useful (0 votes)

5 views4 pages

Exp 11

Uploaded by

C 10 Mayur Sonawane

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views4 pages

Exp 11

Uploaded by

C 10 Mayur Sonawane

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Department of Computer Engineering Subject : DSBDAL

----------------------------------------------------------------------------------------------------------------

Group B
Assignment No: 1
----------------------------------------------------------------------------------------------------------------
Theory:
● Steps to Install Hadoop
● Java Code for word count
● Input File

Steps to install Hadoop:

Step 1) mkdir words

Step 2) Download hadoop-core-1.2.1.jar, which is used to compile and execute the MapReduce
program. Visit the following
link
http://mvnrepository.com/artifact/org.apache.hadoop/hadoop-core/1.2.1

Step 3) Put that downloaded jar file into words folder.

Step 4) Implement WordCount.java program.

Step 5) Create input1.txt on home directory with some random text

Step 6) go on words path then compile

javac -classpath /home/vijay/words/hadoop-core-1.2.1.jar /home/vijay/words/WordCount.java

Step 7) jar -cvf words.jar -c words/ .

Step 8) cd .. then use following commands

hadoop fs -mkdir /input

hadoop fs -put input1.txt /input

hadoop fs -ls /input

GCOERC, NASHIK
Department of Computer Engineering Subject : DSBDAL

hadoop jar /home/vijay/words/words12.jar WordCount /input/input1.txt /out321

hadoop fs -ls /out321

hadoop fs -cat /out321/part-r-00000

(Otherwise check in Browsing HDFS -> Utilities -> Browse the file System -> /)

Java Code for word count:

import java.io.IOException;
import java.util.*;
import org.apache.hadoop.conf.*;
import org.apache.hadoop.fs.*;
import org.apache.hadoop.conf.*;
import org.apache.hadoop.io.*;
import org.apache.hadoop.mapreduce.*;
import org.apache.hadoop.mapreduce.lib.input.*;
import org.apache.hadoop.mapreduce.lib.output.*;
import org.apache.hadoop.util.*;

public class WordCount extends Configured implements Tool

{
public static void main(String args[]) throws Exception
{
int res = ToolRunner.run(new WordCount(), args);
System.exit(res);
}
public int run(String[] args) throws Exception
{
Path inputPath = new Path(args[0]);
Path outputPath = new Path(args[1]);

Configuration conf = getConf();

GCOERC, NASHIK
Department of Computer Engineering Subject : DSBDAL

Job job = new Job(conf, this.getClass().toString());

job.setJarByClass(WordCount.class);

FileInputFormat.setInputPaths(job, inputPath);
FileOutputFormat.setOutputPath(job, outputPath);

job.setJobName("WordCount");

job.setMapperClass(Map.class);
job.setCombinerClass(Reduce.class);
job.setReducerClass(Reduce.class);
job.setMapOutputKeyClass(Text.class);
job.setMapOutputValueClass(IntWritable.class);
job.setOutputKeyClass(Text.class);
job.setOutputValueClass(IntWritable.class);
job.setInputFormatClass(TextInputFormat.class);
job.setOutputFormatClass(TextOutputFormat.class);

return job.waitForCompletion(true) ? 0 : 1;
}

public static class Map extends Mapper<LongWritable, Text, Text,

IntWritable>
{
private final static IntWritable one = new IntWritable(1);
private Text word = new Text();

public void map(LongWritable key, Text value, Mapper.Context

context) throws IOException, InterruptedException
{
String line = value.toString();
StringTokenizer tokenizer = new StringTokenizer(line);
while (tokenizer.hasMoreTokens())
{
word.set(tokenizer.nextToken());
context.write(word, one);
}

GCOERC, NASHIK
Department of Computer Engineering Subject : DSBDAL

}
}

public static class Reduce extends Reducer<Text, IntWritable, Text,

IntWritable>
{

public void reduce(Text key, Iterable<IntWritable> values, Context

context) throws IOException, InterruptedException
{
int sum = 0;
for(IntWritable value : values)
{
sum += value.get();
}
context.write(key, new IntWritable(sum));
}
}
}
Input File
Pune
Mumbai
Nashik
Pune
Nashik
Kolapur

Assignment Questions
1. What is the map reduce explain with a small example?
2. Write down steps to install hadoop.

GCOERC, NASHIK

Linux RHCSA RHCE Course PDF
No ratings yet
Linux RHCSA RHCE Course PDF
381 pages
Word Count Program With MapReduce and Java
No ratings yet
Word Count Program With MapReduce and Java
6 pages
Setting Up Eclipse:: Codelab 1 Introduction To The Hadoop Environment (Version 0.17.0)
No ratings yet
Setting Up Eclipse:: Codelab 1 Introduction To The Hadoop Environment (Version 0.17.0)
9 pages
BDC Final Record
No ratings yet
BDC Final Record
36 pages
Unit IV Programming Model
No ratings yet
Unit IV Programming Model
30 pages
An Intro To Threading in Python - Real Python
No ratings yet
An Intro To Threading in Python - Real Python
25 pages
CS702 Big Data Programs
No ratings yet
CS702 Big Data Programs
59 pages
BDT Lab Manual
No ratings yet
BDT Lab Manual
48 pages
BDF Programs
No ratings yet
BDF Programs
32 pages
Dsa Prac 5 19DCS038
No ratings yet
Dsa Prac 5 19DCS038
16 pages
CS702 Big Data Programs
No ratings yet
CS702 Big Data Programs
58 pages
Map Reduce
No ratings yet
Map Reduce
57 pages
6 WIBD-Practicals
No ratings yet
6 WIBD-Practicals
19 pages
DSBDA GRP B Print
No ratings yet
DSBDA GRP B Print
21 pages
BDT Lab 6 22mis1067
No ratings yet
BDT Lab 6 22mis1067
13 pages
Ravinder Big Data 4 PDF
No ratings yet
Ravinder Big Data 4 PDF
15 pages
Boot Management & Process Management
100% (2)
Boot Management & Process Management
10 pages
Word Count Program To Demonstrate The Use of Map and Reduce Tasks
No ratings yet
Word Count Program To Demonstrate The Use of Map and Reduce Tasks
5 pages
3 MapReduce Program Ex Code
No ratings yet
3 MapReduce Program Ex Code
14 pages
Kick Start Hadoop: Word Count - Hadoop Map Reduce Example
No ratings yet
Kick Start Hadoop: Word Count - Hadoop Map Reduce Example
13 pages
Ravikant Hadoop File
No ratings yet
Ravikant Hadoop File
22 pages
Hadoop and Map Reduce
No ratings yet
Hadoop and Map Reduce
27 pages
Lab3 BigData-MapReduce
No ratings yet
Lab3 BigData-MapReduce
8 pages
Big Data Unit 3 - PPT2
No ratings yet
Big Data Unit 3 - PPT2
11 pages
B1 Instructions
No ratings yet
B1 Instructions
9 pages
Practical 3bcbs
No ratings yet
Practical 3bcbs
5 pages
PART 1 - Install Java and Hadoop On Ubuntu
No ratings yet
PART 1 - Install Java and Hadoop On Ubuntu
4 pages
Practical-2 Aim: Write A Program of Word Count in Map Reduce Over HDFS. Description
No ratings yet
Practical-2 Aim: Write A Program of Word Count in Map Reduce Over HDFS. Description
6 pages
Hadoop Mini Project
No ratings yet
Hadoop Mini Project
8 pages
Run Wordcount
No ratings yet
Run Wordcount
3 pages
Big Data Practical 2
No ratings yet
Big Data Practical 2
11 pages
Word Count Example
No ratings yet
Word Count Example
4 pages
Part B Assignment - No - 1
No ratings yet
Part B Assignment - No - 1
6 pages
Ex No 04
No ratings yet
Ex No 04
4 pages
Running Jar Program
No ratings yet
Running Jar Program
3 pages
Wordcount
No ratings yet
Wordcount
3 pages
Exp 3-Word Count
No ratings yet
Exp 3-Word Count
4 pages
Word Count Program
No ratings yet
Word Count Program
3 pages
Inter BDSD 2022-2023
No ratings yet
Inter BDSD 2022-2023
3 pages
Experiment 6 BDA
No ratings yet
Experiment 6 BDA
4 pages
Steps To Create Jar File and Execute Word Count Problem in Mapper Reducer
No ratings yet
Steps To Create Jar File and Execute Word Count Problem in Mapper Reducer
5 pages
WordCount Program Hadoop Task 2
No ratings yet
WordCount Program Hadoop Task 2
7 pages
Experiment-4 BDA LAB
No ratings yet
Experiment-4 BDA LAB
7 pages
Word Count Program
No ratings yet
Word Count Program
2 pages
Hadoop WordCount
No ratings yet
Hadoop WordCount
2 pages
Sribharanitharan.M 71762234049
No ratings yet
Sribharanitharan.M 71762234049
2 pages
Exp 4 Word Count
No ratings yet
Exp 4 Word Count
4 pages
To Count Using Map and Reduce Program: Wordcount - Java
No ratings yet
To Count Using Map and Reduce Program: Wordcount - Java
2 pages
1 Word Count
No ratings yet
1 Word Count
2 pages
Exam Bigdata
No ratings yet
Exam Bigdata
2 pages
BDC Output 3
No ratings yet
BDC Output 3
4 pages
CTBD Ex02
No ratings yet
CTBD Ex02
3 pages
Palak
No ratings yet
Palak
10 pages
Steps: /usr/lib/hadoop-0.20/ Usr/lib/hadoop-0.20/lib
No ratings yet
Steps: /usr/lib/hadoop-0.20/ Usr/lib/hadoop-0.20/lib
4 pages
Developing A Simple Map-Reduce Program For Hadoop: Big Data Course CS6350 Professor: Dr. Latifur Khan
No ratings yet
Developing A Simple Map-Reduce Program For Hadoop: Big Data Course CS6350 Professor: Dr. Latifur Khan
22 pages
579 BDA Week-04
No ratings yet
579 BDA Week-04
1 page
Import Import Import Import Import Import Import Import Public Class Extends Implements
No ratings yet
Import Import Import Import Import Import Import Import Public Class Extends Implements
7 pages
Practical 2c
No ratings yet
Practical 2c
2 pages
Linux Kernel Slides
No ratings yet
Linux Kernel Slides
459 pages
CS246 TA Session: Hadoop Tutorial: Peyman Kazemian 1/11/2011
No ratings yet
CS246 TA Session: Hadoop Tutorial: Peyman Kazemian 1/11/2011
13 pages
Linux Exam 1
No ratings yet
Linux Exam 1
8 pages
Word Count Program With MapReduce and Java
No ratings yet
Word Count Program With MapReduce and Java
6 pages
Example - (Map Function in Word Count)
No ratings yet
Example - (Map Function in Word Count)
6 pages
02-Wordcount Mapreduce
No ratings yet
02-Wordcount Mapreduce
5 pages
VeraCrypt User Guide - Documentation
No ratings yet
VeraCrypt User Guide - Documentation
396 pages
PDF
No ratings yet
PDF
125 pages
C#4 - Operating Systems
No ratings yet
C#4 - Operating Systems
60 pages
Memory Management and Virtual Memory: Carnegie Mellon
No ratings yet
Memory Management and Virtual Memory: Carnegie Mellon
65 pages
LogCrash BA00000000437790
No ratings yet
LogCrash BA00000000437790
24 pages
Clover of Khaki Color Eng 5129
No ratings yet
Clover of Khaki Color Eng 5129
183 pages
Summary of FreeDOS Commands
No ratings yet
Summary of FreeDOS Commands
5 pages
BITS ZG553 Real Time Systems L-1a KGK
No ratings yet
BITS ZG553 Real Time Systems L-1a KGK
42 pages
10 Distributed Systems
No ratings yet
10 Distributed Systems
66 pages
Virtual Machine Concept
No ratings yet
Virtual Machine Concept
4 pages
Module-4 Question Bank
No ratings yet
Module-4 Question Bank
1 page
Gudlavalleru Engineering College: Seshadri Rao Knowledge Village, Gudlavalleru - 521 356
No ratings yet
Gudlavalleru Engineering College: Seshadri Rao Knowledge Village, Gudlavalleru - 521 356
42 pages
1 Github
No ratings yet
1 Github
3 pages
Mod Menu Log - JP - Co.ponos - Battlecatsen
No ratings yet
Mod Menu Log - JP - Co.ponos - Battlecatsen
13 pages
How To Make Ubuntu Look Like Mac OS Ventura
No ratings yet
How To Make Ubuntu Look Like Mac OS Ventura
5 pages
Perforce 2012.1 Introducing Perforce: April 2012
No ratings yet
Perforce 2012.1 Introducing Perforce: April 2012
30 pages
Fedora (Operating System) : Fedora Is A Linux Distribution Developed by The
No ratings yet
Fedora (Operating System) : Fedora Is A Linux Distribution Developed by The
12 pages
Linux Commands - A Practical Reference
No ratings yet
Linux Commands - A Practical Reference
8 pages
Windows 7 Reg
No ratings yet
Windows 7 Reg
5 pages
Clarify OS
No ratings yet
Clarify OS
13 pages
Assignment of Operating System: Assigned by - Pinki Roy
No ratings yet
Assignment of Operating System: Assigned by - Pinki Roy
13 pages
Solution 5
No ratings yet
Solution 5
8 pages
Parallel Max Servers
No ratings yet
Parallel Max Servers
1 page
Abraham Silberschatz-Operating System Concepts (9th, 2012 - 12) - 460-463, 9.8
No ratings yet
Abraham Silberschatz-Operating System Concepts (9th, 2012 - 12) - 460-463, 9.8
4 pages
Net Share - Net View - Net Session - Net File - Net Use - Windows CMD - SS64
No ratings yet
Net Share - Net View - Net Session - Net File - Net Use - Windows CMD - SS64
2 pages
50 Recipes for Programming Node.js
From Everand
50 Recipes for Programming Node.js
Jamie Munro
3/5 (4)

Exp 11

Uploaded by

Exp 11

Uploaded by

Department of Computer Engineering Subject : DSBDAL

Steps to install Hadoop:

Step 3) Put that downloaded jar file into words folder.

Step 4) Implement WordCount.java program.

Step 5) Create input1.txt on home directory with some random text

Step 6) go on words path then compile

javac -classpath /home/vijay/words/hadoop-core-1.2.1.jar /home/vijay/words/WordCount.java

Step 7) jar -cvf words.jar -c words/ .

Step 8) cd .. then use following commands

hadoop fs -mkdir /input

hadoop fs -put input1.txt /input

hadoop fs -ls /input

hadoop jar /home/vijay/words/words12.jar WordCount /input/input1.txt /out321

hadoop fs -ls /out321

hadoop fs -cat /out321/part-r-00000

Java Code for word count:

public class WordCount extends Configured implements Tool

Configuration conf = getConf();

Job job = new Job(conf, this.getClass().toString());

public static class Map extends Mapper<LongWritable, Text, Text,

public void map(LongWritable key, Text value, Mapper.Context

public static class Reduce extends Reducer<Text, IntWritable, Text,

public void reduce(Text key, Iterable<IntWritable> values, Context

You might also like