0% found this document useful (0 votes)

14 views5 pages

L4A Running Hadoop With MR

The document outlines how to execute a Hadoop application using a MapReduce program to count word frequencies in files of varying sizes. It includes step-by-step instructions for running the program with different input files and configurations, as well as checking the output results. The exercises demonstrate the use of default and custom reducer settings in the MapReduce execution process.

Uploaded by

2024554243

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views5 pages

L4A Running Hadoop With MR

Uploaded by

2024554243

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

L4A - Running Hadoop Application with

MapReduce Program
Objective To explore how to execute Hadoop application based on mapreduce program
To explore how to change the number of reducer for running Hadoop application

Exercise 1 Application objective:

To count the frequency of each words in the file where the size of the file is less than
128MB

Sample Download samplefile.txt (size:28.6 KB)

dataset

Steps The steps:

transfer or put the input file into HDFS
execute the command
check the results

execute 1) execute the following command via SSH :

the *assuming the txt file is not in a directory
command
$hadoop jar /opt/cloudera/parcels/CDH/lib/hadoop-mapreduce/hadoop-mapreduce-
examples.jar wordcount samplefile.txt countfromfile

*note: if the file is in a directory, such as "input", then you need to specify "input/samplefile.txt"

2) mapreduce program is executed and you should get as follows:

Note: observe the number of mapper and reducer executed by default:

mapper = 1
reducer = 2
check the 1) you should see an output folder created named countfromfile
output
(via HUE)

2) click on the folder, and you should get the following:

3) Click on one of the files.

What is the function of wordcount application?

Exercise 2 Application objective:

To count the frequency of each words in the file (.csv) where the size of the file is
greater than 128MB

Sample Download Selected_Task_sample.csv (size: 188.7 MB)

dataset

Steps The steps:

transfer or put the input file into HDFS
execute the command
check the results

execute 1) execute the following command via SSH :

the
command $hadoop jar /opt/cloudera/parcels/CDH/lib/hadoop-mapreduce/hadoop-mapreduce-
examples.jar wordcount Selected_Task_sample.csv countfromSample
2) mapreduce program is executed and you should get as follows:

Note: observe the number of mapper and reducer executed by default:

mapper = 2
reducer = 2

check the 1) check that, you should see an output folder created named countfromSample
output
(via HUE) 2) click on the folder, and you should get the following:

3) Click on one of the files to check the output.

Exercise 3 Application objective: To count the frequency of each words in the file (.csv) where the
size of the file is greater than 128MB

Sample Download Selected_Task_sample.csv (size: 188.7 MB)

dataset

Steps The steps:

transfer or put the input file into HDFS
execute the command with additional setting to change the default number of reducer
check the results

execute 1) execute the following command via SSH :

the *assuming the csv file is in a directory named input2
command
$hadoop jar /opt/cloudera/parcels/CDH/lib/hadoop-mapreduce/hadoop-mapreduce-
examples.jar wordcount -D mapred.reduce.tasks=1 input2/Selected_Task_sample.csv
countfromSample2
2) mapreduce program is executed and you should get as follows:

Note: observe the number of mapper and reducer executed by default:

mapper = 2
reducer = 1

check the 1) check that, you should see an output folder created named countfromSample2
output
(via HUE) 2) click on the folder, and you should get the following:

3) Click on the file

Interpretation of Geophysical Logs Coal.
No ratings yet
Interpretation of Geophysical Logs Coal.
16 pages
Practical 2c
No ratings yet
Practical 2c
2 pages
Word Count
No ratings yet
Word Count
10 pages
DSBDA GRP B Print
No ratings yet
DSBDA GRP B Print
21 pages
Dsbda 11
No ratings yet
Dsbda 11
15 pages
Assignment 11 DSBDA
No ratings yet
Assignment 11 DSBDA
4 pages
Ex No 04
No ratings yet
Ex No 04
4 pages
Cloud PDF
No ratings yet
Cloud PDF
47 pages
Practical-2 Aim: Write A Program of Word Count in Map Reduce Over HDFS. Description
No ratings yet
Practical-2 Aim: Write A Program of Word Count in Map Reduce Over HDFS. Description
6 pages
Lab11 B
No ratings yet
Lab11 B
9 pages
Running Jar Program
No ratings yet
Running Jar Program
3 pages
Prerequisites: Single Node Setup Cluster Setup
No ratings yet
Prerequisites: Single Node Setup Cluster Setup
5 pages
Tutorial-Counting Words in File (S) Using Mapreduce: Prerequisites
No ratings yet
Tutorial-Counting Words in File (S) Using Mapreduce: Prerequisites
11 pages
Ravikant Hadoop File
No ratings yet
Ravikant Hadoop File
22 pages
Labs Lecture2
No ratings yet
Labs Lecture2
6 pages
BDF Programs
No ratings yet
BDF Programs
32 pages
Hadoop Map-Reduce
No ratings yet
Hadoop Map-Reduce
2 pages
Group 11 Assignment 4
No ratings yet
Group 11 Assignment 4
10 pages
03 - Run The WordCount Program Instructions
No ratings yet
03 - Run The WordCount Program Instructions
4 pages
MapReduce Enhanced Guide
No ratings yet
MapReduce Enhanced Guide
3 pages
Experiment-4 BDA LAB
No ratings yet
Experiment-4 BDA LAB
7 pages
CS-702 (D) BigData
No ratings yet
CS-702 (D) BigData
61 pages
Hadoop and Map Reduce
No ratings yet
Hadoop and Map Reduce
27 pages
Installation of Hadoop
No ratings yet
Installation of Hadoop
37 pages
CS702 Big Data Programs
No ratings yet
CS702 Big Data Programs
58 pages
Word Count Program
No ratings yet
Word Count Program
3 pages
Word Count Using MapReduce On Hadoop
No ratings yet
Word Count Using MapReduce On Hadoop
14 pages
Big Data Akshat
No ratings yet
Big Data Akshat
57 pages
BDA Lab
No ratings yet
BDA Lab
13 pages
Unit IV Programming Model
No ratings yet
Unit IV Programming Model
30 pages
TP3 - Hadoop Python - Wordcount
No ratings yet
TP3 - Hadoop Python - Wordcount
6 pages
Ravinder Big Data 4 PDF
No ratings yet
Ravinder Big Data 4 PDF
15 pages
BDA Record
No ratings yet
BDA Record
58 pages
Big Data Lab Manual Printout
No ratings yet
Big Data Lab Manual Printout
51 pages
CS702 Big Data Programs
No ratings yet
CS702 Big Data Programs
59 pages
Big Data Analytics Lab Manual (BE AI&DS)
No ratings yet
Big Data Analytics Lab Manual (BE AI&DS)
29 pages
B1 Instructions
No ratings yet
B1 Instructions
9 pages
32 BDA Exp3
No ratings yet
32 BDA Exp3
11 pages
Palak
No ratings yet
Palak
10 pages
Developing A Simple Map-Reduce Program For Hadoop: Big Data Course CS6350 Professor: Dr. Latifur Khan
No ratings yet
Developing A Simple Map-Reduce Program For Hadoop: Big Data Course CS6350 Professor: Dr. Latifur Khan
22 pages
Word Count Program To Demonstrate The Use of Map and Reduce Tasks
No ratings yet
Word Count Program To Demonstrate The Use of Map and Reduce Tasks
5 pages
02-Wordcount Mapreduce
No ratings yet
02-Wordcount Mapreduce
5 pages
WordCount Program Hadoop Task 2
No ratings yet
WordCount Program Hadoop Task 2
7 pages
Run The WordCount Program Instructions
No ratings yet
Run The WordCount Program Instructions
3 pages
Hadoop Practical Commands & Mapreduce Lab Mannula With Java and Python
No ratings yet
Hadoop Practical Commands & Mapreduce Lab Mannula With Java and Python
2 pages
Setting Up Eclipse:: Codelab 1 Introduction To The Hadoop Environment (Version 0.17.0)
No ratings yet
Setting Up Eclipse:: Codelab 1 Introduction To The Hadoop Environment (Version 0.17.0)
9 pages
Wordcount
No ratings yet
Wordcount
3 pages
Big Datalab
No ratings yet
Big Datalab
4 pages
Homework Labs Lecture2
No ratings yet
Homework Labs Lecture2
6 pages
Run Wordcount
No ratings yet
Run Wordcount
3 pages
Word Count (2021)
No ratings yet
Word Count (2021)
50 pages
Bda Experiment No2
No ratings yet
Bda Experiment No2
12 pages
Bda Lab S
No ratings yet
Bda Lab S
92 pages
Answers
No ratings yet
Answers
5 pages
Bda Lab Manual
No ratings yet
Bda Lab Manual
20 pages
Word Count Program With MapReduce and Java
No ratings yet
Word Count Program With MapReduce and Java
6 pages
Activity 2
No ratings yet
Activity 2
31 pages
Steps: /usr/lib/hadoop-0.20/ Usr/lib/hadoop-0.20/lib
No ratings yet
Steps: /usr/lib/hadoop-0.20/ Usr/lib/hadoop-0.20/lib
4 pages
Hadoop Lab Notes: Nicola Tonellotto November 15, 2010
No ratings yet
Hadoop Lab Notes: Nicola Tonellotto November 15, 2010
9 pages
Word Count Program With MapReduce and Java
No ratings yet
Word Count Program With MapReduce and Java
6 pages
Quick Configuration of Openldap and Kerberos in Linux and Authenicating Linux to Active Directory
From Everand
Quick Configuration of Openldap and Kerberos in Linux and Authenicating Linux to Active Directory
Dr. Hidaia Mahmood Alassouli
No ratings yet
1SFA898118R7000 pstx720 600 70
No ratings yet
1SFA898118R7000 pstx720 600 70
6 pages
Data Reconciliation
No ratings yet
Data Reconciliation
15 pages
Infobasic Programming and T-24 Standard
No ratings yet
Infobasic Programming and T-24 Standard
7 pages
Paradigm Shifts
No ratings yet
Paradigm Shifts
1 page
Computer Project
No ratings yet
Computer Project
59 pages
Natural Sciences, 2020-201
No ratings yet
Natural Sciences, 2020-201
777 pages
Aidco 450E BR
No ratings yet
Aidco 450E BR
4 pages
Basic Principles and Practices in CC1 1
No ratings yet
Basic Principles and Practices in CC1 1
2 pages
AERO3000 Equation List
No ratings yet
AERO3000 Equation List
19 pages
Week006-Where-LabExer003 Rivera Dennis
No ratings yet
Week006-Where-LabExer003 Rivera Dennis
6 pages
White Paper Droplet Based Microfluidics Elveflow Microfluidics
No ratings yet
White Paper Droplet Based Microfluidics Elveflow Microfluidics
28 pages
Smartview Common Issues - Master Blog Part-1: Issue-Smart View Not Submitting Data To Essbase Application/Database
No ratings yet
Smartview Common Issues - Master Blog Part-1: Issue-Smart View Not Submitting Data To Essbase Application/Database
19 pages
Batiment International, Building Research and Practice
No ratings yet
Batiment International, Building Research and Practice
2 pages
Manual Midas m32
100% (1)
Manual Midas m32
61 pages
Chpt4 ThConsumer Satisfaction Theories A Critical Revieweories
67% (3)
Chpt4 ThConsumer Satisfaction Theories A Critical Revieweories
35 pages
IEEEXplore Published Paper
No ratings yet
IEEEXplore Published Paper
8 pages
Artificial Intelligence and Machine Learning
No ratings yet
Artificial Intelligence and Machine Learning
2 pages
Test - Unit - 1 - Vector - Google Forms
No ratings yet
Test - Unit - 1 - Vector - Google Forms
4 pages
Example Calculation Sheet
No ratings yet
Example Calculation Sheet
1 page
S11 Question Catalog en
No ratings yet
S11 Question Catalog en
2 pages
The Elite Public School
No ratings yet
The Elite Public School
1 page
Assignment No. 1: Course: Hydraulic Engineering I&D-501 Due Date: 12 March 2021
No ratings yet
Assignment No. 1: Course: Hydraulic Engineering I&D-501 Due Date: 12 March 2021
2 pages
Maths - Quantitative Aptitude Sample Test: Direction For Questions 8 To 11
No ratings yet
Maths - Quantitative Aptitude Sample Test: Direction For Questions 8 To 11
6 pages
Lecture 6 - Test Design Techniques
No ratings yet
Lecture 6 - Test Design Techniques
44 pages
Enrtl-Rk Rate Based Dipa Model
No ratings yet
Enrtl-Rk Rate Based Dipa Model
34 pages
Assignment (Difference Equations)
No ratings yet
Assignment (Difference Equations)
7 pages
Design Parameters For De-Formable Cushion Systems
No ratings yet
Design Parameters For De-Formable Cushion Systems
19 pages
Saic-Q-1035 Sub-Base & Base Course
No ratings yet
Saic-Q-1035 Sub-Base & Base Course
4 pages
SOAv 1
No ratings yet
SOAv 1
50 pages

L4A Running Hadoop With MR

Uploaded by

L4A Running Hadoop With MR

Uploaded by

L4A - Running Hadoop Application with

Exercise 1 Application objective:

Sample Download samplefile.txt (size:28.6 KB)

Steps The steps:

execute 1) execute the following command via SSH :

2) mapreduce program is executed and you should get as follows:

Note: observe the number of mapper and reducer executed by default:

2) click on the folder, and you should get the following:

3) Click on one of the files.

What is the function of wordcount application?

Exercise 2 Application objective:

Sample Download Selected_Task_sample.csv (size: 188.7 MB)

Steps The steps:

execute 1) execute the following command via SSH :

Note: observe the number of mapper and reducer executed by default:

3) Click on one of the files to check the output.

Sample Download Selected_Task_sample.csv (size: 188.7 MB)

Steps The steps:

execute 1) execute the following command via SSH :

Note: observe the number of mapper and reducer executed by default:

3) Click on the file

You might also like