L4A Running Hadoop With MR
L4A Running Hadoop With MR
MapReduce Program
Objective To explore how to execute Hadoop application based on mapreduce program
To explore how to change the number of reducer for running Hadoop application
*note: if the file is in a directory, such as "input", then you need to specify "input/samplefile.txt"
mapper = 1
reducer = 2
check the 1) you should see an output folder created named countfromfile
output
(via HUE)
mapper = 2
reducer = 2
check the 1) check that, you should see an output folder created named countfromSample
output
(via HUE) 2) click on the folder, and you should get the following:
Exercise 3 Application objective: To count the frequency of each words in the file (.csv) where the
size of the file is greater than 128MB
mapper = 2
reducer = 1
check the 1) check that, you should see an output folder created named countfromSample2
output
(via HUE) 2) click on the folder, and you should get the following: