0% found this document useful (0 votes)
7 views4 pages

DSBDN

The document details a series of Hadoop MapReduce jobs executed on a Cloudera quickstart environment, including the submission of jobs and their successful completion. It provides logs showing the processing of input files, the number of map and reduce tasks, and the output generated in specified directories. The output from the jobs includes word counts and consumption data, stored in HDFS directories MRDir3 and MRDir4.

Uploaded by

4t6b3
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
7 views4 pages

DSBDN

The document details a series of Hadoop MapReduce jobs executed on a Cloudera quickstart environment, including the submission of jobs and their successful completion. It provides logs showing the processing of input files, the number of map and reduce tasks, and the output generated in specified directories. The output from the jobs includes word counts and consumption data, stored in HDFS directories MRDir3 and MRDir4.

Uploaded by

4t6b3
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

[cloudera@quickstart 33312]$ gedit wordcountFile.

txt
[cloudera@quickstart 33312]$ hadoop fs -put wordcountFile WordcountFile
[cloudera@quickstart 33312]$ hadoop jar MRProgramsDemo.jar PackageDemo.WordCount
WordcountFile MRDir3
25/01/21 22:17:28 INFO client.RMProxy: Connecting to ResourceManager at /0.0.0.0:8032
25/01/21 22:17:31 INFO input.FileInputFormat: Total input paths to process : 1
25/01/21 22:17:31 INFO mapreduce.JobSubmitter: number of splits:1
25/01/21 22:17:32 INFO mapreduce.JobSubmitter: Submitting tokens for job:
job_1737526005012_0001
25/01/21 22:17:33 INFO impl.YarnClientImpl: Submitted application
application_1737526005012_0001
25/01/21 22:17:33 INFO mapreduce.Job: The url to track the job:
http://quickstart.cloudera:8088/proxy/application_1737526005012_0001/
25/01/21 22:17:33 INFO mapreduce.Job: Running job: job_1737526005012_0001
25/01/21 22:17:58 INFO mapreduce.Job: Job job_1737526005012_0001 running in uber mode :
false
25/01/21 22:17:58 INFO mapreduce.Job: map 0% reduce 0%
25/01/21 22:18:18 INFO mapreduce.Job: map 100% reduce 0%
25/01/21 22:18:36 INFO mapreduce.Job: map 100% reduce 100%
25/01/21 22:18:37 INFO mapreduce.Job: Job job_1737526005012_0001 completed successfully
25/01/21 22:18:37 INFO mapreduce.Job: Counters: 49
File System Counters
FILE: Number of bytes read=194
FILE: Number of bytes written=221075
FILE: Number of read operations=0
FILE: Number of large read operations=0
FILE: Number of write operations=0
HDFS: Number of bytes read=221
HDFS: Number of bytes written=20
HDFS: Number of read operations=6
HDFS: Number of large read operations=0
HDFS: Number of write operations=2
Job Counters
Launched map tasks=1
Launched reduce tasks=1
Data-local map tasks=1
Total time spent by all maps in occupied slots (ms)=17580
Total time spent by all reduces in occupied slots (ms)=14780
Total time spent by all map tasks (ms)=17580
Total time spent by all reduce tasks (ms)=14780
Total vcore-seconds taken by all map tasks=17580
Total vcore-seconds taken by all reduce tasks=14780
Total megabyte-seconds taken by all map tasks=18001920
Total megabyte-seconds taken by all reduce tasks=15134720
Map-Reduce Framework
Map input records=1
Map output records=18
Map output bytes=152
Map output materialized bytes=194
Input split bytes=124
Combine input records=0
Combine output records=0
Reduce input groups=3
Reduce shuffle bytes=194
Reduce input records=18
Reduce output records=3
Spilled Records=36
Shuffled Maps =1
Failed Shuffles=0
Merged Map outputs=1
GC time elapsed (ms)=271
CPU time spent (ms)=1260
Physical memory (bytes) snapshot=332877824
Virtual memory (bytes) snapshot=3008630784
Total committed heap usage (bytes)=226365440
Shuffle Errors
BAD_ID=0
CONNECTION=0
IO_ERROR=0
WRONG_LENGTH=0
WRONG_MAP=0
WRONG_REDUCE=0
File Input Format Counters
Bytes Read=97
File Output Format Counters
Bytes Written=20
[cloudera@quickstart 33312]$ hadoop fs -ls MRDir3
Found 2 items
-rw-r--r-- 1 cloudera cloudera 0 2025-01-21 22:18 MRDir3/_SUCCESS
-rw-r--r-- 1 cloudera cloudera 20 2025-01-21 22:18 MRDir3/part-r-00000
[cloudera@quickstart 33312]$ hadoop fs -cat MRDir3/part-r-00000
BUS 7
CAR 7
TRAIN 4
[cloudera@quickstart 33312]$ hadoop fs -put sample ConsumData
[cloudera@quickstart 33312]$ hadoop jar ElectricMax.jar Electric.maxelctric ConsumData MRDir4
25/01/21 23:07:54 INFO client.RMProxy: Connecting to ResourceManager at /0.0.0.0:8032
25/01/21 23:07:58 INFO input.FileInputFormat: Total input paths to process : 1
25/01/21 23:07:58 INFO mapreduce.JobSubmitter: number of splits:1
25/01/21 23:07:59 INFO mapreduce.JobSubmitter: Submitting tokens for job:
job_1737526005012_0002
25/01/21 23:08:00 INFO impl.YarnClientImpl: Submitted application
application_1737526005012_0002
25/01/21 23:08:00 INFO mapreduce.Job: The url to track the job:
http://quickstart.cloudera:8088/proxy/application_1737526005012_0002/
25/01/21 23:08:00 INFO mapreduce.Job: Running job: job_1737526005012_0002
25/01/21 23:08:23 INFO mapreduce.Job: Job job_1737526005012_0002 running in uber mode :
false
25/01/21 23:08:23 INFO mapreduce.Job: map 0% reduce 0%
25/01/21 23:08:44 INFO mapreduce.Job: map 100% reduce 0%
25/01/21 23:09:05 INFO mapreduce.Job: map 100% reduce 100%
25/01/21 23:09:05 INFO mapreduce.Job: Job job_1737526005012_0002 completed successfully
25/01/21 23:09:06 INFO mapreduce.Job: Counters: 49
File System Counters
FILE: Number of bytes read=721
FILE: Number of bytes written=222135
FILE: Number of read operations=0
FILE: Number of large read operations=0
FILE: Number of write operations=0
HDFS: Number of bytes read=340
HDFS: Number of bytes written=40
HDFS: Number of read operations=6
HDFS: Number of large read operations=0
HDFS: Number of write operations=2
Job Counters
Launched map tasks=1
Launched reduce tasks=1
Data-local map tasks=1
Total time spent by all maps in occupied slots (ms)=18004
Total time spent by all reduces in occupied slots (ms)=17763
Total time spent by all map tasks (ms)=18004
Total time spent by all reduce tasks (ms)=17763
Total vcore-seconds taken by all map tasks=18004
Total vcore-seconds taken by all reduce tasks=17763
Total megabyte-seconds taken by all map tasks=18436096
Total megabyte-seconds taken by all reduce tasks=18189312
Map-Reduce Framework
Map input records=5
Map output records=65
Map output bytes=585
Map output materialized bytes=721
Input split bytes=121
Combine input records=0
Combine output records=0
Reduce input groups=5
Reduce shuffle bytes=721
Reduce input records=65
Reduce output records=5
Spilled Records=130
Shuffled Maps =1
Failed Shuffles=0
Merged Map outputs=1
GC time elapsed (ms)=321
CPU time spent (ms)=1250
Physical memory (bytes) snapshot=333582336
Virtual memory (bytes) snapshot=3007369216
Total committed heap usage (bytes)=226365440
Shuffle Errors
BAD_ID=0
CONNECTION=0
IO_ERROR=0
WRONG_LENGTH=0
WRONG_MAP=0
WRONG_REDUCE=0
File Input Format Counters
Bytes Read=219
File Output Format Counters
Bytes Written=40
[cloudera@quickstart 33312]$ hadoop fs -ls
Found 15 items
-rw-r--r-- 1 cloudera cloudera 219 2025-01-21 23:03 ConsumData
drwxr-xr-x - cloudera cloudera 0 2025-01-14 23:31 Elec
drwxr-xr-x - cloudera cloudera 0 2025-01-13 02:10 MRDir1
drwxr-xr-x - cloudera cloudera 0 2025-01-14 22:48 MRDir2
drwxr-xr-x - cloudera cloudera 0 2025-01-21 22:18 MRDir3
drwxr-xr-x - cloudera cloudera 0 2025-01-21 23:09 MRDir4
-rw-r--r-- 1 cloudera cloudera 154 2025-01-14 22:42 Sample
-rw-r--r-- 1 cloudera cloudera 63 2025-01-13 01:58 WordCountFile
-rw-r--r-- 1 cloudera cloudera 97 2025-01-21 22:15 WordcountFile
-rw-r--r-- 1 cloudera cloudera 219 2025-01-20 20:52 sample101
-rw-r--r-- 1 cloudera cloudera 219 2025-01-20 21:01 sample102
-rw-r--r-- 1 cloudera cloudera 219 2025-01-14 23:29 sampleE
-rw-r--r-- 1 cloudera cloudera 97 2025-01-14 22:21 wordCountFile
-rw-r--r-- 1 cloudera cloudera 171 2025-01-07 23:28 wordcountFile.txt
-rw-r--r-- 1 cloudera cloudera 63 2025-01-13 02:08 wordcountfile
[cloudera@quickstart 33312]$ hadoop fs -ls MRDir4
Found 2 items
-rw-r--r-- 1 cloudera cloudera 0 2025-01-21 23:09 MRDir4/_SUCCESS
-rw-r--r-- 1 cloudera cloudera 40 2025-01-21 23:09 MRDir4/part-r-00000
[cloudera@quickstart 33312]$ hadoop fs -cat MRDir4/part-r-00000
1979 July 62
1980 July 31
1981 August 36
1984 August 43
1985 December 45

You might also like