0% found this document useful (0 votes)

16 views2 pages

Hadoop Practical Commands & Mapreduce Lab Mannula With Java and Python

This document provides practical commands for using Hadoop, including starting the service, managing files in HDFS, and executing MapReduce jobs using both Java and Python. Key commands include starting Hadoop, listing files, creating directories, and running a word count example. It also emphasizes the importance of changing output directory names for each run to avoid conflicts.

Uploaded by

21bt04004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views2 pages

Hadoop Practical Commands & Mapreduce Lab Mannula With Java and Python

Uploaded by

21bt04004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 2

Hadoop Practical Commands

1. Start Hadoop
sbin/start-all.sh

2. Check all nodes are running

jps
3. Run ls to list Hadoop file

hdfs dfs -ls

4. Make a directory

hdfs dfs -mkdir <folder name>

5. touchz: It creates an empty file.

hdfs dfs -touchz <file_path>

6. copyFromLocal (or) put: To copy files/folders from local file system to
hdfs store. This is the most important command. Local filesystem
means the files present on the OS.

hdfs dfs -copyFromLocal <local file path> <dest(present on hdfs)>

7. Use editor to create a file

Sudo nano inputdata.txt
8. cat: To print file contents.

hdfs dfs -cat Inputdata.txt

--- Execution of MapReduce Word count with java ----

 Create a text input file on HDUSER

Sudo nano t1.txt (enter some text line in it)
 Now copy form hduser to hdfs
hdfs dfs -copyFromLocal t1.txt /input

 goto location for where jars are available

cd /usr/local/hadoop/shar/hadoop/mapreduce

 Execute Mapreduce Word Count

hadoop jar hadoop-mapreduce-examples-3.3.0.jar wordcount /input

/output //run the command for mapreduce
(Give a new output dir name every time)
 Check your output by executing 2 commands

hdfs dfs -ls /output

hdfs dfs -ls /output/part

--- Execution of MapReduce Word count with python ----

Run following 3 commands
 cat t1.txt | python3 mapper.py | sort -k1,1 | python3 reducer.py

 hadoop jar /home/hduser/Downloads/jar_files/hadoop-streaming-3.2.1.jar -input

/input/t1.txt -output /output1 -mapper /home/hduser/mapper.py -reducer
/home/hduser/reducer.py ( Note:- change the name of output directory every time
you run this command)

 Check your output by executing 2 commands

hdfs dfs -ls /output1

hdfs dfs -ls /output1/part

Lab 1 - Hadoop HDFS and MapReduce
No ratings yet
Lab 1 - Hadoop HDFS and MapReduce
4 pages
PRACTICAL NO-3 Word Count
No ratings yet
PRACTICAL NO-3 Word Count
4 pages
Cloud PDF
No ratings yet
Cloud PDF
47 pages
03 - Run The WordCount Program Instructions
No ratings yet
03 - Run The WordCount Program Instructions
4 pages
BDA Lab
No ratings yet
BDA Lab
13 pages
Bda Lab Manual
No ratings yet
Bda Lab Manual
20 pages
Tutorial-Counting Words in File (S) Using Mapreduce: Prerequisites
No ratings yet
Tutorial-Counting Words in File (S) Using Mapreduce: Prerequisites
11 pages
PDC All Labs
100% (1)
PDC All Labs
129 pages
Activity 2
No ratings yet
Activity 2
31 pages
Commands in Hadoop
No ratings yet
Commands in Hadoop
7 pages
@bigdatalabfile 09
No ratings yet
@bigdatalabfile 09
35 pages
Big Datalab
No ratings yet
Big Datalab
4 pages
Assignment Week 1
No ratings yet
Assignment Week 1
9 pages
Hadoop Administrator Training - Lab Hand Book
No ratings yet
Hadoop Administrator Training - Lab Hand Book
12 pages
TPhadoop
No ratings yet
TPhadoop
27 pages
Setup Hadoop Gettingstart
No ratings yet
Setup Hadoop Gettingstart
4 pages
Lsde Workshop wk9
No ratings yet
Lsde Workshop wk9
31 pages
Bda Lab Manual
No ratings yet
Bda Lab Manual
42 pages
C21053 Jay Vijay Karwatkar-Big Data Analytics & Visualization
No ratings yet
C21053 Jay Vijay Karwatkar-Big Data Analytics & Visualization
210 pages
Big Data File
No ratings yet
Big Data File
16 pages
Bi Lab File
No ratings yet
Bi Lab File
19 pages
Procedure: 1
No ratings yet
Procedure: 1
29 pages
Assignment 04_ Saiful Islam
No ratings yet
Assignment 04_ Saiful Islam
6 pages
Course: Big Data Analytics Lab Scheme: 2017
No ratings yet
Course: Big Data Analytics Lab Scheme: 2017
25 pages
Bda Lab
No ratings yet
Bda Lab
47 pages
BDH Record - Merged
No ratings yet
BDH Record - Merged
47 pages
Lab Programs On HDFS and MapReduce
No ratings yet
Lab Programs On HDFS and MapReduce
2 pages
Big Data Cloudera TP
No ratings yet
Big Data Cloudera TP
33 pages
Hands On
No ratings yet
Hands On
26 pages
Hadoop Phase3 Notes
No ratings yet
Hadoop Phase3 Notes
4 pages
Part 03 Intro To Hadoop
No ratings yet
Part 03 Intro To Hadoop
22 pages
Dsa Practical File
No ratings yet
Dsa Practical File
16 pages
Extreme Computing Lab Exercises Session One: 1 Getting Started
No ratings yet
Extreme Computing Lab Exercises Session One: 1 Getting Started
6 pages
Ccs 334 Bigdata Manual
No ratings yet
Ccs 334 Bigdata Manual
45 pages
Bigdatamanualfinal 231019063224 d211cb48
No ratings yet
Bigdatamanualfinal 231019063224 d211cb48
45 pages
BDA Manual
No ratings yet
BDA Manual
41 pages
Extracting Real Value From Your Data With Apache Hadoop: Sarah Sproehnle
No ratings yet
Extracting Real Value From Your Data With Apache Hadoop: Sarah Sproehnle
51 pages
Run Python MapReduce On Local Docker Hadoop Cluster - DEV Community
No ratings yet
Run Python MapReduce On Local Docker Hadoop Cluster - DEV Community
5 pages
Climate Change Guidelines For WASH in Eastern Equatoria State (EES) in South Sudan
No ratings yet
Climate Change Guidelines For WASH in Eastern Equatoria State (EES) in South Sudan
3 pages
Bigdatamanual
No ratings yet
Bigdatamanual
45 pages
Sanoob BDA 1 S Merged
No ratings yet
Sanoob BDA 1 S Merged
8 pages
Bda Lab S
No ratings yet
Bda Lab S
92 pages
CCS334-BDA LAB MANUAL Final
No ratings yet
CCS334-BDA LAB MANUAL Final
46 pages
CC Hadoop Lab
No ratings yet
CC Hadoop Lab
6 pages
Bda File
No ratings yet
Bda File
28 pages
Lab11 B
No ratings yet
Lab11 B
9 pages
Experiment No 1
No ratings yet
Experiment No 1
13 pages
Bdafile
No ratings yet
Bdafile
9 pages
Linux Commands - Mkdir - Rmdir - Touch - RM - CP - More - Less - Head - Tail - Cat
No ratings yet
Linux Commands - Mkdir - Rmdir - Touch - RM - CP - More - Less - Head - Tail - Cat
16 pages
Big Data Record
No ratings yet
Big Data Record
14 pages
Big Data - ASSIGNMENT 3
No ratings yet
Big Data - ASSIGNMENT 3
2 pages
Writing An Hadoop MapReduce Program in Python
No ratings yet
Writing An Hadoop MapReduce Program in Python
21 pages
Big Data Record
No ratings yet
Big Data Record
13 pages
Hadoop Module1
No ratings yet
Hadoop Module1
37 pages
L4A Running Hadoop With MR
No ratings yet
L4A Running Hadoop With MR
5 pages
Bigdata Manual Final
No ratings yet
Bigdata Manual Final
66 pages
Lab Manual
No ratings yet
Lab Manual
34 pages
Linux Commands By Example
From Everand
Linux Commands By Example
Khaled Jamal
4.5/5 (3)
Windows Command Prompt
From Everand
Windows Command Prompt
Murat Yildirimoglu
No ratings yet
Bash Command Line Pro Tips
From Everand
Bash Command Line Pro Tips
Jason Cannon
4.5/5 (8)
AJ Lab Output
No ratings yet
AJ Lab Output
7 pages
Extended Use Case Tasks For Interview
No ratings yet
Extended Use Case Tasks For Interview
6 pages
CityAddresses Updated
No ratings yet
CityAddresses Updated
23 pages
MajorProject Review2
No ratings yet
MajorProject Review2
17 pages
TreeMap MockData 3columns Text
No ratings yet
TreeMap MockData 3columns Text
3 pages
Interactive Data Visualization Dashboard With Python and Streamlit
No ratings yet
Interactive Data Visualization Dashboard With Python and Streamlit
8 pages

Hadoop Practical Commands & Mapreduce Lab Mannula With Java and Python

Uploaded by

Hadoop Practical Commands & Mapreduce Lab Mannula With Java and Python

Uploaded by

Hadoop Practical Commands

2. Check all nodes are running

hdfs dfs -ls

hdfs dfs -mkdir <folder name>

hdfs dfs -touchz <file_path>

hdfs dfs -copyFromLocal <local file path> <dest(present on hdfs)>

7. Use editor to create a file

hdfs dfs -cat Inputdata.txt

--- Execution of MapReduce Word count with java ----

 Create a text input file on HDUSER

 goto location for where jars are available

 Execute Mapreduce Word Count

hadoop jar hadoop-mapreduce-examples-3.3.0.jar wordcount /input

hdfs dfs -ls /output

hdfs dfs -ls /output/part

--- Execution of MapReduce Word count with python ----

 hadoop jar /home/hduser/Downloads/jar_files/hadoop-streaming-3.2.1.jar -input

 Check your output by executing 2 commands

hdfs dfs -ls /output1

hdfs dfs -ls /output1/part

You might also like