0% found this document useful (0 votes)
39 views8 pages

Important Linux and HDFS Commands For Hadoop 1703072726

The document provides an overview of commonly used Linux and HDFS commands in a Hadoop cluster, highlighting their functions for file and directory management. Key Linux commands include ls, cd, mkdir, and rm, while essential HDFS commands include hdfs dfs -ls, hdfs dfs -mkdir, and hdfs dfs -put. It also outlines the historical development of these commands through various milestones in Hadoop's evolution.

Uploaded by

mathanmohan97
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
39 views8 pages

Important Linux and HDFS Commands For Hadoop 1703072726

The document provides an overview of commonly used Linux and HDFS commands in a Hadoop cluster, highlighting their functions for file and directory management. Key Linux commands include ls, cd, mkdir, and rm, while essential HDFS commands include hdfs dfs -ls, hdfs dfs -mkdir, and hdfs dfs -put. It also outlines the historical development of these commands through various milestones in Hadoop's evolution.

Uploaded by

mathanmohan97
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 8

Introduction to

Linux and HDFS


Commands
Streamline Your Workflow Now
Frequently Used Linux &
HDFS Commands in Hadoop
Cluster
Most used Linux
commands
Linux commands are used to interact with the local
system of gateway node in hadoop cluster, perform
tasks, and manage files and directories in local system.
touch - file creation, vi filename - file creation &
insertion, pwd, whoami, ls -a - hidden files etc.,
ls - This command is used to list the contents of a directory.
cd - This command is used to change the current working directory.
mkdir - This command is used to create a new directory.
rm - This command is used to remove files or directories.
cp - This command is used to copy files or directories.
mv - This command is used to move files or directories.
cat - This command is used to view the contents of a file.
grep - This command is used to search for a specific pattern in a file.
ps - This command is used to display information about running
processes.
top - This command is used to display information about system
resources.
Most used HDFS
commands
HDFS is a distributed file system that provides high-
throughput access to application data. Some of the most
used HDFS commands include hadoop fs -ls, hadoop fs
-mkdir, hadoop fs -put, hadoop fs -get, hadoop fs -rm,
and hadoop fs -du, hadoop jar, hadoop fs -tail/-head
hdfs dfs -ls (list files and directories in HDFS)
hdfs dfs -mkdir (make directory in HDFS)
hdfs dfs -rm (remove file or directory in HDFS)
hdfs dfs -cp (copy file or directory in HDFS)
hdfs dfs -mv (move or rename file or directory in HDFS)
hdfs dfs -chmod (change file or directory permissions in HDFS)
hdfs dfs -put or -copyFromLocal (upload file to HDFS)
hdfs dfs -get or -copyToLocal (download file from HDFS)
hdfs jar <jarfile> <fileinput> <output>(To run Jar file in HDFS)
Linux/HDFS Command Timeline
01 02 03 04

Milestone 1: Early Milestone 2: Milestone 3: Milestone 4:


Commands Hadoop MapReduce Recent
Basic Linux commands
Distributed File Commands like hadoop
Advancements
like ls, cd, cp, mv were System jar, hadoop fsck (file Commands like hdfs dfs -
among the first to be Commands like hdfs dfs - system check) , and du -s, hdfs dfs -tail, and
developed. ls, hdfs dfs -rm, and hdfs hadoop job -list were hdfs dfs -touchz were
dfs -put were introduced introduced with the developed.
with the Hadoop MapReduce processing
Distributed File System. engine.
Linux and HDFS
Commands
Command Line Operations.

You might also like