Important Linux and HDFS Commands For Hadoop 1703072726
The document provides an overview of commonly used Linux and HDFS commands in a Hadoop cluster, highlighting their functions for file and directory management. Key Linux commands include ls, cd, mkdir, and rm, while essential HDFS commands include hdfs dfs -ls, hdfs dfs -mkdir, and hdfs dfs -put. It also outlines the historical development of these commands through various milestones in Hadoop's evolution.
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0 ratings0% found this document useful (0 votes)
39 views8 pages
Important Linux and HDFS Commands For Hadoop 1703072726
The document provides an overview of commonly used Linux and HDFS commands in a Hadoop cluster, highlighting their functions for file and directory management. Key Linux commands include ls, cd, mkdir, and rm, while essential HDFS commands include hdfs dfs -ls, hdfs dfs -mkdir, and hdfs dfs -put. It also outlines the historical development of these commands through various milestones in Hadoop's evolution.
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 8
Introduction to
Linux and HDFS
Commands Streamline Your Workflow Now Frequently Used Linux & HDFS Commands in Hadoop Cluster Most used Linux commands Linux commands are used to interact with the local system of gateway node in hadoop cluster, perform tasks, and manage files and directories in local system. touch - file creation, vi filename - file creation & insertion, pwd, whoami, ls -a - hidden files etc., ls - This command is used to list the contents of a directory. cd - This command is used to change the current working directory. mkdir - This command is used to create a new directory. rm - This command is used to remove files or directories. cp - This command is used to copy files or directories. mv - This command is used to move files or directories. cat - This command is used to view the contents of a file. grep - This command is used to search for a specific pattern in a file. ps - This command is used to display information about running processes. top - This command is used to display information about system resources. Most used HDFS commands HDFS is a distributed file system that provides high- throughput access to application data. Some of the most used HDFS commands include hadoop fs -ls, hadoop fs -mkdir, hadoop fs -put, hadoop fs -get, hadoop fs -rm, and hadoop fs -du, hadoop jar, hadoop fs -tail/-head hdfs dfs -ls (list files and directories in HDFS) hdfs dfs -mkdir (make directory in HDFS) hdfs dfs -rm (remove file or directory in HDFS) hdfs dfs -cp (copy file or directory in HDFS) hdfs dfs -mv (move or rename file or directory in HDFS) hdfs dfs -chmod (change file or directory permissions in HDFS) hdfs dfs -put or -copyFromLocal (upload file to HDFS) hdfs dfs -get or -copyToLocal (download file from HDFS) hdfs jar <jarfile> <fileinput> <output>(To run Jar file in HDFS) Linux/HDFS Command Timeline 01 02 03 04
Milestone 1: Early Milestone 2: Milestone 3: Milestone 4:
Commands Hadoop MapReduce Recent Basic Linux commands Distributed File Commands like hadoop Advancements like ls, cd, cp, mv were System jar, hadoop fsck (file Commands like hdfs dfs - among the first to be Commands like hdfs dfs - system check) , and du -s, hdfs dfs -tail, and developed. ls, hdfs dfs -rm, and hdfs hadoop job -list were hdfs dfs -touchz were dfs -put were introduced introduced with the developed. with the Hadoop MapReduce processing Distributed File System. engine. Linux and HDFS Commands Command Line Operations.