DA Lab Program-1
DA Lab Program-1
File management in Hadoop is primarily done through the Hadoop Distributed File
System (HDFS), which is designed to store large data sets reliably, and to stream
those data sets at high bandwidth to user applications. To manage files in HDFS,
you use the Hadoop File System shell (hadoop fs) or the HDFS shell (hdfs dfs),
which provides a variety of commands for managing files and directories.
Below are the steps and commands for performing the listed file management tasks
in Hadoop:
• Adding Files to HDFS: To add a file from your local file system to HDFS,
use the put command.
• To retrieve files from HDFS to your local file system, use the get command:
c) Deleting files.
touch local_empty_file.txt
2
f) Upload and download files in HDFS as well other properties copy file,
move files and remove file operations in HDFS.