How to Compress Files in Linux | Tar Command
Last Updated :
11 Apr, 2025
File compression is a fundamental task in managing and transferring data efficiently on a Linux system. The Tar command, short for Tape Archive, is a powerful tool that allows users to create compressed and archived files.
In this comprehensive guide, we will explore the various options and examples of using the Tar command to compress files on a Linux system.
tar Command to Compress Files in Linux
The tar
command is commonly used to compress files in Linux when combined with options like -z
(gzip) or -j
(bzip2).
The Linux 'tar' stands for tape archive, which is used to create Archive and extract the Archive files. tar command in Linux is one of the important commands that provides archiving functionality in Linux. We can use the Linux tar command to create compressed or uncompressed Archive files and also maintain and modify them.
Syntax of `tar` command in Linux
tar [options] [archive-file] [file or directory to be archived]
Here ,
tar
: The command itself.[options]
: Optional flags or settings that modify the behavior of the tar
command.[archive-file]
: The name of the archive file you are creating or working with.[file or directory to be archived]
: The file or directory you want to include in the archive.
An Archive file is a file that is composed of one or more files along with metadata. Archive files are used to collect multiple data files together into a single file for easier portability and storage, or simply to compress files to use less storage space.
Options | Description |
---|
-c | Creates an archive by bundling files and directories together. |
---|
-x | Extracts files and directories from an existing archive. |
---|
-f | Specifies the filename of the archive to be created or extracted. |
---|
-t | Displays or lists the files and directories contained within an archive. |
---|
-u | Archives and adds new files or directories to an existing archive. |
---|
-v | Displays verbose information, providing detailed output during the archiving or extraction process. |
---|
-A | Concatenates multiple archive files into a single archive. |
---|
-z | Uses gzip compression when creating a tar file, resulting in a compressed archive with the '.tar.gz' extension. |
---|
-j | Uses bzip2 compression when creating a tar file, resulting in a compressed archive with the '.tar.bz2' extension. |
---|
-W | Verifies the integrity of an archive file, ensuring its contents are not corrupted. |
---|
-r | Updates or adds files or directories to an already existing archive without recreating the entire archive. |
---|
Examples of tar Command to Compress Files in Linux
1. Creating an uncompressed tar Archive using option -cvf
This command creates a tar file called file.tar which is the Archive of all .c files in the current directory.
tar cvf file.tar *.c
- '-c': Creates a new archive.
- '-v': Displays verbose output, showing the progress of the archiving process.
- '-f': Specifies the filename of the archive
Output :
os2.c
os3.c
os4.c
2. Extracting files from Archive using option -xvf
This command extracts files from Archives.
tar xvf file.tar
- '-x': Extracts files from an archive.
- '-v': Displays verbose output during the extraction process.
- '-f': Specifies the filename of the archive.
Output :
os2.c
os3.c
os4.c
3. gzip compression on the tar Archive, using option -z
This command creates a tar file called file.tar.gz which is the Archive of .c files.
tar cvzf file.tar.gz *.c
- '-z': Uses gzip compression.
- '-j': Uses bzip2 compression.
- '-J': Uses xz compression.
4. Extracting a gzip tar Archive *.tar.gz using option -xvzf :
This command extracts files from tar archived file.tar.gz files.
tar xvzf file.tar.gz
5. Creating compressed tar archive file in Linux using option -j
This command compresses and creates archive files less than the size of the gzip. Both compress and decompress take more time than gzip.
tar cvfj file.tar.tbz example.cpp
Output:
tar cvfj file.tar.tbz example.cpp
example.cpp
tar tvf file.tar.tbz
-rwxrwxrwx root/root 94 2017-09-17 02:47 example.cpp
6. Untar single tar file or specified directory in Linux:
This command will Untar a file in current directory or in a specified directory using -C option.
tar xvfj file.tar
or
tar xvfj file.tar -C path of file in directory
7. Untar multiple .tar, .tar.gz, .tar.tbz file in Linux:
This command will extract or untar multiple files from the tar, tar.gz and tar.bz2 archive file. For example, the above command will extract “fileA” “fileB” from the archive files.
tar xvf file.tar "fileA" "fileB"
or
tar zxvf file1.tar.gz "fileA" "fileB"
or
tar jxvf file2.tar.tbz "fileA" "fileB"
8. Check size of existing tar, tar.gz, tar.tbz file in Linux:
The above command will display the size of archive file in Kilobytes (KB).
tar czf file.tar | wc -c
or
tar czf file1.tar.gz | wc -c
or
tar czf file2.tar.tbz | wc -c
9. Update existing tar file in Linux
tar rvf file.tar *.c
Output:
os1.c
10. List the contents and specify the tarfile using option -tf
This command will list the entire list of archived files. We can also list for specific content in a tarfile
tar tf file.tar
Output:
example.cpp
11. Applying pipe to through 'grep command' to find what we are looking for
This command will list only for the mentioned text or image in grep from archived file.
tar tvf file.tar | grep "text to find"
or
tar tvf file.tar | grep "filename.file extension"
12. We can pass a file name as an argument to search a tarfile:
This command views the archived files along with their details.
tar tvf file.tar filename
13. Viewing the Archive using option -tvf
tar tvf file.tar
Output:
-rwxrwxrwx root/root 191 2017-09-17 02:20 os2.c
-rwxrwxrwx root/root 218 2017-09-17 02:20 os3.c
-rwxrwxrwx root/root 493 2017-09-17 02:20 os4.c
What are wildcards in Linux
Alternatively referred to as a 'wild character' or 'wildcard character', a wildcard is a symbol used to replace or represent one or more characters. Wildcards are typically either an asterisk (*), which represents one or more characters or question mark (?), which represents a single character.
14. To search for an image in .png format:
This will extract only files with the extension .png from the archive file.tar. The -wildcards option tells tar to interpret wildcards in the name of the files to be extracted; the filename (*.png) is enclosed in single-quotes to protect the wildcard (*) from being expanded incorrectly by the shell.
tar tvf file.tar --wildcards '*.png'
Note: In the above commands " * " is used in place of file name to take all the files present in that particular directory.
Zip Command to Compress Files in Linux
In the realm of Linux file compression, the `zip
`
command emerges as a powerful and user-friendly tool. Distinct from the `tar`
command, `zip`
specializes in creating compressed files while preserving the integrity of the original content. The straightforward syntax of the `zip`
command simplifies the compression process:
Basic Syntax:
zip [options] zipfile files/directories
[
options
]
: Optional flags or settings for the zip
command.zipfile
: The name of the ZIP archive to be created.files/directories
: The files or directories to be included in the ZIP archive.
zip ./bigfile.zip bigfile
This single line of code encapsulates the essence of the `zip`
command, compressing the file `bigfile`
and generating a new file, `bigfile.zip`
, which holds the compressed version of the original content. Notably, the original file remains untouched, ensuring data safety during the compression process.
The user-friendly nature of the `zip
`
command makes it an accessible option for those seeking a straightforward solution for creating compressed archives. Whether you're compressing a single file or multiple files, the simplicity and efficiency of the` zip`
command contribute to its popularity among Linux users. As a versatile tool in the compression toolkit, `zip`
offers a seamless balance between ease of use and effective file compression.
Gzip Command to Compress Files in Linux
In the landscape of Linux compression utilities, the `gzip
`
command stands out as a straightforward and highly efficient tool. Its usage is elegantly simple—provide the filename, and `gzip`
seamlessly compresses it in place. This can be achieved with a single command
Basic Syntax:
gzip [options] filename
[options]
: Optional flags or settings you may want to apply.filename
: The name of the file you wish to compress.
gzip bigfile
Unlike certain compression commands that generate a separate compressed file, `gzip`
operates "in place." In other words, it encrypts the original file, replacing it with the compressed version. This intrinsic simplicity is a key attribute that positions `gzip`
as an excellent choice for users seeking a rapid and effective file compression solution.
The in-place compression feature not only reduces the need for managing multiple files but also ensures a smooth integration into various workflows. This makes `gzip`
particularly well-suited for scenarios where simplicity and speed are paramount. Whether you are compressing a single file or multiple files, the straightforward nature of the `gzip`
command contributes to its popularity among Linux users, making it a reliable and efficient compression tool in diverse contexts.
Bzip2 Command to Compress Files in Linux
In the intricate landscape of Linux file compression, the `bzip2
`
command presents itself as a potent and versatile tool, bearing similarities to its counterpart, `gzip`
. Functioning in a manner akin to `gzip`
, the `bzip2`
command compresses files in place, preserving only the original file. The simplicity of its application is illustrated by the following example:
Basic Syntax :
bzip2 [options] filename
Here, `options`
represent any additional flags or settings you may want to apply, and `filename`
is the name of the file you wish to compress. The basic command without options compresses the file in place, replacing the original file with the compressed version.
bzip2 bigfile
This concise command initiates the compression process, resulting in the creation of `bigfile.bz2`
—the compressed iteration of the original file. Much like the decision between `gzip`
and `bzip2`
often depends on specific use cases, the choice between these two commands is nuanced and influenced by factors such as the nature of the data being compressed and the desired compression ratio.
While both commands share the fundamental concept of in-place compression, `bzip2`
is known for its distinctive compression algorithm, which often achieves higher compression ratios compared to `gzip`
. Consequently, users may opt for `bzip2`
when prioritizing file size reduction and are willing to trade off some compression speed. This nuanced decision-making process underscores the flexibility of the `bzip2`
command in catering to diverse compression requirements within the Linux ecosystem.
XZ Command to Compress Files in Linux
A relative newcomer to the compression scene, the `xz
`
command is recognized for its impressive compression capabilities. While it might take longer for large files, the compression results are noteworthy:
Basic syntax:
xz [options] filename
[options]
: Optional flags or settings you may want to apply.filename
: The name of the file you wish to compress.
xz bigfile
The `bigfile.xz`
showcases the compressed version of the file. `xz`
has gained popularity for its ability to achieve significant compression ratios, making it a top choice for those prioritizing file size reduction.
Conclusion
This article is all about making files smaller in Linux, and it's like unlocking cool tricks with the Tape Archive (Tar) command. We learned how Tar can bundle files together and also squeeze them down using things like gzip, bzip2, and xz. We didn't stop there – we also talked about friendly tools like zip and easy commands like gzip, bzip2, and xz.
This helps you choose what's best for you, whether you want things to be super simple, work quickly, save lots of space, or do some impressive stuff. So, if you're using Linux, this guide helps you do smart things with your files, making them easy to handle.
Similar Reads
Linux/Unix Tutorial Linux is one of the most widely used open-source operating systems. It's fast, secure, stable, and powers everything from smartphones and servers to cloud platforms and IoT devices. Linux is especially popular among developers, system administrators, and DevOps professionals.Linux is:A Unix-like OS
10 min read
Getting Started with Linux
What is Linux Operating SystemLinux is based on the UNIX operating system. UNIX is a powerful, multi-user, multitasking operating system originally developed in the 1970s at AT&T Bell Labs. It laid the foundation for many modern operating systems, including Linux.Linux is free and open-source, accessible to everyone.Its sour
10 min read
LINUX Full Form - Lovable Intellect Not Using XPLINUX stands for Lovable Intellect Not Using XP. Linux was developed by Linus Torvalds and named after him. Linux is an open-source and community-developed operating system for computers, servers, mainframes, mobile devices, and embedded devices. Linux receives requests from system programs and it r
2 min read
Difference between Linux and WindowsLinux: Linux could be a free and open supply OS supported operating system standards. It provides programming interface still as programme compatible with operating system primarily based systems and provides giant selection applications. A UNIX operating system additionally contains several several
7 min read
What are Linux Distributions ?A Linux distribution, often shortened to âdistro,â is a packaged version of Linux that comes with the Linux kernel plus a collection of software and utilities that make the OS functional and user-friendly. Some distros are optimized for business environments, offering tools for productivity and ente
8 min read
Difference between Unix and LinuxUnix was created in the 1970s by Ken Thompson and Dennis Ritchie at Bell Labs. Dennis Ritchie was also the creator of the C programming language. Originally a command-line operating system, Unix has evolved to support graphical interfaces (GUI) as well. It became popular in universities, enterprises
5 min read
Installation with Linux
How to Install Arch Linux in VirtualBox?Installing Arch Linux on a virtual machine is an excellent way to experience this powerful and flexible Linux distribution without affecting your main system. If you're looking to install Arch Linux in VirtualBox, this guide will take you through the process step-by-step. Arch Linux is known for its
7 min read
Fedora Linux Operating SystemFedora Linux is a free and open-source operating system based on the Linux kernel and was developed by the community-supported Fedora Project. It is known for its fast release cycle, which keeps the operating system up to date with the latest software and technologies.What is the Fedora Linux Operat
12 min read
How to install Ubuntu on VirtualBox?Installing Ubuntu on VirtualBox is a great way to experience the powerful features of this popular Linux distribution without altering your main operating system. Whether youâre a developer, a student, or simply curious about Linux, setting up Ubuntu on VirtualBox allows you to test and explore in a
6 min read
How to Install Linux Mint?Linux Mint is the second-largest Linux-based distro used in the world. Linux Mint is a community-driven Linux distribution based on Ubuntu which itself is based on Debian and bundled with a variety of free and open-source applications. So here we discuss the installation of Linux mint. Installation
3 min read
How to Install Kali Linux on Windows?Kali Linux is an open-source Linux distribution based on Debian, designed for sophisticated penetration testing and security auditing. Kali Linux includes hundreds of tools for diverse information security activities such as penetration testing, security research, computer forensics, and reverse eng
2 min read
How to Install Linux on Windows PowerShell Subsystem?There are several ways to Install a Linux subsystem on your Windows PC Powershell Environment. It is good for learners, but it is recommended using original Linux OS if you are a developer as the Subsystem lacks the pre-installed Linux tools. Before we begin installing a Linux subsystem, we need to
2 min read
How to Find openSUSE Linux Version?openSUSE is well known for its GNU/Linux-based operating systems, mainly Tumbleweed, a tested rolling release, and Leap, a distribution with Long-Term-Support(LTS). MicroOS and Kubic are new transactional, self-contained distributions for use as desktop or container runtime. Here we figure out which
2 min read
How to Install CentOSCentOS is a popular open-source Linux distribution aimed at servers and provides compatibility with Red Hat's RPM package manager. It is built with the goal of providing a stable operating system that provided great compatibility with the upstream RHEL (Red hat enterprise Linux) CentOS is therefore
2 min read
Linux Commands
Linux CommandsLinux commands are essential for controlling and managing the system through the terminal. This terminal is similar to the command prompt in Windows. Itâs important to note that Linux/Unix commands are case-sensitive. These commands are used for tasks like file handling, process management, user adm
15+ min read
Essential Unix CommandsUnix commands are a set of commands that are used to interact with the Unix operating system. Unix is a powerful, multi-user, multi-tasking operating system that was developed in the 1960s by Bell Labs. Unix commands are entered at the command prompt in a terminal window, and they allow users to per
7 min read
How to Find a File in Linux | Find CommandThe find command in Linux is used to search for files and directories based on name, type, size, date, or other conditions. It scans the specified directory and its sub directories to locate files matching the given criteria.find command uses are:Search based on modification time (e.g., files edited
9 min read
Linux File System
Linux File SystemA file system is a structured method of storing and managing dataâincluding files, directories, and metadataâon your machine. Think of it like a library. If thousands of books were scattered around, finding one would be hard. But in an organized structure, like labeled shelves, locating a book becom
12 min read
Linux File Hierarchy StructureThe Linux File Hierarchy Structure or the Filesystem Hierarchy Standard (FHS) defines the directory structure and directory contents in Unix-like operating systems. It is maintained by the Linux Foundation. In the FHS, all files and directories appear under the root directory /, even if they are sto
6 min read
Linux Directory StructureIn Linux, everything is treated as a file even if it is a normal file, a directory, or even a device such as a printer or keyboard. All the directories and files are stored under one root directory which is represented by a forward slash /. The Linux directory layout follows the Filesystem Hierarchy
6 min read
Linux Kernel
Linux KernelLinux Kernel is the heart of Linux operating systems. It is an open-source (source code that can be used by anyone freely) software that is most popular and widely used in the industry as well as on a personal use basis. Who created Linux and why? Linux was created by Linus Torvalds in 1991 as a hob
4 min read
Kernel in Operating SystemA kernel is the core part of an operating system. It acts as a bridge between software applications and the hardware of a computer. The kernel manages system resources, such as the CPU, memory and devices, ensuring everything works together smoothly and efficiently. It handles tasks like running pro
9 min read
How Linux Kernel Boots?Many processes are running in the background when we press the system's power button. It is very important to learn the Linux boot process to understand the workings of any operating system. Knowing how the kernel boots is a must to solve the booting error. It is a very interesting topic to learn, l
11 min read
Difference between Operating System and KernelIn the world of computing, two terms that are frequently mentioned are Operating System (OS) and Kernel. In this article, we will explore the key differences between the OS and the Kernel, their functions, and how they work together to manage hardware and software.What is an Operating System?An Oper
3 min read
Linux Kernel Module Programming: Hello World ProgramKernel modules are pieces of code that can be loaded and unloaded into the kernel upon demand. They extend the functionality of the kernel without the need to reboot the system. Custom codes can be added to Linux kernels via two methods. The basic way is to add the code to the kernel source tree and
7 min read
Linux Loadable Kernel ModuleIf you want to add code to a Linux kit, the basic way to do that is to add source files to the kernel source tree and assemble the kernel. In fact, the process of setting up the kernel consists mainly of selecting which files to upload to the kernel will be merged. But you can also add code to the L
7 min read
Loadable Kernel Module - Linux Device Driver DevelopmentFor Linux device drivers, we can use only two languages: Assembler and C. Assembler implements the main parts of the Linux kernel, while C implements the architecture-dependent parts. Uploaded kernel modules are often referred to as kernel modules or modules, but those are misleading names because t
4 min read
Linux Networking Tools
Network configuration and troubleshooting commands in LinuxComputers are often connected to each other on a network. They send requests to each other in the form of packets that travel from the host to the destination. Linux provides various commands from network configuration and troubleshooting. Network Configuration and Troubleshooting Commands in Linux
5 min read
How to configure network interfaces in CentOS?A network interface is a link between a computer and another network(Private or Public). The network interface is basically a card which is known as NIC or Network Interface Card, this does not necessarily have to be in a physical form instead, it can be inbuilt into the software. If we take the exa
5 min read
Command-Line Tools and Utilities For Network Management in LinuxIf you are thinking of becoming a system administrator, or you are already a system admin, then this article is for you.As a system admin, your daily routine will include configuring, maintaining, troubleshooting, monitoring, securing networks, and managing servers within data centers. Network confi
8 min read
Linux - Network Monitoring ToolsNetwork monitoring is using a system (hardware or software) that continuously observes your network and the data flows through it, depending on how the monitoring solution actually functions and informs the network administrator. We can keep a check on all the activities of our network easily. While
4 min read
Linux Process
Linux Firewall
Shell Scripting & Bash Scripting
Introduction to Linux Shell and Shell ScriptingWhenever we use any modern operating system like Linux, macOS, or Windows we are indirectly interacting with a shell, the program that interprets and executes our commands. While running Ubuntu, Linux Mint, or any other Linux distribution, we are interacting with the shell by using the terminal. In
8 min read
What is Terminal, Console, Shell and Kernel?Understanding the terms terminal, console, shell, and kernel is crucial for anyone working with computers or learning about operating systems. These concepts are key components of how we interact with our devices and software. The terminal is a text-based interface used to interact with the computer
5 min read
How to Create a Shell Script in linuxShell is an interface of the operating system. It accepts commands from users and interprets them to the operating system. If you want to run a bunch of commands together, you can do so by creating a shell script. Shell scripts are very useful if you need to do a task routinely, like taking a backup
7 min read
Shell Scripting - Different types of VariablesThe shell is a command-line interpreter for Linux and Unix systems. It provides an interface between the user and the kernel and executes commands. A sequence of commands can be written in a file for execution in the shell. It is called shell scripting. It helps to automate tasks in Linux. Scripting
4 min read
Bash Scripting - Introduction to Bash and Bash ScriptingBash is a command-line interpreter or Unix Shell and it is widely used in GNU/Linux Operating System. It is written by Brian Jhan Fox. It is used as a default login shell for most Linux distributions. Scripting is used to automate the execution of the tasks so that humans do not need to perform them
12 min read
Bash Script - Define Bash Variables and its typesVariables are an important aspect of any programming language. Without variables, you will not be able to store any required data. With the help of variables, data is stored at a particular memory address and then it can be accessed as well as modified when required. In other words, variables let yo
12 min read
Shell Scripting - Shell VariablesA shell variable is a character string in a shell that stores some value. It could be an integer, filename, string, or some shell command itself. Basically, it is a pointer to the actual data stored in memory. We have a few rules that have to be followed while writing variables in the script (which
6 min read
Bash Script - Difference between Bash Script and Shell ScriptIn computer programming, a script is defined as a sequence of instructions that is executed by another program. A shell is a command-line interpreter of Linux which provides an interface between the user and the kernel system and executes a sequence of instructions called commands. A shell is capabl
4 min read
Shell Scripting - Difference between Korn Shell and Bash shellKorn Shell: Korn Shell or KSH was developed by a person named David Korn, which attempts to integrate the features of other shells like C shell, Bourne Shell, etc. Korn Shell allows developers to generate and create new shell commands whenever it is required. Korn shell was developed a long year bac
3 min read
Shell Scripting - Interactive and Non-Interactive ShellA shell gives us an interface to the Unix system. While using an operating system, we indirectly interact with the shell. On Linux distribution systems, each time we use a terminal, we interact with the shell. The job of the shell is to interpret or analyze the Unix commands given by users. A shell
3 min read
Shell Script to Show the Difference Between echo â$SHELLâ and echo â$SHELLâIn shell scripting and Linux, the echo command is used to display text on the terminal or console. When used with the $SHELL variable, which contains the path of the current user's shell program, the output of the echo command can be different depending on whether the variable is enclosed in single
4 min read