0% found this document useful (0 votes)

15 views4 pages

$ XZ - V Data - CSV

The document explains how to use xz for compressing single files and multiple files using tar, highlighting the command syntax and options available for both compression and decompression. It also discusses the benefits of multithreading in xz for faster compression and provides examples of using environment variables to set options. Additionally, it compares the compression effectiveness of xz against gzip and pigz, demonstrating that xz can create smaller archives at the cost of increased compression time.

Uploaded by

Paulo Almeida

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views4 pages

$ XZ - V Data - CSV

Uploaded by

Paulo Almeida

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

3.

Using xz for Single Files

Let’s use xz to compress a single file.

Apart from the program name, the usage is identical to that of gzip:
$ xz -v data.csv

This command compresses the file data.csv and replaces it with the file data.csv.xz. The -v option
makes xz display progress information.
xz has the same compression levels 1-9 as gzip. The default compression level is 6. However, unlike
gzip, that default compression level isn’t usually a good compromise between speed and compression
ratio.
So, let’s compress a file with the minimum compression level 1:
$ xz -v1 data.csv

Unlike gzip, there’s no separate program for decompressing a file.

Instead, we use the -d option to decompress a single file:
$ xz -dv data.csv.xz

This decompresses the file data.csv.xz and replaces it with data.csv. Again, the -v option also displays
progress information.

4. Using tar With xz for Multiple Files and Directories

Just like with gzip, xz can only compress a single file.

4.1. Compress Many Filesystem Objects

That’s why we usually leverage the tar archiving utility in combination with xz to compress
multiple files or entire directories:
$ tar cJvf archive.tar.xz *.csv

Let’s break down this command:

• f archive.tar.xz: resulting archive name
• *.csv: compress all files with a csv extension in the current directory
• J: sets the compression algorithm to xz
• v: verbosity makes tar show each added and compressed file
Notably, unlike xz and gzip, tar doesn’t delete the input files after it creates the archive.
Which xz compression level does tar pick? It depends on the version of tar, but it’s usually the default
compression level 6.
Still, tar enables setting the compression program through the –use-compress-program option. We
use this option to set the compression level since it accepts command-line arguments. Here, we specify
the minimum compression level 1:
$ tar cvf archive.tar.xz --use-compress-program='xz -1' *.csv

Notably, we remove the J option because –use-compress-program already sets the compression
program.

4.2. Decompress Archive

Decompressing a tar archive with xz is also a single step and identical to gzip (except for the
different file extension):
$ tar xvf archive.tar.xz

Again, let’s see what each option does:

• f archive.tar.xz: archive for extraction
• x: extract (decompress)
• v: verbosity makes tar show each extracted file
Again, the archive isn’t deleted after the operation. Notably, we don’t have to tell tar to decompress
with xz as tar does this automatically by inspecting the file and detecting the xz compression.

5. Faster Compression With Multithreading

Unlike gzip, xz supports multithreading directly, which speeds up compression.

By default, xz uses just a single thread. We can specify the number of threads with the -T option. A
value of 0 tells xz to use one thread for every available CPU core. That’s generally a good default
value to use:
$ xz -vT0 data.csv

If we decide to force multithreading, we can use more threads, such as the 3 in this example:
$ xz -vT3 data.csv

Unlike unpigz, decompression with xz doesn’t benefit from multithreading by default. If we want
to employ faster decompression, we’d have to use multithreaded compression as we did above.
Even then, more than two or three threads don’t usually present much improvement, if any.
6. Using Multithreading With tar

There are two main ways to use multithreading with tar and xz.

6.1. The –use-compress-program Option

Previously, we specified the compression level with the –use-compress-program option. Now, we
enable multithreading through the same –use-compress-program option by setting the number of
threads with the command-line options.
Here, we again use one thread for every CPU core:
$ tar cvf archive.tar.xz --use-compress-program='xz -1T0' *.csv

While decompression with xz doesn’t benefit from multithreading by default, we can still use the same
options:
$ tar xvf archive.tar.xz --use-compress-program='xz -dT3'

Thus, we again use -d with a specific thread count (3).

6.2. Environment Variables

Another way to set the options for xz is to use the XZ_* environment variables that tar is aware of:
• XZ_DEFAULTS: sets the default options for xz globally
• XZ_OPT is usually for passing options to the tool when run by another executable
So, in general, we use XZ_DEFAULTS in a .bashrc or similar initialization script, while XZ_OPT
generally helps in specific sessions or local scripts.
Let’s see the compression example from earlier with XZ_OPT:
$ XZ_OPT='-T0 -1' tar cJvf archive.tar.xz *.csv

Similarly, we can perform a decompression:

$ XZ_OPT='-d -T0' tar xJvf archive.tar.xz

Notably, we shouldn’t expect much improvement in either case due to the general way the
algorithm works when decompressing.

6.3. Decompression Considerations

Since version 5.4.1, xz provides support for parallel decompression with -T0. Yet, TAR files require a
sequential read. Because of this, the process might need to preread a number of blocks. To do this, xz
expects the archive to be compressed with the multithreading option.
Because of this, if multithreading is a must, we usually turn to algorithms like Zstd.

7. Testing Archive Sizes With xz

As we already noted, xz usually creates smaller archives than gzip.

To test this claim, we used the same 818 MB CSV file, and the same computer with six CPU cores and
hyperthreading. This is the same setup we used to test gzip in Linux.
We compared xz to pigz, a gzip implementation that uses multithreading for faster compression and
decompression:
• both archiving tools saturated the CPU: pigz does this by default, xz because of the -T0 option
• at compression level 7 out of 9, pigz compressed the 818 MB CSV file down to 95 MB in 4
seconds: higher compression levels didn’t produce meaningfully smaller archives
• at compression level 1 out of 9, xz compressed the 818 MB CSV file down to 48 MB in 4
seconds: 49% smaller result that pigz
With compression level 5, xz produced the smallest archive at 29 MB, which is 69% smaller than
pigz with the same setup. However, xz took nearly 18 times as long at 70 seconds. Compression levels
six and beyond hugely increased the compression time for a negligible 1% reduction in archive size.
So, we’ve demonstrated that xz does indeed create much smaller archives than gzip, sometimes at the
price of time.

XZ Letter
No ratings yet
XZ Letter
24 pages
XZ A4
No ratings yet
XZ A4
22 pages
Lecture 5
No ratings yet
Lecture 5
46 pages
Module I - More About Linux
No ratings yet
Module I - More About Linux
44 pages
16 Tar Commands To Compress and Extract Files in Linux
No ratings yet
16 Tar Commands To Compress and Extract Files in Linux
5 pages
Dey, Nilanjan - Mukherjee, Amartya-Embedded Systems and Robotics With Open Source Tools-CRC Press (2016)
No ratings yet
Dey, Nilanjan - Mukherjee, Amartya-Embedded Systems and Robotics With Open Source Tools-CRC Press (2016)
198 pages
How To Extract Tar
No ratings yet
How To Extract Tar
24 pages
Archive and Compression
No ratings yet
Archive and Compression
9 pages
Linuxreport
No ratings yet
Linuxreport
5 pages
27 Tar Gzip Command
No ratings yet
27 Tar Gzip Command
3 pages
Week 9 - Lect8 - Comp-Decomp
No ratings yet
Week 9 - Lect8 - Comp-Decomp
11 pages
Pigz
No ratings yet
Pigz
3 pages
Linux Command On Archive & Compression
No ratings yet
Linux Command On Archive & Compression
11 pages
How To Extract - Unzip Tar - GZ Files From Linux Command Line
No ratings yet
How To Extract - Unzip Tar - GZ Files From Linux Command Line
7 pages
Basic Commands Les I Compress and Decompress Files Using Rar Bzip Tar Ball Tar and Gunzip
No ratings yet
Basic Commands Les I Compress and Decompress Files Using Rar Bzip Tar Ball Tar and Gunzip
6 pages
12.1 Archive and Copy Files Between Systems
No ratings yet
12.1 Archive and Copy Files Between Systems
3 pages
Activity 5 Archiving and Compression
No ratings yet
Activity 5 Archiving and Compression
3 pages
Let's Shrink "Bloated Debian Repository": Hideki Yamane
No ratings yet
Let's Shrink "Bloated Debian Repository": Hideki Yamane
47 pages
Pigz PDF
No ratings yet
Pigz PDF
3 pages
LAB7 Linux
No ratings yet
LAB7 Linux
6 pages
Learn 7zip Command Examples in Linux
No ratings yet
Learn 7zip Command Examples in Linux
3 pages
Ch8 - Advanced File Management
No ratings yet
Ch8 - Advanced File Management
14 pages
Inodes, Compression and Archiving
No ratings yet
Inodes, Compression and Archiving
8 pages
Compressing File Archives
No ratings yet
Compressing File Archives
1 page
XZ Letter
No ratings yet
XZ Letter
27 pages
Pression
No ratings yet
Pression
6 pages
Files
No ratings yet
Files
26 pages
Examen Capitulo 7 Respuestas
No ratings yet
Examen Capitulo 7 Respuestas
4 pages
Archive and Compress Lab
No ratings yet
Archive and Compress Lab
2 pages
How To Use The Tar Command On Linux A Complete Guide
No ratings yet
How To Use The Tar Command On Linux A Complete Guide
10 pages
Abhi Exp.10
No ratings yet
Abhi Exp.10
7 pages
Archive and Compress Files and Directories in Linux
No ratings yet
Archive and Compress Files and Directories in Linux
6 pages
Lab 4 Solutions
No ratings yet
Lab 4 Solutions
4 pages
Compression Utilities in Linux: How To Archive / Compress Files in Linux 1. Zip/Unzip
No ratings yet
Compression Utilities in Linux: How To Archive / Compress Files in Linux 1. Zip/Unzip
11 pages
Dzip
No ratings yet
Dzip
5 pages
Tar Linux
No ratings yet
Tar Linux
4 pages
7z Manual
No ratings yet
7z Manual
3 pages
Practice 2B - Backup and Restore On Linux
No ratings yet
Practice 2B - Backup and Restore On Linux
4 pages
61-Archive Files
No ratings yet
61-Archive Files
2 pages
Bsdtar 1
No ratings yet
Bsdtar 1
14 pages
RHCSA-8 Managing Compressed Tar Archives
No ratings yet
RHCSA-8 Managing Compressed Tar Archives
6 pages
File System Basics File Compression Archiving and Backup
No ratings yet
File System Basics File Compression Archiving and Backup
9 pages
How To Extract or Unzip Targz Files in Linux - PhoenixNAP KB
No ratings yet
How To Extract or Unzip Targz Files in Linux - PhoenixNAP KB
3 pages
Exp 10.docx - 20240529 - 000134 - 0000
No ratings yet
Exp 10.docx - 20240529 - 000134 - 0000
12 pages
Bright Line
No ratings yet
Bright Line
7 pages
Archiving and Compression
No ratings yet
Archiving and Compression
16 pages
18 Tar Command Examples in Linux
No ratings yet
18 Tar Command Examples in Linux
10 pages
Name Tar: File Pattern
No ratings yet
Name Tar: File Pattern
13 pages
Create Backups
No ratings yet
Create Backups
9 pages
The Tar Command Cheat Sheet: Format
No ratings yet
The Tar Command Cheat Sheet: Format
1 page
Basic Commands in Linux With Examples
No ratings yet
Basic Commands in Linux With Examples
4 pages
Backing Up, Compressing and Restoring
No ratings yet
Backing Up, Compressing and Restoring
41 pages
How To Make Packages of Files On UNIX: Norman Matloff October 21, 2002 C
No ratings yet
How To Make Packages of Files On UNIX: Norman Matloff October 21, 2002 C
2 pages
Archiving and Compressing Data Archiving With Tar: UNIX - Class Subdirectory Structure
No ratings yet
Archiving and Compressing Data Archiving With Tar: UNIX - Class Subdirectory Structure
8 pages
Chapter 07 Exam Question ID 73: Gzip Myfile - Tar
No ratings yet
Chapter 07 Exam Question ID 73: Gzip Myfile - Tar
5 pages
Name Synopsis Description
No ratings yet
Name Synopsis Description
4 pages
7 Zip
No ratings yet
7 Zip
4 pages
7-Zip Compression Algorithm PDF
No ratings yet
7-Zip Compression Algorithm PDF
8 pages
AWS Lab Practice Guide by WWW - Server Computer 13-12-2018
No ratings yet
AWS Lab Practice Guide by WWW - Server Computer 13-12-2018
68 pages
EDC15PSuite Manual
0% (1)
EDC15PSuite Manual
30 pages
BIG-IP Command Line Interface Guide
0% (1)
BIG-IP Command Line Interface Guide
514 pages
EC-436 IOT Syllabus
100% (1)
EC-436 IOT Syllabus
2 pages
Simatic st80 STPC Chap03 English 2016 PDF
No ratings yet
Simatic st80 STPC Chap03 English 2016 PDF
164 pages
Au Aix Powerha Cluster Migration PDF
No ratings yet
Au Aix Powerha Cluster Migration PDF
15 pages
DS-A10 Networks Thunder 1030S
No ratings yet
DS-A10 Networks Thunder 1030S
2 pages
Cisco Vs Huawei Commands
No ratings yet
Cisco Vs Huawei Commands
3 pages
SCL Logic Synthesis Report
No ratings yet
SCL Logic Synthesis Report
10 pages
Snapdragon 410 Processor Product Brief PDF
No ratings yet
Snapdragon 410 Processor Product Brief PDF
2 pages
Audio-Technica AT-LP60-USB, AT-LP120-USB & AT-LP240-USB Turntables
No ratings yet
Audio-Technica AT-LP60-USB, AT-LP120-USB & AT-LP240-USB Turntables
16 pages
EasyAccess2.0 UserManual
No ratings yet
EasyAccess2.0 UserManual
53 pages
Java Tutorial
No ratings yet
Java Tutorial
4 pages
Fping
No ratings yet
Fping
5 pages
Start Open Office As Windows Service
No ratings yet
Start Open Office As Windows Service
4 pages
Computer Programming Lecture1
No ratings yet
Computer Programming Lecture1
13 pages
INVENSOM6UL Datasheet
No ratings yet
INVENSOM6UL Datasheet
22 pages
(VBS Virus) Pornography Terminator Source Code: Sign in Create Account
No ratings yet
(VBS Virus) Pornography Terminator Source Code: Sign in Create Account
22 pages
Android: Android Application Components or Building Blocks
No ratings yet
Android: Android Application Components or Building Blocks
2 pages
Windows Server 2012 - Installation
No ratings yet
Windows Server 2012 - Installation
8 pages
Anti Debug
No ratings yet
Anti Debug
147 pages
Comp422 534 2020 Lecture1 Introduction
No ratings yet
Comp422 534 2020 Lecture1 Introduction
49 pages
Atmega328p 16-Bit Timer Counter1 PWM
No ratings yet
Atmega328p 16-Bit Timer Counter1 PWM
10 pages
Revision Paper 1
No ratings yet
Revision Paper 1
6 pages
GCCEventlog 2023-7-29
No ratings yet
GCCEventlog 2023-7-29
8 pages
Using Sandboxie To Bypass Trial Version Limitations in Software
No ratings yet
Using Sandboxie To Bypass Trial Version Limitations in Software
8 pages
2019 Bimtek TIK - Materi HP
No ratings yet
2019 Bimtek TIK - Materi HP
11 pages
How To Install Windows
No ratings yet
How To Install Windows
2 pages
Aix Jfs2 Cache
No ratings yet
Aix Jfs2 Cache
5 pages
DRBD-Cookbook: How to create your own cluster solution, without SAN or NAS!
From Everand
DRBD-Cookbook: How to create your own cluster solution, without SAN or NAS!
Joerg Christian Seubert
No ratings yet
Configuration of a Simple Samba File Server, Quota and Schedule Backup
From Everand
Configuration of a Simple Samba File Server, Quota and Schedule Backup
Dr. Hedaya Alasooly
No ratings yet
Configuration of a Simple Samba File Server, Quota and Schedule Backup
From Everand
Configuration of a Simple Samba File Server, Quota and Schedule Backup
Dr. Hidaia Mahmood Alassouli
No ratings yet

$ XZ - V Data - CSV

Uploaded by

$ XZ - V Data - CSV

Uploaded by

3.

Using xz for Single Files

Let’s use xz to compress a single file.

Unlike gzip, there’s no separate program for decompressing a file.

4. Using tar With xz for Multiple Files and Directories

Just like with gzip, xz can only compress a single file.

4.1. Compress Many Filesystem Objects

Let’s break down this command:

4.2. Decompress Archive

Again, let’s see what each option does:

5. Faster Compression With Multithreading

Unlike gzip, xz supports multithreading directly, which speeds up compression.

6.1. The –use-compress-program Option

Thus, we again use -d with a specific thread count (3).

6.2. Environment Variables

Similarly, we can perform a decompression:

6.3. Decompression Considerations

7. Testing Archive Sizes With xz

As we already noted, xz usually creates smaller archives than gzip.

You might also like