Optimizing Error-Bounded Lossy Compression for Scientific Data on GPUs

Tian, Jiannan; Di, Sheng; Yu, Xiaodong; Rivera, Cody; Zhao, Kai; Jin, Sian; Feng, Yunhe; Liang, Xin; Tao, Dingwen; Cappello, Franck

Computer Science > Distributed, Parallel, and Cluster Computing

arXiv:2105.12912 (cs)

[Submitted on 27 May 2021 (v1), last revised 3 Sep 2021 (this version, v3)]

Title:Optimizing Error-Bounded Lossy Compression for Scientific Data on GPUs

Authors:Jiannan Tian, Sheng Di, Xiaodong Yu, Cody Rivera, Kai Zhao, Sian Jin, Yunhe Feng, Xin Liang, Dingwen Tao, Franck Cappello

View PDF

Abstract:Error-bounded lossy compression is a critical technique for significantly reducing scientific data volumes. With ever-emerging heterogeneous high-performance computing (HPC) architecture, GPU-accelerated error-bounded compressors (such as cuSZ+ and cuZFP) have been developed. However, they suffer from either low performance or low compression ratios. To this end, we propose cuSZ+ to target both high compression ratios and throughputs. We identify that data sparsity and data smoothness are key factors for high compression throughputs. Our key contributions in this work are fourfold: (1) We propose an efficient compression workflow to adaptively perform run-length encoding and/or variable-length encoding. (2) We derive Lorenzo reconstruction in decompression as multidimensional partial-sum computation and propose a fine-grained Lorenzo reconstruction algorithm for GPU architectures. (3) We carefully optimize each of cuSZ+ kernels by leveraging state-of-the-art CUDA parallel primitives. (4) We evaluate cuSZ+ using seven real-world HPC application datasets on V100 and A100 GPUs. Experiments show cuSZ+ improves the compression throughputs and ratios by up to 18.4X and 5.3X, respectively, over cuSZ on the tested datasets.

Comments:	12 pages, 3 figures, 7 tables, accepted by IEEE Cluster'21
Subjects:	Distributed, Parallel, and Cluster Computing (cs.DC)
Cite as:	arXiv:2105.12912 [cs.DC]
	(or arXiv:2105.12912v3 [cs.DC] for this version)
	https://doi.org/10.48550/arXiv.2105.12912

Submission history

From: Dingwen Tao [view email]
[v1] Thu, 27 May 2021 02:17:04 UTC (10,467 KB)
[v2] Mon, 2 Aug 2021 05:00:28 UTC (457 KB)
[v3] Fri, 3 Sep 2021 18:18:35 UTC (1,386 KB)

Computer Science > Distributed, Parallel, and Cluster Computing

Title:Optimizing Error-Bounded Lossy Compression for Scientific Data on GPUs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Distributed, Parallel, and Cluster Computing

Title:Optimizing Error-Bounded Lossy Compression for Scientific Data on GPUs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators