Generalized Gaussian Model for Learned Image Compression

Zhang, Haotian; Li, Li; Liu, Dong

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2411.19320 (eess)

[Submitted on 28 Nov 2024]

Title:Generalized Gaussian Model for Learned Image Compression

Authors:Haotian Zhang, Li Li, Dong Liu

View PDF HTML (experimental)

Abstract:In learned image compression, probabilistic models play an essential role in characterizing the distribution of latent variables. The Gaussian model with mean and scale parameters has been widely used for its simplicity and effectiveness. Probabilistic models with more parameters, such as the Gaussian mixture models, can fit the distribution of latent variables more precisely, but the corresponding complexity will also be higher. To balance between compression performance and complexity, we extend the Gaussian model to the generalized Gaussian model for more flexible latent distribution modeling, introducing only one additional shape parameter, beta, than the Gaussian model. To enhance the performance of the generalized Gaussian model by alleviating the train-test mismatch, we propose improved training methods, including beta-dependent lower bounds for scale parameters and gradient rectification. Our proposed generalized Gaussian model, coupled with the improved training methods, is demonstrated to outperform the Gaussian and Gaussian mixture models on a variety of learned image compression methods.

Comments:	13 pages, 12 figures
Subjects:	Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2411.19320 [eess.IV]
	(or arXiv:2411.19320v1 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2411.19320

Submission history

From: Haotian Zhang [view email]
[v1] Thu, 28 Nov 2024 18:51:55 UTC (13,793 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:Generalized Gaussian Model for Learned Image Compression

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:Generalized Gaussian Model for Learned Image Compression

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators