End-to-End Rate-Distortion Optimization for Bi-Directional Learned Video Compression

Yilmaz, M. Akin; Tekalp, A. Murat

doi:10.1109/ICIP40778.2020.9190881

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2008.05028 (eess)

[Submitted on 11 Aug 2020 (v1), last revised 26 May 2021 (this version, v2)]

Title:End-to-End Rate-Distortion Optimization for Bi-Directional Learned Video Compression

Authors:M. Akin Yilmaz, A. Murat Tekalp

View PDF

Abstract:Conventional video compression methods employ a linear transform and block motion model, and the steps of motion estimation, mode and quantization parameter selection, and entropy coding are optimized individually due to combinatorial nature of the end-to-end optimization problem. Learned video compression allows end-to-end rate-distortion optimized training of all nonlinear modules, quantization parameter and entropy model simultaneously. While previous work on learned video compression considered training a sequential video codec based on end-to-end optimization of cost averaged over pairs of successive frames, it is well-known in conventional video compression that hierarchical, bi-directional coding outperforms sequential compression. In this paper, we propose for the first time end-to-end optimization of a hierarchical, bi-directional motion compensated learned codec by accumulating cost function over fixed-size groups of pictures (GOP). Experimental results show that the rate-distortion performance of our proposed learned bi-directional {\it GOP coder} outperforms the state-of-the-art end-to-end optimized learned sequential compression as expected.

Comments:	This work is accepted for publication in IEEE ICIP 2020
Subjects:	Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2008.05028 [eess.IV]
	(or arXiv:2008.05028v2 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2008.05028
Related DOI:	https://doi.org/10.1109/ICIP40778.2020.9190881

Submission history

From: Akin Yilmaz [view email]
[v1] Tue, 11 Aug 2020 22:50:06 UTC (9,856 KB)
[v2] Wed, 26 May 2021 19:12:26 UTC (9,856 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:End-to-End Rate-Distortion Optimization for Bi-Directional Learned Video Compression

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:End-to-End Rate-Distortion Optimization for Bi-Directional Learned Video Compression

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators