Flow-Guided Sparse Transformer for Video Deblurring

Lin, Jing; Cai, Yuanhao; Hu, Xiaowan; Wang, Haoqian; Yan, Youliang; Zou, Xueyi; Ding, Henghui; Zhang, Yulun; Timofte, Radu; Van Gool, Luc

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2201.01893 (eess)

[Submitted on 6 Jan 2022 (v1), last revised 29 May 2022 (this version, v3)]

Title:Flow-Guided Sparse Transformer for Video Deblurring

Authors:Jing Lin, Yuanhao Cai, Xiaowan Hu, Haoqian Wang, Youliang Yan, Xueyi Zou, Henghui Ding, Yulun Zhang, Radu Timofte, Luc Van Gool

View PDF

Abstract:Exploiting similar and sharper scene patches in spatio-temporal neighborhoods is critical for video deblurring. However, CNN-based methods show limitations in capturing long-range dependencies and modeling non-local self-similarity. In this paper, we propose a novel framework, Flow-Guided Sparse Transformer (FGST), for video deblurring. In FGST, we customize a self-attention module, Flow-Guided Sparse Window-based Multi-head Self-Attention (FGSW-MSA). For each $query$ element on the blurry reference frame, FGSW-MSA enjoys the guidance of the estimated optical flow to globally sample spatially sparse yet highly related $key$ elements corresponding to the same scene patch in neighboring frames. Besides, we present a Recurrent Embedding (RE) mechanism to transfer information from past frames and strengthen long-range temporal dependencies. Comprehensive experiments demonstrate that our proposed FGST outperforms state-of-the-art (SOTA) methods on both DVD and GOPRO datasets and even yields more visually pleasing results in real video deblurring. Code and pre-trained models are publicly available at this https URL

Comments:	ICML 2022; The First Transformer-based method for Video Deblurring
Subjects:	Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2201.01893 [eess.IV]
	(or arXiv:2201.01893v3 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2201.01893

Submission history

From: Yuanhao Cai [view email]
[v1] Thu, 6 Jan 2022 02:05:32 UTC (27,123 KB)
[v2] Fri, 20 May 2022 13:06:41 UTC (27,131 KB)
[v3] Sun, 29 May 2022 07:58:48 UTC (26,882 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:Flow-Guided Sparse Transformer for Video Deblurring

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:Flow-Guided Sparse Transformer for Video Deblurring

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators