Learning to Estimate Hidden Motions with Global Motion Aggregation

Jiang, Shihao; Campbell, Dylan; Lu, Yao; Li, Hongdong; Hartley, Richard

Computer Science > Computer Vision and Pattern Recognition

arXiv:2104.02409 (cs)

[Submitted on 6 Apr 2021 (v1), last revised 29 Jul 2021 (this version, v3)]

Title:Learning to Estimate Hidden Motions with Global Motion Aggregation

Authors:Shihao Jiang, Dylan Campbell, Yao Lu, Hongdong Li, Richard Hartley

View PDF

Abstract:Occlusions pose a significant challenge to optical flow algorithms that rely on local evidences. We consider an occluded point to be one that is imaged in the first frame but not in the next, a slight overloading of the standard definition since it also includes points that move out-of-frame. Estimating the motion of these points is extremely difficult, particularly in the two-frame setting. Previous work relies on CNNs to learn occlusions, without much success, or requires multiple frames to reason about occlusions using temporal smoothness. In this paper, we argue that the occlusion problem can be better solved in the two-frame case by modelling image self-similarities. We introduce a global motion aggregation module, a transformer-based approach to find long-range dependencies between pixels in the first image, and perform global aggregation on the corresponding motion features. We demonstrate that the optical flow estimates in the occluded regions can be significantly improved without damaging the performance in non-occluded regions. This approach obtains new state-of-the-art results on the challenging Sintel dataset, improving the average end-point error by 13.6% on Sintel Final and 13.7% on Sintel Clean. At the time of submission, our method ranks first on these benchmarks among all published and unpublished approaches. Code is available at this https URL

Comments:	Accepted to ICCV 2021
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2104.02409 [cs.CV]
	(or arXiv:2104.02409v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2104.02409

Submission history

From: Shihao Jiang [view email]
[v1] Tue, 6 Apr 2021 10:32:03 UTC (47,500 KB)
[v2] Mon, 26 Jul 2021 22:56:24 UTC (25,606 KB)
[v3] Thu, 29 Jul 2021 20:59:31 UTC (25,605 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Learning to Estimate Hidden Motions with Global Motion Aggregation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Learning to Estimate Hidden Motions with Global Motion Aggregation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators