MATE: Masked Autoencoders are Online 3D Test-Time Learners

Mirza, M. Jehanzeb; Shin, Inkyu; Lin, Wei; Schriebl, Andreas; Sun, Kunyang; Choe, Jaesung; Possegger, Horst; Kozinski, Mateusz; Kweon, In So; Yoon, Kun-Jin; Bischof, Horst

Computer Science > Computer Vision and Pattern Recognition

arXiv:2211.11432 (cs)

[Submitted on 21 Nov 2022 (v1), last revised 20 Mar 2023 (this version, v3)]

Title:MATE: Masked Autoencoders are Online 3D Test-Time Learners

Authors:M. Jehanzeb Mirza, Inkyu Shin, Wei Lin, Andreas Schriebl, Kunyang Sun, Jaesung Choe, Horst Possegger, Mateusz Kozinski, In So Kweon, Kun-Jin Yoon, Horst Bischof

View PDF

Abstract:Our MATE is the first Test-Time-Training (TTT) method designed for 3D data, which makes deep networks trained for point cloud classification robust to distribution shifts occurring in test data. Like existing TTT methods from the 2D image domain, MATE also leverages test data for adaptation. Its test-time objective is that of a Masked Autoencoder: a large portion of each test point cloud is removed before it is fed to the network, tasked with reconstructing the full point cloud. Once the network is updated, it is used to classify the point cloud. We test MATE on several 3D object classification datasets and show that it significantly improves robustness of deep networks to several types of corruptions commonly occurring in 3D point clouds. We show that MATE is very efficient in terms of the fraction of points it needs for the adaptation. It can effectively adapt given as few as 5% of tokens of each test sample, making it extremely lightweight. Our experiments show that MATE also achieves competitive performance by adapting sparsely on the test data, which further reduces its computational overhead, making it ideal for real-time applications.

Comments:	Code is available at this repository: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2211.11432 [cs.CV]
	(or arXiv:2211.11432v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2211.11432

Submission history

From: Muhammad Jehanzeb Mirza [view email]
[v1] Mon, 21 Nov 2022 13:19:08 UTC (29,817 KB)
[v2] Thu, 24 Nov 2022 10:52:59 UTC (29,817 KB)
[v3] Mon, 20 Mar 2023 09:44:58 UTC (31,758 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:MATE: Masked Autoencoders are Online 3D Test-Time Learners

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:MATE: Masked Autoencoders are Online 3D Test-Time Learners

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators