Towards the Identifiability in Noisy Label Learning: A Multinomial Mixture Approach

Nguyen, Cuong; Do, Thanh-Toan; Carneiro, Gustavo

Abstract:Learning from noisy labels plays an important role in the deep learning era. Despite numerous studies with promising results, identifying clean labels from a noisily-annotated dataset is still challenging since the conventional noisy label learning problem with single noisy label per instance is not identifiable, i.e., it does not theoretically have a unique solution unless one has access to clean labels or introduces additional assumptions. This paper aims to formally investigate such identifiability issue by formulating the noisy label learning problem as a multinomial mixture model, enabling the formulation of the identifiability constraint. In particular, we prove that the noisy label learning problem is identifiable if there are at least $2C - 1$ noisy labels per instance provided, with $C$ being the number of classes. In light of such requirement, we propose a method that automatically generates additional noisy labels per training sample by estimating the noisy label distribution based on nearest neighbours. Such additional noisy labels allow us to apply the Expectation - Maximisation algorithm to estimate the posterior of clean labels. We empirically demonstrate that the proposed method is not only capable of estimating clean labels without any heuristics in several challenging label noise benchmarks, including synthetic, web-controlled and real-world label noises, but also of performing competitively with many state-of-the-art methods.

Comments:	Under peer review
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2301.01405 [cs.LG]
	(or arXiv:2301.01405v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2301.01405

Computer Science > Machine Learning

Title:Towards the Identifiability in Noisy Label Learning: A Multinomial Mixture Approach

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators