Minority Class Oriented Active Learning for Imbalanced Datasets

Aggarwal, Umang; Popescu, Adrian; Hudelot, Céline

doi:10.1109/ICPR48806.2021.9412182

Computer Science > Machine Learning

arXiv:2202.00390 (cs)

[Submitted on 1 Feb 2022]

Title:Minority Class Oriented Active Learning for Imbalanced Datasets

Authors:Umang Aggarwal, Adrian Popescu, Céline Hudelot

View PDF

Abstract:Active learning aims to optimize the dataset annotation process when resources are constrained. Most existing methods are designed for balanced datasets. Their practical applicability is limited by the fact that a majority of real-life datasets are actually imbalanced. Here, we introduce a new active learning method which is designed for imbalanced datasets. It favors samples likely to be in minority classes so as to reduce the imbalance of the labeled subset and create a better representation for these classes. We also compare two training schemes for active learning: (1) the one commonly deployed in deep active learning using model fine tuning for each iteration and (2) a scheme which is inspired by transfer learning and exploits generic pre-trained models and train shallow classifiers for each iteration. Evaluation is run with three imbalanced datasets. Results show that the proposed active learning method outperforms competitive baselines. Equally interesting, they also indicate that the transfer learning training scheme outperforms model fine tuning if features are transferable from the generic dataset to the unlabeled one. This last result is surprising and should encourage the community to explore the design of deep active learning methods.

Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2202.00390 [cs.LG]
	(or arXiv:2202.00390v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2202.00390
Journal reference:	2020 25th International Conference on Pattern Recognition (ICPR)
Related DOI:	https://doi.org/10.1109/ICPR48806.2021.9412182

Submission history

From: Umang Aggarwal Mr [view email]
[v1] Tue, 1 Feb 2022 13:13:41 UTC (887 KB)

Computer Science > Machine Learning

Title:Minority Class Oriented Active Learning for Imbalanced Datasets

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Minority Class Oriented Active Learning for Imbalanced Datasets

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators