The Label Complexity of Active Learning from Observational Data

Yan, Songbai; Chaudhuri, Kamalika; Javidi, Tara

Statistics > Machine Learning

arXiv:1905.12791 (stat)

[Submitted on 29 May 2019 (v1), last revised 28 Oct 2019 (this version, v2)]

Title:The Label Complexity of Active Learning from Observational Data

Authors:Songbai Yan, Kamalika Chaudhuri, Tara Javidi

View PDF

Abstract:Counterfactual learning from observational data involves learning a classifier on an entire population based on data that is observed conditioned on a selection policy. This work considers this problem in an active setting, where the learner additionally has access to unlabeled examples and can choose to get a subset of these labeled by an oracle.
Prior work on this problem uses disagreement-based active learning, along with an importance weighted loss estimator to account for counterfactuals, which leads to a high label complexity. We show how to instead incorporate a more efficient counterfactual risk minimizer into the active learning algorithm. This requires us to modify both the counterfactual risk to make it amenable to active learning, as well as the active learning process to make it amenable to the risk. We provably demonstrate that the result of this is an algorithm which is statistically consistent as well as more label-efficient than prior work.

Comments:	NeurIPS 2019
Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:1905.12791 [stat.ML]
	(or arXiv:1905.12791v2 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1905.12791

Submission history

From: Songbai Yan [view email]
[v1] Wed, 29 May 2019 23:48:16 UTC (30 KB)
[v2] Mon, 28 Oct 2019 03:03:17 UTC (97 KB)

Statistics > Machine Learning

Title:The Label Complexity of Active Learning from Observational Data

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:The Label Complexity of Active Learning from Observational Data

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators