Removing Spurious Features can Hurt Accuracy and Affect Groups Disproportionately

Khani, Fereshte; Liang, Percy

Computer Science > Machine Learning

arXiv:2012.04104 (cs)

[Submitted on 7 Dec 2020]

Title:Removing Spurious Features can Hurt Accuracy and Affect Groups Disproportionately

Authors:Fereshte Khani, Percy Liang

View PDF

Abstract:The presence of spurious features interferes with the goal of obtaining robust models that perform well across many groups within the population. A natural remedy is to remove spurious features from the model. However, in this work we show that removal of spurious features can decrease accuracy due to the inductive biases of overparameterized models. We completely characterize how the removal of spurious features affects accuracy across different groups (more generally, test distributions) in noiseless overparameterized linear regression. In addition, we show that removal of spurious feature can decrease the accuracy even in balanced datasets -- each target co-occurs equally with each spurious feature; and it can inadvertently make the model more susceptible to other spurious features. Finally, we show that robust self-training can remove spurious features without affecting the overall accuracy. Experiments on the Toxic-Comment-Detectoin and CelebA datasets show that our results hold in non-linear models.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Machine Learning (stat.ML)
Cite as:	arXiv:2012.04104 [cs.LG]
	(or arXiv:2012.04104v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2012.04104

Submission history

From: Fereshte Khani [view email]
[v1] Mon, 7 Dec 2020 23:08:59 UTC (1,359 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2020-12

Change to browse by:

cs
cs.AI
cs.CY
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Fereshte Khani
Percy Liang

export BibTeX citation

Computer Science > Machine Learning

Title:Removing Spurious Features can Hurt Accuracy and Affect Groups Disproportionately

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Removing Spurious Features can Hurt Accuracy and Affect Groups Disproportionately

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators