A Survey of Machine Unlearning

Nguyen, Thanh Tam; Huynh, Thanh Trung; Ren, Zhao; Nguyen, Phi Le; Liew, Alan Wee-Chung; Yin, Hongzhi; Nguyen, Quoc Viet Hung

Computer Science > Machine Learning

arXiv:2209.02299v2 (cs)

[Submitted on 6 Sep 2022 (v1), revised 7 Sep 2022 (this version, v2), latest version 17 Sep 2024 (v6)]

Title:A Survey of Machine Unlearning

Authors:Thanh Tam Nguyen, Thanh Trung Huynh, Zhao Ren, Phi Le Nguyen, Alan Wee-Chung Liew, Hongzhi Yin, Quoc Viet Hung Nguyen

View PDF

Abstract:Computer systems hold a large amount of personal data over decades. On the one hand, such data abundance allows breakthroughs in artificial intelligence (AI), especially machine learning (ML) models. On the other hand, it can threaten the privacy of users and weaken the trust between humans and AI. Recent regulations require that private information about a user can be removed from computer systems in general and from ML models in particular upon request (e.g. the "right to be forgotten"). While removing data from back-end databases should be straightforward, it is not sufficient in the AI context as ML models often "remember" the old data. Existing adversarial attacks proved that we can learn private membership or attributes of the training data from the trained models. This phenomenon calls for a new paradigm, namely machine unlearning, to make ML models forget about particular data. It turns out that recent works on machine unlearning have not been able to solve the problem completely due to the lack of common frameworks and resources. In this survey paper, we seek to provide a thorough investigation of machine unlearning in its definitions, scenarios, mechanisms, and applications. Specifically, as a categorical collection of state-of-the-art research, we hope to provide a broad reference for those seeking a primer on machine unlearning and its various formulations, design requirements, removal requests, algorithms, and uses in a variety of ML applications. Furthermore, we hope to outline key findings and trends in the paradigm as well as highlight new areas of research that have yet to see the application of machine unlearning, but could nonetheless benefit immensely. We hope this survey provides a valuable reference for ML researchers as well as those seeking to innovate privacy technologies. Our resources are at this https URL.

Comments:	arXiv admin note: text overlap with arXiv:2109.13398, arXiv:2109.08266 by other authors
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2209.02299 [cs.LG]
	(or arXiv:2209.02299v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2209.02299

Submission history

From: Thanh Tam Nguyen [view email]
[v1] Tue, 6 Sep 2022 08:51:53 UTC (734 KB)
[v2] Wed, 7 Sep 2022 10:36:35 UTC (734 KB)
[v3] Thu, 8 Sep 2022 16:52:04 UTC (735 KB)
[v4] Mon, 12 Sep 2022 12:49:14 UTC (658 KB)
[v5] Fri, 21 Oct 2022 12:34:14 UTC (658 KB)
[v6] Tue, 17 Sep 2024 11:55:58 UTC (684 KB)

Computer Science > Machine Learning

Title:A Survey of Machine Unlearning

Submission history

Access Paper:

References & Citations

1 blog link

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:A Survey of Machine Unlearning

Submission history

Access Paper:

References & Citations

1 blog link

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators