Information Theoretic Evaluation of Privacy-Leakage, Interpretability, and Transferability for Trustworthy AI

Kumar, Mohit; Moser, Bernhard A.; Fischer, Lukas; Freudenthaler, Bernhard

Computer Science > Machine Learning

arXiv:2106.06046 (cs)

[Submitted on 6 Jun 2021 (v1), last revised 12 Apr 2022 (this version, v5)]

Title:Information Theoretic Evaluation of Privacy-Leakage, Interpretability, and Transferability for Trustworthy AI

Authors:Mohit Kumar, Bernhard A. Moser, Lukas Fischer, Bernhard Freudenthaler

View PDF

Abstract:In order to develop machine learning and deep learning models that take into account the guidelines and principles of trustworthy AI, a novel information theoretic trustworthy AI framework is introduced. A unified approach to "privacy-preserving interpretable and transferable learning" is considered for studying and optimizing the tradeoffs between privacy, interpretability, and transferability aspects. A variational membership-mapping Bayesian model is used for the analytical approximations of the defined information theoretic measures for privacy-leakage, interpretability, and transferability. The approach consists of approximating the information theoretic measures via maximizing a lower-bound using variational optimization. The study presents a unified information theoretic approach to study different aspects of trustworthy AI in a rigorous analytical manner. The approach is demonstrated through numerous experiments on benchmark datasets and a real-world biomedical application concerned with the detection of mental stress on individuals using heart rate variability analysis.

Comments:	arXiv admin note: text overlap with arXiv:2105.04615, arXiv:2104.07060
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
Cite as:	arXiv:2106.06046 [cs.LG]
	(or arXiv:2106.06046v5 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2106.06046

Submission history

From: Mohit Kumar [view email]
[v1] Sun, 6 Jun 2021 09:47:06 UTC (340 KB)
[v2] Mon, 14 Jun 2021 05:11:58 UTC (356 KB)
[v3] Tue, 13 Jul 2021 10:42:00 UTC (344 KB)
[v4] Mon, 7 Feb 2022 15:00:37 UTC (749 KB)
[v5] Tue, 12 Apr 2022 12:51:38 UTC (681 KB)

Computer Science > Machine Learning

Title:Information Theoretic Evaluation of Privacy-Leakage, Interpretability, and Transferability for Trustworthy AI

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Information Theoretic Evaluation of Privacy-Leakage, Interpretability, and Transferability for Trustworthy AI

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators