Towards Quantifying The Privacy Of Redacted Text

Gusain, Vaibhav; Leith, Douglas

doi:10.1007/978-3-031-28238-6_32

Computer Science > Machine Learning

arXiv:2410.07772 (cs)

[Submitted on 10 Oct 2024]

Title:Towards Quantifying The Privacy Of Redacted Text

Authors:Vaibhav Gusain, Douglas Leith

View PDF HTML (experimental)

Abstract:In this paper we propose use of a k-anonymity-like approach for evaluating the privacy of redacted text. Given a piece of redacted text we use a state of the art transformer-based deep learning network to reconstruct the original text. This generates multiple full texts that are consistent with the redacted text, i.e. which are grammatical, have the same non-redacted words etc, and represents each of these using an embedding vector that captures sentence similarity. In this way we can estimate the number, diversity and quality of full text consistent with the redacted text and so evaluate privacy.

Comments:	Accepted in ECIR'23
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2410.07772 [cs.LG]
	(or arXiv:2410.07772v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2410.07772
Journal reference:	LNCS,volume 13981, 2023, 423-429
Related DOI:	https://doi.org/10.1007/978-3-031-28238-6_32

Submission history

From: Vaibhav Gusain [view email]
[v1] Thu, 10 Oct 2024 10:00:27 UTC (481 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2024-10

Change to browse by:

References & Citations

export BibTeX citation

Computer Science > Machine Learning

Title:Towards Quantifying The Privacy Of Redacted Text

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Towards Quantifying The Privacy Of Redacted Text

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators