Protecting Users From Themselves: Safeguarding Contextual Privacy in Interactions with Conversational Agents

Ngong, Ivoline; Kadhe, Swanand; Wang, Hao; Murugesan, Keerthiram; Weisz, Justin D.; Dhurandhar, Amit; Ramamurthy, Karthikeyan Natesan

Computer Science > Cryptography and Security

arXiv:2502.18509 (cs)

[Submitted on 22 Feb 2025 (v1), last revised 28 Jul 2025 (this version, v2)]

Title:Protecting Users From Themselves: Safeguarding Contextual Privacy in Interactions with Conversational Agents

Authors:Ivoline Ngong, Swanand Kadhe, Hao Wang, Keerthiram Murugesan, Justin D. Weisz, Amit Dhurandhar, Karthikeyan Natesan Ramamurthy

View PDF HTML (experimental)

Abstract:Conversational agents are increasingly woven into individuals' personal lives, yet users often underestimate the privacy risks associated with them. The moment users share information with these agents-such as large language models (LLMs)-their private information becomes vulnerable to exposure. In this paper, we characterize the notion of contextual privacy for user interactions with LLM-based Conversational Agents (LCAs). It aims to minimize privacy risks by ensuring that users (sender) disclose only information that is both relevant and necessary for achieving their intended goals when interacting with LCAs (untrusted receivers). Through a formative design user study, we observe how even "privacy-conscious" users inadvertently reveal sensitive information through indirect disclosures. Based on insights from this study, we propose a locally deployable framework that operates between users and LCAs, identifying and reformulating out-of-context information in user prompts. Our evaluation using examples from ShareGPT shows that lightweight models can effectively implement this framework, achieving strong gains in contextual privacy while preserving the user's intended interaction goals. Notably, about 76% of participants in our human evaluation preferred the reformulated prompts over the original ones, validating the usability and effectiveness of contextual privacy in our proposed framework. We opensource the code at this https URL.

Comments:	22 pages, 2 figures
Subjects:	Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2502.18509 [cs.CR]
	(or arXiv:2502.18509v2 [cs.CR] for this version)
	https://doi.org/10.48550/arXiv.2502.18509

Submission history

From: Ivoline Ngong [view email]
[v1] Sat, 22 Feb 2025 09:05:39 UTC (1,056 KB)
[v2] Mon, 28 Jul 2025 02:41:49 UTC (1,054 KB)

Computer Science > Cryptography and Security

Title:Protecting Users From Themselves: Safeguarding Contextual Privacy in Interactions with Conversational Agents

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Cryptography and Security

Title:Protecting Users From Themselves: Safeguarding Contextual Privacy in Interactions with Conversational Agents

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators