Efficiently Computing Susceptibility to Context in Language Models

Liu, Tianyu; Du, Kevin; Sachan, Mrinmaya; Cotterell, Ryan

Computer Science > Computation and Language

arXiv:2410.14361 (cs)

[Submitted on 18 Oct 2024]

Title:Efficiently Computing Susceptibility to Context in Language Models

Authors:Tianyu Liu, Kevin Du, Mrinmaya Sachan, Ryan Cotterell

View PDF HTML (experimental)

Abstract:One strength of modern language models is their ability to incorporate information from a user-input context when answering queries. However, they are not equally sensitive to the subtle changes to that context. To quantify this, Du et al. (2024) gives an information-theoretic metric to measure such sensitivity. Their metric, susceptibility, is defined as the degree to which contexts can influence a model's response to a query at a distributional level. However, exactly computing susceptibility is difficult and, thus, Du et al. (2024) falls back on a Monte Carlo approximation. Due to the large number of samples required, the Monte Carlo approximation is inefficient in practice. As a faster alternative, we propose Fisher susceptibility, an efficient method to estimate the susceptibility based on Fisher information. Empirically, we validate that Fisher susceptibility is comparable to Monte Carlo estimated susceptibility across a diverse set of query domains despite its being $70\times$ faster. Exploiting the improved efficiency, we apply Fisher susceptibility to analyze factors affecting the susceptibility of language models. We observe that larger models are as susceptible as smaller ones.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2410.14361 [cs.CL]
	(or arXiv:2410.14361v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2410.14361

Submission history

From: Tianyu Liu [view email]
[v1] Fri, 18 Oct 2024 10:40:47 UTC (332 KB)

Computer Science > Computation and Language

Title:Efficiently Computing Susceptibility to Context in Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Efficiently Computing Susceptibility to Context in Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators