Is Attention Explanation? An Introduction to the Debate

Adrien Bibal; Rémi Cardon; David Alfter; Rodrigo Wilkens; Xiaoou Wang; Thomas François; Patrick Watrin

doi:10.18653/v1/2022.acl-long.269

Is Attention Explanation? An Introduction to the Debate

Adrien Bibal, Rémi Cardon, David Alfter, Rodrigo Wilkens, Xiaoou Wang, Thomas François, Patrick Watrin

Abstract

The performance of deep learning models in NLP and other fields of machine learning has led to a rise in their popularity, and so the need for explanations of these models becomes paramount. Attention has been seen as a solution to increase performance, while providing some explanations. However, a debate has started to cast doubt on the explanatory power of attention in neural networks. Although the debate has created a vast literature thanks to contributions from various areas, the lack of communication is becoming more and more tangible. In this paper, we provide a clear overview of the insights on the debate by critically confronting works from these different areas. This holistic vision can be of great interest for future works in all the communities concerned by this debate. We sum up the main challenges spotted in these areas, and we conclude by discussing the most promising future avenues on attention as an explanation.

Anthology ID:: 2022.acl-long.269
Volume:: Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: May
Year:: 2022
Address:: Dublin, Ireland
Editors:: Smaranda Muresan, Preslav Nakov, Aline Villavicencio
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 3889–3900
Language:
URL:: https://aclanthology.org/2022.acl-long.269
DOI:: 10.18653/v1/2022.acl-long.269
Bibkey:
Cite (ACL):: Adrien Bibal, Rémi Cardon, David Alfter, Rodrigo Wilkens, Xiaoou Wang, Thomas François, and Patrick Watrin. 2022. Is Attention Explanation? An Introduction to the Debate. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 3889–3900, Dublin, Ireland. Association for Computational Linguistics.
Cite (Informal):: Is Attention Explanation? An Introduction to the Debate (Bibal et al., ACL 2022)
Copy Citation:
PDF:: https://aclanthology.org/2022.acl-long.269.pdf
Video:: https://aclanthology.org/2022.acl-long.269.mp4

PDF Cite Search Video