Publications
Citing AttentionLens
If you use AttentionLens
or any of this code in your work, please cite the following paper.
@article{sakarvadia2023attention,
title={Attention Lens: A Tool for Mechanistically Interpreting the Attention Head Information Retrieval Mechanism},
author={Sakarvadia, Mansi and Khan, Arham and Ajith, Aswathy and Grzenda, Daniel and Hudson, Nathaniel and Bauer, Andr{\'e} and Chard, Kyle and Foster, Ian},
journal={arXiv preprint arXiv:2310.16270},
year={2023},
note={Will appear in NeurIPS Attributing Model Behavior at Scale workshop.}
}