Democratizing Advanced Attribution Analyses of Generative Language Models with the Inseq Toolkit

Gabriele Sarti*, Nils Feldhus, Jirui Qi, Malvina Nissim, Arianna Bisazza

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

4 Downloads (Pure)

Abstract

Inseq1 is a recent toolkit providing an intuitive and optimized interface to conduct feature attribution analyses of generative language models. In this work, we present the latest improvements to the library, including efforts to simplify the attribution of large language models on consumer hardware, additional attribution approaches, and a new client command to detect and attribute context usage in language model generations. We showcase an online demo using Inseq as an attribution backbone for context reliance analysis, and we highlight interesting contextual patterns in language model generations. Ultimately, this release furthers Inseq’s mission of centralizing good interpretability practices and enabling fair and reproducible model evaluations.

Original languageEnglish
Title of host publicationxAI-2024 Late-breaking Work, Demos and Doctoral Consortium Joint Proceedings
EditorsLuca Longo, Weiru Liu, Grégoire Montavon
PublisherCEUR Workshop Proceedings (CEUR-WS.org)
Pages289-296
Number of pages8
Publication statusPublished - 2024
EventJoint of the 2nd World Conference on eXplainable Artificial Intelligence Late-Breaking Work, Demos and Doctoral Consortium, xAI-2024:LB/D/DC - Valletta, Malta
Duration: 17-Jul-202419-Jul-2024

Publication series

NameCEUR Workshop Proceedings
Volume3793
ISSN (Print)1613-0073

Conference

ConferenceJoint of the 2nd World Conference on eXplainable Artificial Intelligence Late-Breaking Work, Demos and Doctoral Consortium, xAI-2024:LB/D/DC
Country/TerritoryMalta
CityValletta
Period17/07/202419/07/2024

Keywords

  • Feature Attribution
  • Generative Language Models
  • Natural Language Processing
  • Python Toolkit

Fingerprint

Dive into the research topics of 'Democratizing Advanced Attribution Analyses of Generative Language Models with the Inseq Toolkit'. Together they form a unique fingerprint.

Cite this