After a year of restless development, I'm finally happy to announce Inseq, a new tool to democratize post-hoc interpretability of sequence generation models ๐ https://t.co/YlxGuW4bVO #nlproc#xai
Some highlights ๐ 1/
Model Internals-based RAG Evaluation (MIRAGE) ๐ด is accepted to #EMNLP2024 Main!
โก๏ธ To celebrate, here's our new MIRAGE demo combining @InseqLib and Transformers-specific LRP: https://t.co/wPgnFNAJYk.
Reach out if you want to catch up in Miami! ๐ค๐๏ธ
Very hyped for the new beautiful viz that just landed in the @InseqLib main branch! ๐ฅ This will empower users to explore attribution tensors more flexibly and intuitively.
h/t to @_ddjohnson for his awesome work on the treescope toolkit powering this release!
Thanks to the new treescope integration, @InseqLib now supports interactive visualizations for multidimensional attributions (show_granular), token highlights (show_tokens) and improved viz for attribute_context CLI! ๐ Install main, will appear in v0.7
https://t.co/O2PxXBIr4o
By popular demand, the Treescope pretty-printer from the Penzai neural net library can now be installed separately, and supports both JAX and PyTorch!
And that's not all: Penzai itself now has less boilerplate and includes more pretrained Transformer models!
๐ We're thrilled to host Gabriele Sarti (@gsarti_) in our PhD seminar series tomorrow, July 16th, from 12:00-13:00 in Oe67 BU 101! Join us for his talk on interpreting context usage in generative language models, featuring the Inseq toolkit and PECoRe framework. ๐Don't miss it!
The ๐ PECoRe / ๐ด MIRAGE demo on @huggingface Spaces is powered by our new attribute-context CLI command released in v0.6, and allows to export the code to reproduce your results locally with ๐ Inseq.
Check it out โก๏ธ https://t.co/txaCqxgyuP
โ ๏ธ Citations from prompting or NLI seem plausible, but may not faithfully reflect LLM reasoning.
๐๏ธ MIRAGE detects context dependence in generations via model internals, producing granular and faithful RAG citations.
๐ Demo: https://t.co/OMeM32TNoY
Fun collab w/ @Jirui_Qi, @AriannaBisazza & @raquel_dmg! Check it out โฌ๏ธ
Today, we had the first seminar of our #XAI course!
@gsarti_ presented the @InseqLib to interpret LMs and the PECORE framework to identify & attribute context dependence in LMs! ๐๐
Thank you, it was so interesting! ๐ค
Great start to our series!
https://t.co/YY8zpCcUMO
[1/4] Introducing โA Primer on the Inner Workings of Transformer-based Language Modelsโ, a comprehensive survey on interpretability methods and the findings into the functioning of language models they have led to.
ArXiv: https://t.co/UBh2ZLmYkr
@InseqLib v0.6 is out now on PyPI! ๐ฅ
New CLI command for context attribution (@gsarti_), new perturbation-based methods by @hmohebbi75
& @casszzx and optimizations incl. multi-gpu support! โก๏ธ
Huge shoutout to our contributors! โค๏ธ
Release notes โฌ๏ธ
https://t.co/onjSKzeQqY
The official ๐ PECoRe ๐ demo to detect and attribute context dependence in LM generations is now available on @huggingface Spaces! ๐
Includes code examples, a usage guide, useful presets for various dec-only & enc-dec models, and more! Check it out โฌ๏ธ https://t.co/ZgQm3LdqDq
๐ PECoRe repository is now public (https://t.co/cxoyWeMLql) and all model/datasets are available on @huggingface (https://t.co/h6QllPB1Xg)!
๐Interested in using PECoRe on your models? Have a look at the @InseqLib implementation (`inseq attribute-context`)!
Did someone say Mamba support? ๐
(Actually gradient-based methods are not working out-of-the-box because of some in-place variable overwriting, but should be fixable!)
`mamba` is now available in transformers. PEFT finetuning example: https://t.co/2vPQBgA9iu
Thanks @_albertgu and @tri_dao for this brilliant model! ๐ and the amazing `mamba-ssm` kernels powering this!
Excited to present my recent work on @InseqLib and the #ICLR2024 PECoRe interpretability framework for the @SheffieldNLP group this afternoon! Many thanks @casszzx for inviting me! ๐ค
๐ Inseq: https://t.co/YlxGuW4bVO
๐๐ PECoRe: https://t.co/e5takAk53n
Value Zeroing, a faithful approach for analyzing context mixing in Transformers, is now available on @InseqLib main branch for all @huggingface text generation models! ๐
๐Paper introducing VZ: https://t.co/lbSwujzWli
๐VZ in Inseq: https://t.co/sL76YxwdbO
After two years with Poetry, Inseq just moved to @astral_sh's new blazing fast package manager uv! Our CI installation step is now ~80% faster! ๐ฅ
Congrats to @charliermarsh and the team on the release, and godspeed for the cargo-like experience you are planning to build!
Mousavi et al. attribute the generation of dialogue models, finding increased influence for more refined versions of dialogue history https://t.co/tJNn5WahN6 (@mahedmousavi S. Caldarella G. Riccardi @sislab7) 3/
Wang et al. propose a metric for LMs factual reliability and show its relation to modelsโ sensitivity to in-context distractors https://t.co/1FhGXXxqDw (W. Wang @bazril@alexandrabirch1 W. Peng @EdinburghNLP) 7/