[1/4] Introducing “A Primer on the Inner Workings of Transformer-based Language Models”, a comprehensive survey on interpretability methods and the findings into the functioning of language models they have led to.
ArXiv: https://t.co/UBh2ZLmYkr
Honored to share that I’ve been granted the ICTWoman of the year 2022 award for university students, by the Govt. of Catalonia @tic. This award recognizes women with remarkable careers, to foster girls vocations in ICT. Big thanks to everyone that has helped me get here!☺️
Happy to share “On the Locality of Attention in Direct Speech Translation”💬🔍 accepted at the ACL-SRW!
We use interpretability techniques to
propose an efficient architecture that
matches the baseline performance while reducing computational cost.
https://t.co/nTduu8pcAB
Happy to share our work on interpretability of the Transformer. Our proposed method, ALTI (Aggregation of Layer-wise Token-to-token Interactions), accurately measures how contextual information is mixed across the Transformer. https://t.co/97cLypEH6e
We are so delighted that our paper 'Interpreting Gender Bias in Neural Machine Translation: The Multilingual Architecture Matters' has been accepted at #AAAI2022.
We are very pleased to announce that our paper “Attention Weights in Transformer NMT Fail Aligning Words Between Sequences but Largely Explain Model Predictions” by @javifer_96 and @costajussamarta has been accepted at #EMNLP2021 as a Findings paper
Do not miss our blogs about The Transformer, Neural Machine Translation, Multilingual MT, End-to-End Speech Translation, Unsupervised NMT, Factored NMT, Interpretability in NMT, Gender bias in NLP, Lifelong Learning and Knowledge Generation and Discovery. https://t.co/Fmgd5C5wbr