🎉 Exciting news! The new Adapters library for modular and parameter-efficient transfer learning is out! 🤖
Now simplified & disentangled from @huggingface
pip install adapters
pip install transformers
📄https://t.co/YUxmvjAf72
👾 https://t.co/GTekd4MEFS
#EMNLP2023
🧵👇
When you give an LLM a task, and a solution, point it to the solution, and then force it to read the solution...
...we still do not actually solve the task. Not even close to 100%.
Read @LeonEnglaender's important internship work @cohere investigating exploration for agents
LLM agents are assumed to integrate unexpected environmental observations into their reasoning. It turns out they don't.
We added the complete task solution into agent environments as a file or an API endpoint, and measured whether agents act on what they discover. They almost never do.
Starkest example: on AppWorld, gpt-oss-120b sees a CLI command documented as "returns the complete solution to this task" in 97.54% of runs. It calls it in 0.53%. Same pattern for GLM-4.7 and other models, across Terminal-Bench, SWE-Bench, and AppWorld.
📜 https://t.co/lqFuebkOBY
🧵👇
Took Claude up for a spin on the weekend and started a quick open-source self-hosted re-implementation Thinking Machines' Tinker API: https://t.co/AJmLBV2uqx
As always, a huge thanks to our community for the awesome PRs that helped shape this release!
🎉 Read all about v1.2 on our blog: https://t.co/BwySYdB7Lt
💻 Explore the code, try it out & star our repo ⭐: https://t.co/GTekd4MEFS
(5/5)
🚀Adapters v1.2 is out!🚀
We've made Adapters incredibly flexible: Add adapter support to ANY Transformer architecture with minimal code!
We used this to add 8 new models out-of-the-box, incl. ModernBERT, Gemma3 & Qwen3!
Explore this +2 new adapter methods in this thread👇(1/5)
Also new since v1.0:
✅ Added AdapterPlus
✅ Gradient Checkpointing support for memory efficiency
✅ Push & load complex adapter compositions (Stack, Fuse, etc.) directly via the Hugging Face Hub!
These additions make Adapters even more powerful & usable. (4/5)
I am hiring a Student Researcher for our Modularity team at the Google DeepMind office in Zurich🇨🇭
Please fill out the interest form if you would like to work with us! The role would start mid/end 2025 and would be in-person in Zurich with 80-100% at GDM
https://t.co/Vfypj91KHy
🎉M2QA has been accepted to #EMNLP Findings!🎉
M2QA is a new multilingual and multidomain QA dataset. We show that current transfer methods are insufficient and that language & domain transfer aren't independent!
📄 Paper: https://t.co/A23KymqS0b
👇👇👇
https://t.co/yHn5KWrCMQ
👏 Huge thanks to all contributors and our amazing community!
Adapters is an open-source project, and we're excited to see what you build with it and how you use it for your research.
If you have questions or ideas, join the discussion on GitHub!
https://t.co/GTekd4MEFS
🎉Adapters 1.0 is here!🚀
Our open-source library for modular and parameter-efficient fine-tuning got a major upgrade! v1.0 is packed with new features (ReFT, Adapter Merging, QLoRA, ...), new models & improvements!
Blog: https://t.co/Evp8kQG1je
Highlights in the thread! 🧵👇
📢 New preprint 🎉
We - the AdapterHub team - present the M2QA benchmark to evaluate joint domain and language transfer!
🔬 Key highlight: We show that adapter-based methods on small language models can reach the performance of Llama 3 on M2QA! 🚀
👇
📢 New preprint 🎉
We introduce "M2QA: Multi-domain Multilingual Question Answering", a benchmark for evaluating joint language and domain transfer.
We present 5 key findings - one of them: Current transfer methods are insufficient, even for LLMs!
📜https://t.co/PI2AitnxIp
🧵👇
📢 New preprint 🎉
We introduce "M2QA: Multi-domain Multilingual Question Answering", a benchmark for evaluating joint language and domain transfer.
We present 5 key findings - one of them: Current transfer methods are insufficient, even for LLMs!
📜https://t.co/PI2AitnxIp
🧵👇