AdapterHub @adapterhub - Twitter Profile

Pinned Tweet

over 2 years ago

🎉 Exciting news! The new Adapters library for modular and parameter-efficient transfer learning is out! 🤖 Now simplified & disentangled from @huggingface pip install adapters pip install transformers 📄https://t.co/YUxmvjAf72 👾 https://t.co/GTekd4MEFS #EMNLP2023 🧵👇

AdapterHub's tweet photo. 🎉 Exciting news! The new Adapters library for modular and parameter-efficient transfer learning is out! 🤖

Now simplified & disentangled from @huggingface

pip install adapters
pip install transformers

📄https://t.co/YUxmvjAf72

👾 https://t.co/GTekd4MEFS

#EMNLP2023

🧵👇 https://t.co/rbHW7hTeoG

7

460

101

264

123K

AdapterHub retweeted

Tom Sherborne @tomsherborne

about 2 months ago

When you give an LLM a task, and a solution, point it to the solution, and then force it to read the solution... ...we still do not actually solve the task. Not even close to 100%. Read @LeonEnglaender's important internship work @cohere investigating exploration for agents

0

9

4

3

1K

AdapterHub retweeted

Leon Engländer

@LeonEnglaender

about 2 months ago

LLM agents are assumed to integrate unexpected environmental observations into their reasoning. It turns out they don't. We added the complete task solution into agent environments as a file or an API endpoint, and measured whether agents act on what they discover. They almost never do. Starkest example: on AppWorld, gpt-oss-120b sees a CLI command documented as "returns the complete solution to this task" in 97.54% of runs. It calls it in 0.53%. Same pattern for GLM-4.7 and other models, across Terminal-Bench, SWE-Bench, and AppWorld. 📜 https://t.co/lqFuebkOBY 🧵👇

LeonEnglaender's tweet photo. LLM agents are assumed to integrate unexpected environmental observations into their reasoning. It turns out they don't.

We added the complete task solution into agent environments as a file or an API endpoint, and measured whether agents act on what they discover. They almost never do.

Starkest example: on AppWorld, gpt-oss-120b sees a CLI command documented as "returns the complete solution to this task" in 97.54% of runs. It calls it in 0.53%. Same pattern for GLM-4.7 and other models, across Terminal-Bench, SWE-Bench, and AppWorld.

📜 https://t.co/lqFuebkOBY

🧵👇

9

140

23

99

15K

AdapterHub retweeted

Clifton Poth @clifapt

3 months ago

Took Claude up for a spin on the weekend and started a quick open-source self-hosted re-implementation Thinking Machines' Tinker API: https://t.co/AJmLBV2uqx

0

7

3

2

352

Who to follow

Nils Reimers

@Nils_Reimers

VP AI Search @Cohere | ex-huggingface | Creator of SBERT (https://t.co/MKKOMfuQ4C)

EMNLP 2026

@emnlpmeeting

EMNLP 2026 - The 2026 Conference on Empirical Methods in Natural Language Processing Hashtag: #EMNLP2026 Dates: October 24 –29 Submission: ACL ARR March and May

EdinburghNLP

@EdinburghNLP

The Natural Language Processing Group at the University of Edinburgh.

AdapterHub @AdapterHub

about 1 year ago

As always, a huge thanks to our community for the awesome PRs that helped shape this release! 🎉 Read all about v1.2 on our blog: https://t.co/BwySYdB7Lt 💻 Explore the code, try it out & star our repo ⭐: https://t.co/GTekd4MEFS (5/5)

0

3

0

95

AdapterHub @AdapterHub

about 1 year ago

🚀Adapters v1.2 is out!🚀 We've made Adapters incredibly flexible: Add adapter support to ANY Transformer architecture with minimal code! We used this to add 8 new models out-of-the-box, incl. ModernBERT, Gemma3 & Qwen3! Explore this +2 new adapter methods in this thread👇(1/5)

AdapterHub's tweet photo. 🚀Adapters v1.2 is out!🚀
We've made Adapters incredibly flexible: Add adapter support to ANY Transformer architecture with minimal code!

We used this to add 8 new models out-of-the-box, incl. ModernBERT, Gemma3 & Qwen3!

Explore this +2 new adapter methods in this thread👇(1/5) https://t.co/IbiHihjzKH

1

22

3

6

3K

AdapterHub @AdapterHub

about 1 year ago

Also new since v1.0: ✅ Added AdapterPlus ✅ Gradient Checkpointing support for memory efficiency ✅ Push & load complex adapter compositions (Stack, Fuse, etc.) directly via the Hugging Face Hub! These additions make Adapters even more powerful & usable. (4/5)

1

0

98

AdapterHub retweeted

Jonas Pfeiffer @PfeiffJo

about 1 year ago

I am hiring a Student Researcher for our Modularity team at the Google DeepMind office in Zurich🇨🇭 Please fill out the interest form if you would like to work with us! The role would start mid/end 2025 and would be in-person in Zurich with 80-100% at GDM https://t.co/Vfypj91KHy

3

295

55

181

41K

AdapterHub @AdapterHub

over 1 year ago

🎁 A new update of the Adapters library is out! Check out all the novelties, changes & fixes here: https://t.co/muMqhP0XzA

0

5

4

0

642

AdapterHub retweeted

UKP Lab @UKPLab

over 1 year ago

🎉M2QA has been accepted to #EMNLP Findings!🎉 M2QA is a new multilingual and multidomain QA dataset. We show that current transfer methods are insufficient and that language & domain transfer aren't independent! 📄 Paper: https://t.co/A23KymqS0b 👇👇👇 https://t.co/yHn5KWrCMQ

0

15

2

1

857

AdapterHub retweeted

Jinghan Zhang @jinghan23

almost 2 years ago

Thank you @AdapterHub for implementing our #NeurIPS method (https://t.co/hW3Sn4IAVF) in your latest update! 🎉 Great to see our work being applied for practical advancements. Check out their work! #MachineLearning #AdapterMerging #ModelMerging

0

11

2

4

1K

AdapterHub @AdapterHub

almost 2 years ago

👏 Huge thanks to all contributors and our amazing community! Adapters is an open-source project, and we're excited to see what you build with it and how you use it for your research. If you have questions or ideas, join the discussion on GitHub! https://t.co/GTekd4MEFS

0

5

0

180

AdapterHub @AdapterHub

almost 2 years ago

🎉Adapters 1.0 is here!🚀 Our open-source library for modular and parameter-efficient fine-tuning got a major upgrade! v1.0 is packed with new features (ReFT, Adapter Merging, QLoRA, ...), new models & improvements! Blog: https://t.co/Evp8kQG1je Highlights in the thread! 🧵👇

2

44

7

18

5K

AdapterHub @AdapterHub

almost 2 years ago

🎙️ New Models Alert! Adapters now supports: - Whisper: Our first audio model! - Mistral - MT5 - PLBart With Whisper, we bring speech recognition capabilities to our library!🔊 Notebook: https://t.co/SjerNfhmRa

1

5

0

1

247

AdapterHub @AdapterHub

almost 2 years ago

📢 New preprint 🎉 We - the AdapterHub team - present the M2QA benchmark to evaluate joint domain and language transfer! 🔬 Key highlight: We show that adapter-based methods on small language models can reach the performance of Llama 3 on M2QA! 🚀 👇

AdapterHub's tweet photo. 📢 New preprint 🎉
We - the AdapterHub team - present the M2QA benchmark to evaluate joint domain and language transfer!

🔬 Key highlight: We show that adapter-based methods on small language models can reach the performance of Llama 3 on M2QA! 🚀

👇 https://t.co/3OoJ1jqmnI

Leon Engländer

@LeonEnglaender

almost 2 years ago

📢 New preprint 🎉 We introduce "M2QA: Multi-domain Multilingual Question Answering", a benchmark for evaluating joint language and domain transfer. We present 5 key findings - one of them: Current transfer methods are insufficient, even for LLMs! 📜https://t.co/PI2AitnxIp 🧵👇

LeonEnglaender's tweet photo. 📢 New preprint 🎉
We introduce "M2QA: Multi-domain Multilingual Question Answering", a benchmark for evaluating joint language and domain transfer.
We present 5 key findings - one of them: Current transfer methods are insufficient, even for LLMs!

📜https://t.co/PI2AitnxIp

🧵👇 https://t.co/ZHpqhRnHll

2

14

2

5

5K

0

8

2

617

AdapterHub retweeted

Leon Engländer

@LeonEnglaender

almost 2 years ago

📢 New preprint 🎉 We introduce "M2QA: Multi-domain Multilingual Question Answering", a benchmark for evaluating joint language and domain transfer. We present 5 key findings - one of them: Current transfer methods are insufficient, even for LLMs! 📜https://t.co/PI2AitnxIp 🧵👇

2

14

2

5

5K

AdapterHub

@AdapterHub

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users