Fabian David Schmidt @fdschmidt - Twitter Profile

Pinned Tweet

almost 2 years ago

Introducing NLLB-LLM2Vec! 🚀 We fuse the NLLB encoder & Llama 3 8B trained w/ LLM2Vec to create NLLB-LLM2Vec which supports cross-lingual NLU in 200+ languages🔥 Joint work w/ Philipp Borchert, @licwu, and @gg42554 during my great research stay at @cambridgeltl

fdschmidt's tweet photo. Introducing NLLB-LLM2Vec! 🚀

We fuse the NLLB encoder & Llama 3 8B trained w/ LLM2Vec to create NLLB-LLM2Vec which supports cross-lingual NLU in 200+ languages🔥

Joint work w/ Philipp Borchert, @licwu, and @gg42554 during my great research stay at @cambridgeltl https://t.co/qcwG7hqdXf

3

100

18

40

13K

fdschmidt retweeted

Tiancheng Hu @ ICLR 2026

@tiancheng_hu

about 2 months ago

SimBench accepted at #ICLR2026! A lot of the time in social simulations, the goal is not to predict what one specific person will say or do. It is to estimate how an entire group will respond, whether in pre-testing a real polling question, or in stress-testing a policy or intervention before running it in the real world.

4

49

7

24

4K

fdschmidt retweeted

Yanai Elazar @yanaiela

about 2 months ago

Are you interested in interning with me and my lab? A unique opportunity for a 4-month research stay, with generous funding as an Azrieli visiting PhD fellow! DM me if you're interested. https://t.co/JHYcGFABo5

9

274

49

130

29K

fdschmidt retweeted

Xing Han Lu @xhluca

about 2 months ago

Frontier LLMs can navigate complex websites, but are expensive and can't run locally. At the same time, small open models can't match the capabilities of commercial APIs. Can we close this gap with synthetic data? To answer this, we built Agent-as-Annotators (A3): a framework for agentic capability distillation, which is inspired by the human annotation process. Our new A3-Qwen3.5-9B model trained on just 2.3K trajectories matches the 3x larger Qwen3.5-27B on WebArena (41.5%) and nearly doubles the previous best open-weight SFT result (21.5%), despite never seeing WebArena tasks in during training. Paper: https://t.co/nLOQDUbt7x

xhluca's tweet photo. Frontier LLMs can navigate complex websites, but are expensive and can't run locally. At the same time, small open models can't match the capabilities of commercial APIs. Can we close this gap with synthetic data?

To answer this, we built Agent-as-Annotators (A3): a framework for agentic capability distillation, which is inspired by the human annotation process. Our new A3-Qwen3.5-9B model trained on just 2.3K trajectories matches the 3x larger Qwen3.5-27B on WebArena (41.5%) and nearly doubles the previous best open-weight SFT result (21.5%), despite never seeing WebArena tasks in during training.

Paper: https://t.co/nLOQDUbt7x

3

44

18

12

4K

Who to follow

Goran Glavaš

@gg42554

Professor for #NLProc @Uni_WUE. Moving to Bluesky: https://t.co/YN6lXxW6ND

Shanshan Xu

@shanshan_xu3

PhD student of Legal Tech @ TU Munich, @sxu3.bsky.social

Marlene Lutz

@mar_lutz

Phd student @ University of Mannheim | Social NLP | she/her

fdschmidt retweeted

Nick Frosst

@nickfrosst

2 months ago

@cohere transcribe Sota open source transcription model running in the browser :) Weights on @huggingface link below

61

1K

128

792

191K

fdschmidt retweeted

Andreea Iana @iana_andreea

2 months ago

📢 2nd Call for Papers 📢 Working on user-centered #news #recsys or their legal & ethical dimensions? 👉 Submit to the 14th @NewsRecWorkshop co-located w/ @UMAPconf in Gothenburg! 🗓️Paper deadline: April 9, 2026 More info: https://t.co/rxHEvq0tBX #INRA2026 #UMAP2026

iana_andreea's tweet photo. 📢 2nd Call for Papers 📢

Working on user-centered #news #recsys or their legal & ethical dimensions?

👉 Submit to the 14th @NewsRecWorkshop co-located w/ @UMAPconf in Gothenburg!

🗓️Paper deadline: April 9, 2026

More info: https://t.co/rxHEvq0tBX

#INRA2026 #UMAP2026 https://t.co/xo0j12PtJD

1

2

1

0

195

fdschmidt retweeted

Siva Reddy

@sivareddyg

3 months ago

LLM2Vec-Gen represents a major paradigm shift for embeddings/retrieval. Why encode the query when the LLM already knows what to look for and can directly produce an embedding for it? Best part: it’s self-supervised, and it does all of this while the LLM remains completely frozen. Think about it: "solve x² + 3x − 4 = 0" has zero reasoning in it. But the LLM's response does. By encoding the response, the embedding captures the reasoning --- and the better the LLM reasons, the better the embedding. This is why our results scale with model size. As LLMs get smarter, our embeddings automatically get better. LLM2Vec-Gen is also the first demonstration of the promise of @ylecun's JEPA for text embeddings. The alignment loss is JEPA — predict in representation space, not token space. The reconstruction loss goes beyond --- it keeps embeddings decodable. This paradigm shift opens new frontiers: 🔬 Can we build a full JEPA for language where the teacher and student are the same LLM? ⚡ Can LLMs reason in compressed space without ever generating text? 🤖 Can agents reason in compression tokens and carry that directly into retrieval? 💬 Can agents talk to each other in compression tokens instead of text --- dense, fast, and still human-readable? LLM2Vec-Gen is a first step toward all four.

sivareddyg's tweet photo. LLM2Vec-Gen represents a major paradigm shift for embeddings/retrieval. Why encode the query when the LLM already knows what to look for and can directly produce an embedding for it?

Best part: it’s self-supervised, and it does all of this while the LLM remains completely frozen.

Think about it: "solve x² + 3x − 4 = 0" has zero reasoning in it. But the LLM's response does. By encoding the response, the embedding captures the reasoning --- and the better the LLM reasons, the better the embedding. This is why our results scale with model size. As LLMs get smarter, our embeddings automatically get better.

LLM2Vec-Gen is also the first demonstration of the promise of @ylecun's JEPA for text embeddings. The alignment loss is JEPA — predict in representation space, not token space. The reconstruction loss goes beyond --- it keeps embeddings decodable.

This paradigm shift opens new frontiers:

🔬 Can we build a full JEPA for language where the teacher and student are the same LLM?

⚡ Can LLMs reason in compressed space without ever generating text?

🤖 Can agents reason in compression tokens and carry that directly into retrieval?

💬 Can agents talk to each other in compression tokens instead of text --- dense, fast, and still human-readable?

LLM2Vec-Gen is a first step toward all four.

7

171

27

131

22K

fdschmidt retweeted

Marius Mosbach @mariusmosbach

3 months ago

Checkout our latest work on building self-supervised text embeddings without relying on contrastive data. ☝️ The main idea behind LLM2Vec-Gen is trying to encode a model's answer to a query, rather than the query itself.

3

27

5

2

2K

fdschmidt retweeted

Vaibhav Adlakha

@vaibhav_adlakha

3 months ago

Your LLM already knows the answer. Why is your embedding model still encoding the question? 🚨Introducing LLM2Vec-Gen: your frozen LLM generates the answer's embedding in a single forward pass — without ever generating the answer. Not only that, the frozen LLM can decode the embedding back into text. 🏆 SOTA self-supervised embeddings 🛡️ Free transfer of instruction-following, safety, and reasoning

5

193

37

121

50K

fdschmidt retweeted

Andreea Iana @iana_andreea

3 months ago

📢 Call for Papers📢 Working on user-centered #news #recsys or their legal & ethical dimensions? 👉 Submit to the 14th @NewsRecWorkshop co-located w/ @UMAPconf in Gothenburg! 🗓️Paper deadline: April 9, 2026 More info: https://t.co/rxHEvq0tBX #INRA2026 #UMAP2026

iana_andreea's tweet photo. 📢 Call for Papers📢

Working on user-centered #news #recsys or their legal & ethical dimensions?

👉 Submit to the 14th @NewsRecWorkshop co-located w/ @UMAPconf in Gothenburg!

🗓️Paper deadline: April 9, 2026

More info: https://t.co/rxHEvq0tBX

#INRA2026 #UMAP2026 https://t.co/JugTCsNRLB

0

2

0

221

fdschmidt retweeted

Marius Mosbach @mariusmosbach

4 months ago

Check out our new preprint on the superficial alignment hypothesis (SAH). 👇 We operationalize the SAH via the length of the shortest program that achieves a certain performance on a task, unifying previous views on the SAH and showing how post-training affects "superficiality".

2

8

2

1

769

fdschmidt retweeted

Cohere Labs

@Cohere_Labs

4 months ago

Introducing ✨Tiny Aya✨, a family of massively multilingual small language models built to run where people actually are. Tiny Aya delivers strong multilingual performance in 70+ global languages in a 3.35B parameter model, efficient enough to run locally, even on a phone.

28

845

155

497

192K

fdschmidt retweeted

Desmond Elliott @delliott

4 months ago

📢I am hiring a highly-motivated Ph.D student at the University of Copenhagen, in Denmark🇩🇰, to work on tokenization-free NLP. See our previous work in this topic: https://t.co/bim6SIRmjF https://t.co/rcfHGbmOo0 https://t.co/xwt7tpI2n6 Apply by March 8: https://t.co/oxf8ACiMzL

delliott's tweet photo. 📢I am hiring a highly-motivated Ph.D student at the University of Copenhagen, in Denmark🇩🇰, to work on tokenization-free NLP.

See our previous work in this topic: https://t.co/bim6SIRmjF
https://t.co/rcfHGbmOo0
https://t.co/xwt7tpI2n6

Apply by March 8: https://t.co/oxf8ACiMzL https://t.co/4c1sr5pK29

3

220

49

100

23K

fdschmidt retweeted

Michael Rizvi-Martel @frisbeemortel

4 months ago

Excited to announce our work on multi-agent systems has been accepted to #ICLR2026! Looking forward to seeing everyone in Rio :) 🇧🇷

1

21

3

2

1K

fdschmidt retweeted

Desmond Elliott @delliott

6 months ago

I am grateful that the Carlsberg Foundation is supporting our basic research on tokenization-free language models at the University of Copenhagen. I will be hiring Ph.D students to start in September 2026. Feel free to reach out early if you want to express informal interest.

1

24

7

2

2K

fdschmidt retweeted

Cohere

@cohere

6 months ago

Introducing our latest breakthrough in AI search and retrieval: Rerank 4! It’s the most advanced set of reranking models on the market, with best-in-class performance across search relevance, speed, deployment flexibility, multilingual support, and domain-specific understanding.

cohere's tweet photo. Introducing our latest breakthrough in AI search and retrieval: Rerank 4!

It’s the most advanced set of reranking models on the market, with best-in-class performance across search relevance, speed, deployment flexibility, multilingual support, and domain-specific understanding. https://t.co/ABsLQq6wGO

11

168

50

34

42K

fdschmidt retweeted

Josip Jukic @chatruncata

6 months ago

Presenting our paper "Disentangling Latent Shifts of In-Context Learning with Weak Supervision" (with Jan Šnajder) at NeurIPS 2025, San Diego: 🗓 Fri, Dec 5 · 11:00–14:00 PST 📍 Exhibit Hall C/D/E · Poster #2615 Paper: https://t.co/q1pbChaHTq #NeurIPS2025

0

7

1

0

692

fdschmidt retweeted

Verna Dankers @vernadankers

7 months ago

Ready for day 3 of #EMNLP2025 🎉🎉 I've been on the lookout for memorization, unlearning, interp, memory module papers & more, chat w me if these topics fascinate you too😻 Looking forward to more of Suzhou, the conf & my BlackboxNLP keynote Sunday 1.45PM! https://t.co/JkJVjmNAm3

vernadankers's tweet photo. Ready for day 3 of #EMNLP2025 🎉🎉 I've been on the lookout for memorization, unlearning, interp, memory module papers & more, chat w me if these topics fascinate you too😻 Looking forward to more of Suzhou, the conf & my BlackboxNLP keynote Sunday 1.45PM! https://t.co/JkJVjmNAm3 https://t.co/4RQaGazbhO

0

57

12

6

5K

fdschmidt retweeted

Mehar Bhatia @bhatia_mehar

7 months ago

🚨How do LLMs acquire human values?🤔 We often point to preference optimization. However, in our new work, we trace how and when model values shift during post-training and uncover surprising dynamics. We ask: How do data, algorithms, and their interaction shape model values?🧵

2

129

49

67

40K

fdschmidt retweeted

Tiancheng Hu @ ICLR 2026

@tiancheng_hu

7 months ago

Instruction tuning unlocks incredible skills in LLMs, but at a cost: they become dangerously overconfident. You face a choice: a well-calibrated base model or a capable but unreliable instruct model. What if you didn't have to choose? What if you could navigate the trade-off? (1/8)

3

14

4

1K

fdschmidt retweeted

Catherine Arnett @linguist_cat

7 months ago

I’m so excited that Global PIQA is out! This has been a herculean effort by our 300+ contributors. The result is an extremely high-quality, culturally-specific benchmark for over 100 languages.

1

35

7

1

5K

Fabian David Schmidt

@fdschmidt

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users