Marie-Leontine Wörgötter @mlwfee - Twitter Profile

Pinned Tweet

24 days ago

Woke up to these shades of blue this morning 🌊 … and they immediately reminded me of the heatmaps in our paper “There is No Spoon: Existential Presupposition in Large Language Models.”

mlwfee's tweet photo. Woke up to these shades of blue this morning 🌊

… and they immediately reminded me of the heatmaps in our paper “There is No Spoon: Existential Presupposition in Large Language Models.” https://t.co/Nc9KrjWueT

4

3

0

116

Marie-Leontine Wörgötter @mlwfee

24 days ago

I’m excited to present this work today at #LREC2026 here in Mallorca, and I’m looking forward to talking to some of you who are around too! #LLMs #nlproc #pragmatics

0

3

1

0

1K

Marie-Leontine Wörgötter @mlwfee

24 days ago

Woke up to these shades of blue this morning 🌊 … and they immediately reminded me of the heatmaps in our paper “There is No Spoon: Existential Presupposition in Large Language Models.”

4

3

0

116

Marie-Leontine Wörgötter @mlwfee

24 days ago

Check it out: https://t.co/BlznaDEH4g Work with Shikai Lai and @sebschu

0

55

Marie-Leontine Wörgötter @mlwfee

24 days ago

Using an NLI-based probing setup, we compare zero-shot, few-shot and NLI-fine-tuned models, and find that while models show some sensitivity to existential presupposition, the most systematic and theoretically aligned projection patterns emerge after NLI fine-tuning.

mlwfee's tweet photo. Using an NLI-based probing setup, we compare zero-shot, few-shot and NLI-fine-tuned models, and find that while models show some sensitivity to existential presupposition, the most systematic and theoretically aligned projection patterns emerge after NLI fine-tuning. https://t.co/3WT8SQ3wkj

0

41

Marie-Leontine Wörgötter @mlwfee

24 days ago

In this work, we test whether LLMs infer existential presuppositions (implicit assumptions about the existence of discourse referents) and whether these inferences are modulated across syntactic embedding, determiner strength and discourse context.

0

34

mlwfee retweeted

Nicholas Edwards @nedwards99

about 2 months ago

RExBench is now available in Terminal Bench (@harborframework)! 🎉 We integrate 2 tasks (cogs, othello) along with a local testing framework so you can test if your agents can autonomously implement novel AI research extensions.

1

8

2

2K

mlwfee retweeted

Nicholas Edwards @nedwards99

2 months ago

🧵 Do coding agents know when to ask for help? Real-world coding tasks are rarely fully specified, yet most agents are optimized to execute autonomously rather than clarify.

1

7

3

1

1K

mlwfee retweeted

Sarah Breckner @hieristSarah

3 months ago

Diffusion LLMs can think EoS-by-EoS! The higher the generation length, the better the performance of Masked Diffusion LLMs, even though they generate the same amount of words and only augment them with more and more EoS tokens 👀

hieristSarah's tweet photo. Diffusion LLMs can think EoS-by-EoS!

The higher the generation length, the better the performance of Masked Diffusion LLMs, even though they generate the same amount of words and only augment them with more and more EoS tokens 👀 https://t.co/byyxMCZ4di

1

4

3

0

303

Marie-Leontine Wörgötter @mlwfee

6 months ago

@j_foerst Or rather linguistics 🥲

0

37

mlwfee retweeted

Rohan Pandey

@khoomeik

8 months ago

honestly surprising that you don’t see the linguists celebrating in the LLM era they didn’t make much progress but at least they were studying the right thing if i were a real linguist, i’d be bragging all the time about having modeled language for decades before LLMs

18

152

4

17

14K

mlwfee retweeted

Sebastien Bubeck

@SebastienBubeck

10 months ago

Claim: gpt-5-pro can prove new interesting mathematics. Proof: I took a convex optimization paper with a clean open problem in it and asked gpt-5-pro to work on it. It proved a better bound than what is in the paper, and I checked the proof it's correct. Details below.

SebastienBubeck's tweet photo. Claim: gpt-5-pro can prove new interesting mathematics.

Proof: I took a convex optimization paper with a clean open problem in it and asked gpt-5-pro to work on it. It proved a better bound than what is in the paper, and I checked the proof it's correct.

Details below. https://t.co/eNEGqyZG0L

305

8K

1K

3K

7M

Marie-Leontine Wörgötter @mlwfee

almost 2 years ago

@gunsnrosesgirl3 Spirit animal

0

17

Marie-Leontine Wörgötter

@mlwfee

Last Seen Users on Sotwe

Trends for you

Most Popular Users