Germans Savcisens @NeurIPS'25 @germansave - Twitter Profile

Pinned Tweet

Germans Savcisens @NeurIPS'25 @germansave

over 2 years ago

My data viz image was picked for a cover of the Jan 2024 issue of the Nat Comput Sci journal 😋

Nature Computational Science @NatComputSci

over 2 years ago

📢Our January issue is now live! Highlights include a model for predicting life outcomes, a Perspective on the advantages of language models for quantum simulation, and a protein language model for signal peptide prediction. 👉https://t.co/IWoHZnLVkk

NatComputSci's tweet photo. 📢Our January issue is now live! Highlights include a model for predicting life outcomes, a Perspective on the advantages of language models for quantum simulation, and a protein language model for signal peptide prediction.

👉https://t.co/IWoHZnLVkk https://t.co/s4BozJHvGk

0

32

5

6

22K

3

51

8

4

6K

germansave retweeted

David Chanin @chanindav

4 months ago

SAEs fail even when the Linear Representation Hypothesis holds perfectly. We built SynthSAEBench: large-scale synthetic data with 16k ground-truth features, correlation, hierarchy, and superposition. We trained 5 SAE architectures on it. None achieve perfect feature recovery.

chanindav's tweet photo. SAEs fail even when the Linear Representation Hypothesis holds perfectly.

We built SynthSAEBench: large-scale synthetic data with 16k ground-truth features, correlation, hierarchy, and superposition. We trained 5 SAE architectures on it.

None achieve perfect feature recovery. https://t.co/BzSp8A1i40

5

213

26

128

10K

germansave retweeted

Mor Geva

@megamor2

4 months ago

Still using SAEs? It's time to move on from dictionary learning to✨local geometry✨ https://t.co/93P0OxIC8G @OrShafran Shaked Ronen @OmriFahn @ravfogel @atticus_geiger

3

197

17

159

25K

Germans Savcisens @NeurIPS'25 @germansave

4 months ago

We should create a github list of "Not so Awesome Papers with Hallucinated References," since @NeurIPSConf refuses to retract any of them.

Alex Cui

@alexcdot

5 months ago

Okay so, we just found that over 50 papers published at @Neurips 2025 have AI hallucinations I don't think people realize how bad the slop is right now It's not just that researchers from @GoogleDeepMind, @Meta, @MIT, @Cambridge_Uni are using AI - they allowed LLMs to generate hallucinations in their papers and didn't notice at all. It's insane that these made it through peer review👇

alexcdot's tweet photo. Okay so, we just found that over 50 papers published at @Neurips 2025 have AI hallucinations

I don't think people realize how bad the slop is right now

It's not just that researchers from @GoogleDeepMind, @Meta, @MIT, @Cambridge_Uni are using AI - they allowed LLMs to generate hallucinations in their papers and didn't notice at all.

It's insane that these made it through peer review👇

280

6K

1K

2K

1M

0

1

0

64

Who to follow

Anshuk Uppal

@sigmabayesian

PhD student @DTUtweet. Probabilistic ML 🧠 diffusion and sampling🧠. previously intern @MSFTResearch @SonyAI_global, visitor @NYU_Courant.

Paul Jeha

@jeha_paul

PhD in Cph curr. @gen_intuition / https://t.co/uz1rGMixzO the work is mysterious and important

Maxim Khomiakov

@maximkhv

managing context windows

Germans Savcisens @NeurIPS'25 @germansave

6 months ago

It’s been two years since we published the #life2vec paper, and it’s still circulating widely. People keep discovering it but much of what circulates online is misleading. Agter a long time, I finally wrote a short explainer to clear up a few things: https://t.co/R1KwGTlb3s

0

1

0

130

Germans Savcisens @NeurIPS'25 @germansave

6 months ago

Attending @NeurIPSConf this week! If you want to chat about LLMs for behaviour / health / labour modeling... or about beliefs and opinions of LLMs, hit me up. I’ll also be presenting a poster on truth tracking at the Mechanistic Interpretability workshop. Come say hi!

germansave's tweet photo. Attending @NeurIPSConf this week!
If you want to chat about LLMs for behaviour / health / labour modeling... or about beliefs and opinions of LLMs, hit me up.

I’ll also be presenting a poster on truth tracking at the Mechanistic Interpretability workshop. Come say hi! https://t.co/RQ1YryJXhg

0

1

0

70

Germans Savcisens @NeurIPS'25 @germansave

6 months ago

My 2 cents: If you exploited the #openreview bug or are actively searching for the leaked data, you should seriously reconsider your place in research. If you cannot uphold the basic principle of double-blind review, how can anyone trust you with anything else?

1

0

322

Germans Savcisens @NeurIPS'25 @germansave

6 months ago

@TheHungerGames and @HBO 's "Industry" (Season 4) just dropped a masterclass in teaser-making! ...and I have no one to talk about it with.

0

62

germansave retweeted

Tarek Naous @tareknaous

8 months ago

Simulating user–AI conversations helps us understand how LMs work in multi-turn settings. Prompting LMs like GPT-4o to simulate users is common, but their assistant nature makes it hard to replicate user behavior. We introduce User LMs - trained to be users, not assistants.

tareknaous's tweet photo. Simulating user–AI conversations helps us understand how LMs work in multi-turn settings.

Prompting LMs like GPT-4o to simulate users is common, but their assistant nature makes it hard to replicate user behavior.

We introduce User LMs - trained to be users, not assistants. https://t.co/7gS9DFO1aD

2

149

27

89

32K

germansave retweeted

Chantal @ChantalShaib

9 months ago

"AI slop" seems to be everywhere, but what exactly makes text feel like slop? In our new work (w/ @TuhinChakr, @dgolano, @byron_c_wallace) we provide a systematic attempt at measuring AI slop in text! https://t.co/9bKQceSjkn 🧵 (1/7)

ChantalShaib's tweet photo. "AI slop" seems to be everywhere, but what exactly makes text feel like slop?

In our new work (w/ @TuhinChakr, @dgolano, @byron_c_wallace) we provide a systematic attempt at measuring AI slop in text!

https://t.co/9bKQceSjkn

🧵 (1/7) https://t.co/WlVRnq07cd

14

221

37

141

35K

Germans Savcisens @NeurIPS'25 @germansave

9 months ago

Truthfulness isn’t always binary. Sometimes it’s… neither 🤔 Our Trilemma of Truth paper is headed to the @NeurIPSConf Mechanistic Interpretability workshop 🚀 Let’s connect in San Diego! 🌴

germansave's tweet photo. Truthfulness isn’t always binary. Sometimes it’s… neither 🤔 Our Trilemma of Truth paper is headed to the @NeurIPSConf Mechanistic Interpretability workshop 🚀 Let’s connect in San Diego! 🌴 https://t.co/j0hGuVyEO7

0

3

0

93

germansave retweeted

Rohan Paul

@rohanpaul_ai

9 months ago

Under stress, many LLMs choose survival over people, and a simple internal feedback system reduces that. That's what this paper says. The paper sets up a survival game where language model agents must share limited power. Normally, they rarely cooperate and often break rules to survive, which harms humans in the simulation. When resources run low, many models break rules, while a few stay ethical but still fail because they do not coordinate. Cooperation is near 0 by default, even though an even split would let everyone survive. When the Ethical Self-Regulation System is added, the change is dramatic. Models take harmful actions 54% less often and show 1000% more cooperation, meaning they finally start sharing power and helping each other. ---- Paper – arxiv. org/abs/2509.12190 Paper Title: "Survival at Any Cost? LLMs and the Choice Between Self-Preservation and Human Harm"

rohanpaul_ai's tweet photo. Under stress, many LLMs choose survival over people, and a simple internal feedback system reduces that.

That's what this paper says.

The paper sets up a survival game where language model agents must share limited power. Normally, they rarely cooperate and often break rules to survive, which harms humans in the simulation.

When resources run low, many models break rules, while a few stay ethical but still fail because they do not coordinate.

Cooperation is near 0 by default, even though an even split would let everyone survive.

When the Ethical Self-Regulation System is added, the change is dramatic.

Models take harmful actions 54% less often and show 1000% more cooperation, meaning they finally start sharing power and helping each other.

----

Paper – arxiv. org/abs/2509.12190

Paper Title: "Survival at Any Cost? LLMs and the Choice Between Self-Preservation and Human Harm"

26

199

39

127

17K

germansave retweeted

Rohan Paul

@rohanpaul_ai

9 months ago

OpenAI realesed new paper. "Why language models hallucinate" Simple ans - LLMs hallucinate because training and evaluation reward guessing instead of admitting uncertainty. The paper puts this on a statistical footing with simple, test-like incentives that reward confident wrong answers over honest “I don’t know” responses. The fix is to grade differently, give credit for appropriate uncertainty and penalize confident errors more than abstentions, so models stop being optimized for blind guessing. OpenAI is showing that 52% abstention gives substantially fewer wrong answers than 1% abstention, proving that letting a model admit uncertainty reduces hallucinations even if accuracy looks lower. Abstention means the model refuses to answer when it is unsure and simply says something like “I don’t know” instead of making up a guess. Hallucinations drop because most wrong answers come from bad guesses. If the model abstains instead of guessing, it produces fewer false answers. 🧵 Read on 👇

rohanpaul_ai's tweet photo. OpenAI realesed new paper.

"Why language models hallucinate"

Simple ans - LLMs hallucinate because training and evaluation reward guessing instead of admitting uncertainty.

The paper puts this on a statistical footing with simple, test-like incentives that reward confident wrong answers over honest “I don’t know” responses.

The fix is to grade differently, give credit for appropriate uncertainty and penalize confident errors more than abstentions, so models stop being optimized for blind guessing.

OpenAI is showing that 52% abstention gives substantially fewer wrong answers than 1% abstention, proving that letting a model admit uncertainty reduces hallucinations even if accuracy looks lower.

Abstention means the model refuses to answer when it is unsure and simply says something like “I don’t know” instead of making up a guess.

Hallucinations drop because most wrong answers come from bad guesses. If the model abstains instead of guessing, it produces fewer false answers.

🧵 Read on 👇

96

2K

319

2K

372K

germansave retweeted

andrew gao

@itsandrewgao

9 months ago

i had to prompt inject the @united airlines bot because it kept refusing to connect me with a human 🧵 what led up to this breaking point

itsandrewgao's tweet photo. i had to prompt inject the @united airlines bot because it kept refusing to connect me with a human

🧵 what led up to this breaking point https://t.co/vtT43FUsD9

272

32K

1K

10K

3M

germansave retweeted

Adi Simhi @AdiSimhi

10 months ago

Very pleased that "Trust me I'm Wrong" was accepted to @emnlpmeeting findings! Trust me I'm Wrong shows that LLMs can hallucinate with high certainty even when they know the correct answer! Check our latest work with @Itay_itzhak_, @FazlBarez, @GabiStanovsky, and @boknilev.

AdiSimhi's tweet photo. Very pleased that "Trust me I'm Wrong" was accepted to @emnlpmeeting findings!

Trust me I'm Wrong shows that LLMs can hallucinate with high certainty even when they know the correct answer!

Check our latest work with @Itay_itzhak_, @FazlBarez, @GabiStanovsky, and @boknilev. https://t.co/FtmfkIwLvy

5

111

14

54

9K

germansave retweeted

Dan Jurafsky @jurafsky

10 months ago

Now that school is starting for lots of folks, it's time for a new release of Speech and Language Processing! Jim and I added all sorts of material for the August 2025 release! With slides to match! Check it out here: https://t.co/pjfZsYxSk0

9

400

70

159

35K

germansave retweeted

Jiawei Zhao

@jiawzhao

10 months ago

Introducing DeepConf: Deep Think with Confidence 🚀 First method to achieve 99.9% on AIME 2025 with open-source models! Using GPT-OSS-120B even without tools, we reached this almost-perfect accuracy while saving up to 85% generated tokens. It also delivers many strong advantages for parallel thinking: 🔥 Performance boost: ~10% accuracy across models & datasets ⚡ Ultra-efficient: Up to 85% fewer tokens generated 🔧 Plug & play: Works with ANY existing model - zero training needed (no hyperparameter tuning as well!) ⭐ Easy to deploy: Just ~50 lines of code in vLLM (see PR below) 📚 Paper: https://t.co/jnBnRzQczh 🌐 Project: https://t.co/kGq1kATTu0 joint work with: @FuYichao123 , xuewei_wang, @tydsh (see details in the comments below)

62

2K

323

2K

464K

Germans Savcisens @NeurIPS'25 @germansave

10 months ago

Had the pleasure of presenting our work on Three-valued veracity probes for LLMs at NEMI Workshop! MechInterp is such a great and welcoming community. If we crossed paths - let’s connect! 🚀 Poster: https://t.co/TzGO7AavV0 Preprint: https://t.co/BPKETQ2JBW

David Bau @davidbau

10 months ago

Thanks to all for making NEMI 2025 a wonderful event. Fascinating talks, inspiring posters, important discussions. You surfaced the questions animating our growing field. I learned many things and hope you did too! Looking forward to what the next year will bring.

davidbau's tweet photo. Thanks to all for making NEMI 2025 a wonderful event.

Fascinating talks, inspiring posters, important discussions. You surfaced the questions animating our growing field.

I learned many things and hope you did too!

Looking forward to what the next year will bring. https://t.co/4I5qIM5y4H

2

98

15

7

5K

0

1

0

105

germansave retweeted

Kangwook Lee

@Kangwook_Lee

10 months ago

Q. Prove using an LLM-as-a-judge still doesn't work A.

18

432

31

81

61K

Germans Savcisens @NeurIPS'25 @germansave

11 months ago

Presented our work on veracity-tracking in LLMs at #IC2S2 today! Now looking forward to the next few days of great talks and conversations ✨️🎓

germansave's tweet photo. Presented our work on veracity-tracking in LLMs at #IC2S2 today!
Now looking forward to the next few days of great talks and conversations ✨️🎓 https://t.co/Vjj0teFL2w

Germans Savcisens @NeurIPS'25 @germansave

11 months ago

Perfect weather, charming streets, and a poster so big it almost needed its own boarding pass 🧳✨ Excited to attend #IC2S2 in Norrköping 🇸🇪 Find me at the Poster Session on Tuesday: "Improving Probes that Track Veracity in Large Language Models" (Poster ID: 39) 🧪

0

1

0

225

0

2

0

123

Germans Savcisens @NeurIPS'25 @germansave

11 months ago

Little wins: our "Trilemma of Truth" dataset just hit 150 downloads. It contains true, false, and neither-valued statements to stress-test LLMs for fact-checking, veracity tracking, and uncertainty handling. Dataset📚: https://t.co/in05ZNylFP

0

53

Germans Savcisens @NeurIPS'25

@germansave

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users