Timo Kaufmann @timokauf - Twitter Profile

Pinned Tweet

Timo Kaufmann @timokauf

6 months ago

Presenting ResponseRank at #NeurIPS2025! Come by poster #405 at 4:30pm today if you're in San Diego 👋

4

6

1

2

617

Timo Kaufmann @timokauf

9 days ago

Very excited to start my Student Researcher position at @GoogleDeepMind next week! Flying out tomorrow, ready to wield my #GoogleInterns propeller head proudly

Artificial Intelligence and Machine Learning @ LMU @AIML_LMU

10 days ago

🎉We are proud of @timokauf, who is off to @GoogleDeepMind, London as a Student Researcher! On Monday he'll begin 6 months on the Amplified Oversight team, which tackles a central challenge in AI safety: how humans can reliably oversee models that are becoming ever more capable🚀

AIML_LMU's tweet photo. 🎉We are proud of @timokauf, who is off to @GoogleDeepMind, London as a Student Researcher! On Monday he'll begin 6 months on the Amplified Oversight team, which tackles a central challenge in AI safety: how humans can reliably oversee models that are becoming ever more capable🚀 https://t.co/QlN3kxL7Jk

0

15

0

3

2K

0

13

0

1

4K

Timo Kaufmann @timokauf

about 2 months ago

@ZacKenton1 @DavidDAfrica @jacob_pfau Thanks for making me aware! Inverse constitutional learning has a lot of overlap with the ICAI + FF combination and understanding motivations would be great for alignment. Right now the method is more focused on expressed traits, but might be extensible (@arduinfindeis)

0

85

Timo Kaufmann @timokauf

6 months ago

Somebody recorded me at our poster. Cool to have a video! Don't expect a full explanation though, it's just a random excerpt and I had no idea I was being filmed 🙃

Starc

@Starc_Institute

6 months ago

#NeurIPS2025 has passed, but we hope the celebration will last forever. Here is a poster presentation we recorded at the event and we hope it can last forever in cyberspace. The paper: ResponseRank: Data-Efficient Reward Modeling through Preference Strength Learning Thanks for the authors: @timokauf, @metz_yannick, Daniel Keim, Eyke Hüllermeier

1

3

0

473

0

1

0

107

Who to follow

Artificial Intelligence and Machine Learning @ LMU

@AIML_LMU

Chair of Artificial Intelligence and Machine Learning (#AI and #ML) @eyke_hu @LMU_Muenchen.

Yusuf Sale

@ysale12

Statistician & Ph.D. Candidate @AIML_LMU 🇩🇪 Uncertainty Representation and Quantification in AI & ML | @zuseschoolrelAI

Calvin Zhang

@calvincbzhang

ML Research Ops @scale_AI | Previously @CHAI_Berkeley @MIT @ETH @OfficialUoM

Timo Kaufmann @timokauf

6 months ago

@Starc_Institute Thanks for posting this, nice to have a video of the session! I had no idea I was being filmed. The video starts sort of in the middle of the explanation, so take a look at the paper if you're interested. Paper: https://t.co/suPIUZpewO Poster etc.: https://t.co/5gYGIxNaem

0

1

0

41

Timo Kaufmann @timokauf

6 months ago

Had a good time presenting this! (Though who decided poster sessions should go until 7:30pm?) Thanks to Max Muschalik for helping me present even without being an author. Poster presentations really need two people!

timokauf's tweet photo. Had a good time presenting this! (Though who decided poster sessions should go until 7:30pm?) Thanks to Max Muschalik for helping me present even without being an author. Poster presentations really need two people! https://t.co/pXLwQetBUG

Timo Kaufmann @timokauf

6 months ago

Presenting ResponseRank at #NeurIPS2025! Come by poster #405 at 4:30pm today if you're in San Diego 👋

4

6

1

2

617

0

1

0

110

Timo Kaufmann @timokauf

6 months ago

Joint work with @metz_yannick, Daniel Keim, and @eyke_hu.

0

82

Timo Kaufmann @timokauf

6 months ago

Presenting ResponseRank at #NeurIPS2025! Come by poster #405 at 4:30pm today if you're in San Diego 👋

4

6

1

2

617

Timo Kaufmann @timokauf

6 months ago

Benefits: improved reward model generalization, better data efficiency, and improved policies. Looking forward to seeing you at the poster! Paper and more: https://t.co/5gYGIxNaem

timokauf's tweet photo. Benefits: improved reward model generalization, better data efficiency, and improved policies. Looking forward to seeing you at the poster!

Paper and more: https://t.co/5gYGIxNaem https://t.co/N22kat3gQt

0

67

Timo Kaufmann @timokauf

6 months ago

The key insight is that these signals only need to be locally valid and relative (e.g., within one annotator's comparisons). No need to model the exact relationship to strength. Just rank which comparisons are stronger.

timokauf's tweet photo. The key insight is that these signals only need to be locally valid and relative (e.g., within one annotator's comparisons). No need to model the exact relationship to strength. Just rank which comparisons are stronger. https://t.co/1z5RO1P9oE

0

65

Timo Kaufmann @timokauf

6 months ago

The core idea: Not all preferences are equal. ResponseRank learns preference strength from implicit signals in your data, like inter-annotator agreement, stated confidence, or response times.

timokauf's tweet photo. The core idea: Not all preferences are equal. ResponseRank learns preference strength from implicit signals in your data, like inter-annotator agreement, stated confidence, or response times. https://t.co/WiAPqc6Yh8

0

1

0

107

timokauf retweeted

Arduin Findeis @arduinfindeis

7 months ago

I think Gemini 3 Pro’s personality and style notably improved over 2.5. It uses fewer common shortcuts to bias human annotators, e.g. long responses or overly polite tone. That makes the strong LMArena performance quite a bit more impressive! Some examples of the differences 🧵

arduinfindeis's tweet photo. I think Gemini 3 Pro’s personality and style notably improved over 2.5.

It uses fewer common shortcuts to bias human annotators, e.g. long responses or overly polite tone. That makes the strong LMArena performance quite a bit more impressive!

Some examples of the differences 🧵 https://t.co/HyJMxlHBms

1

2

1

0

231

Timo Kaufmann @timokauf

10 months ago

@janleike Thanks for pointing this out! Do you think Anthropic will be able to offer visa sponsorship in future iterations? I'm a German PhD student and would love to apply, but cannot self-sponsor as far as I can tell.

0

2

0

142

Timo Kaufmann @timokauf

10 months ago

Nice demo of feedback forensics, check out our tool and Arduin's analysis!

Arduin Findeis @arduinfindeis

10 months ago

How is GPT-5's personality different to GPT-4o? A quantitative analysis using Feedback Forensics 🧵

1

3

0

636

0

2

0

135

Timo Kaufmann @timokauf

12 months ago

Just noticed the key deadlines for #ICLR2026 out! PSA for everyone else who's been waiting. Full paper: Sept 24 AoE.

0

3

0

1

652

Timo Kaufmann @timokauf

about 1 year ago

This was a lot of fun!

Artificial Intelligence and Machine Learning @ LMU @AIML_LMU

about 1 year ago

@arduinfindeis and our lab member @timokauf presented Inverse Constitutional AI. Had a blast, the pictures speak for themselves! Joint work with @eyke_hu, @SamuelAlbanie, @RobDMullins. Paper📰https://t.co/49eGzYchQs GitHub: https://t.co/dMlHYlCl5r 🧵2/4

AIML_LMU's tweet photo. @arduinfindeis and our lab member @timokauf presented Inverse Constitutional AI. Had a blast, the pictures speak for themselves! Joint work with @eyke_hu, @SamuelAlbanie, @RobDMullins.
Paper📰https://t.co/49eGzYchQs
GitHub: https://t.co/dMlHYlCl5r
🧵2/4 https://t.co/rSmOXFcD7j

1

3

0

180

0

2

0

89

Timo Kaufmann @timokauf

about 1 year ago

Looking forward to present tomorrow. Come by our poster #520 if you're at ICLR!

Arduin Findeis @arduinfindeis

about 1 year ago

Excited to be in Singapore for ICLR! Keen to chat about interpreting feedback data and detecting model characteristics ⚖️ Reach out or come by our poster on Inverse Constitutional AI tomorrow, Friday 25 April from 10am-12.30pm (#520 in Hall 2B) - @timokauf and I will be there!

arduinfindeis's tweet photo. Excited to be in Singapore for ICLR! Keen to chat about interpreting feedback data and detecting model characteristics ⚖️

Reach out or come by our poster on Inverse Constitutional AI tomorrow, Friday 25 April from 10am-12.30pm (#520 in Hall 2B) - @timokauf and I will be there! https://t.co/wWTbr3i3ak

0

14

1

0

513

0

2

0

1

92

Timo Kaufmann @timokauf

about 1 year ago

Currently visiting @arduinfindeis in Cambridge. I didn't realize it's this beautiful! Do I know anyone here that I haven't met up with yet?

timokauf's tweet photo. Currently visiting @arduinfindeis in Cambridge. I didn't realize it's this beautiful!

Do I know anyone here that I haven't met up with yet? https://t.co/jLBf0LSMvv

0

2

0

189

Timo Kaufmann @timokauf

about 1 year ago

Ever wondered how models on chatbot arena differ? Feedback forensics gives some answers, check it out!

Arduin Findeis @arduinfindeis

about 1 year ago

🤖 3. Discovering model strengths How is GPT-4o different to other models? → Uses more numbered lists, but Gemini is more friendly and polite https://t.co/bDCQXGtRFW

arduinfindeis's tweet photo. 🤖 3. Discovering model strengths

How is GPT-4o different to other models? → Uses more numbered lists, but Gemini is more friendly and polite

https://t.co/bDCQXGtRFW https://t.co/h3BsM9MmLj

1

0

230

0

125

timokauf retweeted

Arduin Findeis @arduinfindeis

about 1 year ago

🕵🏻💬 Introducing Feedback Forensics: a new tool to investigate pairwise preference data. Feedback data is notoriously difficult to interpret and has many known issues – our app aims to help! Try it at https://t.co/4HubCg52Pi Three example use-cases 👇🧵

2

31

11

16

11K

Timo Kaufmann @timokauf

over 1 year ago

Our paper on query-efficient reward learning will be at AAAI! Unfortunately I won’t be attending, but Xuening will be at the poster - stop by or reach out online!

Artificial Intelligence and Machine Learning @ LMU @AIML_LMU

over 1 year ago

🚀 Excited to share our latest work at #AAAI2025! How can we make Reinforcement Learning from Human Feedback (RLHF) more query-efficient? Introducing DUO: Diverse, Uncertain, On-policy query generation and selection. 🧵👇 1/6

AIML_LMU's tweet photo. 🚀 Excited to share our latest work at #AAAI2025!

How can we make Reinforcement Learning from Human Feedback (RLHF) more query-efficient? Introducing DUO: Diverse, Uncertain, On-policy query generation and selection.
🧵👇 1/6 https://t.co/JXbCiHg3Vv

1

3

0

1

331

0

1

0

78

Timo Kaufmann

@timokauf

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users