Maximilian Müller

5 days ago

💥I’m very excited to announce Exponential Security Labs: our new agent security startup based in Tübingen! Now everyone is deploying agents for increasingly high-stakes applications. Yet their reliability and security remain unsolved technical problems. Our mission is to secure agentic systems through self-improving red-teaming and guardrail agents. Tübingen is a special place to build a company in this space due to its incredibly high concentration of AI safety talent (ELLIS Institute, MPI-IS, University of Tübingen). Reach out if you're interested—we are hiring! We are also looking for customers—please get in touch!

8

174

13

54

13K

23 days ago

@maxmbeck Let's go 🥖

0

1

0

54

🇭🇺 🇪🇺 ML researcher @MPI_IS, @ELLISforEurope | Causal representation learning | Building research tools | Newsletter: https://t.co/TPP2SvAvqr

2 months ago

@maxmbeck Glückwunsch!

0

1

0

55

Who to follow

Patrik Reizinger

@rpatrik96

Roberto Amoroso

@roberto__mrs

Senior Research Engineer @NVIDIA

Aditya Gulati

@adiGulati_

PhD Student at ELLIS Alicante

mueller_mp retweeted

Christian Schlarmann @chs20_

3 months ago

New paper: We introduce Visual Memory Injection, a new attack on large vision-language models. A subtly perturbed image, that remains in the chat context, causes the model to behave normally for many turns and later triggers a targeted harmful response on a topic-specific prompt.

chs20_'s tweet photo. New paper: We introduce Visual Memory Injection, a new attack on large vision-language models. A subtly perturbed image, that remains in the chat context, causes the model to behave normally for many turns and later triggers a targeted harmful response on a topic-specific prompt. https://t.co/9qrm5ApoMv

1

7

2

3

137

mueller_mp retweeted

6 months ago

📣 We are expanding our AI Safety and Alignment group at @ELLISInst_Tue and @MPI_IS! We have: - a great cluster at MPI with 50+ GB200s, 250+ H100s, and many-many A100 80GBs, - outstanding colleagues (@jonasgeiping, @sahar_abdelnabi, etc), - competitive salaries (as for academia), - fully English-speaking environment. In particular, I'm looking for: - one postdoc with a proven track record in AI safety, - PhD students with a strong computer science background and ideally experience in cybersecurity, interpretability, or training dynamics, - master’s thesis students (if you are already in Tübingen or can relocate to Tübingen for ~6 months), - remote mentees for the Summer 2026 MATS cohort (apply directly via the MATS portal). I'll be at NeurIPS in San Diego and would be glad to chat about these positions!

4

145

13

79

14K

mueller_mp retweeted

francesco croce @fra__31

7 months ago

Happy to share that I've started as an assistant professor at @AaltoUniversity and ELLIS Institute Finland! I'll recruit students via the ELLIS PhD Program https://t.co/WXLbi7BiZD to work on multimodal learning, robustness, visual reasoning... feel free to reach out!

fra__31's tweet photo. Happy to share that I've started as an assistant professor at @AaltoUniversity and ELLIS Institute Finland!

I'll recruit students via the ELLIS PhD Program https://t.co/WXLbi7BiZD to work on multimodal learning, robustness, visual reasoning... feel free to reach out! https://t.co/7Gms4utKqn

4

29

5

4

4K

7 months ago

@fra__31 @AaltoUniversity Congratulations, Francesco!

0

1

0

77

mueller_mp retweeted

9 months ago

Very promising results on *robust* unlearning from colleagues at Tübingen and EPFL. (+ some general improvements to the standard evaluation by using an LLM judge and worst-case evaluation over paraphrases and input formats)

maksym_andr's tweet photo. Very promising results on *robust* unlearning from colleagues at Tübingen and EPFL.

(+ some general improvements to the standard evaluation by using an LLM judge and worst-case evaluation over paraphrases and input formats) https://t.co/2IPMEYy1iB

5

124

11

78

10K

10 months ago

This is a great opportunity for anyone who wants to work on AI safety. Congrats and all the best, Maksym!

10 months ago

🚨 Incredibly excited to share that I'm starting my research group focusing on AI safety and alignment at the ELLIS Institute Tübingen and Max Planck Institute for Intelligent Systems in September 2025! 🚨 Hiring. I'm looking for multiple PhD students: both those able to start in Fall 2025 (i.e., as soon as possible) and through centralized programs like CLS, IMPRS, and ELLIS (the deadlines are in November) to start in Spring–Fall 2026. I'm also searching for postdocs, master's thesis students, and research interns. Fill the Google form below if you're interested! Research group. We will focus on developing algorithmic solutions to reduce harms from advanced general-purpose AI models. We're particularly interested in alignment of autonomous LLM agents, which are becoming increasingly capable and pose a variety of emerging risks. We're also interested in rigorous AI evaluations and informing the public about the risks and capabilities of frontier AI models. Additionally, we aim to advance our understanding of how AI models generalize, which is crucial for ensuring their steerability and reducing associated risks. For more information about research topics relevant to our group, please check the following documents: - International AI Safety Report, - An Approach to Technical AGI Safety and Security by DeepMind, - Open Philanthropy’s 2025 RFP for Technical AI Safety Research. Research style. We are not necessarily interested in getting X papers accepted at NeurIPS/ICML/ICLR. We are interested in making an impact: this can be papers (and NeurIPS/ICML/ICLR are great venues), but also open-source repositories, benchmarks, blog posts, even social media posts—literally anything that can be genuinely useful for other researchers and the general public. Broader vision. Current machine learning methods are fundamentally different from what they used to be pre-2022. The Bitter Lesson summarized and predicted this shift very well back in 2019: "general methods that leverage computation are ultimately the most effective". Taking this into account, we are only interested in studying methods that are general and scale with intelligence and compute. Everything that helps to advance their safety and alignment with societal values is relevant to us. We believe getting this—some may call it "AGI"—right is one of the most important challenges of our time. Join us on this journey!

maksym_andr's tweet photo. 🚨 Incredibly excited to share that I'm starting my research group focusing on AI safety and alignment at the ELLIS Institute Tübingen and Max Planck Institute for Intelligent Systems in September 2025! 🚨

Hiring. I'm looking for multiple PhD students: both those able to start in Fall 2025 (i.e., as soon as possible) and through centralized programs like CLS, IMPRS, and ELLIS (the deadlines are in November) to start in Spring–Fall 2026. I'm also searching for postdocs, master's thesis students, and research interns. Fill the Google form below if you're interested!

Research group. We will focus on developing algorithmic solutions to reduce harms from advanced general-purpose AI models. We're particularly interested in alignment of autonomous LLM agents, which are becoming increasingly capable and pose a variety of emerging risks. We're also interested in rigorous AI evaluations and informing the public about the risks and capabilities of frontier AI models. Additionally, we aim to advance our understanding of how AI models generalize, which is crucial for ensuring their steerability and reducing associated risks. For more information about research topics relevant to our group, please check the following documents:
- International AI Safety Report,
- An Approach to Technical AGI Safety and Security by DeepMind,
- Open Philanthropy’s 2025 RFP for Technical AI Safety Research.

Research style. We are not necessarily interested in getting X papers accepted at NeurIPS/ICML/ICLR. We are interested in making an impact: this can be papers (and NeurIPS/ICML/ICLR are great venues), but also open-source repositories, benchmarks, blog posts, even social media posts—literally anything that can be genuinely useful for other researchers and the general public.

Broader vision. Current machine learning methods are fundamentally different from what they used to be pre-2022. The Bitter Lesson summarized and predicted this shift very well back in 2019: "general methods that leverage computation are ultimately the most effective". Taking this into account, we are only interested in studying methods that are general and scale with intelligence and compute. Everything that helps to advance their safety and alignment with societal values is relevant to us. We believe getting this—some may call it "AGI"—right is one of the most important challenges of our time.

Join us on this journey!

76

839

90

293

106K

0

4

0

257

mueller_mp retweeted

Christian Schlarmann @chs20_

10 months ago

🚨 Incredibly excited to share that I'm starting my research group focusing on AI safety and alignment at the ELLIS Institute Tübingen and Max Planck Institute for Intelligent Systems in September 2025! 🚨 Hiring. I'm looking for multiple PhD students: both those able to start in Fall 2025 (i.e., as soon as possible) and through centralized programs like CLS, IMPRS, and ELLIS (the deadlines are in November) to start in Spring–Fall 2026. I'm also searching for postdocs, master's thesis students, and research interns. Fill the Google form below if you're interested! Research group. We will focus on developing algorithmic solutions to reduce harms from advanced general-purpose AI models. We're particularly interested in alignment of autonomous LLM agents, which are becoming increasingly capable and pose a variety of emerging risks. We're also interested in rigorous AI evaluations and informing the public about the risks and capabilities of frontier AI models. Additionally, we aim to advance our understanding of how AI models generalize, which is crucial for ensuring their steerability and reducing associated risks. For more information about research topics relevant to our group, please check the following documents: - International AI Safety Report, - An Approach to Technical AGI Safety and Security by DeepMind, - Open Philanthropy’s 2025 RFP for Technical AI Safety Research. Research style. We are not necessarily interested in getting X papers accepted at NeurIPS/ICML/ICLR. We are interested in making an impact: this can be papers (and NeurIPS/ICML/ICLR are great venues), but also open-source repositories, benchmarks, blog posts, even social media posts—literally anything that can be genuinely useful for other researchers and the general public. Broader vision. Current machine learning methods are fundamentally different from what they used to be pre-2022. The Bitter Lesson summarized and predicted this shift very well back in 2019: "general methods that leverage computation are ultimately the most effective". Taking this into account, we are only interested in studying methods that are general and scale with intelligence and compute. Everything that helps to advance their safety and alignment with societal values is relevant to us. We believe getting this—some may call it "AGI"—right is one of the most important challenges of our time. Join us on this journey!

76

839

90

293

106K

mueller_mp retweeted

about 1 year ago

Excited to announce FuseLIP: an embedding model that encodes image+text into a single vector. We achieve this by tokenizing images into discrete tokens, merging these with the text tokens and subsequently processing them with a single transformer.

chs20_'s tweet photo. Excited to announce FuseLIP: an embedding model that encodes image+text into a single vector. We achieve this by tokenizing images into discrete tokens, merging these with the text tokens and subsequently processing them with a single transformer. https://t.co/Qr6gkcvP5E

1

12

4

1K

mueller_mp retweeted

Václav Voráček @VaclavVoracekCZ

about 1 year ago

With @bremen79, We propose a new algorithm for constructing confidence intervals for means of bounded r.vs using "testing by betting" framework. It performs remarkably well even in the challenging, very small sample regime. (and of course, it is great in the large sample one)

1

45

4

32

5K

about 1 year ago

This work now provides both an explanation and a (partial) solution to those observations. More insights in the paper: https://t.co/V64zJipIh7 Code: https://t.co/RPSJNwx89f

0

1

0

53

about 1 year ago

Mahalanobis++: Improving OOD Detection via Feature Normalization Our latest work has been accepted to ICML and is now also on arXiv! We explain why Mahalanobis-based OOD detection led to varied results and show that l2 normalization improves its performance consistently.

mueller_mp's tweet photo. Mahalanobis++: Improving OOD Detection via Feature Normalization

Our latest work has been accepted to ICML and is now also on arXiv!

We explain why Mahalanobis-based OOD detection led to varied results and show that l2 normalization improves its performance consistently. https://t.co/439sd0VC9A

1

14

3

6

553