Martin Gubri @framart1 - Twitter Profile

Pinned Tweet

about 2 years ago

🦹💥 How to detect if my LLM was stolen or leaked? 🤖💥 I am delighted to announce TRAP 🪤, our new #ACL2024 findings paper ☝️ We showcase how to use adversarial prompt as model fingerprint for LLM. A thread 🧵 ⬇️⬇️⬇️

framart1's tweet photo. 🦹💥 How to detect if my LLM was stolen or leaked? 🤖💥

I am delighted to announce TRAP 🪤, our new #ACL2024 findings paper ☝️ We showcase how to use adversarial prompt as model fingerprint for LLM.

A thread 🧵
⬇️⬇️⬇️ https://t.co/wZShlIFXNV

1

23

4

3

4K

Martin Gubri @framart1

about 1 month ago

@DamienTeney @a_rubique @coallaoh Thanks, Damien ☺️

0

16

Martin Gubri @framart1

about 1 month ago

🏆🪩 DISCO just won the best paper award at ICLR'26-CAO! I am so proud of my co-authors! Huge congrats to @a_rubique, Benjamin, and @coallaoh!

framart1's tweet photo. 🏆🪩 DISCO just won the best paper award at ICLR'26-CAO!

I am so proud of my co-authors! Huge congrats to @a_rubique, Benjamin, and @coallaoh! https://t.co/yUDxAI7YTd

Alexander Rubinstein @a_rubique

about 1 month ago

🎉 Our DISCO paper received a Best Paper Award at the ICLR 2026 “Catch, Adapt, and Operate” Workshop! Huge thanks to my coauthors Benjamin, @framart1, and @coallaoh, and to the CAO organizers, led by @sepidshs, and @RBCBorealis, for supporting the workshop!

2

17

3

0

1K

2

10

0

329

Martin Gubri @framart1

about 1 month ago

@HaritzPuerto @a_rubique @coallaoh Thanks a lot!!

0

8

Who to follow

Non Serviam

@Catowar2611

Πιστεύω εις έναν θεό, al dente, με σάλτσα και κεφτέδες! Είθε το ιερό μακαρονοτέρας να φωτίζει την ζωή σας! Αμήν!

Vikash Sehwag

@VSehwag_

Research Scientist @GoogleDeepMind (Gemini core post-training - Gemini 3, 2.5; RL and Reasoning); PhD @Princeton

A livr'ouvert

@Bookynette

Petite librairie de quartier ouverte du lundi au samedi de 10h à 19h. Des ateliers pour enfants et adultes sont proposés ainsi que des soirées thématiques!

Martin Gubri @framart1

about 1 month ago

🪩 If you are at #ICLR today, come the poster of our DISCO paper to discuss LLM efficient benchmarking with @a_rubique!

Alexander Rubinstein @a_rubique

about 1 month ago

Come check out our poster to learn a simple (but very effective) trick to speed up your LLM evaluation by 100×! ⏰ 15:15 to 17:45 today! 📍Pavilion 3 P3-#1015

0

8

3

0

714

0

8

0

177

Martin Gubri @framart1

about 1 month ago

6/ Thank you for the hard work, the patience, and the brilliance! I learned so much from all of you. More to come soon :)

0

5

0

84

Martin Gubri @framart1

about 1 month ago

1/ My contract at @parameterlab ended last week, after 2.5 years (since Sept 2023, with some collaboration before). I had the chance to lead research on trustworthy AI for LLMs alongside an incredible group of people (Neckarfront. All Tübingen researchers have to post it once!)

framart1's tweet photo. 1/ My contract at @parameterlab ended last week, after 2.5 years (since Sept 2023, with some collaboration before).

I had the chance to lead research on trustworthy AI for LLMs alongside an incredible group of people

(Neckarfront. All Tübingen researchers have to post it once!) https://t.co/pkNrlCuPEq

2

15

0

1

1K

Martin Gubri @framart1

about 1 month ago

5/ And to all my brilliant co-authors since 2023: @dnnslmr, @oodgnas, @hwaran_lee, Siwon, @HaritzPuerto, @tommasogreen, @anmgoel, @CorEmde, Ahmed, @esruzzetti, Salman, @amohamed264, @a_rubique, @adamdaviesnlp, @_elinguyen, @michael_simeone, Erik.

1

4

0

174

Martin Gubri @framart1

about 2 months ago

@CFGeek There is work on that: you can check model fingerprinting and model watermarking (black & white box methods). I recommend this survey & benchmark paper: https://t.co/xmxcG6zngs Which includes our TRAP paper: https://t.co/IO1rIGrBX5 Nonetheless, I wish we have such model lineage

1

0

111

Martin Gubri @framart1

about 2 months ago

🎉Our privacy collapse paper has been accepted at #ACL 2026 (main)! Contextual privacy is fragile: fine-tune an LLM on benign data, and it can overshare personal information. This is silent: suites don't usually measure contextual privacy, which is a big issue with LLM agents.

framart1's tweet photo. 🎉Our privacy collapse paper has been accepted at #ACL 2026 (main)!

Contextual privacy is fragile: fine-tune an LLM on benign data, and it can overshare personal information.

This is silent: suites don't usually measure contextual privacy, which is a big issue with LLM agents. https://t.co/0DOqxjHBeD

Anmol Goel @anmgoel

4 months ago

🚨 Fine-tuning your model to be more helpful or empathetic might be making it less private, without you noticing. In our latest work, we show that benign fine-tuning can silently break contextual privacy in language models while safety & general capabilities appear intact. ⬇️

anmgoel's tweet photo. 🚨 Fine-tuning your model to be more helpful or empathetic might be making it less private, without you noticing.

In our latest work, we show that benign fine-tuning can silently break contextual privacy in language models while safety & general capabilities appear intact.

⬇️ https://t.co/Dy27vohA6I

1

9

2

3

4K

1

4

0

215

Martin Gubri @framart1

about 2 months ago

@tom_doerr Our Neurips paper proposes a benchmark for these GEO methods and shows that none (that we tested) work: https://t.co/8n18q3NuHZ

0

27

Martin Gubri @framart1

2 months ago

Would love to hear thoughts from people working on watermarking :) cc @ni_jovanovic @sahar_abdelnabi @jonasgeiping

0

96

Martin Gubri @framart1

2 months ago

🌍We've made LLM watermarking equally robust across all languages we studied. Scaling to 100+ languages! Even sota watermarks can be removed by translating to another language, eg. Tamil. This hits hardest in low-resource languages, where moderation tools are already weak. 🧵

framart1's tweet photo. 🌍We've made LLM watermarking equally robust across all languages we studied. Scaling to 100+ languages!

Even sota watermarks can be removed by translating to another language, eg. Tamil. This hits hardest in low-resource languages, where moderation tools are already weak.
🧵 https://t.co/leDYIc1F1C

1

6

2

0

275

Martin Gubri @framart1

2 months ago

Kudos to Asim Mohamed for his first research paper! Paper: https://t.co/slv2Kxo3YC Code: https://t.co/V3Qo0UeaXB

2

0

95

Martin Gubri

@framart1

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users