Workshop on Large Language Model Memorization @l2m2_workshop - Twitter Profile

Pinned Tweet

Workshop on Large Language Model Memorization @l2m2_workshop

about 1 year ago

📢 @aclmeeting notifications have been sent out, making this the perfect time to finalize your commitment. Don't miss the opportunity to be part of the workshop! 🔗 Commit here: https://t.co/CrO6CYEnyI 🗓️ Deadline: May 20, 2025 (AoE) #ACL2025 #NLProc

1

12

8

1

2K

Workshop on Large Language Model Memorization @l2m2_workshop

10 months ago

L2M2 will be tomorrow at VIC, room 1.31-32! We hope you will join us for a day of invited talks, orals, and posters on LLM memorization. The full schedule and accepted papers are now on our website: https://t.co/AH3aMn3ID6

0

12

5

1

8K

l2m2_workshop retweeted

Niloofar

@niloofar_mire

10 months ago

I'm psyched for my 2 *different* talks on Friday @aclmeeting: 1.@llm_sec (11:00): What does it mean for an AI agent to preserve privacy? 2.@l2m2_workshop (16:00): Emergent Misalignment thru the Lens of Non-verbatim Memorization (& phonetic to visual attacks!) Join us!

niloofar_mire's tweet photo. I'm psyched for my 2 *different* talks on Friday @aclmeeting:

1.@llm_sec (11:00): What does it mean for an AI agent to preserve privacy?

2.@l2m2_workshop (16:00): Emergent Misalignment thru the Lens of Non-verbatim Memorization (& phonetic to visual attacks!)

Join us! https://t.co/4W1QcnqQpf

1

85

12

16

6K

l2m2_workshop retweeted

Yanai Elazar @yanaiela

11 months ago

I'll be at #ACL2025 next week! Catch me at the poster sessions, eating sachertorte, schnitzel and speaking about distributional memorization at the @l2m2_workshop

yanaiela's tweet photo. I'll be at #ACL2025 next week!
Catch me at the poster sessions, eating sachertorte, schnitzel and speaking about distributional memorization at the @l2m2_workshop https://t.co/SpRbZyBBin

1

90

11

27

5K

Workshop on Large Language Model Memorization @l2m2_workshop

10 months ago

L2M2 is happening this Friday in Vienna at @aclmeeting #ACL2025NLP! We look forward to the gathering of memorization researchers in the NLP community. Invited talks include: @yanaiela @niloofar_mire @rzshokri and see our website for the full program. https://t.co/AH3aMn3ID6

0

27

13

2

2K

l2m2_workshop retweeted

Ai2 @allen_ai

about 1 year ago

For years it’s been an open question — how much is a language model learning and synthesizing information, and how much is it just memorizing and reciting? Introducing OLMoTrace, a new feature in the Ai2 Playground that begins to shed some light. 🔦

17

619

133

306

177K

l2m2_workshop retweeted

Tom McCoy @RTomMcCoy

about 1 year ago

Do language models just copy text they've seen before, or do they have generalizable abilities? ⬇️This new tool from Ai2 will be very useful for such questions! And allow me to plug our paper on this topic: We find that LLMs are mostly not copying! https://t.co/eNHNgyjLsQ 1/2

RTomMcCoy's tweet photo. Do language models just copy text they've seen before, or do they have generalizable abilities?

⬇️This new tool from Ai2 will be very useful for such questions!

And allow me to plug our paper on this topic: We find that LLMs are mostly not copying!
https://t.co/eNHNgyjLsQ

1/2 https://t.co/18djjcx8MF

1

74

6

42

8K

l2m2_workshop retweeted

Jiacheng Liu @liujc1998

about 1 year ago

As infini-gram surpasses 500 million API calls, today we're announcing two exciting updates: 1. Infini-gram is now open-source under Apache 2.0! 2. We indexed the training data of OLMo 2 models. Now you can search in the training data of these strong, fully-open LLMs. 🧵 (1/4)

2

66

14

24

7K

Workshop on Large Language Model Memorization @l2m2_workshop

about 1 year ago

Hi all, reminder that our direct submission deadline is April 15th! We are co-located at ACL'25 and you can submit archival or non-archival. You can also submit work published elsewhere (non-archival) Hope to see your submission! https://t.co/AH3aMn4gsE

0

11

7

0

2K

l2m2_workshop retweeted

Abhilasha Ravichander @lasha_nlp

about 1 year ago

Want to know what training data has been memorized by models like GPT-4? We propose information-guided probes, a method to uncover memorization evidence in *completely black-box* models, without requiring access to 🙅‍♀️ Model weights 🙅‍♀️ Training data 🙅‍♀️ Token probabilities 🧵1/5

lasha_nlp's tweet photo. Want to know what training data has been memorized by models like GPT-4?

We propose information-guided probes, a method to uncover memorization evidence in *completely black-box* models,

without requiring access to
🙅‍♀️ Model weights
🙅‍♀️ Training data
🙅‍♀️ Token probabilities 🧵1/5 https://t.co/gAJws3pqXC

4

206

44

130

28K

Workshop on Large Language Model Memorization @l2m2_workshop

about 1 year ago

https://t.co/Iikjv8x2XM

0

66

Workshop on Large Language Model Memorization @l2m2_workshop

about 1 year ago

Hey all, we will be retweeting works on memorization. Please DM us if you want us to retweet your work. Our submission deadline is 4/15, consider submitting to one of our archival or non-archival tracks!

1

0

94

l2m2_workshop retweeted

Niloofar

@niloofar_mire

over 1 year ago

Adding or removing PII in LLM training can *unlock previously unextractable* info. Even if “John.Mccarthy” never reappears, enough Johns & Mccarthys during post-training can make it extractable later! New paper on PII memorization & n-gram overlaps: https://t.co/EDNPwhj1as

niloofar_mire's tweet photo. Adding or removing PII in LLM training can *unlock previously unextractable* info.

Even if “John.Mccarthy” never reappears, enough Johns & Mccarthys during post-training can make it extractable later!

New paper on PII memorization & n-gram overlaps:
https://t.co/EDNPwhj1as https://t.co/ngFGQGZlQ1

4

83

13

27

6K

l2m2_workshop retweeted

Ashwinee Panda

@PandaAshwinee

about 1 year ago

we show for the first time ever how to privacy audit LLM training. we give new SOTA methods that show how much models can memorize. by using our methods, you can know beforehand whether your model is going to memorize its training data, and how much, and when, and why! (1/n 🧵)

PandaAshwinee's tweet photo. we show for the first time ever how to privacy audit LLM training. we give new SOTA methods that show how much models can memorize. by using our methods, you can know beforehand whether your model is going to memorize its training data, and how much, and when, and why! (1/n 🧵) https://t.co/8iJ5sOnmeS

1

127

22

70

14K

Workshop on Large Language Model Memorization @l2m2_workshop

over 1 year ago

More details on our website: https://t.co/AH3aMn4gsE 📩Got questions? Reach out to us at [email protected]

0

3

0

138

Workshop on Large Language Model Memorization @l2m2_workshop

over 1 year ago

📢The First Workshop on Large Language Model Memorization (L2M2) will be co-located with @aclmeeting in Vienna🎉 💡L2M2 brings together researchers to explore memorization from multiple angles. Whether it's text-only LLMs or Vision-language models, we want to hear from you!🌍

1

13

4

3

5K

Workshop on Large Language Model Memorization @l2m2_workshop

over 1 year ago

📝 Key Deadlines: - ARR submission: Feb 15, 2025 - Direct submission: Mar 25, 2025 - ARR commitment: Apr 17, 2025 - Notification: Apr 27, 2025 - Camera-ready: May 16, 2025

1

4

0

159

Workshop on Large Language Model Memorization @l2m2_workshop

over 1 year ago

🎉 Happy to announce that the L2M2 workshop has been accepted at @aclmeeting! #NLProc #ACL2025 More details will follow soon. Stay tuned and spread the word! 📣

0

35

5

4

14K

Workshop on Large Language Model Memorization

@l2m2_workshop

Last Seen Users on Sotwe

Trends for you

Most Popular Users