Guy Davidson @guyd33 - Twitter Profile

Pinned Tweet

about 1 year ago

New preprint alert! We often prompt ICL tasks using either demonstrations or instructions. How much does the form of the prompt matter to the task representation formed by a language model? Stick around to find out 1/N

guyd33's tweet photo. New preprint alert! We often prompt ICL tasks using either demonstrations or instructions. How much does the form of the prompt matter to the task representation formed by a language model? Stick around to find out 1/N https://t.co/tvf9XXNwNX

1

275

34

263

49K

Guy Davidson @guyd33

about 2 months ago

@intothecrevasse Next time tot go, the paneer makhani and eggplant larger plates are fantastic, as is the pav bhaji

0

1

0

1

3K

Guy Davidson @guyd33

about 2 months ago

@nir_benz הנחתי שפאתוס זה המודל הבא אחרי מיתוס, אבל אולי הם הקדימו :)

1

0

44

Guy Davidson @guyd33

about 2 months ago

@nir_benz אני לא חושב שהועלם נגמר, ואני בטוח שאנטרופיק עושים אחלה יח״צ; אני מניח שהמציאות היא איפשהו באמצע, מצד אחד זו קפיצת מדרגה מרשימה, ומצד שני זה לא אפוקליפסות סייבר

1

0

120

Who to follow

Judy Fan

@judyefan

Cognitive scientist seeking to reverse engineer the human cognitive toolkit. Asst Prof of Psychology @Stanford.

Stephanie Chan

@scychan_brains

Staff Research Scientist at DeepMind. Artificial & biological brains 🤖 🧠 Societal impacts of AI + Science of AI. Views are my own.

Andrew Lampinen

@AndrewLampinen

Interested in cognition and artificial intelligence. MTS at @AnthropicAI. Previously @DeepMind, cognitive science @StanfordPsych. Tweets are mine.

Guy Davidson @guyd33

about 2 months ago

@nir_benz הנחתי שאם הם היו יוצאים מראש את אחת החולשות שאנטרופיק עשו מהן הייפ, הם היו אומרים, אבל אולי לא

1

0

9

Guy Davidson @guyd33

about 2 months ago

@nir_benz עכשיו, המחיר להריץ מודל ברמה מסוימת בגדול יורד, ואולי יורד מהר יותר משכמות הקוד הרלוונטי בעולם עולה, אבל לדעתי השאלה של איפה בכלל לחפש חולשה (ובטח כשמדובר על חולשה שאולי דורשת לחלוש על כמה מקומות שונים בקוד) היא ממש לא טריוויאלית, ולעשות לה סקיילינג עם חיפוש נאיבי זה קשה.

1

0

113

Guy Davidson @guyd33

about 2 months ago

@nir_benz המחקר הזה לא בלתי מעניין, אבל אני חושב שהוא קצת מוכר את עצמו יותר מדי. בהקבלה: אנטרופיק פרסמו שיש להם מודל ״איפה אפי״ מדהים שיכול לפתור איפה אפי בגודל של מגרש כדורגל, והמחקר הזה אומר ״סימנו למודל קטן יותר את המטר על מטר שבו אפי, וגם המודל הקטן מצא, תאכלו תחת אנטרופיק״.

1

0

189

Guy Davidson @guyd33

4 months ago

@_kobim אני משתמש ב-Strong וזה פותר את זה יפה

0

1

0

216

Guy Davidson @guyd33

4 months ago

@eyalFeder כל הסיפור מהמם, אבל הטענה שאין שווארמה מעל בינונית בניו יורק קצת מפוקפקת… היית ב-OMG על השביעית ורחוב עשר או שבזי באמסטרדם ו-93? (אני מניח שזה למטרות הסיפור, אבל אני גם תמיד בעד להרים לשווארמה מקומית)

1

4

0

84

Guy Davidson @guyd33

5 months ago

@_kobim מה המקום? מקווה שהקפה מצוין (התפריט לפחות עושה רושם טוב)

0

1

0

118

guyd33 retweeted

Dr. Karen Ullrich @karen_ullrich

6 months ago

If “getting started with agents” feels like setup hell — same. So we made a starter tutorial: First agent running in <14 minutes, no Docker/AWS. Laptop + API key only. 👇 https://t.co/xiac8r3cti

0

13

3

5

2K

Guy Davidson @guyd33

6 months ago

@sarahcat21 I almost brought an aeropress and coffee from home before I decided that’s a bit extra. I slightly regret the decision.

0

1

0

193

Guy Davidson @guyd33

6 months ago

@adinamwilliams @LakeBrenden @todd_gureckis @jcyhc_ai will present SAGE-Eval, our (w/ @LakeBrenden) systematic generalization safety benchmark at poster #1104 on Friday AM (11-2). John does fantastic work and he's open to RE/RS roles or PhD positions in AI Safety. If you're hiring, talk to him! https://t.co/SqbtLAjdy6

guyd33's tweet photo. @adinamwilliams @LakeBrenden @todd_gureckis @jcyhc_ai will present SAGE-Eval, our (w/ @LakeBrenden) systematic generalization safety benchmark at poster #1104 on Friday AM (11-2).

John does fantastic work and he's open to RE/RS roles or PhD positions in AI Safety. If you're hiring, talk to him!

https://t.co/SqbtLAjdy6 https://t.co/jsEmx2ZFrD

John (Yueh-Han) Chen

@jcyhc_ai

about 1 year ago

Do LLMs show systematic generalization of safety facts to novel scenarios? Introducing our work SAGE-Eval, a benchmark consisting of 100+ safety facts and 10k+ scenarios to test this! - Claude-3.7-Sonnet passes only 57% of facts evaluated - o1 and o3-mini passed <45%! 🧵

jcyhc_ai's tweet photo. Do LLMs show systematic generalization of safety facts to novel scenarios?

Introducing our work SAGE-Eval, a benchmark consisting of 100+ safety facts and 10k+ scenarios to test this!

- Claude-3.7-Sonnet passes only 57% of facts evaluated
- o1 and o3-mini passed <45%! 🧵 https://t.co/1iPAbWSLSc

2

40

16

20K

0

5

2

1

1K

Guy Davidson @guyd33

6 months ago

Like ~everyone, I'll also be at #NeurIPS this week! Please reach out to chat about past (goal representations, cognitive science, intrep) or current interests (LLM mental state inference, social environments for RL). Also if you have leads on great coffee, craft beer, or tacos.

3

53

3

16

4K

Guy Davidson @guyd33

6 months ago

We're also presenting some work! Our (@adinamwilliams @LakeBrenden @todd_gureckis ) interpretability work on task representations from different prompting forms will be poster #1016 on Friday's afternoon session (4:30-7:30, hall C/D/E) https://t.co/2xjCd3uafl

guyd33's tweet photo. We're also presenting some work! Our (@adinamwilliams @LakeBrenden @todd_gureckis ) interpretability work on task representations from different prompting forms will be poster #1016 on Friday's afternoon session (4:30-7:30, hall C/D/E)

https://t.co/2xjCd3uafl https://t.co/MlV72KTZZC

Guy Davidson @guyd33

about 1 year ago

New preprint alert! We often prompt ICL tasks using either demonstrations or instructions. How much does the form of the prompt matter to the task representation formed by a language model? Stick around to find out 1/N

1

275

34

263

49K

1

12

1

4

861

Guy Davidson @guyd33

6 months ago

@redtachyon Eh my read is more of a tongue in cheek “if you know you know” than trying and failing

0

1

0

14

guyd33 retweeted

Dr. Karen Ullrich @karen_ullrich

6 months ago

Stop by the Meta booth tomorrow, Wednesday Dec 3rd at #NeurIPS in San Diego! 🤖📱 We demo our new research environment, OpenApps, for digital agents. Generate thousands of app versions to train and evaluate multimodal agents to use apps like humans do. Not attending? Stay tuned

karen_ullrich's tweet photo. Stop by the Meta booth tomorrow, Wednesday Dec 3rd at #NeurIPS in San Diego! 🤖📱

We demo our new research environment, OpenApps, for digital agents. Generate thousands of app versions to train and evaluate multimodal agents to use apps like humans do.

Not attending? Stay tuned https://t.co/rt64Z5PdXC

1

9

2

0

933

Guy Davidson @guyd33

6 months ago

@marikgoldstein I went to Modern Times Coffee nearby today, quite nice + solid breakfast tacos

0

1

0

35

Guy Davidson @guyd33

6 months ago

@dianarycai I got here through the conference website, tried to upload a form tonight and we'll see if it works tomorrow: https://t.co/uN3noQO6e9

1

0

382

Guy Davidson @guyd33

6 months ago

@joannejang Absolutely, interesting and hard problem. Unclear what exactly to measure, how much of what good EQ looks like is user-dependent, and how aligned writing style/tone is with EQ (and/or the perception of it)

0

1

0

276

Guy Davidson

@guyd33

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users