Rotem Dror

about 2 months ago

@nlopitz lol😅 maybe...maybe not

0

1

0

28

2 months ago

🚨 New Paper(s) Alert 🚨 I’m excited to share papers I’ve recently published, exploring a different application of LLMs: https://t.co/qUwsuSGgry https://t.co/e3ybk50hvC https://t.co/IUtG2xfVPO https://t.co/XX1Dki2SDk (Sorry for the paper drop — I’ve been offline for a while.)

1

16

0

1

947

Senior Lecturer at @CseHuji. #NLPROC

2 months ago

All very different, but the common theme is: using LLMs in structured, evaluable, and purpose-driven ways.

1

5

0

124

Who to follow

Roy Schwartz

@royschwartzNLP

Yonatan Belinkov

@boknilev

Associate professor of computer science @TechnionLive; visiting scholar @KempnerInst 2025-2026.

Yanai Elazar

@yanaiela

Assistant Professor at Bar-Ilan University

2 months ago

• Adaptive surveys + LLMs for measuring extreme psychological constructs • Planning-based view + evaluation of LLM web agents • BET: detecting behavioral & emotional themes in narratives • Lilo: multi-agent system for children’s digital communication

1

4

0

156

DrorRotem retweeted

4 months ago

[1/7] Why do frontier LLMs make factual errors? Is it because they never learned the fact… or because they can’t access knowledge they already encoded? In our new paper, we show: The bottleneck is not encoding; it is recall. 🧵👇 Paper: https://t.co/mkkqr0KN4X Many thanks to @_galyo @bd_eyal @zorikgekhman @eran_ofek59358 @GoogleResearch

NitCal's tweet photo. [1/7] Why do frontier LLMs make factual errors?
Is it because they never learned the fact…
or because they can’t access knowledge they already encoded?

In our new paper, we show:
The bottleneck is not encoding; it is recall. 🧵👇

Paper: https://t.co/mkkqr0KN4X

Many thanks to @_galyo @bd_eyal @zorikgekhman @eran_ofek59358 @GoogleResearch

4

124

33

91

13K

DrorRotem retweeted

Yanai Elazar @yanaiela

10 months ago

A strange trend I've noticed at #ACL2025 is that people are hesitant to reach out to papers/"academic products" authors. This is unfortunate for both parties! A simple email can save a lot of time to the sender, but is also one of my favorite kind of email as the receiver!

0

33

1

2

2K

11 months ago

Tomorrow morning (9AM🌅) I'll be giving a keynote talk at LAW (linguistic annotation workshop) at #ACL2025 on annotations in the era of LLMs - see you there!!🌟🌟

0

17

3

0

906

DrorRotem retweeted

11 months ago

The Alternative Annotator Test (alt-test) is a new statistical procedure proposed in our ACL 2025 paper! 🇦🇹🇦🇹 @DrorRotem @roireichart The goal? To help justify using LLMs over humans. If the LLM passes the test, its annotations can be trusted 😎 https://t.co/Y2l8Eyv2mY

1

10

1

9

452

DrorRotem retweeted

Mike Erlihson, Math PhD, AI

11 months ago

Everyone uses LLMs to annotate data or evaluate models in their research. But how can we convince others (readers, collaborators, reviewers!!!) that LLMs are reliable? 🤖 Here’s a simple (and low-effort) solution: show the LLM is a *comparable alternative annotator* ✅

NitCal's tweet photo. Everyone uses LLMs to annotate data or evaluate models in their research.

But how can we convince others (readers, collaborators, reviewers!!!) that LLMs are reliable? 🤖

Here’s a simple (and low-effort) solution: show the LLM is a *comparable alternative annotator* ✅ https://t.co/D4Uu585Jio

3

69

19

32

6K

DrorRotem retweeted

@MikeE_3_14

12 months ago

🔥הסקירות ממשיכות לזרום ל-X🔥 🧵 המאמר היומי של מייק: 25.06.25 The Alternative Annotator Test for LLM-as-a-Judge: How to Statistically Justify Replacing Human Annotators with LLMs מאמר 🇮🇱 תפנית מעניינת מתרחשת בתקופה האחרונה בעולם של הערכת ביצועי מודלים. אנחנו כבר לא שואלים רק עד כמה המודל מצליח במבחן כלשהו, אלא שאלה מהותית יותר: האם ניתן לסמוך על מודל שפה שיחליף מתייג אנושי? זו לא שאלה שמדדים מסורתיים כמו דיוק, F1 או הסכמה בין מתייגים יכולים לענות עליה כראוי. תחת זאת, המאמר שנסקור היום מציג שיטה מבוססת סטטיסטיקה לפתרון ביעה זו. בלב המאמר עומדת קריאה להתרחק ממדדי התאמה שטחיים, ולעבור לנימוקים מבוססי השערות סטטיסטיות וניתוח עלות-תועלת.

3

24

7

8

4K

DrorRotem retweeted

about 1 year ago

Preferences drive modern LLM research and development: from model alignment to evaluation. But how well do we understand them? Excited to share our new preprint: Multi-domain Explainability of Preferences https://t.co/XuoN5Mz246 @roireichart @LiatEinDor 🧵👇 1/11

NitCal's tweet photo. Preferences drive modern LLM research and development: from model alignment to evaluation.
But how well do we understand them?

Excited to share our new preprint:
Multi-domain Explainability of Preferences
https://t.co/XuoN5Mz246

@roireichart @LiatEinDor
🧵👇
1/11 https://t.co/oU3cII1twM

2

36

17

6

2K

about 1 year ago

This is your chance to uncover revolutionary research, forge invaluable connections, and become part of the AI innovation wave. Conference dates: May 25-27, 2025 📷 Information and registration: https://t.co/rqgcPoUpzK

0

21

about 1 year ago

Join us at the University of Haifa for the HiAI Conference, a dynamic event at the forefront of Artificial Intelligence. Immerse yourself in groundbreaking discoveries, gain exclusive insights into the latest advancements, and navigate the future of AI with leading experts.

1

0

37

about 1 year ago

Keynote speakers: . Hod Lipson, Mechanical Engineering, Columbia University, USA Prof. Mor Naaman, Information, Cornell Tech, USA Prof. Tanya Berger-Wolf, Computer Science & Engineering, Ohio State University, USA

1

0

43

over 1 year ago

@VeredShwartz AI is also old...now it's agents😆

0

1

0

48

over 1 year ago

@jkkummerfeld @VeredShwartz @LChoshen @GabiStanovsky what load is that? Review load or AC load? or to be more clear - if I'm an AC and I submitted a paper, am I expected to be the AC of at least 4 papers, or, in addition to the AC load, also review 4 papers?

1

0

58