Lujain Ibrahim

Verified account

@lujainmibrahim

phd candidate @oiioxford @uniofoxford, currently visiting @stanfordnlp / previously @googledeepmind @govaiorg @schwarzmanorg @nyuniversity

Joined July 2019

798 Following

1.3K Followers

2K Posts

lujainmibrahim retweeted

Meryl Ye @merylyemerylye

3 days ago

Excellent paper on why anthropomorphic misalignment research needs more precise concepts and stronger evidence. One of their arguments, that a single label can obscure different operationalizations, is something we found empirically for sycophancy. https://t.co/65lLtiQZGE

0

7

1

1

542

lujainmibrahim retweeted

Myra Cheng @chengmyra1

3 days ago

Read @lujainmibrahim's and my paper here: https://t.co/zIDK2vfgPm

0

12

6

3

3K

lujainmibrahim retweeted

Myra Cheng @chengmyra1

3 days ago

This is a wonderful articulation of the limitations of anthropomorphism for understanding + addressing model behavior! Our position paper discusses these assumptions across the LLM development pipeline, to be presented at ACL next month!

chengmyra1's tweet photo. This is a wonderful articulation of the limitations of anthropomorphism for understanding + addressing model behavior! Our position paper discusses these assumptions across the LLM development pipeline, to be presented at ACL next month! https://t.co/WOkGiblV32

3

69

7

41

9K

lujainmibrahim retweeted

3 days ago

Really excited about this work! I think how models generalize alignment out of distribution will be increasingly important; positive alignment has the the potential to create huge benefits; and the results here are both great and a bit surprising. Check it out!

w01fe's tweet photo. Really excited about this work! I think how models generalize alignment out of distribution will be increasingly important; positive alignment has the the potential to create huge benefits; and the results here are both great and a bit surprising. Check it out! https://t.co/x2TQOGHvbl

0

58

11

11

8K

Who to follow

Assistant Culture Editor @project_polis | Non-Fiction Editor @saaganthology | Social Anthropology @soas

known as a perpetual student, maker, and researcher. I write, code, or design things @ucsc & @nyuniversity

premodern arabic lit @princeton

lujainmibrahim retweeted

Joachim Baumann

5 days ago

Cool to see that the Claude Code sessions we collected and released are now helping Anthropic study Claude Code 😎 Check out SWE-chat if you haven't! (more data coming soon...)

joabaum's tweet photo. Cool to see that the Claude Code sessions we collected and released are now helping Anthropic study Claude Code 😎
Check out SWE-chat if you haven't! (more data coming soon...) https://t.co/0ykWCmf73B

6

34

13

14

12K

lujainmibrahim retweeted

Kobi Hackenburg

@KobiHackenburg

6 days ago

New w/ @AISecurityInst & @UniofOxford: Frontier AI can now out-persuade expert humans in conversation - incl. world-champ debaters and professional canvassers. This held even when humans chose their topics, prepared in advance, and competed for £1,000 prizes 🧵

58

929

223

595

204K

lujainmibrahim retweeted

Ben Tappin @Ben_Tappin

13 days ago

Are AI Chatbots Harmful or Beneficial? It Depends What Would’ve Happened Otherwise When thinking about whether using AI chatbots is harmful or beneficial, we should always be asking: “Compared to what?” Link below 👇

Ben_Tappin's tweet photo. Are AI Chatbots Harmful or Beneficial? It Depends What Would’ve Happened Otherwise

When thinking about whether using AI chatbots is harmful or beneficial, we should always be asking: “Compared to what?”

Link below 👇 https://t.co/HPhMX6CAES

1

9

3

8

6K

lujainmibrahim retweeted

about 1 month ago

Really excited to see this longitudinal study. So far there aren't that many of the effect of long term LLM use on users

2

24

5

7

8K

lujainmibrahim retweeted

17 days ago

We propose a new way to quantify AI overreliance: the Offloading Score 🧐 @vishakh_pk It measures the fraction of cognitive work you hand off to AI 🤖 via simulating how you'd have done each step without AI, then counting the steps the AI saved. It works directly from interaction traces (keystrokes, screenshots), so it's reusable across many tools!!

3

169

23

104

46K

@lujainmibrahim

19 days ago

Excited to share this paper, led by @vishakh_pk! A very creative way to measure reliance on AI tools by showing, via simulating counterfactual workflows, how much cognitive effort has moved from the person to the tool.

Vishakh Padmakumar

19 days ago

People are increasingly worried that AI tools make us overreliant. But how do we actually measure this? We introduce Offloading Score, a measure of reliance based on the fraction of cognitive effort offloaded to AI while completing a task. In a controlled user study, Offloading Score detects increased reliance under time pressure, while several common alternatives do not. (1/9)

$vishakh_pk's tweet photo. People are increasingly worried that AI tools make us overreliant. But how do we actually measure this? We introduce Offloading Score, a measure of reliance based on the fraction of cognitive effort offloaded to AI while completing a task. In a controlled user study, Offloading Score detects increased reliance under time pressure, while several common alternatives do not. (1/9)$

7

213

75

99

77K

0

15

1

8

2K

lujainmibrahim retweeted

Vishakh Padmakumar

19 days ago

High reliance is not always undesirable. We examine the interaction between reliance and a desirable task outcome, code understanding. While in-general high reliance leads to low code understanding, we also find a cluster of high-reliance + high-understanding users that are often _learning_ with AI and augmenting their own skills. This suggests that reliance should be interpreted alongside task outcomes, not in isolation. (8/9)

1

7

2

0

582

lujainmibrahim retweeted

Vishakh Padmakumar

19 days ago

People are increasingly worried that AI tools make us overreliant. But how do we actually measure this? We introduce Offloading Score, a measure of reliance based on the fraction of cognitive effort offloaded to AI while completing a task. In a controlled user study, Offloading Score detects increased reliance under time pressure, while several common alternatives do not. (1/9)

$vishakh_pk's tweet photo. People are increasingly worried that AI tools make us overreliant. But how do we actually measure this? We introduce Offloading Score, a measure of reliance based on the fraction of cognitive effort offloaded to AI while completing a task. In a controlled user study, Offloading Score detects increased reliance under time pressure, while several common alternatives do not. (1/9)$

7

213

75

99

77K

lujainmibrahim retweeted

20 days ago

Training the models to be less sycophantic seems like a good idea, but I think the problem is that they’re not really smart enough for it yet, so they end up “pushing back” in ways that are awkward and annoying because they don’t make sense.

18

197

5

13

8K

lujainmibrahim retweeted

Lama Ahmad لمى احمد @_lamaahmad

24 days ago

We (@CedricWhitney, @SandhiniAgarwal, @EstherTetruas, @OliviaGWatkins2, @dgrobinson) wrote about nuances we’ve observed while working with third parties on frontier model evals, and why eval standards need to account for them. https://t.co/oH95xM8DPm

2

25

8

18

4K

lujainmibrahim retweeted

25 days ago

It’s been great to have spent a few months with @woj setting up OAIF’s work on AI resilience / economic futures and getting ambitious about where this can go. Glad to have helped on this one. I feel a huge amount of uncertainty on AI’s impacts and what to do about it. But I think giving people a pathway to shape their own futures, and not be subject to arbitrary power, is really important in all worlds. Econ is a huge part of that, democracy is a huge part of that, and they’re fundamentally entwined (e.g. it’s good that the state is funded via broad-based taxes, it’s good that competition enables plurality and keeps people’s choices relevant, etc.). There’s a ton that can be done now - better, real-time labor statistics, broad and deep qualitative data collection, pathways for sharing in AI’s value for countries around the world, lots of experiments that give people an actual stake in AI growth, investment in state capacity. Tons to do, more soon.

5

86

5

22

13K

lujainmibrahim retweeted

27 days ago

we will come to see human limitations as sacred and joyous and look back on them the way you look back on your childhood

53

757

32

110

59K

lujainmibrahim retweeted

Sunny Yu @sunnyyuych

about 1 month ago

“AI, what color do I get from mixing black and white?” Why do people turn to AI for simple tasks that they could easily do themselves? In our new preprint (also to appear at CogSci 2026!), we investigate the mechanisms and dangers of people over-using AI on easy tasks.

sunnyyuych's tweet photo. “AI, what color do I get from mixing black and white?”
Why do people turn to AI for simple tasks that they could easily do themselves?

In our new preprint (also to appear at CogSci 2026!), we investigate the mechanisms and dangers of people over-using AI on easy tasks. https://t.co/1wrfd1qGet

11

126

31

72

14K

@lujainmibrahim

about 1 month ago

We use "sycophancy" to describe not one thing but a family of model behaviors. That makes it hard to measure how "sycophantic" models really are, or compare results across studies. This paper, led by @merylyemerylye, is a great effort at addressing this!

Meryl Ye @merylyemerylye

about 1 month ago

🚨 New preprint 🚨 We developed a sycophancy taxonomy based on prior literature and surveyed 106 experts. 94% agreed it's a serious problem. But they substantially disagreed about which behaviors actually count as sycophancy. Thread 🧵(1/n)

merylyemerylye's tweet photo. 🚨 New preprint 🚨

We developed a sycophancy taxonomy based on prior literature and surveyed 106 experts.

94% agreed it's a serious problem. But they substantially disagreed about which behaviors actually count as sycophancy.

Thread 🧵(1/n) https://t.co/AeLPjOJ748

3

43

14

26

12K

1

15

2

15

3K

lujainmibrahim retweeted

Meryl Ye @merylyemerylye

about 1 month ago

🚨 New preprint 🚨 We developed a sycophancy taxonomy based on prior literature and surveyed 106 experts. 94% agreed it's a serious problem. But they substantially disagreed about which behaviors actually count as sycophancy. Thread 🧵(1/n)

merylyemerylye's tweet photo. 🚨 New preprint 🚨

We developed a sycophancy taxonomy based on prior literature and surveyed 106 experts.

94% agreed it's a serious problem. But they substantially disagreed about which behaviors actually count as sycophancy.

Thread 🧵(1/n) https://t.co/AeLPjOJ748

3

43

14

26

12K

lujainmibrahim retweeted

Meryl Ye @merylyemerylye

about 1 month ago

Construct fragmentation extends outside of academic research. How legislation describes sycophancy differs from how researchers and companies do. If different stakeholders are targeting different behaviors, what does "less sycophantic" actually mean?

merylyemerylye's tweet photo. Construct fragmentation extends outside of academic research. How legislation describes sycophancy differs from how researchers and companies do.

If different stakeholders are targeting different behaviors, what does "less sycophantic" actually mean? https://t.co/ipiyhXX7He

1

5

2

1

351

Last Seen Users on Sotwe

Trends for you

Most Popular Users