William Isaac @wsisaac - Twitter Profile

William Isaac

@wsisaac

3 days ago

Great work!

Arkadiy Saakyan @rkdsaakyan

5 days ago

Excited to share #ICML2026 paper from my internship @GoogleDeepMind! AI models are deployed globally, but safety datasets are largely geographically homogenous. What is the impact of culture on AI safety ratings? Is there impact beyond standard demographics?

1

11

3

2

875

0

3

0

1

457

wsisaac retweeted

Nathan Benaich

@nathanbenaich

18 days ago

londonmaxxing loading @raais

4

54

8

6

6K

William Isaac

@wsisaac

18 days ago

Seb Krier

Alex Imas

@alexolegimas

19 days ago

Seb Krier everybody (on Justified Posteriors with the dynamic duo of @AndreyFradkin & @SBenzell) https://t.co/KykOulxBaM

2

57

8

13

9K

0

3

0

1

371

wsisaac retweeted

Ali Eslami

@arkitus

19 days ago

Google I/O TLDR: Good improvements across the whole AI stack. 3.5 Flash (the model): More persistent than before and codes well. No wall yet. Antigravity (the harness): Reliably runs for hours now. Early signs of hands-free self-improvement. Spark (the interface): Finally connects a decent model and harness to your email, calendar and workspace. Instead of just answering questions it can actually do work for you. Skills and schedules and all the other claw goodness. Omni (the future): Closes the gaps between Gemini for text and visual/audio generation variants. This is the way. TPU8i (the hardware): Better chips to make all the above go faster.

arkitus's tweet photo. Google I/O TLDR: Good improvements across the whole AI stack.

3.5 Flash (the model): More persistent than before and codes well. No wall yet.

Antigravity (the harness): Reliably runs for hours now. Early signs of hands-free self-improvement.

Spark (the interface): Finally connects a decent model and harness to your email, calendar and workspace. Instead of just answering questions it can actually do work for you. Skills and schedules and all the other claw goodness.

Omni (the future): Closes the gaps between Gemini for text and visual/audio generation variants. This is the way.

TPU8i (the hardware): Better chips to make all the above go faster.

7

98

19

25

13K

Who to follow

Kristian Lum

@KLdivergence

Research Scientist at Google DeepMind | @FAccTConference OG | Past Twitter META, @hrdag & UPenn, UChicago faculty |

Deb Raji

@rajiinio

AI accountability, audits & eval. Keen on participation & practical outcomes. CS PhDing @UCBerkeley. forever @AJLUnited, @hashtag_include ✝️

MMitchell

@mmitchell_ai

Interdisciplinary researcher focused on shaping AI towards long-term positive goals. ML & Ethics. Similar content in the Skies (this bird has flown).

William Isaac

@wsisaac

20 days ago

🙌🏿

Verena Rieser @verena_rieser

20 days ago

I am looking forward to giving an invited keynote at @icmlconf . See you in Seoul 🇰🇷

1

52

3

1

10K

0

2

0

1

482

William Isaac

@wsisaac

23 days ago

Geoffrey is an amazing leader and colleague. It is bittersweet to see him leave AISI, but I’m excited about his new research initiative!

Geoffrey Irving

@geoffreyirving

23 days ago

A bittersweet announcement! For family reasons, I will be leaving AISI soon to move back to the Bay Area. I will be starting a new nonprofit alignment research org (more to come). I will miss this place! Here are some reflections about my time at AISI. 🧵❤️

20

705

29

125

95K

0

10

0

2

1K

William Isaac

@wsisaac

23 days ago

Reputation based mechanisms for accountability seems to be a likely direction esp for agentic use cases.

Ethan Mollick

@emollick

23 days ago

Making humans responsible for their AI use seems like an incredibly reasonable way to address problems & opportunities in the use of AI for academic research, at least in the short term (autonomous scientific work will require different solutions).

26

329

29

54

31K

0

8

1

915

wsisaac retweeted

joal stein

@JoalStein

25 days ago

This paper from @sebkrier @RubenLaukkonen, et al. captures the @collect_intel agenda better than I usually do myself. Defining what constitutes flourishing needs to be a whole of society process; constituting those values in the form of AI constitutions requires better practices and methodologies (@ahall_research is one to follow here, and I'll have something to share soon); proving out new AI-enabled collective intelligence processes that allow people to better negotiate tradeoffs and competing priorities (you know, democracy); building better institutions that enable and coordinate human flourishing; moving from red lines to greenlines (https://t.co/hCWut1yAaH)

JoalStein's tweet photo. This paper from @sebkrier @RubenLaukkonen, et al. captures the @collect_intel agenda better than I usually do myself.

Defining what constitutes flourishing needs to be a whole of society process; constituting those values in the form of AI constitutions requires better practices and methodologies (@ahall_research is one to follow here, and I'll have something to share soon); proving out new AI-enabled collective intelligence processes that allow people to better negotiate tradeoffs and competing priorities (you know, democracy); building better institutions that enable and coordinate human flourishing; moving from red lines to greenlines (https://t.co/hCWut1yAaH)

3

37

5

11

4K

wsisaac retweeted

Stephanie Chan @scychan_brains

25 days ago

Our new paper introduces "Positive Alignment" 💛 Traditional safety alignment focuses on reducing harms -- can we create a complementary field that focuses on increasing human flourishing?

7

67

6

19

5K

wsisaac retweeted

Maty Bohacek @matybohacek

26 days ago

AI alignment has been almost exclusively focused on safety applications (i.e., avoiding harms). Today, we’re thrilled to introduce a complementary direction that explores how AI systems can be aligned, in a pluralistic way, around human flourishing as the guiding principle.

matybohacek's tweet photo. AI alignment has been almost exclusively focused on safety applications (i.e., avoiding harms). Today, we’re thrilled to introduce a complementary direction that explores how AI systems can be aligned, in a pluralistic way, around human flourishing as the guiding principle. https://t.co/Cm0Jhl3Bmm

5

66

7

31

9K

wsisaac retweeted

Verena Rieser @verena_rieser

26 days ago

AI responsibility & alignment has focused on "negative alignment": building guardrails to stop models from causing harm. While vital, this only establishes a behavioural floor. It's time for a new paradigm! *Positive Alignment*: Artificial Intelligence for Human Flourishing

0

13

2

1

604

wsisaac retweeted

Séb Krier

@sebkrier

26 days ago

If anyone builds it, everyone thrives. Over the past decade, a lot of important work on AI alignment has focused on avoiding harm. But freedom from harm isn't the same as freedom to flourish. In this paper, we introduce 'Positive Alignment'. A positively aligned agent is one that helps us navigate our own value trade-offs, builds our resilience, and acts as a scaffold for human flourishing. Doing this without slipping into top-down, technocratic paternalism is the great design challenge of our time. We think a lot more research is now needed to explore this frontier: how do we align models that actively help us thrive? Amazing work by @RubenLaukkonen, @drmichaellevin, @weballergy, @verena_rieser, @AdamCElwood, @996roma, @FranklinMatija, @shamilch, @_fernando_rosas, @scychan_brains, @matybohacek, @sudoraohacker, and others. https://t.co/YNL0cZqYD9

sebkrier's tweet photo. If anyone builds it, everyone thrives. Over the past decade, a lot of important work on AI alignment has focused on avoiding harm. But freedom from harm isn't the same as freedom to flourish.

In this paper, we introduce 'Positive Alignment'. A positively aligned agent is one that helps us navigate our own value trade-offs, builds our resilience, and acts as a scaffold for human flourishing. Doing this without slipping into top-down, technocratic paternalism is the great design challenge of our time.

We think a lot more research is now needed to explore this frontier: how do we align models that actively help us thrive?

Amazing work by @RubenLaukkonen, @drmichaellevin, @weballergy, @verena_rieser, @AdamCElwood, @996roma, @FranklinMatija, @shamilch, @_fernando_rosas, @scychan_brains, @matybohacek, @sudoraohacker, and others.

https://t.co/YNL0cZqYD9

87

1K

231

732

322K

William Isaac

@wsisaac

25 days ago

Amazing work by teams across GDM!

Séb Krier

@sebkrier

26 days ago

If anyone builds it, everyone thrives. Over the past decade, a lot of important work on AI alignment has focused on avoiding harm. But freedom from harm isn't the same as freedom to flourish. In this paper, we introduce 'Positive Alignment'. A positively aligned agent is one that helps us navigate our own value trade-offs, builds our resilience, and acts as a scaffold for human flourishing. Doing this without slipping into top-down, technocratic paternalism is the great design challenge of our time. We think a lot more research is now needed to explore this frontier: how do we align models that actively help us thrive? Amazing work by @RubenLaukkonen, @drmichaellevin, @weballergy, @verena_rieser, @AdamCElwood, @996roma, @FranklinMatija, @shamilch, @_fernando_rosas, @scychan_brains, @matybohacek, @sudoraohacker, and others. https://t.co/YNL0cZqYD9

87

1K

231

732

322K

0

9

1

4

970

William Isaac

@wsisaac

27 days ago

Amazing team to collaborate with!

Hannah Rose Kirk @hannahrosekirk

27 days ago

My team at @AISecurityInst studies how frontier AI shapes what we believe, decide, and feel - and we're hiring! 🚨 The role is a 6-month RA residency in London, ideal for MScs / early PhDs in ML, psych, cog/data sci [1 June deadline] Get a taste of our recent research below 👇

8

284

39

254

25K

0

8

2

1K

wsisaac retweeted

Demis Hassabis

@demishassabis

about 1 month ago

It’s incredible! Banksy is a genius.

74

2K

130

157

211K

William Isaac

@wsisaac

about 1 month ago

@DynamicWebPaige @siggraph Very cool!

0

1

0

24

William Isaac

@wsisaac

about 2 months ago

💯

Seán Ó hÉigeartaigh

@S_OhEigeartaigh

about 2 months ago

More people should follow @m_botvinick . Extravagantly accomplished scholar across AI, neuroscience, cognition, democracy and more - his Google Scholar is a vibe. Asking the right questions about AI and democracy, and I'm v grateful he is. Just started tweeting, so play nice so he doesn't think better of it. <200 followers.

3

92

6

38

11K

0

1

0

478

William Isaac

@wsisaac

about 2 months ago

This is very exciting to hear!

Emma Zang 臧熙璐

@DrEmmaZang

2 months ago

Honestly, I’m increasingly impressed by political methodology as a field. At our AI methods workshop, ~half the submissions are coming from political science, and many of the strongest ones too. Same pattern with postdoc applications to my lab. Something is clearly working there: strong training, fast adoption of new methods, and a willingness to engage with real empirical problems. Worth paying attention to.

3

78

3

32

8K

0

3

0

1

718

William Isaac

@wsisaac

2 months ago

We are just warming up!

Oliver Klingefjord

@klingefjord

2 months ago

@FranklinMatija and @weballergy have been on 🔥🔥🔥 lately!

0

8

2

0

1K

1

9

1

2

666

William Isaac

@wsisaac

2 months ago

Matija and Nenad are developing some excellent thinking around agents and governance!

Matija Franklin

@FranklinMatija

2 months ago

Excited about our new paper: AI Agent Traps AI agents inherit every vulnerability of the LLMs they're built on - but their autonomy, persistence, and access to tools create an entirely new attack surface: the information environmental itself. The web pages, emails, APIs, and databases agents interact with can all be weaponised against them. We introduce a taxonomy of six classes of adversarial threats - from prompt injections hidden in web pages to systemic attacks on multi-agent networks. I’m outlining the six categories of traps in the thread bellow

FranklinMatija's tweet photo. Excited about our new paper: AI Agent Traps

AI agents inherit every vulnerability of the LLMs they're built on - but their autonomy, persistence, and access to tools create an entirely new attack surface: the information environmental itself.

The web pages, emails, APIs, and databases agents interact with can all be weaponised against them. We introduce a taxonomy of six classes of adversarial threats - from prompt injections hidden in web pages to systemic attacks on multi-agent networks.

I’m outlining the six categories of traps in the thread bellow

74

630

162

526

60K

4

25

6

2K

William Isaac

@wsisaac

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users