Xiaohu Zhu ⏹️ CSAGI @neil_csagi - Twitter Profile

Pinned Tweet

Xiaohu Zhu ⏹️ CSAGI @neil_csagi

3 months ago

Current embarrassment: all AI labs have the "pseudo preparedness".

0

5

0

1

183

neil_csagi retweeted

Seth Lazar

@sethlazar

about 22 hours ago

One of the most ritualistic phrases in the responsible AI (etc) field over the last decade has been the sanctimonious observation that *of course* all reasons ultimately apply to people, not to the AI systems themselves. Similarly, whenever AI exercises power, the catechism required you to say that it did always so *on behalf* of some human/s, never itself. I think this is probably false already; if not, it's only a matter of time.

1

16

4

6

2K

neil_csagi retweeted

Rosie Campbell

@RosieCampbell

1 day ago

The deadline for this is June 21! Please share with anyone who might be interested in contributing to empirical work on digital minds

3

35

8

9

6K

neil_csagi retweeted

Jason Weston

@jaseweston

6 months ago

Our co-improvement position paper is now on arXiv! (We've updated it, covering more existing work.) 📝: https://t.co/xnxWYoMNP7 After >27 years of research, my first position paper! Short 🧵 (1/5) follows 👇 Synopsis: it's about building AI that collaborates on AI research *with us* to solve AI faster, and to help fix the alignment problem together. How? Build the AI with those collab skills (i.e., we create benchmarks! training data! methods! etc. for that). I've been personally inspired by @Yoshua_Bengio's recent talks on safety & AI research, and also from seeing Nicholas Carlini's COLM keynote where he said we researchers can all do our bit to help (paraphrased). So – hope this helps! 🙏

jaseweston's tweet photo. Our co-improvement position paper is now on arXiv!
(We've updated it, covering more existing work.)
📝: https://t.co/xnxWYoMNP7

After >27 years of research, my first position paper!

Short 🧵 (1/5) follows 👇

Synopsis: it's about building AI that collaborates on AI research *with us* to solve AI faster, and to help fix the alignment problem together.

How? Build the AI with those collab skills (i.e., we create benchmarks! training data! methods! etc. for that).

I've been personally inspired by @Yoshua_Bengio's recent talks on safety & AI research, and also from seeing Nicholas Carlini's COLM keynote where he said we researchers can all do our bit to help (paraphrased). So – hope this helps! 🙏

7

250

41

157

41K

Who to follow

Rachel Freedman (will be @ICML2026)

@FreedmanRach

RLHF, LLMS, interpretability & safety | PhD researcher @berkeley_ai | Previously @Cambridge_Uni and @DukeU

Andrew Critch (🤖🩺🚀)

@AndrewCritchPhD

CEO @ https://t.co/xk315mSBah. AI Researcher @ Berkeley. Views my own. I also post my favorite healthy+tasty foods for free.

Victoria Krakovna

@vkrakovna

Research scientist in AI alignment at Google DeepMind. Co-founder of Future of Life Institute @FLI_org. Views are my own and do not represent GDM or FLI.

neil_csagi retweeted

Eliezer Yudkowsky ⏹️

@ESYudkowsky

5 days ago

On a first read, this paper seems far ahead of the pack in terms of (1) understanding some reasons why a task might stay difficult even in the face of gradient descent, and (2) distilling out propositions they'd need to somehow verify before they started expecting nice things.

5

238

16

163

34K

Xiaohu Zhu ⏹️ CSAGI @neil_csagi

5 days ago

@lunarwallfacer_ Can you explain it?

1

0

8

Xiaohu Zhu ⏹️ CSAGI @neil_csagi

6 days ago

Phase transition in alignment community. Safe AGI Game continues.

Geoffrey Irving

@geoffreyirving

6 days ago

We are starting a new, nonprofit alignment organization, ⊢ Sequent Research, bringing together researchers previously on UK AISI’s Alignment Team, Timaeus, and elsewhere to research how to align superintelligence. We are hiring! 🧵

geoffreyirving's tweet photo. We are starting a new, nonprofit alignment organization, ⊢ Sequent Research, bringing together researchers previously on UK AISI’s Alignment Team, Timaeus, and elsewhere to research how to align superintelligence. We are hiring! 🧵 https://t.co/UziUGbIPdU

27

961

144

418

189K

1

0

195

Xiaohu Zhu ⏹️ CSAGI @neil_csagi

6 days ago

@geoffreyirving Congrats!

0

115

neil_csagi retweeted

Cas (Stephen Casper)

@StephenLCasper

9 days ago

According to the MIT Libraries' database of theses (dating back to the 1800s), my thesis was only the 2nd in the institute's history to contain the word "shit."

StephenLCasper's tweet photo. According to the MIT Libraries' database of theses (dating back to the 1800s), my thesis was only the 2nd in the institute's history to contain the word "shit." https://t.co/csIpuxKAAU

6

163

3

25

14K

neil_csagi retweeted

Eliezer Yudkowsky ⏹️

@ESYudkowsky

11 days ago

Today is June 5th, one day to take a break from fighting each other online, and remind ourselves of our shared humanity and common goals by uniting around the one thing we all agree about: Repealing the Jones Act. https://t.co/ajQqtTdt6z

15

408

36

22

18K

neil_csagi retweeted

Holly ⏸️ Elmore

@ilex_ulmus

11 days ago

https://t.co/ogP7cjc3dc

3

37

4

22

5K

Xiaohu Zhu ⏹️ CSAGI @neil_csagi

11 days ago

What I cannot stop, I do not understand. https://t.co/oBOGwc3NXd

0

20

neil_csagi retweeted

Shane Legg

@ShaneLegg

12 days ago

A great interview with our new Director of AGI Economics @alexolegimas and the economist @pawtrammell. It's a good antidote to some of the overly simplistic narratives about the economics of AGI.

6

196

29

91

27K

neil_csagi retweeted

Brian Christian

@brianchristian

13 days ago

Just published in @PNASNews, we resolve a 50-year-old riddle from Richard Feynman's handwritten notes, prove and generalize it, and run a large-scale human study to reveal near-optimal heuristics in sequential decision problems: https://t.co/4AOM1iDqG2

4

87

20

49

8K

Xiaohu Zhu ⏹️ CSAGI @neil_csagi

13 days ago

@DavidSKrueger This episode was recorded on December 4, 2025.

0

1

0

81

neil_csagi retweeted

Tiberiu Mușat

@Tiberiu_Musat_

21 days ago

Why does deep learning generalize? What does weight decay really do? Can algorithmic information theory address these questions? In my latest preprint, I give a proof that the minimum neural weight norm matches the minimum program length (aka Kolmogorov Complexity), up to a logarithmic factor. In other words, the neural network with the smallest possible weight norm (that fits the data) must encode the shortest program (that fits the data). The result only holds for fixed-precision neural nets: infinite precision nets can store infinite information with finite (small) weights. https://t.co/eMZIGQDf2f

Tiberiu_Musat_'s tweet photo. Why does deep learning generalize? What does weight decay really do? Can algorithmic information theory address these questions?

In my latest preprint, I give a proof that the minimum neural weight norm matches the minimum program length (aka Kolmogorov Complexity), up to a logarithmic factor. In other words, the neural network with the smallest possible weight norm (that fits the data) must encode the shortest program (that fits the data).

The result only holds for fixed-precision neural nets: infinite precision nets can store infinite information with finite (small) weights.

https://t.co/eMZIGQDf2f

30

1K

153

1K

147K

neil_csagi retweeted

Elizabeth Barnes

@BethMayBarnes

25 days ago

Sometimes people outside the field say things like “The AI situation can’t be that bad, there must be experts who are on top of it”. As “an expert”, I would like to be clear that we are *not* on top of it. Some key aspects of the situation IMO:

21

1K

185

379

227K

neil_csagi retweeted

Goodfire

@GoodfireAI

26 days ago

The most popular way to interpret AI is missing the bigger picture. Models think in curved shapes. But sparse autoencoders (SAEs) work with straight lines. Can they still capture models’ curved neural geometry? Yes, but not how you might think! (1/7)

25

1K

151

769

174K

neil_csagi retweeted

steve hsu

@hsu_steve

27 days ago

AI Billionaire on Existential Risk: Jaan Tallinn, Manifold episode #112 Jaan Tallinn is a tech billionaire and founding engineer of Skype who leverages his wealth to mitigate existential risks from artificial general intelligence (AGI). He co-founded the Future of Life Institute and the Centre for the Study of Existential Risk, while making early foundational investments in frontier AI labs like DeepMind and Anthropic. 0:00 Assessing Current AI Risk Levels 02:11 Self-Sustaining AI Scenarios 07:55 Global AI Race Dynamics 41:18 Explaining the Techno-Capital Flywheel 46:45 Insider Origins of AI Safety 55:42 Race Politics and Public Fear 01:25:45 Pop Culture, Movies, and Fame 01:30:23 Big Questions for Humanity's Future

5

70

9

45

8K

neil_csagi retweeted

Alec Helbling

@alec_helbling

about 1 month ago

The Helmholtz decomposition is one of the fundamental results of vector calculus. It says any well-behaved vector field can be split into two parts, one capturing sources and sinks through divergence, and one capturing rotation through curl.

33

3K

345

2K

221K

neil_csagi retweeted

Dr. Roman Yampolskiy

@romanyam

about 1 month ago

2nd episode of The Roman Forum is an interview with AI Safety/Governance expert Connor Leahy @NPCollapse. Connor is a great speaker and is lobbying to get government to ban Superintelligence. My first virtual recording. Got a good mic, should probably hold it closer. Lots of room for improvement, but once I get familiar with software/hardware/setup things will feel a lot more natural. Enjoy and subscribe!

21

133

24

31

10K

Xiaohu Zhu ⏹️ CSAGI

@neil_csagi

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users