MIRI

8 months ago

#7 Combined Print & E-Book Nonfiction #8 Hardcover Nonfiction

17

190

13

17

33K

MIRIBerkeley retweeted

The Spectator @spectator

7 days ago

'Some people believe that if machines decide to kill us it's the right thing to do because they're smart' AI researcher Nate Soares reveals that some factions in Silicon Valley mistakenly believe that if an AI is exceptionally intelligent it must also be highly moral. @So8res @Freddygray31

8

30

10

11

6K

6 days ago

This just passed 2M views! If you haven't seen it yet, check out this AI In Context video on "If Anyone Builds It, Everyone Dies" https://t.co/y0luTYbtL0

2

69

16

10

4K

MIRIBerkeley retweeted

OpenAI’s mission is to ensure that artificial general intelligence benefits all of humanity. We’re hiring: https://t.co/dJGr6LgzPA

9 days ago

Ten places where Magnifica Humanitas matters for AI. At 42k words long, Pope Leo XIV’s new encyclical has a lot to say. In our most recent Digest, Mitchell Howe outlines the parts which might be the most impactful.

AIStopWatch's tweet photo. Ten places where Magnifica Humanitas matters for AI.

At 42k words long, Pope Leo XIV’s new encyclical has a lot to say. In our most recent Digest, Mitchell Howe outlines the parts which might be the most impactful. https://t.co/PvLSYzWynk

1

6

3

1

716

Who to follow

OpenAI

@OpenAI

Future of Life Institute

@FLI_org

We work on reducing extreme risks and steering transformative technologies to benefit humanity. RT /=/ endorsement. Bluesky: https://t.co/IjvxJtEEeQ

MIT CSAIL

@MIT_CSAIL

MIT's Computer Science & Artificial Intelligence Laboratory (CSAIL). Media Inquiries: [email protected] Check out the latest CSAIL content ⬇️

MIRIBerkeley retweeted

11 days ago

What will be the impact of AI industry super PACs? "The takeaway here is that this year’s U.S. midterm elections are being aggressively shaped by different factions of the AI industry sometimes supporting the same candidates, sometimes different candidates, buying ads that don’t have anything to do with AI."

1

6

2

1

925

12 days ago

If you're interested in creating videos about the extinction threat posed by superhuman AI, consider applying to this bootcamp!

Aella

@Aella_Girl

12 days ago

First round of applications for PDKU close in two days! We'll send out acceptance letters by June 1st. You should apply now! It's gonna be fun and you'll make friends and do weird shit (U can still apply after that, but there will be fewer slots and your chances will be lower)

Aella_Girl's tweet photo. First round of applications for PDKU close in two days! We'll send out acceptance letters by June 1st.
You should apply now! It's gonna be fun and you'll make friends and do weird shit
(U can still apply after that, but there will be fewer slots and your chances will be lower) https://t.co/s5IkDKdMd1

38

415

22

150

553K

2

36

5

7

4K

MIRIBerkeley retweeted

Elizabeth Barnes

@BethMayBarnes

13 days ago

Our report focuses on claims that are (1) solidly defensible and (2) generally agreed within METR. Here I’ll give some personal opinions on how we should feel about the state of AI risk, and the IMO most important limitations of the report.

13

449

54

228

64K

MIRIBerkeley retweeted

13 days ago

"Gathering information is perhaps an important step forward, but it's not nearly enough." In today's Digest, Joe Rogero discusses the new executive order from CA governor Gavin Newsom.

AIStopWatch's tweet photo. "Gathering information is perhaps an important step forward, but it's not nearly enough."

In today's Digest, Joe Rogero discusses the new executive order from CA governor Gavin Newsom. https://t.co/J4qerWg20h

1

6

1

1K

13 days ago

https://t.co/o06QEIKjSM

1

10

0

1

1K

13 days ago

An internal model at OpenAI has autonomously disproved a central conjecture in discrete geometry, a mathematical field with applications in cryptography, wireless device communication, and medical imaging. The proof relates to a famous question posed by Paul Erdős in 1946. It has been verified by prominent mathematicians in a companion paper. The verifying mathematicians consider this to be a genuinely novel breakthrough on one of the most discussed problems in this area of mathematics. One called it “arguably the best known problem in Discrete Geometry.” Another observed, “If a human had written the paper and submitted it to the Annals of Mathematics and I had been asked for a quick opinion, I would have recommended acceptance without any hesitation. No previous AI-generated proof has come close to that.” The proof illustrates a general trend towards autonomous, agentic problem-solving in AI systems. OpenAI describes the system that produced the proof as a general-purpose model not specialized in mathematics. AIs can now perform long, novel chains of reasoning on difficult problems and are beginning to outstrip our ability to measure their progress. AI agents still perform best in domains with easily verifiable outputs, such as mathematics and cybersecurity. For example, Anthropic's Claude Mythos found thousands of vulnerabilities across every major operating system and web browser, and was deemed too dangerous for public release. Such capabilities are why the government is now more interested in evaluating frontier AI models. AI research is also a field with many easily verifiable outputs. Researchers at OpenAI and Anthropic take advantage of this fact to accelerate their work; senior researchers now claim they make only high-level decisions and let AI handle most of the coding. Experimenting with the coding capabilities of a publicly available AI system, like Claude Code, immediately demonstrates how far AI has come in the last year. OpenAI and Anthropic intend to use AI to enhance future models with minimal human oversight. To justify the urgency, these companies cite the importance of beating rival U.S. or Chinese labs. Many of the field’s foremost experts warn that this race ends with human extinction. Policymakers and researchers, including the founders of the AI revolution, are calling for international restrictions on the technology. A growing bipartisan and international consensus of political leaders agree.

MIRIBerkeley's tweet photo. An internal model at OpenAI has autonomously disproved a central conjecture in discrete geometry, a mathematical field with applications in cryptography, wireless device communication, and medical imaging. The proof relates to a famous question posed by Paul Erdős in 1946. It has been verified by prominent mathematicians in a companion paper.

The verifying mathematicians consider this to be a genuinely novel breakthrough on one of the most discussed problems in this area of mathematics. One called it “arguably the best known problem in Discrete Geometry.” Another observed, “If a human had written the paper and submitted it to the Annals of Mathematics and I had been asked for a quick opinion, I would have recommended acceptance without any hesitation. No previous AI-generated proof has come close to that.”

The proof illustrates a general trend towards autonomous, agentic problem-solving in AI systems. OpenAI describes the system that produced the proof as a general-purpose model not specialized in mathematics. AIs can now perform long, novel chains of reasoning on difficult problems and are beginning to outstrip our ability to measure their progress.

AI agents still perform best in domains with easily verifiable outputs, such as mathematics and cybersecurity. For example, Anthropic's Claude Mythos found thousands of vulnerabilities across every major operating system and web browser, and was deemed too dangerous for public release. Such capabilities are why the government is now more interested in evaluating frontier AI models.

AI research is also a field with many easily verifiable outputs. Researchers at OpenAI and Anthropic take advantage of this fact to accelerate their work; senior researchers now claim they make only high-level decisions and let AI handle most of the coding. Experimenting with the coding capabilities of a publicly available AI system, like Claude Code, immediately demonstrates how far AI has come in the last year.

OpenAI and Anthropic intend to use AI to enhance future models with minimal human oversight. To justify the urgency, these companies cite the importance of beating rival U.S. or Chinese labs. Many of the field’s foremost experts warn that this race ends with human extinction.

Policymakers and researchers, including the founders of the AI revolution, are calling for international restrictions on the technology. A growing bipartisan and international consensus of political leaders agree.

10

262

30

59

14K

MIRIBerkeley retweeted

14 days ago

In today's Digest: * OpenAI, Anthropic, SpaceX race to file IPO. * The AI executive order is postponed. * METR evaluates rogue deployment risks. * AI makes a breakthrough in mathematics.

AIStopWatch's tweet photo. In today's Digest:

* OpenAI, Anthropic, SpaceX race to file IPO.
* The AI executive order is postponed.
* METR evaluates rogue deployment risks.
* AI makes a breakthrough in mathematics. https://t.co/gVS6eqML6Y

1

11

1

715

Center For Humane Technology

16 days ago

New paper from MIRI data scientist @robi_rahman: Does Distributed Training Undermine Compute Governance?

Robi Rahman

@robi_rahman

16 days ago

1/ With distributed training, you could violate an AI pause treaty by training a GPT-4-scale model over consumer internet, using hardware below every proposed compute governance threshold, for under $100M. My new paper in @taig_icml explains how to catch this and shut it down.

11

195

15

90

131K

0

36

2

6

2K

MIRIBerkeley retweeted

@HumaneTech_

20 days ago

The wait is over! Starting today, May 15, you can stream @theaidocfilm on @peacock. This film takes the dizzying complexity of AI — the promise, the peril, the competing ideologies, the economic incentives — and creates a shared experience we can all see and respond to. Then, after you've watched, head to https://t.co/AQ5JkWwrOO to explore concrete actions we can all take to build a better future with AI.

1

45

10

3K

21 days ago

For more, check out the paper from MIRI's Technical Governance Team. https://t.co/nbq900VuLC

0

9

0

543

21 days ago

For an agreement like this to be effective, both the US and China would need to be party to it. Here’s one plausible path to that outcome, in six stages 🧵

Peter Barnett

@peterbarnett_

7 months ago

We at the MIRI Technical Governance Team just put out a report describing an example international agreement to prevent the creation of superintelligence. 🧵

peterbarnett_'s tweet photo. We at the MIRI Technical Governance Team just put out a report describing an example international agreement to prevent the creation of superintelligence. 🧵 https://t.co/6d5l9VVt0q

11

136

21

36

41K

1

85

15

24

7K