Tim Schnabel

@TimSchnabel

President, @LawReformInst; previously executive director @uniformlaws, attorney @StateDept

McLean, Virginia

Joined January 2016

590 Following

141 Followers

320 Posts

Pinned Tweet

Tim Schnabel @TimSchnabel

8 months ago

Ever wondered how U.S. export controls apply to AI model outputs? Excited to make our first draft report @LawReformInst available for comments.

Law Reform Institute @LawReformInst

8 months ago

Our report on AI Outputs and National Security Controls is now out for comments-- link below. It addresses how the U.S. export control system applies to AI model outputs, as well as why (and how) the regime needs to be fixed. Comments are welcome. (1/N)

1

3

2

1

1K

1

2

0

0

855

Tim Schnabel @TimSchnabel

27 minutes ago

Paging @petersalib and @ProfArbel

about 1 hour ago

Javier Milei on Argentina's plans for AI https://t.co/GxAqbflwco

sebkrier's tweet photo. Javier Milei on Argentina's plans for AI https://t.co/GxAqbflwco https://t.co/eJrmFpvJqP

2

29

5

11

3K

0

3

0

0

21

Tim Schnabel @TimSchnabel

about 18 hours ago

Although the word "antitrust" isn't in the document, I was glad to see this passage here. Hopefully DOJ/FTC will provide some useful guidance, and hopefully Congress will provide some statutory clarity.

TimSchnabel's tweet photo. Although the word "antitrust" isn't in the document, I was glad to see this passage here. Hopefully DOJ/FTC will provide some useful guidance, and hopefully Congress will provide some statutory clarity. https://t.co/iZ8aug875B

OpenAI Newsroom

@OpenAINewsroom

about 20 hours ago

There’s real momentum right now for AI safety policy. Yesterday’s EO on cyber was an important step forward. We’re proposing a set of ideas for policymakers to consider next and to put the US out in front on frontier safety. https://t.co/2RlMqd0hLw

48

601

65

132

159K

0

0

0

1

57

Tim Schnabel @TimSchnabel

about 18 hours ago

Indeed, some interesting fodder on the RSI issue:

TimSchnabel's tweet photo. Indeed, some interesting fodder on the RSI issue: https://t.co/q0lJVg5MIC

about 19 hours ago

Very excited about this to be out, and to hear feedback! I think there is a lot of good stuff in here, from expanded role of CAISI, to RSI safety, to more nuanced stance on preemption, and much more.

0

32

1

0

8K

0

5

1

0

618

Who to follow

Verified account

Retired career photojournalist. A retweet is not an endorsement, merely something I found of interest or worthy of further discussion.

professor of law at Cornell University and jury scholar

@mohamedfahumy15

Tim Schnabel @TimSchnabel

about 21 hours ago

Feature request for @ChatGPTapp: want to be able to (1) keep memories off but (2) let ChatGPT search/reference past conversations when I specifically ask. Can do this in @claudeai, but ChatGPT can't access past conversations with memories off nor trigger it only on request.

0

0

0

0

25

Tim Schnabel @TimSchnabel

about 22 hours ago

@ARozenshtein Also @IThinkIAgree

1

2

0

0

30

Tim Schnabel @TimSchnabel

about 23 hours ago

Looking forward to reading this one...

about 24 hours ago

AI governance often focuses on the model. Yet capability progress is increasingly driven by non-model gains inference gain (scaling compute at test-time), systems gain (scaffolds), and asset gain (specialized datasets). Here we explore the implications. https://t.co/mYkD30B6bM

sebkrier's tweet photo. AI governance often focuses on the model. Yet capability progress is increasingly driven by non-model gains inference gain (scaling compute at test-time), systems gain (scaffolds), and asset gain (specialized datasets). Here we explore the implications. https://t.co/mYkD30B6bM https://t.co/6IzZgGJXb3

3

92

12

53

6K

1

3

0

0

153

Tim Schnabel @TimSchnabel

about 22 hours ago

Ok, it does not disappoint. This "asset gain" lens is going to be very important for oversight as more capable models are used by the executive branch. Will be hard for Congress to know what models can do for the USG. Flagging for @ARozenshtein

TimSchnabel's tweet photo. Ok, it does not disappoint. This "asset gain" lens is going to be very important for oversight as more capable models are used by the executive branch. Will be hard for Congress to know what models can do for the USG. Flagging for @ARozenshtein https://t.co/5Jzt79CAfz

1

3

0

0

44

TimSchnabel retweeted

Brendan Steinhauser

1 day ago

0

1

1

0

67

TimSchnabel retweeted

1 day ago

Added to prinzbench: Opus-4.8. For the very first time, the Max setting was available to me in the Claude app when I used this model. Using this setting, Claude's performance improved dramatically vs. all prior Anthropic models. Opus-4.8 (Max) scored 42/99 on prinzbench, as compared to 25/99 for Opus 4.7 (Extended). This was the second-highest score of all tested models to date for a model: (i) not released by OpenAI, and (ii) not utilizing a multi-agent setup or parallelized compute. (Gemini 3.1 Pro is still the best such model, having scored 50/99.) I am now very curious about how the "Mythos-class models" that Anthropic has promised to release in the near future will perform on my benchmark.

deredleritt3r's tweet photo. Added to prinzbench: Opus-4.8.

For the very first time, the Max setting was available to me in the Claude app when I used this model. Using this setting, Claude's performance improved dramatically vs. all prior Anthropic models. Opus-4.8 (Max) scored 42/99 on prinzbench, as compared to 25/99 for Opus 4.7 (Extended).

This was the second-highest score of all tested models to date for a model: (i) not released by OpenAI, and (ii) not utilizing a multi-agent setup or parallelized compute. (Gemini 3.1 Pro is still the best such model, having scored 50/99.)

I am now very curious about how the "Mythos-class models" that Anthropic has promised to release in the near future will perform on my benchmark.

14

207

18

45

16K

Tim Schnabel @TimSchnabel

1 day ago

On 5 and 6, I am increasingly concerned that -- though we need a plan, and it should come from Congress -- First Amendment concerns will make it very tricky. Not sure how to narrowly tailor a solution.

Peter Wildeford🇺🇸🚀

@peterwildeford

1 day ago

Some personal thoughts on President Trump's new executive order on AI -- 1. It's really great to see President Trump taking these risks seriously. It's a vindication of the idea that the government will respond to risks as they emerge. 2. This is important because this is not a narrow cyber issue. The EO focuses too much on cyber risks to the exclusion of other national security concerns. Mythos wasn't built to do cyber - it was trained in a general-purpose way and just happened to get superhuman cyber capabilities. And Mythos is just the beginning. Companies are clear we are building towards superintelligent AI that outclasses all human experts combined at all tasks. We have no plans to be able to control such a superintelligence. The framework being started by the EO needs to be built to consider far more risks than just cyber. 3. Also evaluations themselves won't be enough - the US government also has a national security interest for wider-ranging visibility into what is happening in AI companies. The main risks of AI systems are not 30 days before commercial release. Risks will occur first and foremost from AI systems that are only available internally within an AI company. For example, it makes sense that the Air Force would want to test a fighter jet before they fly it, because if you fly it and the fighter jet crashes because it is built incorrectly, then many people will die. However, as long as the fighter is just sitting on the runway, nothing bad can happen. But now imagine you had a fighter that could just take off and fly itself without human authorization and launch missiles and crash before anyone realized what had happened. That kind of fighter jet would need a very different kind of security measures. This may sound crazy for a fighter jet but it is already beginning to happen with the most advanced AI. AI systems can take actions, including unintended and unauthorized actions, and are increasing in their sophistication to do so. The government deserves to know what capabilities AIs have at the same time companies know, not just 30 days before commercial deployment. 4. We also need to focus on the security of the AI models themselves, including internally. What happens if an adversary steals the AI model and then can use it against us? An employee or contractor with privileged access, possibly in collusion with an external actor such as a foreign intelligence service, could steal an internally-deployed AI model. We don't have good defenses against this yet, and the government isn't putting enough pressure on AI companies to ensure this happens. Surely China, Russia, or North Korea would want access to Mythos and the fact that both Mythos has been illicitly accessed by random people on Discord and Mythos was first learned from the internet via an unauthorized leak do not inspire confidence. 5. We also have the question about what to do if evaluations find risks that companies are not mitigating well on their own. Some of these risks we have no plans for even how to mitigate them. Will it be possible, in these ultimate scenarios, for the government to be in a position to tell the companies that some aspects of their development may be too dangerous and get them to halt or change practice? Currently we have no framework for this. 6. The ideal response to all of the above is Congressional action. It's great to see the White House leading where they can, but so much of this can only come from Congress. So far Congress is way behind, and that's unfortunate.

2

115

19

43

12K

0

1

0

1

97

TimSchnabel retweeted

@anton_d_leicht

1 day ago

it feels like the EO overindexes on the cyber threat model: it prescribes 'see what the model can do, shore up defenses, then release'. that works for cyber, where you can patch vulnerabilities. but is there any hardening you can do in a few weeks if the next risk vector is bio?

anton_d_leicht's tweet photo. it feels like the EO overindexes on the cyber threat model: it prescribes 'see what the model can do, shore up defenses, then release'.

that works for cyber, where you can patch vulnerabilities. but is there any hardening you can do in a few weeks if the next risk vector is bio? https://t.co/oqhFCA8hv6

6

44

8

11

8K

Tim Schnabel @TimSchnabel

2 days ago

Great report from @CNASdc; very thoughtful analysis by @BenHayum and @danielremler; really appreciate the shout-out to @LawReformInst's work here.

TimSchnabel's tweet photo. Great report from @CNASdc; very thoughtful analysis by @BenHayum and @danielremler; really appreciate the shout-out to @LawReformInst's work here. https://t.co/asYnvnbCAb

2 days ago

10/ What we recommend for deterrence: - A coordinated State/DOJ/Treasury/DHS campaign to disrupt intermediary infrastructure across jurisdictions. - An IEEPA executive order authorizing targeted services prohibitions, escalating to full blocking sanctions if campaigns continue — with State Department direct notice to Beijing specifying what triggers designation. - Congress should pass the Deterring American AI Model Theft Act (H.R. 8283) and the Remote Access Security Act (H.R. 2683) to make these authorities durable.

1

4

0

0

277

0

2

0

1

190

Tim Schnabel @TimSchnabel

2 days ago

TimSchnabel's tweet photo. https://t.co/sUdm0iAHgM

2 days ago

9/ What we recommend for detection: - DOJ/FTC guidance clarifying that sharing adversarial distillation signals between firms does not raise antitrust or Stored Communications Act concerns. - An industry forum — building on the Frontier Model Forum, modeled on GIFCT — for real-time signal sharing among major U.S. actors. CAISI-led best practices and strategic convenings. - NSA intelligence support to attribute campaigns beyond what companies can see alone.

1

4

0

0

288

0

1

1

0

203

Tim Schnabel @TimSchnabel

2 days ago

@bahradx Yeah-- though less "government might use" than "individual government employees might misuse"... which could be an incredibly important precedent.

0

2

0

0

14

Tim Schnabel @TimSchnabel

2 days ago

So I'm curious what the "insider-risk ... protection" would look like-- which insiders, and which risks, are deemed relevant here? But presumably we won't find out.

TimSchnabel's tweet photo. So I'm curious what the "insider-risk ... protection" would look like-- which insiders, and which risks, are deemed relevant here? But presumably we won't find out. https://t.co/rOidsxYgXi

2 days ago

Key provisions in the White House EO on AI Innovation and Security (hot off the presses): - Consistent with the draft EO leaked a few days ago, the EO establishes a *voluntary* framework enabling frontier labs to "provide access" to a new "covered frontier model" to the USG. The USG gets it "up to 30 days" before the model is released to "other trusted partners"; this is down from 90 days in the draft EO. Note: "up to" probably means "30 or more days", not "at most 30 days". Weird phrasing. - Interestingly, the intention appears to be to limit "covered frontier models" solely to those that have significant cyber capabilities. Note that the benchmark to be developed by the USG for assessing whether a model is a "covered frontier model" will "assess the advanced cyber capabilities of AI models and determine the threshold at which an AI model should be designated a 'covered frontier model'". - The determination on whether a model is a "covered frontier model" is made by the NSA. This makes sense, since the NSA is the federal agency in charge of cybersecurity protection of the USG. The NSA is required to consult in its decision with a variety of federal agencies and personnel, including CISA, the National Cyber Director, the Assistant to the President for Science and Technology, and other DoD representatives. But "consult" doesn't mean much - it's the NSA that will make the final determination. - "Nothing in this section shall be construed to authorize the creation of a mandatory governmental licensing, preclearance, or permitting requirement for the development, publication, release, or distribution of new AI models, including frontier models." https://t.co/kFIwlvhMSK

6

53

8

18

7K

1

3

0

1

670

Tim Schnabel @TimSchnabel

2 days ago

@alex_reinauer @AldenAbbott1 @LawReformInst @WillRinehart @CSRisks And one from @RANDCorporation: https://t.co/GuLoeMdtdZ

TimSchnabel's tweet photo. @alex_reinauer @AldenAbbott1 @LawReformInst @WillRinehart @CSRisks And one from @RANDCorporation: https://t.co/GuLoeMdtdZ https://t.co/W6LMQVxJq6

0

0

0

1

26

Tim Schnabel @TimSchnabel

9 days ago

Really glad to see that @alex_reinauer and @AldenAbbott1 both submitted letters urging DOJ/FTC to provide guidance on collaboration on key AI risks, as have @LawReformInst and @WillRinehart. Seems to be a growing consensus, thankfully.

1

1

0

1

656

Tim Schnabel @TimSchnabel

7 days ago

@alex_reinauer @AldenAbbott1 @LawReformInst @WillRinehart And another one, from @CSRisks https://t.co/cOZdT37Lat

TimSchnabel's tweet photo. @alex_reinauer @AldenAbbott1 @LawReformInst @WillRinehart And another one, from @CSRisks https://t.co/cOZdT37Lat https://t.co/hSoKzeNozt

1

0

0

0

96

Tim Schnabel @TimSchnabel

6 days ago

Very interesting (and great) that Anthropic is now testing models' willingness to help build/obtain "military-grade weapons"; curious as to how that's defined, and whether it takes into account accuracy of outputs. Seems new in Opus 4.8 system card.

TimSchnabel's tweet photo. Very interesting (and great) that Anthropic is now testing models' willingness to help build/obtain "military-grade weapons"; curious as to how that's defined, and whether it takes into account accuracy of outputs. Seems new in Opus 4.8 system card. https://t.co/x9PzOAVPoQ

TimSchnabel's tweet photo. Very interesting (and great) that Anthropic is now testing models' willingness to help build/obtain "military-grade weapons"; curious as to how that's defined, and whether it takes into account accuracy of outputs. Seems new in Opus 4.8 system card. https://t.co/x9PzOAVPoQ

TimSchnabel's tweet photo. Very interesting (and great) that Anthropic is now testing models' willingness to help build/obtain "military-grade weapons"; curious as to how that's defined, and whether it takes into account accuracy of outputs. Seems new in Opus 4.8 system card. https://t.co/x9PzOAVPoQ

0

1

0

1

56

Tim Schnabel @TimSchnabel

6 days ago

I love that you can now set Opus 4.8 on "max" mode, but wow, it burns through tokens fast. Two prompts to review 1-page documents, and three brief follow-up questions, and I'm 90% through (Pro plan).

TimSchnabel's tweet photo. I love that you can now set Opus 4.8 on "max" mode, but wow, it burns through tokens fast. Two prompts to review 1-page documents, and three brief follow-up questions, and I'm 90% through (Pro plan). https://t.co/CeG5P3BmI7

7 days ago

We put a lot of work into calibrating thinking effort for Opus 4.8. As you're trying out the model, if you do run into any examples of it still over/under thinking, please flag it to us!

50

423

10

25

37K

0

0

0

0

69

Tim Schnabel @TimSchnabel

6 days ago

@tomekkorbak Seems not great that Mythos is so much higher? Or some less-concerning explanation?

1

0

0

0

53

Tim Schnabel @TimSchnabel

6 days ago

@drmtown Somehow need to enable frontier model access to the subscription databases to get the content that can't be accessed on the open web.

1

1

0

0

21

Last Seen Users on Sotwe

Trends for you

Most Popular Users