Daniel and 10 others

1 day ago

AI SDK now supports agent harnesses like Claude Code, Codex, and Pi with sandboxed sessions and AI SDK-compatible streams: 𝚌𝚘𝚗𝚜𝚝 𝚊𝚐𝚎𝚗𝚝 = 𝚗𝚎𝚠 𝙷𝚊𝚛𝚗𝚎𝚜𝚜𝙰𝚐𝚎𝚗𝚝({ 𝚑𝚊𝚛𝚗𝚎𝚜𝚜: 𝚌𝚕𝚊𝚞𝚍𝚎𝙲𝚘𝚍𝚎, 𝚜𝚊𝚗𝚍𝚋𝚘𝚡: 𝚌𝚛𝚎𝚊𝚝𝚎𝚅𝚎𝚛𝚌𝚎𝚕𝚂𝚊𝚗𝚍𝚋𝚘𝚡(), }); Available in canary: 𝚗𝚙𝚖 𝚒 𝚊𝚒@𝚌𝚊𝚗𝚊𝚛𝚢. We welcome your feedback as we bring agent harness portability to the ecosystem, with excellent DX. https://t.co/xojn6am7sg

850

783

340K

techwraith retweeted

AI SDK

@aisdk

2 days ago

Prevent your agent from going rogue. A policy gives your agent runtime guardrails for AI SDK tool calls. Create policies as code via Open Policy Agent.

aisdk's tweet photo. Prevent your agent from going rogue.

A policy gives your agent runtime guardrails for AI SDK tool calls. Create policies as code via Open Policy Agent. https://t.co/ZAIleux2A4

Mad-scientist investor main-questing Europe 🇪🇺 @prototype_cap 🦾 @euinc_petition 🇪🇺 🔧-prev: @producthunt @angellist @coinlist @beondeck ❤️ @susanneknoll

3 days ago

@tdinh_me @rauchg Noted!

Who to follow

Andreas Klinger 🦾

@andreasklinger

Elyse Davis

@Emoneylady

Finance & Administration @craft_ventures. Previously Finance @Zinc and @Yammer. SF peninsula native. Lover of tech, music, travel, Tahoe, and bay area sports.

Raul Tiru 🌎

@raultiru

The World Needs Our Lord and Saviour Jesus Christ. #christian #Jesus #Christianity

techwraith retweeted

shadcn

@shadcn

4 days ago

You have Claude Fable for only a few days. Here's how to make the most of it. Introducing /improve: use your most capable model to audit your codebase and write plans for cheaper models to execute later. Studies your code, figures out bugs, perf, tech debt, missing tests, what to build and writes plans any agent can run.

shadcn's tweet photo. You have Claude Fable for only a few days. Here's how to make the most of it.

Introducing /improve: use your most capable model to audit your codebase and write plans for cheaper models to execute later.

Studies your code, figures out bugs, perf, tech debt, missing tests, what to build and writes plans any agent can run.

177

382

744K

techwraith retweeted

Guillermo Rauch

@rauchg

4 days ago

Vercel CLI now allows you to: ◾ create AI Gateway API keys ◾ pass a --𝚋𝚞𝚍𝚐𝚎𝚝 to cap their spend ◾ set a --𝚛𝚎𝚏𝚛𝚎𝚜𝚑-𝚙𝚎𝚛𝚒𝚘𝚍 for the quota Think of it as virtual credit cards for AI tokens 🤖💳

rauchg's tweet photo. Vercel CLI now allows you to:
◾ create AI Gateway API keys
◾ pass a --𝚋𝚞𝚍𝚐𝚎𝚝 to cap their spend
◾ set a --𝚛𝚎𝚏𝚛𝚎𝚜𝚑-𝚙𝚎𝚛𝚒𝚘𝚍 for the quota

Think of it as virtual credit cards for AI tokens 🤖💳 https://t.co/ZOuhwIp7h5

247

37K

techwraith retweeted

Dan Greenheck

@dangreenheck

4 days ago

Jokingly asked Fable to build me Crysis in Three.js. It may not be Crysis, but the fact this is all done procedurally in basically one shot is kind of blowing my mind right now.

105

762

230K

techwraith retweeted

4 days ago

AI Gateway API keys now support budgets. Set one programmatically or configure on the API keys page: 𝚟𝚎𝚛𝚌𝚎𝚕 𝚊𝚒-𝚐𝚊𝚝𝚎𝚠𝚊𝚢 𝚊𝚙𝚒-𝚔𝚎𝚢𝚜 𝚌𝚛𝚎𝚊𝚝𝚎 \ --𝚗𝚊𝚖𝚎 𝚖𝚢-𝚔𝚎𝚢 \ --𝚋𝚞𝚍𝚐𝚎𝚝 𝟷𝟶𝟶𝟶 https://t.co/JbrlSngt6F

49K

techwraith retweeted

Taelin

@VictorTaelin

4 days ago

this is my personal singularity moment this post may sound like a paid ad. I only wish. I'm concerned, more so than happy. the world is changing, and, among the scenarios where AI goes terribly wrong, inequality is the most realistic, yet, the one Anthropic seems to be the least concerned about. I'm glad OpenAI is taking the opposite stance: *personal AGI for everyone*. I think this is a commendable position in the times we live. but who am I in the queue of the bread? anyway, Fable is here, so I'll just report my first-hour experience first of all, all my pet prompts are solved. → λ-calculus puzzles → bug questions → one-shot apps all are trivial to it. I don't have anything harder other than my ongoing work so, in the last several days, I've been toying with HVM5, a new interaction net evaluator with a faster loop. after writing the first version, I left 32 GPT-5 agents working for ~20 hours each. this resulted in up to 2x speedups, but the file size increased by 2-fold and quality decreased significantly. I then simplified the whole thing into an even simpler core, and left Opus 4.8 and GPT 5.5 optimizing it for 8 hours. Opus got a legit 6% - 34% speedup in most benches. GPT got better results, but, sadly, an unusable file. I then asked Fable to optimize it. 2 hours later, it landed a 1770% speedup in one case, 100%+ in other 4, and 22% in average. yes, in 2 hours it outperformed me, opus 4.8 and a swarm of gpt 5.5 agents, by one order of magnitude. that could not possibly be legit. "it must be hardcoding the benchmarks" (GPT trauma). so I read its explanation and what it did was, indeed, the most high impact optimization one could try first. seems like HVM5 was wasting a lot of time garbage-collecting unused branches of pattern-match nodes. I had optimized that for static mats, but not for dynamic mats. skill issue. Fable figured how to do it for these, resulting in a massive speedup in some benches but wait, is that *correct*? I'm not sure yet, it is credible, but this is the kind of thing that is very easy to get wrong on interaction nets. the problem is, when I was ready to start auditing Fable's solution so I could tell whether it was buggy or legit, it interrupted me to tell me it had found a massive bug on the code *I* had written. ... wait, what? so... for garbage collection purposes, I stored a bit on lambda term pointers that meant "the variable bound by this lambda has been freed, so, its lambda must free whatever argument it is applied to". that's fine. yet, on duplicator nodes, I also used the same bit to mean "one of the duplicated variables was freed, so, treat this dup as a passthrough no-op". so, if a lambda entered a duplicator, it would mistake the lambda's collection bit for its own, resulting in corrupted interaction! that's a mouthful, why I'm writing this? just so you can appreciate the sheer absurdity of what just happened. I didn't ask it to find bugs. I asked it for an optimization. and even if I did ask it to find bugs, this bug is so astonishingly subtle and specific, identifying it takes mastering the domain to an extent that it beyond even me. I'd easily need hours or days to fix it, *if* I ever came across it. chances are it would just go unnoticed. and Fable found it and fixed it like it was nothing, while it was busy adding a 17x speedup to a file that neither I, nor Opus 4.8, nor a fleet of GPT 5.5 managed to barely make 2x faster. oh and there is also another tab where it is also ripping through Bend's codebase and finishing everything I had to do I don't know what to say anymore this isn't about Anthropic or OpenAI, this is about our collective future as a species. the world is changing, and we need to be aware of it, and discuss how to handle this change. receipt below . . .

VictorTaelin's tweet photo. this is my personal singularity moment

this post may sound like a paid ad. I only wish. I'm concerned, more so than happy. the world is changing, and, among the scenarios where AI goes terribly wrong, inequality is the most realistic, yet, the one Anthropic seems to be the least concerned about. I'm glad OpenAI is taking the opposite stance: *personal AGI for everyone*. I think this is a commendable position in the times we live. but who am I in the queue of the bread?

anyway, Fable is here, so I'll just report my first-hour experience

first of all, all my pet prompts are solved.
→ λ-calculus puzzles
→ bug questions
→ one-shot apps
all are trivial to it.

I don't have anything harder other than my
ongoing work

so, in the last several days, I've been toying with HVM5, a new interaction net evaluator with a faster loop.

after writing the first version, I left 32 GPT-5 agents working for ~20 hours each. this resulted in up to 2x speedups, but the file size increased by 2-fold and quality decreased significantly.

I then simplified the whole thing into an even simpler core, and left Opus 4.8 and GPT 5.5 optimizing it for 8 hours. Opus got a legit 6% - 34% speedup in most benches. GPT got better results, but, sadly, an unusable file.

I then asked Fable to optimize it.

2 hours later, it landed a 1770% speedup in one case, 100%+ in other 4, and 22% in average. yes, in 2 hours it outperformed me, opus 4.8 and a swarm of gpt 5.5 agents, by one order of magnitude.

that could not possibly be legit. "it must be hardcoding the benchmarks" (GPT trauma). so I read its explanation and what it did was, indeed, the most high impact optimization one could try first. seems like HVM5 was wasting a lot of time garbage-collecting unused branches of pattern-match nodes. I had optimized that for static mats, but not for dynamic mats. skill issue. Fable figured how to do it for these, resulting in a massive speedup in some benches

but wait, is that *correct*? I'm not sure yet, it is credible, but this is the kind of thing that is very easy to get wrong on interaction nets. the problem is, when I was ready to start auditing Fable's solution so I could tell whether it was buggy or legit, it interrupted me to tell me it had found a massive bug on the code *I* had written.

... wait, what?

so... for garbage collection purposes, I stored a bit on lambda term pointers that meant "the variable bound by this lambda has been freed, so, its lambda must free whatever argument it is applied to". that's fine. yet, on duplicator nodes, I also used the same bit to mean "one of the duplicated variables was freed, so, treat this dup as a passthrough no-op". so, if a lambda entered a duplicator, it would mistake the lambda's collection bit for its own, resulting in corrupted interaction!

that's a mouthful, why I'm writing this?

just so you can appreciate the sheer absurdity of what just happened. I didn't ask it to find bugs. I asked it for an optimization. and even if I did ask it to find bugs, this bug is so astonishingly subtle and specific, identifying it takes mastering the domain to an extent that it beyond even me. I'd easily need hours or days to fix it, *if* I ever came across it. chances are it would just go unnoticed. and Fable found it and fixed it like it was nothing, while it was busy adding a 17x speedup to a file that neither I, nor Opus 4.8, nor a fleet of GPT 5.5 managed to barely make 2x faster.

oh and there is also another tab where it is also ripping through Bend's codebase and finishing everything I had to do

I don't know what to say anymore

this isn't about Anthropic or OpenAI, this is about our collective future as a species. the world is changing, and we need to be aware of it, and discuss how to handle this change.

receipt below . . .

251

679

techwraith retweeted

Malte Ubl

@cramforce

4 days ago

Huge thanks to the @AnthropicAI team for investigating `just-bash` with Mythos as part of Project Glasswing There are no serious findings. I'll issue a release with minor hardening improvements after my vacation. This is an interesting case for 2 reasons: 1. `just-bash` is a very ripe surface (it's a full implementation of bash and common utilities, bundles QuickJS, CPython, has optional filesystem access) 2. `just-bash` itself was largely written by Opus 4.5 with minimal human review (but deep hardening loops and very paranoid machine-enforced coding rules) https://t.co/z4WRuMI6KB

291

110

128K

techwraith retweeted

5 days ago

Claude Fable 5 is now on AI Gateway. A Mythos-class model for your hardest unsolved problems. 𝚖𝚘𝚍𝚎𝚕: '𝚊𝚗𝚝𝚑𝚛𝚘𝚙𝚒𝚌/𝚌𝚕𝚊𝚞𝚍𝚎-𝚏𝚊𝚋𝚕𝚎-𝟻' https://t.co/oLvVc2UvaE

techwraith retweeted

AI SDK

@aisdk

5 days ago

AI SDK 7 canary Configure tool approvals for any tool directly on ToolLoopAgent, generateText, and streamText: - tool specific approval with constants - tool specific approval with functions - generic tool approval function with custom logic

aisdk's tweet photo. AI SDK 7 canary

Configure tool approvals for any tool directly on ToolLoopAgent, generateText, and streamText:

- tool specific approval with constants
- tool specific approval with functions
- generic tool approval function with custom logic https://t.co/pULHPMlgXw

techwraith retweeted

Vercel @vercel

5 days ago

https://t.co/hUowOdv4Ci

177

128

181K

5 days ago

@andrewqu @johnyeo_ It's loops all the way down.

5 days ago

@andrewqu This is the way

190

techwraith retweeted

6 days ago

Use the observability dashboard to track AI Gateway usage ▪︎ Graph cost and requests ▪︎ Group by model or project Add tags to AI SDK calls and search with the Custom Reporting API ▪︎ Ask your agent questions about the data ▪︎ Build a custom dashboard in v0

6 days ago

@thiago_peres @vercel_dev Yep- unless your key fails for some reason and we have to fall back to ours (which is great! That means that your app will survive even if your key or provider has an issue)

techwraith retweeted

Guillermo Rauch

@rauchg

6 days ago

Vercel AI Gateway recovers on average over 1T tokens a month 🤯 Much like Stripe recovers revenue with smart retries on failed payments or credit card updates. And we do it with 0️⃣ zero markup over the labs; adding redundancy, zero-data retention enforcement, observability, usage APIs, caps, … https://t.co/OougSipbBX

352

55K

techwraith retweeted

Esteban Suárez

@EstebanSuarez

8 days ago

Vercel Sandbox persistence is now GA, so I built a demo on top of it: 𝚞𝚙. Run 𝚞𝚙 in your project: • syncs your code into a Vercel Sandbox • detects the framework, installs deps, runs your dev server • serves it at a public URL you can share • stop it, run 𝚞𝚙 . again, and it resumes from the snapshot, files and deps intact try it: https://t.co/nKN3hIjLdl

12K

techwraith retweeted