David Buxton @davidreads - Twitter Profile

I agree that resolving a ticket decouples input cost and output value. I do not think that repackaging tokens as compute units does so. Obviously it does give you more discretion to slip margin in, but you’re fundamentally still using a cost yardstick, albeit one that’s more flexible in your favor.

Tomasz Tunguz

@ttunguz

4 days ago

Sierra charges when an agent resolves a ticket, zero for failures. Devin sells Agent Compute Units, not tokens — the same abstraction Databricks & Snowflake use with credits to decouple pricing from raw compute. Margin is decoupled from the inference line. Durable.

1

15

0

14

3K

0

15

David Buxton

@davidreads

4 days ago

@paulg source: https://t.co/UWxQHn38VH

0

22

David Buxton

@davidreads

4 days ago

@paulg It's amazing how human some of the old attempts to pass the Turing test are way better than the best LLMs in terms of non-boringness (even if they can get a little weird)

davidreads's tweet photo. @paulg It's amazing how human some of the old attempts to pass the Turing test are way better than the best LLMs in terms of non-boringness (even if they can get a little weird) https://t.co/HUD8rx2VnL

1

4

0

574

David Buxton

@davidreads

4 days ago

I’ve been wondering if we need a new notation for AI. Big-O describes how computation scales. But what we’re seeing now is something different: how much agency we’re willing to hand over. Call it Big-A. A(1): “Write this function.” A(n): “Execute this workflow.” A(n²): “Keep working until the tests pass.” A(n³): “Debate another model until you’re both happy with the result.” A(n⁴): “Build this product.” A(∞?): “Achieve this outcome.” The point isn’t that these are mathematically correct. The point is that every time AI gets cheaper, we don’t seem to do the same work for less money. We move up a level of abstraction and ask for something bigger. A prompt becomes a workflow. A workflow becomes an agent. An agent becomes a team. A team becomes a project. A project becomes an outcome. Which makes me wonder whether Jevons Paradox has any natural limit in AI. Do we eventually run out of higher-order loops to automate? Or does Big-A just keep increasing forever?

0

22

David Buxton

@davidreads

4 days ago

@kakashiii111 free credits are a sugar high. they delay the moment you have to ask why the spend is what it is, they don't answer it. the teams that actually flatten the bill aren't the ones with the biggest credit line, they're the ones who can see and cap usage.

0

2

David Buxton

@davidreads

4 days ago

@dexhorthy "just build more loops" is the same instinct that shows up in the finance review as a 5-figure inference bill nobody can explain. the loop that doesn't read its own output burns tokens and trust at the same rate.

0

12

David Buxton

@davidreads

4 days ago

@kimmonismus the tell is they're building an "AI Gateway to track spend and impose token budgets" in-house. every big co is quietly arriving at the same place: the cost problem was never the model price, it's that nobody could see or cap usage. a control problem wearing a procurement costume.

0

13

David Buxton

@davidreads

4 days ago

@mattpocockuk For things like classification, summarization I don’t think it’s impossible for these to work but I also haven’t seen good results

0

1K

David Buxton

@davidreads

4 days ago

@bscholl I love this framing. Imagine the user stories: “As an enemy fighter pilot, I want to get shot down, because…”

0

3

David Buxton

@davidreads

4 days ago

@emollick Yes, the value is what you do along the way rather than the end result. Whereas in coding if you just magically write the answer and it’s right, who cares what you did to get there

0

86

David Buxton

@davidreads

4 days ago

A simulation environment is hard to build but game changing when it comes to ability to iterate If you don’t have one, you’ll be constantly holding customers’ hands while they press the “on” button on agentic stuff Curious, is anyone building a comprehensive harness for this as a standalone product?

1

0

920

David Buxton

@davidreads

4 days ago

@arshamg_ Dm me and I can show you how this is solved by https://t.co/11n97AJX2a

0

5

David Buxton

@davidreads

5 days ago

@scottastevenson Do-maxxing is a suboptimal choice when everything that can be done soon _will_ be done. The same thing done tomorrow will be half the price of doing it today. It is the era of wait-maxxing

1

0

207

David Buxton

@davidreads

5 days ago

@Suhail Just X

0

6

David Buxton

@davidreads

5 days ago

@paulg Do you know anyone doing the smart compiler? Feels like it might just be tractable with Fable+ quality coding agents

0

3

0

1K

David Buxton

@davidreads

5 days ago

@petergyang You are being massively subsidized. Long may it last. But at some point there will be a rug pull

1

0

667

David Buxton

@davidreads

5 days ago

@theo This is very task dependent. There are a lot of simple tasks - especially non-programming - that GLM will burn similar numbers of tokens on and hence be way cheaper.

0

4

0

2K

David Buxton

@davidreads

Last Seen Users on Sotwe

Trends for you

Most Popular Users