We still don't grasp LLMs. That's evident in our choice of words to anthropomorphize what's really just a statistical model.
We don't understand what the existence of a large statistical language model implies. It does not imply intelligence, but it’s something easily confound-able with it.
The training corpus was human written, with thought and intelligence, so the output is a sort of mirror of humans, of how we write and reason to some extent. the mirror is confusing because people have never seen that kind of mirror before. Once you identify these models as mirrors or the text they are given, problems like "sycophancy" and "hallucinations", described in these anthropomorphized terms sound silly.
Once you identify these models as mirrors or the text they are given, problems like "sycophancy" and "hallucinations", described in these anthropomorphized terms sound silly.
Inference will never be free or even negligibly cheap. Everyone will get a token budget.
➤ Productivity gains from tokens are hard to attribute. ➤ Consumption is easy to scale. ➤ Waste is nearly invisible. You don't know where to cut.
That's all you need for a price floor.
Tokens will forever be a scarce resource you need to allocate consciously, much like old problems: headcount for managers, capital for investments, ad budget for marketers.
Tokens won’t be different. You need a performance-adjusted budget.
In a previous era, I often made a big deal about deleting unnecessary bits in a project repository. “What’s the big deal? It’s not harmful” — but it is, like clutter on a work table.
I’m glad there’s now a bigger imperative: it costs tokens and increases perplexity!
*UBER SETS $1,500 MONTHLY CAP ON SOME AI CODING TOOLS FOR STAFF
$UBER officially reeling in the Claude budget after blowing their AI budget earlier this year.
Undoubtedly more companies to follow
I can now probably say this:
Two months ago, inside Anthropic someone suggested building a token leaderboard.
A heated internal debate followed and the decision was made to *never* ever do it… because several people inside Anthropic simply thought ahead of the consequences
This what collapse looks like. Societal culture collapse.
Every thought, trend, what you assume is mutually assumed gets pushed and compressed toward the statistical model.
A worldwide echo chamber worthy of a black mirror episode.
Alpha comes from the ability to disconnect.
engineers are realizing that any serious work requires LLM use to work *slower*
orgs are still scrambling to answer “what are we getting back from all this AI spend?”
unproven but generally accepted answer is “faster” or “more”
but the direction to look is “higher quality”
Good post!
“Using AI to write better code more slowly”: https://t.co/H6gieYNfai
This is what I've been doing with my smalloc project. Wrote all the (core) code myself while asking AIs to teach me about stuff, and now I'm repeatedly asking AIs to “Find more bugs in this.”.
Inference will never be free or even negligibly cheap. Everyone will get a token budget.
➤ Productivity gains from tokens are hard to attribute. ➤ Consumption is easy to scale. ➤ Waste is nearly invisible. You don't know where to cut.
That's all you need for a price floor.
Tokens will forever be a scarce resource you need to allocate consciously, much like old problems: headcount for managers, capital for investments, ad budget for marketers.
Tokens won’t be different. You need a performance-adjusted budget.
Hang on to your ears. When even the most hyped innovators start to U-turn, you know a correction is looming. Maybe a welcome breeze for laggard adopters
Jiujitsu is very inclusive. No fancy credentials or rich parents needed. Friends made from all sorts of backgrounds.
6yrs ago I first heard locker-room crypto trade talk. Plumbers, dentists w/ strong opinions. Was a clear sign.
Today people were raving about Cerebras’s IPO.
👀
Vet any creator profile on X with an automated team of experts.
Check engagement quality, patterns, reach (ability to break through the algorithm), promotion style, and check for red flags,
Codex anywhere and everywhere, all the time.
Now your Mac doesn’t have to be unlocked for Codex to use your computer.
From your phone, Codex can securely use apps on your Mac, even when the screen is off and locked.
https://t.co/PCGK4i7FSF
It's tough out there for inference pipeline builders:
* Groq - vulture-gutted, tiny limits & fewer models
* Cerebras - cut down to 2 models post IPO
* Together - raising prices
* DeepInfra - still unreliable latency
Hope SambaNova & Fireworks stay off this list.
Thank you. The important part is zeroing out taxes on the bottom half. Best way to put money in someone’s pocket is to not take it out in the first place. Bottom half is only 3% of total tax revenue. But it’s very meaningful to that person. Zero it out.
Didn't like the answer? Just keep prompting.
Sad reality: sycophancy isn't a marketing play, it's the feature you want even if you say otherwise.
We may get less polarized but increasingly delusional.
@davicorn@Fastmail Thanks for sharing. What’s the lifetime on that cookie? Defeats the point of automation if every/most time I use it I end up having to open the browser and click around to extract credentials. Not to mention the reliance on private (i.e. unstable) APIs here.