Igor Soarez

Verified account

@igorsoarez

Dad x2. Engineer building on my own terms, or failing at it, mostly. Consulting by day, indie projects by night. AI agent infra.

London, England

Joined March 2009

267 Following

717 Followers

3K Posts

1 day ago

We still don't grasp LLMs. That's evident in our choice of words to anthropomorphize what's really just a statistical model. We don't understand what the existence of a large statistical language model implies. It does not imply intelligence, but it’s something easily confound-able with it. The training corpus was human written, with thought and intelligence, so the output is a sort of mirror of humans, of how we write and reason to some extent. the mirror is confusing because people have never seen that kind of mirror before. Once you identify these models as mirrors or the text they are given, problems like "sycophancy" and "hallucinations", described in these anthropomorphized terms sound silly. Once you identify these models as mirrors or the text they are given, problems like "sycophancy" and "hallucinations", described in these anthropomorphized terms sound silly.

igorsoarez's tweet photo. We still don't grasp LLMs. That's evident in our choice of words to anthropomorphize what's really just a statistical model.

We don't understand what the existence of a large statistical language model implies. It does not imply intelligence, but it’s something easily confound-able with it.

The training corpus was human written, with thought and intelligence, so the output is a sort of mirror of humans, of how we write and reason to some extent. the mirror is confusing because people have never seen that kind of mirror before. Once you identify these models as mirrors or the text they are given, problems like "sycophancy" and "hallucinations", described in these anthropomorphized terms sound silly.

Once you identify these models as mirrors or the text they are given, problems like "sycophancy" and "hallucinations", described in these anthropomorphized terms sound silly.

0

0

0

0

28

2 days ago

“never expected” 😆 🙉

igorsoarez's tweet photo. “never expected” 😆 🙉 https://t.co/jqIBji5ufX

12 days ago

Inference will never be free or even negligibly cheap. Everyone will get a token budget. ➤ Productivity gains from tokens are hard to attribute. ➤ Consumption is easy to scale. ➤ Waste is nearly invisible. You don't know where to cut. That's all you need for a price floor. Tokens will forever be a scarce resource you need to allocate consciously, much like old problems: headcount for managers, capital for investments, ad budget for marketers. Tokens won’t be different. You need a performance-adjusted budget.

0

0

0

0

90

0

0

0

0

40

2 days ago

In a previous era, I often made a big deal about deleting unnecessary bits in a project repository. “What’s the big deal? It’s not harmful” — but it is, like clutter on a work table. I’m glad there’s now a bigger imperative: it costs tokens and increases perplexity!

0

1

0

0

37

2 days ago

This won’t last. They’ll need adjustable, per division and per employee budgets

Negligible Capital

@negligible_cap

3 days ago

*UBER SETS $1,500 MONTHLY CAP ON SOME AI CODING TOOLS FOR STAFF $UBER officially reeling in the Claude budget after blowing their AI budget earlier this year. Undoubtedly more companies to follow

88

1K

57

206

3M

0

0

0

0

87

Who to follow

Verified account

👾 Software Engineer @GraphyHQ 🚀 Curious. My opinions are my own.

𝖑𝖚𝖐𝖊 𝖇𝖔𝖓𝖉

Computer programmer. Work at https://t.co/Kf1ZX14PaI. Golang, Rust and Kubernetes. You can find me on the other place 🦋

Verified account

reaching light through the struggle | https://t.co/Zj8k5VSSwS

8 days ago

dealers know better than getting high on their own supply

8 days ago

I can now probably say this: Two months ago, inside Anthropic someone suggested building a token leaderboard. A heated internal debate followed and the decision was made to *never* ever do it… because several people inside Anthropic simply thought ahead of the consequences

171

8K

306

1K

1M

0

0

0

0

68

8 days ago

@mitchellh How’s this different from a ceo or director or anyone who hires or employs subject matter experts outside their own domain?

0

0

0

0

167

9 days ago

This what collapse looks like. Societal culture collapse. Every thought, trend, what you assume is mutually assumed gets pushed and compressed toward the statistical model. A worldwide echo chamber worthy of a black mirror episode. Alpha comes from the ability to disconnect.

Armin Ronacher ⇌

10 days ago

This is such a good post. https://t.co/IdmAnh18Nt

mitsuhiko's tweet photo. This is such a good post. https://t.co/IdmAnh18Nt https://t.co/kGVBOwRneQ

82

3K

432

801

100K

0

1

0

0

118

10 days ago

engineers are realizing that any serious work requires LLM use to work *slower* orgs are still scrambling to answer “what are we getting back from all this AI spend?” unproven but generally accepted answer is “faster” or “more” but the direction to look is “higher quality”

zooko🛡🦓🦓🦓 ⓩ

11 days ago

Good post! “Using AI to write better code more slowly”: https://t.co/H6gieYNfai This is what I've been doing with my smalloc project. Wrote all the (core) code myself while asking AIs to teach me about stuff, and now I'm repeatedly asking AIs to “Find more bugs in this.”.

zooko's tweet photo. Good post!

“Using AI to write better code more slowly”: https://t.co/H6gieYNfai

This is what I've been doing with my smalloc project. Wrote all the (core) code myself while asking AIs to teach me about stuff, and now I'm repeatedly asking AIs to “Find more bugs in this.”. https://t.co/61SvlI76X3

2

35

1

16

3K

0

1

0

0

85

12 days ago

Inference will never be free or even negligibly cheap. Everyone will get a token budget. ➤ Productivity gains from tokens are hard to attribute. ➤ Consumption is easy to scale. ➤ Waste is nearly invisible. You don't know where to cut. That's all you need for a price floor. Tokens will forever be a scarce resource you need to allocate consciously, much like old problems: headcount for managers, capital for investments, ad budget for marketers. Tokens won’t be different. You need a performance-adjusted budget.

0

0

0

0

90

12 days ago

@yacineMTB It looks like the way it works you just need to make a lot more to avoid having to pay them.

0

0

0

0

76

12 days ago

Hang on to your ears. When even the most hyped innovators start to U-turn, you know a correction is looming. Maybe a welcome breeze for laggard adopters

george hotz archive @geohotarchive

12 days ago

The Eternal Sloptember https://t.co/kFIW7LNhNd

78

1K

191

942

572K

0

1

0

0

97

12 days ago

Jiujitsu is very inclusive. No fancy credentials or rich parents needed. Friends made from all sorts of backgrounds. 6yrs ago I first heard locker-room crypto trade talk. Plumbers, dentists w/ strong opinions. Was a clear sign. Today people were raving about Cerebras’s IPO.

0

3

0

0

152

13 days ago

👀 Vet any creator profile on X with an automated team of experts. Check engagement quality, patterns, reach (ability to break through the algorithm), promotion style, and check for red flags,

0

2

1

1

88

13 days ago

The ultimate botnet. Built in the open, installed voluntarily, often by paying customers.

OpenAI Developers

15 days ago

Codex anywhere and everywhere, all the time. Now your Mac doesn’t have to be unlocked for Codex to use your computer. From your phone, Codex can securely use apps on your Mac, even when the screen is off and locked. https://t.co/PCGK4i7FSF

OpenAIDevs's tweet photo. Codex anywhere and everywhere, all the time.

Now your Mac doesn’t have to be unlocked for Codex to use your computer.

From your phone, Codex can securely use apps on your Mac, even when the screen is off and locked.

https://t.co/PCGK4i7FSF https://t.co/956aAtM3vl

500

8K

556

3K

3M

0

1

0

0

115

13 days ago

It's tough out there for inference pipeline builders: * Groq - vulture-gutted, tiny limits & fewer models * Cerebras - cut down to 2 models post IPO * Together - raising prices * DeepInfra - still unreliable latency Hope SambaNova & Fireworks stay off this list.

0

3

0

0

164

15 days ago

@cpinto @rodriscoll If everything ticks along - the retail investors, or even taxpayers

0

0

0

0

13

16 days ago

The writing’s bright on the wall.

16 days ago

Thank you. The important part is zeroing out taxes on the bottom half. Best way to put money in someone’s pocket is to not take it out in the first place. Bottom half is only 3% of total tax revenue. But it’s very meaningful to that person. Zero it out.

3K

54K

5K

4K

9M

0

1

0

0

98

29 days ago

Didn't like the answer? Just keep prompting. Sad reality: sycophancy isn't a marketing play, it's the feature you want even if you say otherwise. We may get less polarized but increasingly delusional.

1

2

0

0

53

about 1 month ago

@davicorn @Fastmail Thanks for sharing. What’s the lifetime on that cookie? Defeats the point of automation if every/most time I use it I end up having to open the browser and click around to extract credentials. Not to mention the reliance on private (i.e. unstable) APIs here.

1

0

0

0

37

about 1 month ago

Dear @Fastmail , when can we get support for programmatic management of Sieve rules?

2

4

0

0

833

about 1 month ago

0

0

0

0

49

Last Seen Users on Sotwe

Trends for you

Most Popular Users