Joseph Sirosh

@josephsirosh

Founder, I like to share thoughts about Authentic Generative Interaction, #GenAI and platforms for the future.

Bellevue, Washington

Joined May 2014

739 Following

8.6K Followers

2.9K Posts

josephsirosh retweeted

Techmeme

@Techmeme

2 days ago

Databricks says annualized revenue rose 80%+ YoY to $6.9B, up from $5.4B in Q4; CEO Ali Ghodsi says higher AI agent usage is increasing costs, lowering margins (@jordannovet / CNBC) (Visit Techmeme dot com for the link and full context!)

37K

Joseph Sirosh

@josephsirosh

1 day ago

Tokens Per Task is the other thing to consider as agentic harnesses co-evolve with models. @GavinSBaker Value = tasks (or task outcomes). Delivering outcomes with the fewest generated tokens in the agentic loop will be the bigger picture optimization. Costs from GDPVal-AA benchmark runs, not just in reference harness, but in the custom harnesses might be very interesting to track.

Gavin Baker

@GavinSBaker

1 day ago

"If token throughput per watt rises faster than price per token falls, revenue per gigawatt can net expand. This is compounded by models getting smarter, which unlocks higher value tasks for end users [justifying increased costs]" Well said @ShanuMathew93 and @downingARK

776

336

70K

128

Joseph Sirosh

@josephsirosh

2 days ago

"HBM is not where data sits. It is where AI thinks."

Trade Whisperer

@TradexWhisperer

2 days ago

$MU $DRAM Where analysts & market got it wrong. They model HBM like it has been around for decades. HBM is a new architecture built to feed AI models context at speed. It did not exist 3 years ago in any meaningful revenue. It is not where data sits. It is where AI thinks.

274

29K

153

josephsirosh retweeted

Matei Zaharia @matei_zaharia

5 days ago

Really excited to open source a new project: Omnigent, a meta-harness for AI agents. It lets you build multi-agent coding and custom agents, sitting above Claude Code, Codex, Pi, and agent SDKs to let you compose them. It also adds live collaboration and rich control policies.

matei_zaharia's tweet photo. Really excited to open source a new project: Omnigent, a meta-harness for AI agents.

It lets you build multi-agent coding and custom agents, sitting above Claude Code, Codex, Pi, and agent SDKs to let you compose them. It also adds live collaboration and rich control policies. https://t.co/jwFmH8nHsZ

200

984

198K

Who to follow

Eric Horvitz

@erichorvitz

Chief Scientific Officer, Microsoft

Amir Netz

@AmirNetz

Technical Fellow at Microsoft, CTO of Microsoft Fabric - Power BI, Synapse, Data Factory and more. Godfather of SSAS, VertiPaq, PowerPivot.

Lindsey Allen

@herdcats

CPO, https://t.co/6u3RyAqIkb

Joseph Sirosh

@josephsirosh

4 days ago

Token capital may become real firm capital, but it will not automatically preserve human agency, broad value distribution, or institutional control. Those have to be designed, measured, audited, and governed. Once systems become more autonomous, faster, copyable, and coordinated, the likely emergent phenomenon is group agents that exceed human oversight capacity. The “learning loop” can become an automated bureaucracy moving far faster than its managers.

348

josephsirosh retweeted

Dan McAteer

@daniel_mac8

4 days ago

Okay, this is seriously cool. A team from @GoogleDeepMind, including DeepMind Cofounder Shane Legg, published a paper "From AGI to ASI". In the paper, they include instructions for an AI agent to read along with you. You can open the paper in Codex's in-app browser and have GPT-5.5 read it with you and explain all the concepts. This is the future. AI agents will be part of the target audience, and help us to understand anything we want.

444

555

53K

josephsirosh retweeted

Chamath Palihapitiya

@chamath

5 days ago

Game theory from here is super interesting: Original Mags (Google, Amazon, Microsoft, Meta) now have a serious non-zero opportunity to tank the frontier labs. Go to the government, kneecap the labs’ motion of putting the latest models out in the wild, become the trusted gatekeeper between the labs and the public at large (including internationally) by having the labs go through their clouds (AWS, GCP, Azure) and implement strict KYC to seal the deal. The frontier labs should have seen this coming years ago and implemented a robust KYC for just this moment. The fact they didn’t is kind of concerning. Why did they not do it? Best guess is because it would have changed the run-rate revenues (downward) which would have then changed funding dynamics - lower valuations, more dilution, less secondary. A valuation reset may happen now anyways, except the labs may end up with less control and more restrictions at the end of it. At the same time, everyone is already clamoring about token prices of the old models from the labs anyways… This couldn’t be a better setup for open source and neoclouds. Big question is can they meet the moment? There are too few of them and their progress seems sporadic at best.

292

259

josephsirosh retweeted

TiE Seattle @SeattleTiE

7 days ago

A Heartfelt Thanks to Our TYE Global Championship 2026 Sponsors As we welcome the world's brightest young entrepreneurs to Seattle for the TYE Global Championship on June 13, we extend our sincere gratitude to the incredible sponsors and supporters who make this event possible.

SeattleTiE's tweet photo. A Heartfelt Thanks to Our TYE Global Championship 2026 Sponsors
As we welcome the world's brightest young entrepreneurs to Seattle for the TYE Global Championship on June 13, we extend our sincere gratitude to the incredible sponsors and supporters who make this event possible. https://t.co/2bYhPP7pZi

josephsirosh retweeted

Zephyr

@zephyr_z9

9 days ago

"moving from giving AI tasks to giving it responsibilities."

200

37K

Joseph Sirosh

@josephsirosh

9 days ago

@GavinSBaker @polynoamial And test time compute resides mostly in the top of the memory hierarchy (SRAM, HBM, LPDDR). The balance of logic and memory in a rack will inexorably shift towards memory bandwidth and capacity.

367

josephsirosh retweeted

Gavin Baker

@GavinSBaker

9 days ago

Super important post from @polynoamial and the investor TLDR is: all current estimates for compute demand might be low. “We likely don't know what the capability ceiling is for modern LLMs because it's too expensive to measure. Frequently when I discuss this, people ask why we don't just evaluate with a harness that pushes test-time compute until performance plateaus. The problem is that, empirically, the plateau is very far out. Sometimes we may not observe a plateau at all within practical budgets Notice that for the stronger models the performance improvement over time is stronger. It seems likely that as models become stronger they become more effective at operating over longer horizons. The point of plateau is pushed out, and may even disappear.” If test-time compute performance improvement over time *effectively* scales at some ratio with training…

717

424

102K

Joseph Sirosh

@josephsirosh

9 days ago

@RHouseResearch As support for the $SpaceX IPO. They're one of the early investors and will reap a huge windfall return.

522

josephsirosh retweeted

Noam Brown

@polynoamial

10 days ago

https://t.co/oWqzT12RtZ

408

963K

Joseph Sirosh

@josephsirosh

10 days ago

@firstadopter Tpu 8i with Boardfly / inference topology is for MoE - it's probably just ramping at scale.

386

josephsirosh retweeted

Peter Steinberger 🦞

@steipete

11 days ago

Here’s your monthly reminder that you shouldn’t be prompting coding agents anymore. You should be designing loops that prompt your agents.

20K

14K

Joseph Sirosh

@josephsirosh

12 days ago

@jukan05 @diyas_1989 HBM is all you need.

Joseph Sirosh

@josephsirosh

12 days ago

Yes - And I have a (simplistic) thesis that at the end of the day all LLMs need is High Bandwidth Memory - and the logic coupled to the HBM is secondary - you can optimize the code that runs on the logic to make best use of what you get. So Anthropic is going for every accelerator they can get - including TPU, Trainium, Maia, and probably pricing in a way that aligns with the token throughput they can get out of each.

josephsirosh retweeted

Paul Graham

@paulg

about 7 years ago

Universities are backing themselves into a dangerous corner by becoming more expensive at the same time they're becoming less necessary.

226

13K

251

josephsirosh retweeted

TrendForce

@trendforce

20 days ago

👀 Agentic AI is driving memory demand. As inference workloads surge, how will next-gen AI servers tackle rising compute costs and capacity limits? 💡#TrendForce has raised its global memory market outlook: https://t.co/SJQ2upkDIs 🔗

Joseph Sirosh

@josephsirosh

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users