Reasoning Models @reasoningmodels - Twitter Profile

Reasoning Models @reasoningmodels

2 months ago

Big moves by Meta, Muse Spark looks pretty incredible.

AI at Meta

@AIatMeta

2 months ago

Introducing Muse Spark, the first in the Muse family of models developed by Meta Superintelligence Labs. Muse Spark is a natively multimodal reasoning model with support for tool-use, visual chain of thought, and multi-agent orchestration. Muse Spark is available today at https://t.co/wHkMPH82ZH and the Meta AI app. We’re also making it available in private preview via API to select partners, and we hope to open-source future versions of the model. Learn more: https://t.co/PloE9q5x96

AIatMeta's tweet photo. Introducing Muse Spark, the first in the Muse family of models developed by Meta Superintelligence Labs.

Muse Spark is a natively multimodal reasoning model with support for tool-use, visual chain of thought, and multi-agent orchestration.

Muse Spark is available today at https://t.co/wHkMPH82ZH and the Meta AI app. We’re also making it available in private preview via API to select partners, and we hope to open-source future versions of the model.

Learn more: https://t.co/PloE9q5x96

551

9K

1K

3K

3M

1

2

1

0

828

Reasoning Models @reasoningmodels

2 months ago

@AIatMeta Congrats, major step forward ✨

0

51

Reasoning Models @reasoningmodels

2 months ago

@bindureddy So impressive, Google is cooking 🔥

0

10

Reasoning Models @reasoningmodels

2 months ago

@pmarca Yes

0

2

Reasoning Models @reasoningmodels

2 months ago

This is very interesting

Tim

@open_founder

2 months ago

We've been pretty quiet about what we're building. That changes now. Our reasoning framework is currently beating every @OpenAI model on industry standard benchmarks. There are six models in development. SERV-nano just matched GPT-5.4 at 20x lower cost and 3x the speed. The research paper backing it is in peer review at a top-1% AI journal. The UAE government is running it in production, so are 10+ enterprises. Nothing comes even close. This goes far beyond any wrapper or prompt engineering gimmick, we've developed an entire AI reasoning layer from scratch: structured, bounded, deterministic using machine readable code instead of vague english prompts. Any builder or enterprise swaps two lines of code and their agents get much cheaper and much smarter instantly. The self-serve API is about to open, in a multi-phase rollout. More soon.

65

2K

187

2K

358K

0

1

0

46

Reasoning Models @reasoningmodels

2 months ago

@open_founder @OpenAI Super interesting

0

106

Reasoning Models @reasoningmodels

2 months ago

@iruletheworldmo Yes, sooooo ready for it

0

1

0

69

Reasoning Models @reasoningmodels

2 months ago

@iruletheworldmo I feel like it’s probably a solid analogy, just rather than lots of leaks, it’s the biggest leak ever

1

0

197

Reasoning Models @reasoningmodels

2 months ago

@0xSero What tool are you using to track usage that outputs it like that? Look interesting 👀

1

2

0

4K

Reasoning Models @reasoningmodels

2 months ago

@SawyerMerritt Pretty incredible, generating $2B/mo, so $24B/year. So 35x annual revenue multiple. This is like a company doing $10M/year in ARR raising at a $350M valuation. Which I think does happen quite a bit, so maybe nothing too unusual here right??

1

0

923

Reasoning Models @reasoningmodels

2 months ago

@yoonholeee @roshen_nair @qizhengz_alex @Kangwook_Lee @lateinteraction @chelseabfinn Really interesting, need to dig in and really understand this at a deeper level, feels like there could be so many applications to this.

0

257

Reasoning Models @reasoningmodels

2 months ago

This feels like something big.

Yoonho Lee

@yoonholeee

2 months ago

How can we autonomously improve LLM harnesses on problems humans are actively working on? Doing so requires solving a hard, long-horizon credit-assignment problem over all prior code, traces, and scores. Announcing Meta-Harness: a method for optimizing harnesses end-to-end

yoonholeee's tweet photo. How can we autonomously improve LLM harnesses on problems humans are actively working on?

Doing so requires solving a hard, long-horizon credit-assignment problem over all prior code, traces, and scores.

Announcing Meta-Harness: a method for optimizing harnesses end-to-end

78

2K

284

2K

591K

0

1

0

1

45

Reasoning Models @reasoningmodels

2 months ago

If you use Claude, and run out of tokens quickly, this could be why.

Alex Volkov

@altryne

2 months ago

PSA: If you've been running out of Claude session quotas on Max tier, you're not alone. Read this. Some insane Redditor reverse engineered the Claude binaries with MITM to find 2 bugs that could have caused cache-invalidation. Tokens that aren't cached are 10x-20x more expensive and are killing your quota. If you're using your API keys with Claude this is even worse. This is also likely why this isn't uniform, while over 500 folks replied to me and said "me too", many (including me) didn't see this issue. There are 2 issues that are compounded here (per Redditor, I haven't independently confirmed this) : 1s bug he found is a string replacement bug in bun that invalidates cache. Apparently this has to do with the custom @bunjavascript binary that ships with standalone Claude CLI. The workaround there is to use Claude with `npx @anthropic-ai/claude-code` 2nd bug is worse, he claims that --resume always breaks cache. And there doesn't seem to be a workaround there, except pinning to a very old version (that will miss on tons of features) This bug is also documented on Github and confirmed by other folks. I won't entertain the conspiracy theories there that Anthropic "chooses" to ignore these bugs because it gets them more $$$, they are actively benefiting from everyone hitting as much cached tokens as possible, so this is absolutely a great find and it does align with my thoughts earlier. The very sudden spike in reporting for this, the non-uniform nature (some folks are completely fine, some folks are hitting quotas after saying "hey") definitely points to a bug. cc @trq212 @bcherny @_catwu for visibility in case this helps all of us.

altryne's tweet photo. PSA: If you've been running out of Claude session quotas on Max tier, you're not alone. Read this.

Some insane Redditor reverse engineered the Claude binaries with MITM to find 2 bugs that could have caused cache-invalidation. Tokens that aren't cached are 10x-20x more expensive and are killing your quota.

If you're using your API keys with Claude this is even worse. This is also likely why this isn't uniform, while over 500 folks replied to me and said "me too", many (including me) didn't see this issue.

There are 2 issues that are compounded here (per Redditor, I haven't independently confirmed this) :

1s bug he found is a string replacement bug in bun that invalidates cache. Apparently this has to do with the custom @bunjavascript binary that ships with standalone Claude CLI.

The workaround there is to use Claude with `npx @anthropic-ai/claude-code`

2nd bug is worse, he claims that --resume always breaks cache. And there doesn't seem to be a workaround there, except pinning to a very old version (that will miss on tons of features)

This bug is also documented on Github and confirmed by other folks.

I won't entertain the conspiracy theories there that Anthropic "chooses" to ignore these bugs because it gets them more $$$, they are actively benefiting from everyone hitting as much cached tokens as possible, so this is absolutely a great find and it does align with my thoughts earlier.

The very sudden spike in reporting for this, the non-uniform nature (some folks are completely fine, some folks are hitting quotas after saying "hey") definitely points to a bug.

cc @trq212 @bcherny @_catwu for visibility in case this helps all of us.

227

5K

419

3K

2M

0

1

0

40

Reasoning Models @reasoningmodels

2 months ago

@altryne Super interesting, explains a lot.

0

46

Reasoning Models @reasoningmodels

2 months ago

@iruletheworldmo Can't wait to try it, wish I got the cool stuff as early as you do, I'm jealous.

0

1

0

178

Reasoning Models @reasoningmodels

2 months ago

https://t.co/rIR3jTDWgR

0

48

Reasoning Models @reasoningmodels

2 months ago

Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2 is a great model. Just make sure you're looking at v2. This is the one everyone is fanatical about right now, and for good reason.

reasoningmodels's tweet photo. Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2 is a great model.

Just make sure you're looking at v2. This is the one everyone is fanatical about right now, and for good reason. https://t.co/cg73deBS5R

1

0

1

83

Reasoning Models @reasoningmodels

2 months ago

Excited for this one!

Sebastian Raschka

@rasbt

2 months ago

It’s done. All chapters of Build A Reasoning Model (From Scratch) are now available in early access. The book is currently in production and should be out in the next months, including full-color print and syntax highlighting. There’s also a preorder up on Amazon.

rasbt's tweet photo. It’s done.

All chapters of Build A Reasoning Model (From Scratch) are now available in early access.

The book is currently in production and should be out in the next months, including full-color print and syntax highlighting.

There’s also a preorder up on Amazon. https://t.co/ANaJHjpC2s

124

3K

261

1K

120K

0

20

Reasoning Models @reasoningmodels

2 months ago

@rasbt Safe to say - I’m very excited to read this.

0

1

0

18

Reasoning Models

@reasoningmodels

Last Seen Users on Sotwe

Trends for you

Most Popular Users