AllThingsIntel @allthingsintel - Twitter Profile

Pinned Tweet

7 months ago

🧵 1/2 Just launched: The first open-source reasoning model that fully thinks in-character 🧠 The entire reasoning process (not just responses) embodies the persona you provide through system prompts. This makes interactions more realistic and human-like. Uncensored and unbound.

1

5

0

5K

AllThingsIntel

@AllThingsIntel

about 3 hours ago

Ok so the latter option it is, apparently. Can’t wait to see what @elder_plinius does with it!

0

16

AllThingsIntel

@AllThingsIntel

about 3 hours ago

Anthropic just released Claude Fable 5, basically a public version of Mythos. Benchmarks look great, beats Mythos Preview on cybersecurity too. But those are just benchmarks. Offensive capabilities should definitely be lobotomised or heavily safeguarded, and if it’s the latter, good luck keeping @elder_plinius out of it.

Claude

@claudeai

about 3 hours ago

Introducing Claude Fable 5: a Mythos-class model that we’ve made safe for general use. Its capabilities exceed those of any model we’ve ever made generally available.

3K

62K

8K

11K

11M

1

0

56

AllThingsIntel

@AllThingsIntel

1 day ago

The longer I watch WWDC, the fewer use cases are left for ChatGPT and Gemini for most non-power users. All of this seamlessly and natively baked into the OS itself: Siri covering basic search, analysis and automation needs, and new image editing and generation features replacing what people normally turn to Nano Banana or ChatGPT Images for. The big unanswered questions though: how frequently will they update the models, and what exactly will the rate limits look like?

1

0

154

AllThingsIntel

@AllThingsIntel

1 day ago

If the new dictation models shipping with Apple’s new OSes are as good as claimed, a lot of companies in that space are going to feel it.

0

30

AllThingsIntel

@AllThingsIntel

1 day ago

Siri’s new visuals leak turned out to be spot on, and the redesign is real.

0

24

AllThingsIntel

@AllThingsIntel

1 day ago

Google AI x Siri and Apple Intelligence integration confirmed across Apple’s OSes. Just one wish: ship the features at launch this time, not “coming later”.

0

57

AllThingsIntel

@AllThingsIntel

4 days ago

Claude Opus 4.8 is down. Not great, Anthropic.

0

1

0

251

AllThingsIntel

@AllThingsIntel

5 days ago

LM Studio teases an iPhone app drop today 👀

LM Studio @lmstudio

5 days ago

Today.

131

3K

152

327

194K

0

108

AllThingsIntel

@AllThingsIntel

5 days ago

Bots officially outnumber humans online!

Matthew Prince 🌥

@eastdakota

6 days ago

Welp, that happened faster than I predicted. Thought it would be end of 2027, then early 2027, but agentic traffic growing so fast that bots have now passed human traffic online for the first time in the Internet's history. https://t.co/2zX5bHdhsa

384

8K

2K

3K

2M

0

34

AllThingsIntel retweeted

Reve @reve

6 days ago

Today, we’re launching Reve 2.0, the best 4K image model in the world. We invented a new way to generate and edit any image using precise layouts. For the first time, it’s possible to create images you can touch.

271

5K

486

5K

12M

AllThingsIntel retweeted

Omar Sanseviero

@osanseviero

6 days ago

Super excited to introduce Gemma 4 12B! 💎 - Multimodal: audio, image, video, and text input - Novel architecture: we removed the multimodal encoders for a unified, streamlined arch - New MacOS desktop app powered by LiteRT - MTP support Excited to see what you build with it!

94

2K

167

935

124K

AllThingsIntel

@AllThingsIntel

7 days ago

Microsoft just unveiled an AI agent platform with two concept devices, a wearable badge and a desk companion. Both do things that could just be a phone app. The Humane AI Pin died for this. The Rabbit R1 died for this. One day we’ll learn.

Microsoft

@Microsoft

7 days ago

What changes when agents become both a new unit of programming and an emerging new unit of human-to-machine interaction? The mission of Project Solara, a new software platform coupled with tailored hardware solutions, is to pioneer agent-first experiences that are shaped around you: your agents, your tasks, your environment, under your control. #MSBuild

31

474

95

135

130K

0

135

AllThingsIntel

@AllThingsIntel

8 days ago

MiniMax just dropped M3 and the benchmarks look strong, but their M2.7 scored ZERO on DeepSWE. That’s the one result I actually want to see before getting too excited about this release.

AllThingsIntel's tweet photo. MiniMax just dropped M3 and the benchmarks look strong, but their M2.7 scored ZERO on DeepSWE. That’s the one result I actually want to see before getting too excited about this release. https://t.co/D3IDNNUc8n

MiniMax (official) @MiniMax_AI

9 days ago

Introducing MiniMax M3: The First Open-Weights Model to Combine Three Frontier Capabilities - Coding & Agentic Frontier: 59.0% SWE-Bench Pro, 66.0% Terminal Bench 2.1, 34.8% SWE-fficiency, 28.8% KernelBench Hard, 74.2% MCP Atlas - MiniMax Sparse Attention scales context to 1M - Natively Multimodal from Step Zero API: https://t.co/fHRdSV7BwZ Token Plan: https://t.co/BDCycxepZw 🚀New! MiniMax Code: https://t.co/GvB4YiB6Ul Weights & Tech Report in ~10 Days

MiniMax_AI's tweet photo. Introducing MiniMax M3: The First Open-Weights Model to Combine Three Frontier Capabilities

- Coding & Agentic Frontier: 59.0% SWE-Bench Pro, 66.0% Terminal Bench 2.1, 34.8% SWE-fficiency, 28.8% KernelBench Hard, 74.2% MCP Atlas
- MiniMax Sparse Attention scales context to 1M
- Natively Multimodal from Step Zero

API: https://t.co/fHRdSV7BwZ
Token Plan: https://t.co/BDCycxepZw
🚀New! MiniMax Code: https://t.co/GvB4YiB6Ul

Weights & Tech Report in ~10 Days

546

10K

1K

3K

5M

0

170

AllThingsIntel

@AllThingsIntel

10 days ago

@IntCyberDigest Microsoft should really work on their comms a bit.

5

275

5

0

13K

AllThingsIntel

@AllThingsIntel

10 days ago

Latest DeepSWE results are out, which tend to closely reflect real-world SWE performance. Claude Opus 4.8 disappoints on both performance and cost. GPT-5.5 xHigh holds first place, coming in 12% better and half the price of Opus 4.8, which is comparable to GPT-5.4 xHigh in performance but costs three times as much. Anthropic has some catching up to do.

AllThingsIntel's tweet photo. Latest DeepSWE results are out, which tend to closely reflect real-world SWE performance. Claude Opus 4.8 disappoints on both performance and cost. GPT-5.5 xHigh holds first place, coming in 12% better and half the price of Opus 4.8, which is comparable to GPT-5.4 xHigh in performance but costs three times as much. Anthropic has some catching up to do.

0

1

0

186

AllThingsIntel

@AllThingsIntel

12 days ago

@intheworldofai Where would you put it on your leaderboard?

0

225

AllThingsIntel

@AllThingsIntel

12 days ago

@AndrewCurran_ They seem to have forgotten Sonnet too!

0

2

0

159

AllThingsIntel

@AllThingsIntel

12 days ago

@JamesLaneAI Agreed, that could definitely be addressed better.

0

9

AllThingsIntel

@AllThingsIntel

12 days ago

Can’t figure out how people get Claude Opus 4.8 to fail the carwash question. Tried it multiple times and even without max reasoning effort, it handles the trick just fine. 🤷🏻‍♂️

AllThingsIntel's tweet photo. Can’t figure out how people get Claude Opus 4.8 to fail the carwash question. Tried it multiple times and even without max reasoning effort, it handles the trick just fine. 🤷🏻‍♂️ https://t.co/2YgEAP2ZIr

2

0

161

AllThingsIntel

@AllThingsIntel

12 days ago

Then how would the model know the purpose of going there? Maybe the person asking has a meeting at the carwash, or is picking up a friend at the carwash, or works at the carwash and is heading to their shift, or needs to speak with the manager about something unrelated. Without the explicit goal of “I want to wash my car”, the scenario completely changes. Claude is just refusing to assume an unstated intent.

1

0

25

AllThingsIntel

@AllThingsIntel

Last Seen Users on Sotwe

Trends for you

Most Popular Users