Logosmonger @logosmonger - Twitter Profile

3 days ago

Opus 4.8 vs MiniMax M3 tested both on default settings with the same prompt > Opus one shotted everything in 7 minutes > M3 needed an extra prompt to fix the "break block" feature and took 20+ minutes both got super close, judge both and lemme know which one looks better?

67

384

15

96

66K

logosmonger retweeted

goodalexander

@goodalexander

3 days ago

172

9K

961

2K

342K

logosmonger retweeted

ℏεsam

@Hesamation

2 days ago

Mythos, regulate something. make no mistakes.

25

17K

603

450

548K

logosmonger retweeted

Lisan al Gaib

@scaling01

2 days ago

the permanent underclass is already here, but you just haven't noticed it here's what it looks like: - frontier labs keep their best models to themselves for 1-3 months to make sure it's safe - then they sell the tokens to the US government and trillion dollar companies - after that allied countries get access - and only then do the poors get access to it after half a year of waiting. meanwhile they are already on Mythos 2 that is exponentially better

78

3K

114

583

341K

Who to follow

King David

@NFTCollector

The #1 designer in web3 👀🔥 Founder @cryptochipsio Co-Founder @ascentrivals @genungames 🤜 @Larserthanlife

#NFT art collector and trader. The future is now. #nftart #cryptoart

logosmonger retweeted

NIK

@ns123abc

2 days ago

“Sir… OpenAI just LOST the race to go public…”

52

1K

39

51

65K

logosmonger retweeted

Lisan al Gaib

@scaling01

3 days ago

Nvidia announcing a 550B model wasn't on my bingo card They are now the strongest american open-source lab

79

2K

78

231

162K

logosmonger retweeted

Clemente

@Chilearmy123

2 days ago

I DEMAND A REFUND @saylor

347

4K

287

631

1M

logosmonger retweeted

CG

@cgtwts

3 days ago

Sam Altman right now:

42

2K

62

570

834K

logosmonger retweeted

Yuchen Jin

@Yuchenj_UW

2 days ago

OpenAI slept on coding, so Anthropic stole the crown. Anthropic didn’t secure enough GPUs/TPUs to turn that lead into a monopoly. Now Codex has caught up. Gemini will catch up too. It’s only a matter of time. AI coding is becoming a three-body problem.

228

2K

71

186

168K

logosmonger retweeted

terminally onλine εngineer

@tekbog

3 days ago

mythos find all the laws that google, meta and openai break so we can fine them, make no mistakes

73

11K

600

413

344K

logosmonger retweeted

ᐱ ᑎ ᑐ ᒋ ᕮ ᒍ

@Andr3jH

2 days ago

36

18K

1K

750

565K

logosmonger retweeted

Qwen

@Alibaba_Qwen

2 days ago

👏👏 Introducing Qwen3.7-Plus — a multimodal agent model that unifies vision and language into one versatile agent foundation. ✅ Multimodal interactive hybrid agent: unified GUI & CLI operation across visual and text tasks ✅ Versatile coding agent & productivity assistant with full-modality input ✅ Visual Agent: perception, reasoning, grounding, and search-augmented QA ✅ Cross-harness generalization across diverse agent frameworks One model. Sees, thinks, codes, acts.🙌🙌 Now available via API on Alibaba Cloud Model Studio. Try it — let us know what you build.😎 🔗🔗⬇️⬇️ Blog：https://t.co/pVYf0h3NNa Qwen Studio：https://t.co/HUYgFW4cYf API：https://t.co/viL0cXrMzW

Alibaba_Qwen's tweet photo. 👏👏 Introducing Qwen3.7-Plus — a multimodal agent model that unifies vision and language into one versatile agent foundation.

✅ Multimodal interactive hybrid agent: unified GUI & CLI operation across visual and text tasks
✅ Versatile coding agent & productivity assistant with full-modality input
✅ Visual Agent: perception, reasoning, grounding, and search-augmented QA
✅ Cross-harness generalization across diverse agent frameworks

One model. Sees, thinks, codes, acts.🙌🙌

Now available via API on Alibaba Cloud Model Studio. Try it — let us know what you build.😎

🔗🔗⬇️⬇️
Blog：https://t.co/pVYf0h3NNa
Qwen Studio：https://t.co/HUYgFW4cYf
API：https://t.co/viL0cXrMzW

247

4K

451

697

445K

logosmonger retweeted

Lisan al Gaib

@scaling01

2 days ago

Opus 4.8 just broke ARC-AGI-3 it tripled GPT-5.5's score we are now at a breathtaking 1.5% human efficiency

99

2K

76

265

173K

logosmonger retweeted

taoki

@justalexoki

4 days ago

218

19K

2K

3K

613K

logosmonger retweeted

ℏεsam

@Hesamation

3 days ago

Pewd did it again. now he open-sourced a self-hosted AI workspace. bro is building a CV harder than a CS undergrad looking for a job: > built a 10-GPU home rig > quantized giant LLM to run local > built ChatOS, local AI UI > added RAG/local memory > built “council” of AI models > built “swarm”, small models in parallel for data collection > fine-tuned a Qwen 32B-based coding model > donated compute from his GPU rig for protein folding research

Hesamation's tweet photo. Pewd did it again. now he open-sourced a self-hosted AI workspace. bro is building a CV harder than a CS undergrad looking for a job:
> built a 10-GPU home rig
> quantized giant LLM to run local
> built ChatOS, local AI UI
> added RAG/local memory
> built “council” of AI models
> built “swarm”, small models in parallel for data collection
> fine-tuned a Qwen 32B-based coding model
> donated compute from his GPU rig for protein folding research

117

6K

340

2K

352K

logosmonger retweeted

Crémieux

@cremieuxrecueil

4 days ago

raising a seed round in 2030 like

91

9K

391

891

672K

logosmonger retweeted

jinjingliang

@JinjingLiang

4 days ago

DeepSWE, the David Beckham of benchmarks 😂

23

2K

68

159

170K

logosmonger retweeted

MiniMax (official) @MiniMax_AI

3 days ago

Introducing MiniMax M3: The First Open-Weights Model to Combine Three Frontier Capabilities - Coding & Agentic Frontier: 59.0% SWE-Bench Pro, 66.0% Terminal Bench 2.1, 34.8% SWE-fficiency, 28.8% KernelBench Hard, 74.2% MCP Atlas - MiniMax Sparse Attention scales context to 1M - Natively Multimodal from Step Zero API: https://t.co/fHRdSV7BwZ Token Plan: https://t.co/BDCycxepZw 🚀New! MiniMax Code: https://t.co/GvB4YiB6Ul Weights & Tech Report in ~10 Days

MiniMax_AI's tweet photo. Introducing MiniMax M3: The First Open-Weights Model to Combine Three Frontier Capabilities

- Coding & Agentic Frontier: 59.0% SWE-Bench Pro, 66.0% Terminal Bench 2.1, 34.8% SWE-fficiency, 28.8% KernelBench Hard, 74.2% MCP Atlas
- MiniMax Sparse Attention scales context to 1M
- Natively Multimodal from Step Zero

API: https://t.co/fHRdSV7BwZ
Token Plan: https://t.co/BDCycxepZw
🚀New! MiniMax Code: https://t.co/GvB4YiB6Ul

Weights & Tech Report in ~10 Days

528

8K

1K

3K

3M

logosmonger retweeted

Lisan al Gaib

@scaling01

3 days ago

Claude 4.8 Opus smashes GPT-5.5 and is new SOTA on GBA Eval On GBA Eval models are used as coding agents to build a working Game Boy Advance emulator from scratch within 24 hours.

scaling01's tweet photo. Claude 4.8 Opus smashes GPT-5.5 and is new SOTA on GBA Eval

On GBA Eval models are used as coding agents to build a working Game Boy Advance emulator from scratch within 24 hours. https://t.co/ISpEwoDHTS

31

429

17

67

62K

logosmonger retweeted

Lisan al Gaib

@scaling01

3 days ago

MiniMax-M3 Benchmarks

28

476

28

58

134K

Logosmonger

@logosmonger

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users