Opus 4.8 vs MiniMax M3
tested both on default settings with the same prompt
> Opus one shotted everything in 7 minutes
> M3 needed an extra prompt to fix the "break block" feature and took 20+ minutes
both got super close, judge both and lemme know
which one looks better?
the permanent underclass is already here, but you just haven't noticed it
here's what it looks like:
- frontier labs keep their best models to themselves for 1-3 months to make sure it's safe
- then they sell the tokens to the US government and trillion dollar companies
- after that allied countries get access
- and only then do the poors get access to it after half a year of waiting. meanwhile they are already on Mythos 2 that is exponentially better
OpenAI slept on coding, so Anthropic stole the crown.
Anthropic didn’t secure enough GPUs/TPUs to turn that lead into a monopoly. Now Codex has caught up.
Gemini will catch up too. It’s only a matter of time.
AI coding is becoming a three-body problem.
👏👏 Introducing Qwen3.7-Plus — a multimodal agent model that unifies vision and language into one versatile agent foundation.
✅ Multimodal interactive hybrid agent: unified GUI & CLI operation across visual and text tasks
✅ Versatile coding agent & productivity assistant with full-modality input
✅ Visual Agent: perception, reasoning, grounding, and search-augmented QA
✅ Cross-harness generalization across diverse agent frameworks
One model. Sees, thinks, codes, acts.🙌🙌
Now available via API on Alibaba Cloud Model Studio. Try it — let us know what you build.😎
🔗🔗⬇️⬇️
Blog:https://t.co/pVYf0h3NNa
Qwen Studio:https://t.co/HUYgFW4cYf
API:https://t.co/viL0cXrMzW
Pewd did it again. now he open-sourced a self-hosted AI workspace. bro is building a CV harder than a CS undergrad looking for a job:
> built a 10-GPU home rig
> quantized giant LLM to run local
> built ChatOS, local AI UI
> added RAG/local memory
> built “council” of AI models
> built “swarm”, small models in parallel for data collection
> fine-tuned a Qwen 32B-based coding model
> donated compute from his GPU rig for protein folding research
Claude 4.8 Opus smashes GPT-5.5 and is new SOTA on GBA Eval
On GBA Eval models are used as coding agents to build a working Game Boy Advance emulator from scratch within 24 hours.