Best models for your hardware this week.
8-12GB
- https://t.co/5SYi6D56FR incredible model, so fast, so small
16-32GB
- latest Google model, Gemma 12B: https://t.co/TLm2x2l3lk really solid performance up neck and neck with a model 2x its size from a month ago.
Jetbrains new model, best in class on livecode bench
32-96gb
- Nex-N2-Mini GPT style postrain of Qwen-35B it seems to be its class leader caveman style reasoning https://t.co/EL1ePzwI58
- Jackrong’s Qwopus is the #1 overall Q4 of Qwen3.6-27B on our benchmark suite of 5 agent + coding benchmarks (1200 samples total) https://t.co/P1gypZwufi
192gb
- Step-3.7-Flash is hard to beat, high scores, really fast inference, vision capable, later cutoff dates https://t.co/oaVf5wMILx
384gb
- Nex-N2-Pro GPT style post train of Qwen-3.5-397B incredibly strong and #1 on deepswe if their claims are right https://t.co/LsGXZRl6nh
768gb
- very promising post-train of GLM-5.1 that wins out on 8 benchmarks https://t.co/25KElLHEos
If you have an Anthropic Plan, stop what you're doing right now
Go claim your free credits on your usage page. (Link in comments)
They are gifting users the same value their plan costs, which is nice.
They have finally clarified: no third party harnesses allowed on their plans.
You can still use them, but at discounted API pricing.
Glad they finally clarified this, even if I don't like it.
New in Claude Code: auto mode.
Instead of approving every file write and bash command, or skipping permissions entirely, auto mode lets Claude make permission decisions on your behalf.
Safeguards check each action before it runs.
LiteLLM HAS BEEN COMPROMISED, DO NOT UPDATE. We just discovered that LiteLLM pypi release 1.82.8. It has been compromised, it contains litellm_init.pth with base64 encoded instructions to send all the credentials it can find to remote server + self-replicate. link below
https://t.co/1vOwERU4d8 Everyone in the replies is scared Claude will spy on them.
Meanwhile I've been giving Claude full access to my codebase, my git history, my business docs, and my decision-making process for months.
Not because I'm reckless. Because I run a real business and Claude does in one afternoon what used to take me a week.
The privacy concern isn't wrong. But the real question nobody's asking:
What's the cost of NOT letting AI help you?
For me it was 2 employees I couldn't afford anymore and a business that was dying slowly.
Computer use is just the mouse and keyboard. I already gave it the brain.
Today, we’re releasing a feature that allows Claude to control your computer: Mouse, keyboard, and screen, giving it the ability to use any app.
I believe this is especially useful if used with Dispatch, which allows you to remotely control Claude on your computer while you’re away.
What I learned building all this as a non-developer:
— Start with the problem you have today, not the platform you dream about
— Build complete, stabilize, monetize, move on. Don't iterate forever.
— If your AI can't explain what it doesn't know, it's not ready
— Copy-paste between LLMs is the most underrated integration pattern
— Every document should be readable by someone with zero context
The tools don't matter. The systems thinking does.
If you sell things, manage people, and run operations — you already have the hardest skill. The coding part is the easy part now.
I'm not a developer. I'm a business operator who sells stuff and manages people.
In the last 7 days I built:
— a persona-builder that creates AI personas
— a traceability system that tracks files across projects
— a grounded dialog protocol for multi-agent conversations
— a 55-item security audit framework
All with Claude Code + a plugin ecosystem I built myself over 3 months.
No React knowledge before this. No Python. No CS degree.
The gap isn't "can AI code." The gap is: most developers still think in sprints.
Operators think in systems.
EurekaClaw automates research papers. My stuff automates business decisions.
Same energy. Different arena.
1/n 🦞 Introducing EurekaClaw💡 — a local-first AI research agent that captures your Eureka moments before they vanish.
From idea → proof → experiment → paper — fully automated.
Local-first. Zero data leak. 🔒
Try it: https://t.co/ZJlEYvkX9h
The boring stuff that actually matters:
Every file I move between projects leaves a trace. A simple .md that says: what left, where it went, why, one-line summary.
Any AI reading that folder with zero context instantly knows what happened.
No databases. No tracking software. Just markdown files that humans and AI both read naturally.
The best systems are the ones you don't notice running.
Introducing the new @stitchbygoogle, Google’s vibe design platform that transforms natural language into high-fidelity designs in one seamless flow.
🎨Create with a smarter design agent: Describe a new business concept or app vision and see it take shape on an AI-native canvas.
⚡️ Iterate quickly: Stitch screens together into interactive prototypes and manage your brand with a portable design system.
🎤 Collaborate with voice: Use hands-free voice interactions to update layouts and explore new variations in real-time.
Try it now (Age 18+ only. Currently available in English and in countries where Gemini is supported.) → https://t.co/pmT9iHEpZa
I'm testing 3 client sites this week. If the daily SEO audits are solid and GEO tracking actually works — this becomes something I recommend to every client post-launch.
"How visible is your brand in AI search?" is a question most businesses can't answer today. That gap won't last.
I build websites for startups. Every project ends the same way:
"Site looks great. How do we get users?"
That question costs more than the site itself to answer. @askOkara threw a team of AI agents at it for $99/mo.
My actual take after digging in:
Today we're introducing the world's first AI CMO.
Enter your website and it deploys a team of agents to help you get traffic and users.
Try it now at https://t.co/KbAE6FNgzE
Reality check though: servers melted on launch day. Multiple bugs in replies. Zero public case studies with concrete numbers.
$99/mo won't replace a real CMO. But it replaces the marketing you weren't doing at all — daily SEO actions, community monitoring, AI search tracking.
For a 2-person startup with zero marketing bandwidth, going from nothing to automated daily agents is a massive jump.
The question isn't "what game should be next."
It's what happens when a client sees this and says "can you make something like this for our launch?"
A $500 AI video that took an afternoon vs. a $15K production shoot that took three weeks. Kling 3.0 is where AI video stops being a tech demo and starts being billable.
We're 6 months away from agencies losing pitches over this. The ones building the capability now won't need to justify the line item — they'll just hit play.