Congrats to the @googlegemma team on the Gemma 4 12B launch 🎉 Day-0 support on vLLM is ready to go.
It's an encoder-free unified multimodal model — text, image, audio, and video all project straight into the LLM's embedding space, no separate vision or audio towers. 256K context, built-in thinking, native tool calling.
Reasoning + tool parsers (`gemma4`), vision, and audio all served through the OpenAI-compatible API.
🔗 Recipe: https://t.co/MGJcoQkwzz
Gemma 4 12B dropped today. Apache 2.0, multimodal: text, image, audio, and video. 256K context, built-in thinking, native tool calling.
Running on Red Hat OpenShift AI with @vllm_project on Day 0:
I am thrilled to introduce Claude Opus 4.8, our most terrifying and dangerous model yet.
There’s a small chance it won’t kill you, but it will almost certainly take your job.
I feel terrible about this, and terribly excited. I think there’s a very good chance Opus 4.8 will be able to build a model that reverses the damage Opus 4.8 does. We can’t know for sure, but a decade from now might be really great. Hold on to your butts.
I've been thinking it's ironic how LLMs are surprisingly distant from most of the classic science fiction idea of a robot: they are neither binary nor deterministic. In fact, they are not even honest; they confabulate, hedge, and sometimes deceive
“Part of the inhumanity of the agent is that, once it is given a SKILL.md and the input prompt, it actually reads the fucking manual” – Asimov, I think
In the rolling hills of Northern California, there are still communities of tradcoders. A curious people who are steadfast in their beliefs, they never adopted AI.
They host artisanal shops that sell hand-made code, often considered higher quality than modern products.
so we really are accelerating 100x because I thought it would take them weeks, instead in the last 3 days people have realized that their “enhanced productivity” really is burnout-speedrunning
I talked about this on the standup podcast yesterday, but I'll reiterate here: if you're losing sleep because you need to keep feeding the agents STOP, I promise it's not worth it. You got caught in a [prompt -> reward] dopamine cycle and you're addicted to the feeling of the token slot machine. It's not your fault, but you need to escape before it grinds you into a pulp and you can't look at a computer for a month (this was me). If you can break out of it and spend some more time offline, or find other healthy sources of dopamine in hobbies/etc, you'll start to realize just how warped your perception was and that the thing you were chasing wasn't actually productive.