3D Artist and Gfx with my big Bro, Freelance.
“If wars can be started by lies, peace can be started by truth” Assange
"Divided By label, united by noticing" MM
AI went from predicting biological processes to controlling them, and now it is becoming an active participant in science.
Agents' architectures perfectly fit the scientific loop at the execution level: read literature, write and run code, generate hypotheses, test them, evaluate the results, and repeat.
Here is a broad look at this shift
Talk to Gemma 4 31B with our voice app!
It sees and searches the web faster than you blink. Thanks to @cerebras’ ultra fast inference, the LLM is almost instantaneous.
The whole stack is fully open-source, and is a drop-in replacement for OpenAI's realtime API.
Demo: https://t.co/RRCwMbkW9r
We took a 30B model and split it in two to write tokens in parallel instead of one at a time.
Introducing Nemotron-Labs-TwoTower: a diffusion language model from NVIDIA Research adapted from Nemotron-3-Nano-30B-A3B. Here’s how it works: one half holds the context, the other writes the tokens, with both reusing the pretrained model instead of training a new one from scratch.
We found it kept 98.7% of the original model’s quality at 2.42× faster generation.
"Its (Sonnet 5) performance is close to Opus 4.8, at lower prices."
So I ran 4 canvas test through both.
> Opus 4.8, 4/4 actually animating.
> Sonnet 5, 2/4 came back as static images.
And "lower price"? On the paper shredder task, Sonnet 5 spent $0.36 for a static image. Opus 4.8 spent $0.18 and it actually animated.
The 4 tests:
> Win 98 drag-to-BSOD
> Self-typing keyboard + CRT
> Letter burning
> Paper shredder
Thanks for the PRs that you are starting to send to tau τ 🤗
You guys are amazing. New providers and tutorials arriving soon to tau to help you create your own coding agents!
If you want to contribute 👇
Notes (and a Pelican) on Claude Sonnet 5 - the new tokenizer makes it ~1.4x more expensive for English, ~1.33x more expensive for Spanish but roughly the same price for Simplified Mandarin https://t.co/UUnmjtPaSi
Super excited about open-source router systems and routing models like @vllm_project semantic router: https://t.co/Gwza9jPWzr
The future is multi-models and you'll want to customize your router the same way you customize your code!
It could be the key to tilt the value capture from a few expensive frontier models to a long-tail of models (especially open-source).
More people should build those!
Always so much fun to chat with @3blue1brown
AI has been making much faster progress in math than in other fields.
As a result, mathematics is showing us, very concretely, what AI progress in other fields will look like.
Even within mathematics, there's a jagged landscape. What does it look like?
What is the nature of the most important conceptual breakthroughs in the history of mathematics, and how different are they from what AIs are currently able to do?
Does AI (on net) increase or decrease human understanding of the field?
How big is the overhang from having AIs systematically try to connect ideas already in the literature?
And what advice does Grant have for aspiring mathematicians, coders, and other students who are passionate about fields that are being most transformed upon by AI?
0:00:00 – AI is discovering new proofs. Is that AGI?
0:11:32 – The verification loop on conceptual breakthroughs can be a century long
0:26:12 – Will we understand an AI proof of the Riemann hypothesis?
0:38:08 – Can AI find the hidden bridges between fields?
0:53:48 – Why real-world tasks don’t fit into RL environments
1:07:07 – Good writing requires theory of mind that AI still lacks
1:16:02 – Why learning will still depend on human curation
Look up Dwarkesh Podcast on Spotify, Apple Podcasts, YouTube, etc.
People of https://t.co/oUoqqL9hAp. While @mitsuhiko is away in SF hugging people, I thought I'd celebrate @AnthropicAI new Sonnet 5 model and the end of the first european heatwave with a new pi release.
Sent from a euromaxxing room without AC at 10:30pm at 28°C
Was thinking if I should highlight this tweet or not, but it’s a masterclass in the amount of vitriol people face when working on open source.
Is the app great yet? No. It’s a start.
It was built by the community. Getting the iOS and Android apps working with secure pairing and push notifications - and getting both through App Review -took a surprising amount of work.
OpenClaw wasn’t acquired by OpenAI and isn’t an OpenAI product. It’s an open, independent project under the OpenClaw Foundation. OpenAI sponsors the project’s token usage; I work there.
Cristian, your tweet was just one of ~30 I woke up to today. I’d genuinely love your help making it great.
Attention is still the scarcest resource.
I’d rather spend mine encouraging people who build.
Meituan's LongCat-2.0 reportedly lands near GPT-5.5 on SWE-bench. So I threw 5 HTML canvas animation prompts at both.
🥷 Paper sliced fruit-ninja style.
💧 An ink drop diffusing in water.
🔥 A letter burning.
🗑️ Paper crumpling into a ball.
✂️ A strip-cut shredder.
Here's how they did 👇