ICYMI
Nanbeige 4.1, a 3b model released by Chinese Indeed, outperforms Qwen3-30b-A3b + Qwen 3.5 4b. It can finish long horizon tasks with 600+ tool calls
We are working on something similar. thinking about doing a paper reading session. dm/comment for interest.
A bit of news: After nearly 9 years, I have decided to leave Google DeepMind and join Anthropic (after taking some time to recharge). I am incredibly grateful for my time at GDM. @demishassabis took a real chance letting me lead the AlphaFold team just six months after finishing my PhD, and the entire GDM team taught me so much about how to do great science. GDM is a special place, and I’ll still be excited to hear about what amazing things they discover next.
The US has outlined concerns to ASML that one of its top chip machines may have made its way into China, violating export curbs https://t.co/zujrQA0fS6