Intelligence should be open, accessible, and ready to build with, empowering every developer, everywhere.
GLM-5.2 is now available to all GLM Coding Plan users, including Lite, Pro, Max, and Team plans.
https://t.co/AedZACyzej
As our new flagship model, GLM-5.2 delivers powerful coding capabilities, usable 1M-context support, and continued strengths in long-horizon tasks.
API and Chatbot services will launch next week. The model will also be officially open-sourced next week under the MIT License.
The future of AI is open, and it belongs to the people.
Finally got my hands on the big one.
Qwen3.5-122B-A10B — 122 billion parameters. Too big for any single consumer GPU.
So I rented 4 of each... and then one professional card to see if brute force even matters.
- 1x RTX PRO 6000 (96GB): 101.4 tok/s
- 4x 5090 (128GB): 87.0 tok/s
- 4x 4090 (96GB): 25.1 tok/s
- 4x 3090 (96GB): 20.8 tok/s
One single $8,500 card beat four RTX 5090s