A Chinese open-source LLM + Claude Code + goal-driven = 76% SWE-bench Lite
That's +13 points over Claude Opus 4.6 (62.7%) — powered by a dead-simple dual-agent loop.
Goal-Driven: https://t.co/Tzc35yNP6N
Small architecture change, big performance jump.