@kilocode well I tried minimax m3 for creating a simple davinci resolve fusion fuse (which is basically a lua script) and it did worse than deepseek v4 flash.. I like the benchmarks but in my experience so far nothing comes close to qwen 3.7 max. not even gpt 5.5