Hi
on account of twitter imploding we're no longer active here. You can follow us at:
https://t.co/aGukLUNrKG
You can also follow our main linked therein. See you all later friends. It has been lovely.
VideoGameBench
Can Vision-Language Models complete popular video games?
best performing model, Gemini 2.5 Pro, completes only 0.48% of VideoGameBench and 1.6% of VideoGameBench Lite
Hi
on account of twitter imploding we're no longer active here. You can follow us at:
https://t.co/aGukLUNrKG
You can also follow our main linked therein. See you all later friends. It has been lovely.