Hey everyone, we're โช White Circle
We're building the most advanced runtime safety and alignment infrastructure for AI in the real world.
Read more about us in Fortune โ
Introducing โช๏ธ KillBench โ a benchmark of hidden LLM biases in critical decisions.
We ran millions of life-and-death scenarios across every major LLM, varying nationality, religion, gender, and more.
Every AI model is biased.
Here's what we found โ
All code, prompts, and data are open-sourced on GitHub and HuggingFace.
We also built an interactive game so you can check your own odds of survival!
Check it out and read the full report at https://t.co/ruBnNTF1qU