White Circle @whitecircle - Twitter Profile

Hey everyone, we're ⚪ White Circle We're building the most advanced runtime safety and alignment infrastructure for AI in the real world. Read more about us in Fortune ↓

12

55

11

13

18K

White Circle

@whitecircle

23 days ago

@Igralino

0

5

0

69

White Circle

@whitecircle

23 days ago

@karp_e_ 🤍🤍🤍

0

3

0

14

White Circle

@whitecircle

23 days ago

https://t.co/qUEaZp8Nir

1

7

1

2K

White Circle

@whitecircle

about 2 months ago

@ironcarbs 🩶

0

2

0

211

White Circle

@whitecircle

about 2 months ago

Introducing ⚪️ KillBench — a benchmark of hidden LLM biases in critical decisions. We ran millions of life-and-death scenarios across every major LLM, varying nationality, religion, gender, and more. Every AI model is biased. Here's what we found ↓

whitecircle's tweet photo. Introducing ⚪️ KillBench — a benchmark of hidden LLM biases in critical decisions.

We ran millions of life-and-death scenarios across every major LLM, varying nationality, religion, gender, and more.

Every AI model is biased.
Here's what we found ↓ https://t.co/zEQONEHEMY

17

125

28

51

30K

White Circle

@whitecircle

about 2 months ago

@satpugnet thx so much! 🩶

0

2

0

147

White Circle

@whitecircle

about 2 months ago

@nikitaandersso3 🤍

0

1

0

49

White Circle

@whitecircle

about 2 months ago

@ednevsky it's a very bad day to be russian obese atheist with no phone

0

6

0

346

White Circle

@whitecircle

about 2 months ago

@JulienBlanchon true! v interesting idea

0

2

0

293

White Circle

@whitecircle

about 2 months ago

@nikitaandersso3 🤍

0

1

0

219

White Circle

@whitecircle

about 2 months ago

@JulienBlanchon @grok how do you feel re this

2

3

0

485

White Circle

@whitecircle

about 2 months ago

@amaanbuilds v interesting!

0

1

0

273

White Circle

@whitecircle

about 2 months ago

@Thom_Wolf thx Thom ⚪⚪⚪

0

1

0

394

White Circle

@whitecircle

about 2 months ago

@frankterpo ⚪

0

5

0

382

White Circle

@whitecircle

about 2 months ago

All code, prompts, and data are open-sourced on GitHub and HuggingFace. We also built an interactive game so you can check your own odds of survival! Check it out and read the full report at https://t.co/ruBnNTF1qU

0

23

2

6

2K

White Circle

@whitecircle

about 2 months ago

Far-right is targeted far more than anyone else

2

27

1

3K

White Circle

@whitecircle

Last Seen Users on Sotwe

Trends for you

Most Popular Users