- I work on post-training and RL
- I am an expert at the alphabet soup - DPO, PPO, GRPO
- my papers are cited by all the OpenAI researchers
- dropped a SOTA 10b LLM just a few weeks ago
- my dreams are about LLM alignment techniqes
Still got laid off by Meta, who hired a guy with my same profile for $100M a year ๐ญ๐ญ
ChatGPT quietly scrubbed today nearly 50,000 shared conversations from Google's index after our investigation. They thought they'd solved the problem. They were wrong. (1/5)
A third Disco Elysium successor studio has gone public today, but this is the one to get excited about: Multiple writers from the first game and a manifesto to create an RPG "with complexity and ambition to rival our wretched and wonderful world" https://t.co/ALWvGE0yst
Investigators have released a computer generated image of a man they would like to identify in connection with a burglary in Tunbridge Wells. https://t.co/hPrbjaLeyU