๐ฅณ๐ฅณ๐ฅณI defended my PhD thesis today! Special thanks to my wonderful advisor @zicokolter and committee members @rsalakhu@gneubig@LesterMackey!
๐๐๐I am joining @OpenAI as a researcher, super excited to keep working on frontier models and meet everyone in SF!
Introducing GPT-5.5
A new class of intelligence for real work and powering agents, built to understand complex goals, use tools, check its work, and carry more tasks through to completion. It marks a new way of getting computer work done.
Now available in ChatGPT and Codex.
Codex for (almost) everything.
It can now use apps on your Mac, connect to more of your tools, create images, learn from previous actions, remember how you like to work, and take on ongoing and repeatable tasks.
Alzheimerโs is one of medicine's hardest unsolved problems, and one of the most devastating.
At the OpenAI Foundation, we believe AI is well suited to its complexity. We're directing over $100M to scientists mapping the disease, designing drugs, & more.
I wrote about it here:
https://t.co/wOkiE78KUo
Introducing the OpenAI Safety Fellowship, a new program supporting independent research on AI safety and alignmentโand the next generation of talent.
https://t.co/vAQKvf8KyO
New OpenAI post: Can midtraining on docs about aligned AI bake in alignment priors for agents? We report an experiment where those priors are quickly washed away by RL and fail to generalize to agentic settings. But that cuts both ways: priors that AIs are misaligned fade too!
In 2023, WebArena took 7 grad students more than 6 months to build just 5 environments with 812 variable browser-use tasks.
Now, it takes under 10 hours and less than $100 per environment, with easy support for parallel generation.
Excited to introduce WebArena-Infinity: a scalable approach for automatically generating high-authenticity, high-complexity browser environments with verifiable tasks suitable for RL training and benchmarking.
Even strong open-source models that already achieve 60%+ success rates on WebArena and OSWorld complete fewer than 50% of tasks here.
Project page: https://t.co/tEtYkChMBt
Repo: https://t.co/lBg69T12xx ๐งต (1/n)
Qiang Liu, Chris Oates, and I are writing a monograph on Probabilistic Inference and Learning with Steinโs Method, and weโd love to get your feedback on the first draft