We're a part-time, virtual research program that gives students and early career professionals an opportunity to work with professional AI safety researchers.
📣 Only 2 days left to apply for this round of SPAR!
Apply by January 14 to join our largest round yet — 130+ projects with mentors from Google DeepMind, RAND, AI Security Institute, Apollo Research, SecureBio, Machine Intelligence Research Institute, and more!
Applications for the Generator Residency close on Monday EOD! Last chance to apply.
Fully funded, 6k stipend + travel + housing, 3 months with an extension, in-person in Berkeley. Probably the best path into AI safety for non-researcher roles.
📣 Only 3 days left to apply for Generator!
Apply by April 27, to join our inaugural cohort with advisers from AI Futures Project, BlueDot, Coefficient Giving, FAR. AI, Forethought, METR, RAND, and more!
https://t.co/nfvm4Urxe4
Announcing the Generator Residency: a 3-month residency for AI safety generalists, by @KairosAIS × @ConstellOrg.
Fully funded. In-person in Berkeley. Summer 2026.
🗓 Apply by April 27
https://t.co/0pM58jFJBP
Excited to share our new paper! We looked at when reasoning LLMs 'knew' their final answer internally vs. when it was stated in chain-of-thought. Turns out these models can be performative depending on the task!
In this work, we complement behavioral goal-directedness evals of LLM agents with a probing analysis of environment and plan representations, examining whether observed actions are consistent with models' internal beliefs, and how reasoning affects representations. Check it out!
LawZero is accepting applications as part of the SPAR Spring 2026 program!
If you're interested in studying model awareness or emergent misalignment, you can learn more and apply here: https://t.co/c9IWRLT3IX.
Applications are open until Jan 14, 2026.
Come work with me and @SPARexec to build an AI mech interp researcher to accelerate AI safety research.🧠🔬
In the last cohort, my mentees built AI agents that automatically find and refine explanations for SAE features (demo of what they built after only one month below). In this cohort, we want to push for agents that discover and explain full circuits.
Deadline is Jan 14th!���🗓️
📣 Only 2 days left to apply for this round of SPAR!
Apply by January 14 to join our largest round yet — 130+ projects with mentors from Google DeepMind, RAND, AI Security Institute, Apollo Research, SecureBio, Machine Intelligence Research Institute, and more!
Work on a part-time AI safety, AI policy, AI security, or biosecurity project. Open to students & professionals, prior research experience not required for all projects.
I'm mentoring a SPAR project on evaluating and refining alignment targets for LLMs (constitutions, model specs, etc.) this spring! Apply by January 14 to work with me or other SPAR mentors - project details/application link ⬇️:
Does training language models on AI safety literature make them more likely to scheme?
This is one of the research questions being explored in the upcoming round of @SPARexec. A few projects I'm excited about: 🧵
The NYU Center for Mind, Ethics, and Policy is seeking research fellows to contribute to upcoming reports on legal personhood and economic rights for digital minds. Please apply if you have interest in working with us!
I'm glad to mentor again for this round of SPAR, likely with @zhonghaohe! Together let's help human-AI coevolution go a little bit better :)
⬇️🧵Here's a collection of research ideas I'd be excited to mentor projects on. Feel free to pitch yours too!
🚀 We're excited to announce that mentee applications are now open for the Spring round of the SPAR research program!
This will be our largest round ever, featuring 130+ projects across AI safety, policy, governance, security, welfare, and strategy.