Taylor W. Killian @tw_killian - Twitter Profile

Pinned Tweet

21 days ago

📣 There's never a "best" time to share important updates, especially after sitting on this for so long... I'm joining the faculty @BYU + @BYUCS this Summer as an Assistant Professor in preparation for the upcoming school year. Lots of excitement and a fair bit of nerves. 🧵

tw_killian's tweet photo. 📣 There's never a "best" time to share important updates, especially after sitting on this for so long...

I'm joining the faculty @BYU + @BYUCS this Summer as an Assistant Professor in preparation for the upcoming school year. Lots of excitement and a fair bit of nerves. 🧵 https://t.co/GZQFTmF97T

44

173

11

8

19K

tw_killian retweeted

Institute of Foundation Models

@IFM_MBZUAI

about 17 hours ago

2/4 The system decomposes deliberation into three processes: reactive execution (System I), future-state simulation via LLM-as-world-model (System II), and a learned configurator (System III) that decides when to simulate, how far ahead, and when to act directly. RL trains the configurator to plan further ahead, not more often. Allocation, not compression.

IFM_MBZUAI's tweet photo. 2/4
The system decomposes deliberation into three processes: reactive execution (System I), future-state simulation via LLM-as-world-model (System II), and a learned configurator (System III) that decides when to simulate, how far ahead, and when to act directly.

RL trains the configurator to plan further ahead, not more often. Allocation, not compression.

1

3

2

0

331

Taylor W. Killian @tw_killian

1 day ago

@ziv_ravid Hey Ravid, his skill set seems to map really cleanly to some of the problems we're working on and constructing solutions for @LilaSciences. Can you connect us? cc: @BenKompa + @AndrewLBeam

1

4

0

447

Taylor W. Killian @tw_killian

4 days ago

@andrew_n_carr #givethanks

0

1

0

565

Who to follow

Pavel Izmailov

@Pavel_Izmailov

Researcher @AnthropicAI 🤖 Assistant Professor @nyuniversity 🏙️ Previously @OpenAI #StopWar 🇺🇦

Jakob Foerster

@j_foerst

Associate Prof in ML @UniofOxford. Something Something Research Scientist @MetaAI. Something @FLAIR_Ox. Always #teamhuman. Opinions belong to the world.

Brandon Amos

@brandondamos

🧙 RL @Reflection_AI past: @MetaAi @GoogleDeepmind @SCSatCMU @Cornell_Tech

tw_killian retweeted

RedditCFB

@RedditCFB

4 days ago

Never been a better time to be earnest and love what you do.

3

6K

154

63

173K

Taylor W. Killian @tw_killian

4 days ago

@alexgshaw @harborframework Did you at least get lunch in the rail car?

1

0

51

Taylor W. Killian @tw_killian

5 days ago

@xeophon @code_star Don’t get too far ahead. I’m loving my little neighborhood

0

3

0

135

tw_killian retweeted

RL Beyond Rewards Workshop @RLBRew_RLC

7 days ago

Two BIG updates for the RLBrew Workshop at #RLC2026! 📣 1️⃣ Dual submissions are welcome 2️⃣ We’ll be awarding a Best Paper RLBrew Award 🏆 You have 2 DAYS LEFT to submit — deadline: May 29! Details: https://t.co/segLTne6Tp

1

10

6

2

982

tw_killian retweeted

Niloofar

@niloofar_mire

10 days ago

Tbh i’m kinda sick of this academic doomerism vibe consuming all of bay area and the self-aggrandizing pov that frontier labs have. Sure a lot of exciting stuff is happening but we wouldn’t be where we are wo academia & there is sth to be said about the pursuit of curiosity.

22

603

51

65

51K

Taylor W. Killian @tw_killian

11 days ago

@CatGodSandHive @joeddav Thanks CatGod, 🥰

0

37

Taylor W. Killian @tw_killian

11 days ago

My friends… get ready for mountain pictures at a higher frequency… I really don’t ever want to take them for granted. Spent a few days getting some things set up in Provo and couldn’t leave before scampering up “Khyv” Peak (that new name is going to take some getting used to)

tw_killian's tweet photo. My friends… get ready for mountain pictures at a higher frequency… I really don’t ever want to take them for granted.

Spent a few days getting some things set up in Provo and couldn’t leave before scampering up “Khyv” Peak (that new name is going to take some getting used to) https://t.co/JtjUAb9J05

Taylor W. Killian @tw_killian

21 days ago

📣 There's never a "best" time to share important updates, especially after sitting on this for so long... I'm joining the faculty @BYU + @BYUCS this Summer as an Assistant Professor in preparation for the upcoming school year. Lots of excitement and a fair bit of nerves. 🧵

44

173

11

8

19K

1

28

0

2K

tw_killian retweeted

Lara Sá Neves @larasnevess

13 days ago

SR²AM is out! Thinking longer ≠ thinking smarter. SR²AM knows which one it needs. A configurator regulates internal simulation: when to predict future states, how far, and when to skip. Result: 30B competing with 685B–1T at a fraction of the token cost. Model and code available

1

21

7

13

3K

Taylor W. Killian @tw_killian

12 days ago

@mdeng34 @fronxer @jinyuhou0 @larasnevess @varad0309 @waterluffy @ericxing Did someone say RL?! We’re RL-maxxing all the time here. Great call out Mingkai.

0

2

0

1

106

tw_killian retweeted

Continual RL Workshop @continual_learn

13 days ago

Reminder: the submission deadline is in about one week: May 29, 2026 (AoE). We look forward to your submission!

0

17

7

3

2K

tw_killian retweeted

RL_Conference @RL_Conference

12 days ago

Under review? Still welcome! The RLBrew deadline is 1 week away — great place to get early feedback on your work!

0

6

5

3K

Taylor W. Killian @tw_killian

13 days ago

@RyanBoldi Darn it Ryan you can’t be adding more great poets to my to read pile! This looks amazing. Can’t wait to try it out asap

0

1

0

319

tw_killian retweeted

Mingkai Deng

@mdeng34

13 days ago

This is a prototype using language-based world models. Stay tuned for our next steps on multimodal and physical world models. The concept of a configurator, which decides when and how deeply to engage a reasoning process, is not specific to planning, but extensible to learning and adaptation going forward. 📄 SR²AM: https://t.co/LKeXZFN8Hh 📄 SiRA: https://t.co/5JzLSEu4nO 🌐 Project: https://t.co/1CUlEdFMxY 💻 Code: https://t.co/JSBoERYHaB 🤗 SR²AM-v0.1-8B: https://t.co/b1kkuvFL6k 🤗 SR²AM-v1.0-30B: https://t.co/PES00q6a4J Joint work with @jinyuhou0, @larasnevess, @varad0309, @tw_killian, @waterluffy, @ericxing

2

59

10

46

4K

Taylor W. Killian @tw_killian

13 days ago

New work led by the inimitable @mdeng34 and @jinyuhou0. We took a fair bit of time thinking about whether an agent can assess how much effort it needs to spend on thinking through the problems it is presented. The resulting algorithm is one step to a fully adaptive future!

Mingkai Deng

@mdeng34

13 days ago

Frontier LLMs are converging on efficient, adaptive reasoning. Opus 4.7 lets the model decide how deeply to reason. GPT-5.5 achieves strong results with fewer reasoning tokens. We study a related but more structural question: what 𝗸𝗶𝗻𝗱 𝗼𝗳 𝗿𝗲𝗮𝘀𝗼𝗻𝗶𝗻𝗴 should we adapt? Last year in SiRA (upper figure), we showed that simulative reasoning (System II), which uses a 𝘄𝗼𝗿𝗹𝗱 𝗺𝗼𝗱𝗲𝗹 to evaluate consequences of actions, yields up to 124% improvement over reactive baselines (System I), and that strong reasoning models (o1, o3-mini) fail as planners without this structure. In our new paper SR²AM (lower figure), we add a learned 𝗰𝗼𝗻𝗳𝗶𝗴𝘂𝗿𝗮𝘁𝗼𝗿 (System III) that self-regulates when to simulate, how far ahead, and when to skip planning entirely. Efficient reasoning is not just shorter reasoning: it is better allocation of simulation.

mdeng34's tweet photo. Frontier LLMs are converging on efficient, adaptive reasoning. Opus 4.7 lets the model decide how deeply to reason. GPT-5.5 achieves strong results with fewer reasoning tokens.

We study a related but more structural question: what 𝗸𝗶𝗻𝗱 𝗼𝗳 𝗿𝗲𝗮𝘀𝗼𝗻𝗶𝗻𝗴 should we adapt?

Last year in SiRA (upper figure), we showed that simulative reasoning (System II), which uses a 𝘄𝗼𝗿𝗹𝗱 𝗺𝗼𝗱𝗲𝗹 to evaluate consequences of actions, yields up to 124% improvement over reactive baselines (System I), and that strong reasoning models (o1, o3-mini) fail as planners without this structure.

In our new paper SR²AM (lower figure), we add a learned 𝗰𝗼𝗻𝗳𝗶𝗴𝘂𝗿𝗮𝘁𝗼𝗿 (System III) that self-regulates when to simulate, how far ahead, and when to skip planning entirely.

Efficient reasoning is not just shorter reasoning: it is better allocation of simulation.

4

278

47

273

61K

0

7

2

3

1K

tw_killian retweeted

Isha Puri

@ishapuri101

13 days ago

It's never made sense to me that RL collapses all reward signals to a single scalar. Today, we fix that! Introducing Vector Policy Optimization: we train models to inherently optimize for the varied nature of a reward vector, creating diverse sets of answers ideal for test time search. Website and code coming soon!

11

712

67

576

68K

tw_killian retweeted

RL Beyond Rewards Workshop @RLBRew_RLC

13 days ago

Reminder! RLBRew deadline in coming up in 7 days! Submit your works soon👩‍💻 Reminder that we accept under review papers! This is a good place to discuss your ideas and get feedback from the community

RLBRew_RLC's tweet photo. Reminder! RLBRew deadline in coming up in 7 days! Submit your works soon👩‍💻

Reminder that we accept under review papers! This is a good place to discuss your ideas and get feedback from the community https://t.co/4Zkkzbh04r

0

12

5

10

4K

Taylor W. Killian

@tw_killian

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users