Alex Lewandowski @axlewandowski - Twitter Profile

about 13 hours ago

Why do we assume an RL agent can always compute the correct action immediately? Every policy is a program and thus resource bounded. In this blog, I argue why computation should be part of decision making, illustrated with some toy examples. 🔗https://t.co/tBBRc6K0CX

0

2

1

100

axlewandowski retweeted

Louis Kirsch

@LouisKirschAI

about 1 month ago

After automating AI research with @SchmidhuberAI and building AI Scientists at DeepMind, now comes the real experiment: the institution itself. Excited to co-found @inherent_labs: the recursively self-improving lab for scientific AI. https://t.co/SQjUduaG3D

24

526

41

336

72K

axlewandowski retweeted

Richard Sutton

@RichardSSutton

2 months ago

I am definitely going to this...

3

299

19

95

44K

axlewandowski retweeted

RL in Big Worlds @rlc_bigworlds

3 months ago

RL in Big Worlds is a workshop at @RL_Conference about ideas that enable agents to achieve goals in environments vastly more complex than themselves This requires giving agents the ability to learn continually and use approximate value functions, models and policies effectively

rlc_bigworlds's tweet photo. RL in Big Worlds is a workshop at @RL_Conference about ideas that enable agents to achieve goals in environments vastly more complex than themselves

This requires giving agents the ability to learn continually and use approximate value functions, models and policies effectively https://t.co/xPmSn2bxIl

3

185

28

118

89K

Who to follow

Qingfeng Lan

@qingfeng_lan

Researcher @Alibaba_Qwen, PhD @rlai_lab. Ideas are my own.

John D. Martin

@jdmartin86

Fellow @ Openmind Research Institute. Adjunct Professor @UAlberta. Thinking about AI and RL.

Alan Chan

@_achan96_

Research Fellow @GovAIOrg | AI policy | PhD from @Mila_quebec | 🇨🇦

Alex Lewandowski @axlewandowski

4 months ago

@DimitrisPapail Love this work! One way to avoid training: LMs are already universal computers if you prompt them to simulate Lag systems: https://t.co/LQYeD0uvgr My followup shows even untrained LMs can do this, suggesting programmability is key to language computers: https://t.co/YhEKzyQYwo

0

1

0

1

55

axlewandowski retweeted

sorina

@robot_in_space2

5 months ago

We organized an RL competition during the first Openmind Research Institute Winter School in Malaysia. The participants were able to implement SARSA and SAC in just 2 days onboard our Embodied MuJoCo Ant! 🎉

5

202

26

41

18K

axlewandowski retweeted

Michał Bortkiewicz @m_bortkiewicz

7 months ago

Great week at #NeurIPSanDiego packed, intense, and genuinely inspiring. Grateful for all the discussions and feedback. Now looking forward to some quieter days and cooking up new stuff 🚀

m_bortkiewicz's tweet photo. Great week at #NeurIPSanDiego packed, intense, and genuinely inspiring. Grateful for all the discussions and feedback. Now looking forward to some quieter days and cooking up new stuff 🚀 https://t.co/P89VAJhWxX

0

39

3

4

2K

axlewandowski retweeted

Erin Grant @ermgrant

7 months ago

Thrilled to announce I'll start in 2026 as faculty in Psych & CS @UAlberta + @AmiiThinks Fellow!! 🥳 Recruiting students to develop theories of cognition in natural and artificial systems 🤖💭🧠. Find me at #NeurIPS2025 workshops (talk at @CogInterp & organising @DataOnBrainMind)

9

199

21

20

25K

Alex Lewandowski @axlewandowski

7 months ago

At NeurIPS and interested in continual learning? Stop by our spotlight poster this Friday @ 11am (#508). Our work provides a computational approach to the big world hypothesis through embedded agents. Feel free to reach out if you want to meet up while in San Diego!

axlewandowski's tweet photo. At NeurIPS and interested in continual learning? Stop by our spotlight poster this Friday @ 11am (#508).

Our work provides a computational approach to the big world hypothesis through embedded agents.

Feel free to reach out if you want to meet up while in San Diego! https://t.co/prmNzShjZe

1

48

8

19

7K

Alex Lewandowski @axlewandowski

7 months ago

As a reviewer: I spent multiple days replying to Authors' rebuttals. I cannot overstate how eerily similar some of it felt to pointing out bugs to an LLM. I proceeded on the assumption that my critique served the scientific record; it is disheartening to hear otherwise.

Delip Rao e/σ

@deliprao

7 months ago

Hey @iclr_conf, reverting scores is unnecessary punishment for the majority of the authors who had nothing to do with this incident and had successful rebuttals. Instead of detecting collusions on your end (you have a ton of metadata) why is this everyone’s burden to bear?

deliprao's tweet photo. Hey @iclr_conf, reverting scores is unnecessary punishment for the majority of the authors who had nothing to do with this incident and had successful rebuttals. Instead of detecting collusions on your end (you have a ton of metadata) why is this everyone’s burden to bear? https://t.co/HHGLMXGq1h

7

215

29

10

39K

0

7

0

1K

Alex Lewandowski @axlewandowski

7 months ago

Language models can make convenient writing partners, if you know how to use them. But one disadvantage of their increased use in peer-review is a collapse in thought diversity. Instead of hearing from thousands of different minds, we get thousands of samples from a few.

0

6

0

234

Alex Lewandowski @axlewandowski

7 months ago

The link to your paper belongs in 1/n, not n/n. The paper is the product. Algorithms be damned.

0

5

0

716

Alex Lewandowski @axlewandowski

9 months ago

@tw_killian I have the opposite problem: everything is RL

1

0

58

Alex Lewandowski @axlewandowski

9 months ago

@jsuarez Yeah, it's not RL. It's offline RL. Calling it supervised learning is equally misleading: the targets in offline RL provide only feedback on a chosen action. In supervised learning, the target provides full feedback. Also, interaction is needed for evaluation in offline RL.

0

1

0

41

axlewandowski retweeted

Prabhat Nagarajan @prabhatmn

10 months ago

Have people seen this prescient 2001 post by @RichardSSutton on self-verification? "An AI system can create and maintain knowledge only to the extent that it can verify that knowledge itself". This sentiment underpins much LLM reasoning research today. https://t.co/gK3DOwqYm8

1

29

4

11

8K

axlewandowski retweeted

Finding The Frame Workshop @RLFrameWorkshop

10 months ago

Thanks to everyone who joined us for another great workshop! 🥳 This year we once again asked our panelists to share a paper or book that heavily influenced their perspective 📚 check out their recommendations here! https://t.co/ZVfbTWk0lg

0

16

2

8

955

axlewandowski retweeted

Mateusz Ostaszewski @MatOstasze

11 months ago

🚀 Excited to announce our paper "Balancing Expressivity and Robustness: Constrained Rational Activations for RL" will be an *oral* at #CoLLAs2025! We study how trainable rational activations boost expressivity in RL but can also harm stability:

MatOstasze's tweet photo. 🚀 Excited to announce our paper "Balancing Expressivity and Robustness: Constrained Rational Activations for RL" will be an *oral* at #CoLLAs2025!

We study how trainable rational activations boost expressivity in RL but can also harm stability: https://t.co/y4C8SIf3CU

1

18

3

5

3K

Alex Lewandowski @axlewandowski

about 1 year ago

I will be presenting a poster at RLDM on Wednesday @ 4:30pm. We show that embedding an agent implicitly constrains both it and the environment. We use this constraint to characterize continual adaptation. https://t.co/3eS6qDCD2T Feel free to reach out if you're attending RLDM!

axlewandowski's tweet photo. I will be presenting a poster at RLDM on Wednesday @ 4:30pm. We show that embedding an agent implicitly constrains both it and the environment. We use this constraint to characterize continual adaptation. https://t.co/3eS6qDCD2T

Feel free to reach out if you're attending RLDM! https://t.co/o2qd4TBSTu

0

35

3

8

5K

axlewandowski retweeted

Finding The Frame Workshop @RLFrameWorkshop

about 1 year ago

🚨 We extended Finding the Frame's submission deadline to June 15 AoE! 🚨 ✨We're looking for bold ideas that rethink the foundations of RL: goals, values, rewards, formalisms, and beyond🚀 More details: https://t.co/mNy0SGJtRu See you @RL_Conference !

RLFrameWorkshop's tweet photo. 🚨 We extended Finding the Frame's submission deadline to June 15 AoE! 🚨

✨We're looking for bold ideas that rethink the foundations of RL: goals, values, rewards, formalisms, and beyond🚀

More details: https://t.co/mNy0SGJtRu

See you @RL_Conference ! https://t.co/VWGCSKuMGe

1

18

10

2

5K

axlewandowski retweeted

Finding The Frame Workshop @RLFrameWorkshop

about 1 year ago

🚨 Reminder! Submissions for @RL_Conference's Finding the Frame are due May 30 (AoE)! We're looking for bold ideas that rethink the foundations of RL: goals, values, rewards, formalisms, and beyond. 🧠Philosophy, theory, critique welcome! 🔗More details: https://t.co/mNy0SGJtRu

RLFrameWorkshop's tweet photo. 🚨 Reminder! Submissions for @RL_Conference's Finding the Frame are due May 30 (AoE)!
We're looking for bold ideas that rethink the foundations of RL: goals, values, rewards, formalisms, and beyond.
🧠Philosophy, theory, critique welcome!
🔗More details: https://t.co/mNy0SGJtRu https://t.co/4DxdlTQJbp

0

17

11

4

7K

Alex Lewandowski

@axlewandowski

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users