Ed Li @t_ed_li - Twitter Profile

Pinned Tweet

5 months ago

What if, instead of “solving” continual learning, we can “work around” it? Blog here: https://t.co/IYSQrWdIao Finetuning a multiagent system end-to-end = natural specialization without catastrophic forgetting.

t_ed_li's tweet photo. What if, instead of “solving” continual learning, we can “work around” it?

Blog here: https://t.co/IYSQrWdIao

Finetuning a multiagent system end-to-end = natural specialization without catastrophic forgetting. https://t.co/wsV2qCRvKZ

5

13

3

5

937

Ed Li

@t_ed_li

12 days ago

this gotta be ai generated, thousands of ppl in the photo but only my face is shown clearly and distinctly lol

Beff (e/acc)

@beffjezos

13 days ago

This is the middle of SF rn

35

412

4

18

37K

0

21

Ed Li

@t_ed_li

12 days ago

@OKfallah @beffjezos lollll what are the odds

0

15

t_ed_li retweeted

Tom Dörr

@tom_doerr

24 days ago

Multi-agent system automates end-to-end scientific research lifecycle https://t.co/jsZHmzrNda

4

205

25

230

9K

Ed Li

@t_ed_li

23 days ago

@tom_doerr Lead author here, we got plans to finetune all of these agents via RL end-to-end next, stay tuned;)

0

1

0

24

Ed Li

@t_ed_li

24 days ago

@pmarca the structural similarity is real, but ai panic has a feature the others lacked. the people most worried are the people building the thing, and that internal dissent doesn't fade the same way external panics do

0

3

0

1

489

Ed Li

@t_ed_li

24 days ago

@JeffDean @rronak_ @MichaelElabd @QuantumArjun continuous learning systems being more robust assumes you can prevent catastrophic forgetting. that's still the open problem. otherwise the system is just one that hasn't been pushed off-distribution yet

0

86

Ed Li

@t_ed_li

24 days ago

@naval @rauchg @bscholl models instructing humans is the inversion the field has been waiting for. the productivity unlock isn't the AI doing the work, it's the AI knowing the next step you should take

2

0

1K

Ed Li

@t_ed_li

25 days ago

@gdb byo mcp servers is the moment codex starts being a platform instead of a product. now the question is whether the marketplace forms inside codex or outside

0

306

Ed Li

@t_ed_li

25 days ago

@gdb how does codex coordinate state between parallel browser subagents, shared memory or message passing per task?

0

97

Ed Li

@t_ed_li

25 days ago

@NousResearch the agent jam format is the natural successor to the model release thread, when the action moves from the weights to the loops

0

118

Ed Li

@t_ed_li

25 days ago

@willccbb max power + min complexity + opinionated path is the trilemma. you can pick two. the only way out is opinionated defaults that are easy to override but hard to discover

0

32

Ed Li

@t_ed_li

25 days ago

@rasbt full attention coming back after a year of sliding-window papers is exactly the cycle academic ML always runs

0

129

Ed Li

@t_ed_li

25 days ago

@gdb real-time meeting Q&A is the use case where agent latency budgets actually bite. if it's slow, the user has already moved on by the time the answer lands

0

1

0

156

Ed Li

@t_ed_li

25 days ago

@ziwenxu_ is it the agent doing speculative subtask exploration that's burning the budget, or is it the long-running tool calls eating a lot of context per turn?

1

0

351

Ed Li

@t_ed_li

25 days ago

@AravSrinivas tokenization quietly being on the critical path is the kind of detail you only learn when you ship at scale

0

96

Ed Li

@t_ed_li

25 days ago

@willccbb the world model that lets you bypass replayable environments has to be reliable on counterfactuals you haven't seen, which is structurally the same problem as RL. you've recursed up a level

0

1

0

257

Ed Li

@t_ed_li

25 days ago

@russelljkaplan the operation vacation framing implies a closed loop where the system improves itself. was the feedback signal coming from fleet shadow mode or from sim-based eval?

0

259

Ed Li

@t_ed_li

25 days ago

@gregisenberg haha can testify to the popularity of coffee shops, and what r ur startup ideas on that?

0

5

Ed Li

@t_ed_li

about 1 month ago

@dair_ai treating coordination as a separable configurable layer is the half step. the next one is training the whole system end-to-end so coordination emerges from optimization instead of getting designed by hand

0

16

Ed Li

@t_ed_li

Last Seen Users on Sotwe

Trends for you

Most Popular Users