NP @np_hard - Twitter Profile

Pinned Tweet

2 months ago

As part of @PrimeIntellect's RL residency program, I've been exploring how to do multi-agent RL using their current stack (from verifiers + prime-rl to lab experiments with hosted training /evals) and thinking about how it could be extended to support these abstractions natively. I've summarized my findings the blogpost below and I'll leave a few comments here, too...

np_hard's tweet photo. As part of @PrimeIntellect's RL residency program, I've been exploring how to do multi-agent RL using their current stack (from verifiers + prime-rl to lab experiments with hosted training /evals) and thinking about how it could be extended to support these abstractions natively. I've summarized my findings the blogpost below and I'll leave a few comments here, too...

9

415

50

466

66K

np_hard retweeted

Prime Intellect @PrimeIntellect

29 days ago

The next wave of AI will not be won by better prompts. It will be won by systems that learn from experience. Today, Prime Intellect Lab is out of beta, open for you to start training your own models. The era of self-improving agents is here.

83

2K

198

1K

1M

NP

@np_hard

about 1 month ago

@moultano This is awesome! ❤️

0

2

0

123

NP

@np_hard

2 months ago

@vonbinging

0

1

0

17

Who to follow

Jero

@jeroaranda

Mono gramático I believe in distributed systems. 🍑🍋🍍🍇🍒

Ner*o*

@mokou_ireven

ねろと読みます刃渡りが2億センチあります @No_SilverBullet

ジン

@mnst_jinbeizame

モンスト垢作りやしたモンスト名「じんべいざめ」ランク1000〜運極🍀1000〜仲良くしてください！無課金返信めっちゃ早いと思う #モンスト #モンスト好き #にゃんこ大戦争 #スプラトゥーン3

NP

@np_hard

2 months ago

Sometimes I feel like AGI is just a psyop created by SF’s real estate market

1

5

0

253

np_hard retweeted

Sinatras

@myainotez

2 months ago

https://t.co/3kq6s5Syl3

6

87

20

46

18K

NP

@np_hard

2 months ago

@maxbittker @PrimeIntellect Yeah! maybe with something like cfr but I haven't looked much into this it'd imply many more changes probably

0

1

0

96

NP

@np_hard

2 months ago

As part of @PrimeIntellect's RL residency program, I've been exploring how to do multi-agent RL using their current stack (from verifiers + prime-rl to lab experiments with hosted training /evals) and thinking about how it could be extended to support these abstractions natively. I've summarized my findings the blogpost below and I'll leave a few comments here, too...

9

415

50

466

66K

NP

@np_hard

2 months ago

@blaiseaguera Hey this is really cool! I read your book recently too, and enjoyed it a lot. I just shared this work yesterday I think it’s very related, and I do mention your book early on in the post. Would love to talk if there’s an opportunity :)

NP

@np_hard

2 months ago

As part of @PrimeIntellect's RL residency program, I've been exploring how to do multi-agent RL using their current stack (from verifiers + prime-rl to lab experiments with hosted training /evals) and thinking about how it could be extended to support these abstractions natively. I've summarized my findings the blogpost below and I'll leave a few comments here, too...

9

415

50

466

66K

0

1

0

172

np_hard retweeted

will brown

@willccbb

2 months ago

veeery cool writeup digging into nuances of training, experimentation, and infra for multi-agent RL :)

6

312

18

272

35K

NP

@np_hard

2 months ago

I would like to thank @PrimeIntellect for the support with this work, specifically @willccbb, @johannes_hage , @omouamoua , @hallerite , @GottliebEli and @creet_z . Also thanks for the feedback @myainotez / @BillyHoy1_ / @_djdumpling !

0

16

1

3

1K

NP

@np_hard

2 months ago

I discuss some more details in the blogpost (https://t.co/HR5vMgUQFD). I'm very excited to see what comes out of this, and related work in the residency, like @BillyHoy1_'s stuff - hopefully it will spark more work on open multi-agent RL!

1

17

3

8

1K

NP

@np_hard

3 months ago

computer use

0

2

0

171

NP

@np_hard

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users