Prannay Hebbar @pran_ker - Twitter Profile

Pinned Tweet

Prannay Hebbar

@Pran_Ker

6 days ago

There is a hackathon being held at @AGIHouseSF on some of my work on test time training and self-improving agents.

2

18

0

4

18K

Pran_Ker retweeted

Prannay Hebbar

@Pran_Ker

3 days ago

you can train lots of insanely cool things you can train it to play a Minecraft sim, it collects resources and crafts items @puffer_ai is just insanely cool

4

147

6

44

366K

Prannay Hebbar

@Pran_Ker

1 day ago

@willccbb will be missed

0

209

Prannay Hebbar

@Pran_Ker

2 days ago

Incredible to see @bcherny repost our work!

Boris Cherny

@bcherny

3 days ago

Seeing a number of benchmarks showing Opus is the best model for long-running work. Five tips for running Opus autonomously for hours/days: 1. Use auto mode for permissions, so Claude doesn’t ask for approval 2. Use dynamic workflows, to have Claude orchestrate hundreds/thousands of agents to get a task done 3. Use /goal or /loop, to nudge Claude to keep going until it’s done 4. Use Claude Code in the cloud, so you can close your laptop (easiest way is the desktop or mobile app) 5. Make sure Claude has a way to self-verify its work end to end: Claude in Chrome browser extension for web, iOS/Android sim MCP for mobile, a way to start the full web server or service for backend work

312

3K

275

4K

628K

0

1

0

109

Who to follow

Rahul

@im__rahulg

23 | Software Engineer

Intelligence that can be shared cant be scaled. @ Canopy AI Prev @KukuFMOfficial . @eloeloapp . GX 18 Ex-justVouchclub (acq.)

Prannay Hebbar

@Pran_Ker

2 days ago

@jsuarez @puffer_ai Amazing, played around with the drone env. Would love to mess around with the new envs

0

2

0

99

Prannay Hebbar

@Pran_Ker

3 days ago

you can train lots of insanely cool things you can train it to play a Minecraft sim, it collects resources and crafts items @puffer_ai is just insanely cool

Angus (dirtman)

@dirtman

3 days ago

reinforcenment learning is so cool crazy you can get it to do this in <60s thank you @puffer_ai

12

313

10

156

403K

4

147

6

44

366K

Prannay Hebbar

@Pran_Ker

3 days ago

Good weekend read: https://t.co/k5ZIKCQZUC

0

4

0

2

244

Prannay Hebbar

@Pran_Ker

3 days ago

@klyap_ Haha yeah :)

0

1

0

18

Prannay Hebbar

@Pran_Ker

4 days ago

@klyap_ Only Svgs

1

0

28

Prannay Hebbar

@Pran_Ker

5 days ago

@kokoxsu @novaholdings incredible stuff

0

115

Pran_Ker retweeted

Rishi Desai

@rishi_desai2

5 days ago

Can coding agents stay coherent over a 1 billion token budget? Can they build Slack from scratch? Rewrite a JAX codebase in PyTorch? Build a C compiler in Rust? Enter SWE-Marathon: a benchmark for autonomous long-horizon software work.

rishi_desai2's tweet photo. Can coding agents stay coherent over a 1 billion token budget?

Can they build Slack from scratch?
Rewrite a JAX codebase in PyTorch?
Build a C compiler in Rust?

Enter SWE-Marathon: a benchmark for autonomous long-horizon software work. https://t.co/K97VHyLvIX

50

676

66

283

786K

Prannay Hebbar

@Pran_Ker

6 days ago

Credit goes to the whole Hexo Team

0

1

0

112

Prannay Hebbar

@Pran_Ker

6 days ago

There is a hackathon being held at @AGIHouseSF on some of my work on test time training and self-improving agents.

2

18

0

4

18K

Prannay Hebbar

@Pran_Ker

6 days ago

Dm for access: https://t.co/nAhswjLhtq

0

1

0

102

Prannay Hebbar

@Pran_Ker

8 days ago

this isn’t talked about enough cause a lot of people are hoping the big labs will solve this and it will be available to them downstream. in my opinion there will be a lot of different styles of continual learning (in ttt, opd, etc). different problems would require different styles implemented.

0

3

0

1

112

Prannay Hebbar

@Pran_Ker

8 days ago

@_WEEXIAO 👀

0

1

0

102

Pran_Ker retweeted

llm_enjoyer

@LLMenjoyer

9 days ago

video of my training runs

4

278

18

73

30K

Prannay Hebbar

@Pran_Ker

10 days ago

@QuantumArjun @rronak_ @MichaelElabd insanely good team

0

17

Prannay Hebbar

@Pran_Ker

10 days ago

elegant solution

Zihan "Zenus" Wang

@wzenus

3 months ago

In Agent RL, models suffer from Template Collapse. They generate vast, diverse outputs (High Entropy) that lose all meaningful connection to the input prompt (Low Mutual Information). In other words, agent learn different ways to say nothing. 🚀 Introducing RAGEN-v2 -- Here's how we define and fix such silent failure modes in Agent RL. 🧵

13

270

60

216

187K

0

1

0

156

Prannay Hebbar

@Pran_Ker

10 days ago

done. cool bit: interpolating between RL checkpoints souping (didn't know this was a thing before). Also they show you can extrapolate beyond what training ever reached. I was hoping they had some results on the divergence between the souped checkpoints and the actual ones.

0

25

Prannay Hebbar

@Pran_Ker

12 days ago

Wow, huge if true going to be my weekend read

Kunhao Zheng @KunhaoZ

13 days ago

🧵 For 2 RL checkpoints trained differently, you can just weight extrapolate them and it works! Bonus: these extrapolated checkpoints are complementary policies -> Get exploration and diversity for free -> Better inference scaling when ensembling Paper: https://t.co/zU0LH0TOdm

3

122

30

76

13K

1

2

0

144

Prannay Hebbar

@Pran_Ker

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users