drubinstein @dsrubinstein - Twitter Profile

Pinned Tweet

over 1 year ago

Excited to finally share our progress in developing a reinforcement learning system to beat Pokémon Red. Our system successfully completes the game using a policy under 10M parameters, PPO, and a few novel techniques. Blog posted below

13

405

33

208

56K

drubinstein

@dsrubinstein

14 days ago

@deforestpeg Trust me. It works :)

0

1

0

69

drubinstein

@dsrubinstein

18 days ago

Tried to turn it into a skill https://t.co/U1GU5tXTog Remember to measure first, optimize second.

0

2

0

1

34

drubinstein

@dsrubinstein

18 days ago

Teaching my Claude sessions how to benchmark and profile has been a great help recently.

2

0

98

drubinstein

@dsrubinstein

18 days ago

My current use case is to profile speedruns for a tool I'm working on. Profiling has helped find inefficient navigation subroutines. It's great.

0

1

0

39

drubinstein

@dsrubinstein

about 1 month ago

@DanAdvantage Nothing to do with Reflection. This work is entirely my own.

1

0

20

drubinstein

@dsrubinstein

about 1 month ago

A few years ago, I wanted to find a way to maximize shade during my long runs in the summer. Recently, I decided to experiment with a solution. Presenting Shady Route Finder.

dsrubinstein's tweet photo. A few years ago, I wanted to find a way to maximize shade during my long runs in the summer. Recently, I decided to experiment with a solution. Presenting Shady Route Finder. https://t.co/H6U3KspYqW

3

4

1

2

2K

drubinstein

@dsrubinstein

about 1 month ago

https://t.co/bFxtrDpTnq . Supports 8 cities, can tell you what side of the street to walk on, mobile and even has a sunny route finder mode on desktop. I've tested some routes, but obviously not all. Any feedback would be great.

0

1

0

181

drubinstein

@dsrubinstein

about 2 months ago

@yacinelearning It's like 30% the way there. Mostly working on a new harness

0

10

drubinstein

@dsrubinstein

3 months ago

Move over nvidia-smi, ibstat is my new best friend

2

0

255

drubinstein

@dsrubinstein

4 months ago

I dared him to try an ai assisted native rewrite. As expected, 10k sps at best has now become 4M sps. Nice.

Dan Advantage

@DanAdvantage

4 months ago

i did start with a rudimentary implementation of pokemon stemming from a native rewrite of pokemon firered. the starting point i used gets around 4,000,000 steps per second as an rl env. here is the entire prompt (caution: long!!!):

3

19

0

3

12K

2

0

306

drubinstein

@dsrubinstein

4 months ago

대박! A year ago we announced our series A. Today we’re announcing an amazing partnership with Shinsegae. Who knows what’ll come next?

Reflection @reflection_ai

4 months ago

Reflection is partnering with Shinsegae Group to build a 250-megawatt sovereign AI factory for the Republic of Korea. Open intelligence. Built on trust between allies. Owned by the nations that need it most. The future of sovereign AI. Read more in the @WSJ.

reflection_ai's tweet photo. Reflection is partnering with Shinsegae Group to build a 250-megawatt sovereign AI factory for the Republic of Korea.

Open intelligence. Built on trust between allies. Owned by the nations that need it most.

The future of sovereign AI. Read more in the @WSJ. https://t.co/9o1WjozRZP

15

217

31

42

161K

1

14

0

379

drubinstein

@dsrubinstein

5 months ago

Underrated: Letting a coding agent run when you're in meetings.

0

2

0

161

drubinstein

@dsrubinstein

5 months ago

@kywch500 @jsuarez Read to learn more https://t.co/Db8Jwwx1f6

1

6

1

3

438

drubinstein

@dsrubinstein

5 months ago

Had some fun helping out @kywch500 and @jsuarez simplifying Pufferlib's 2048 env the last couple of weeks. 2x better results with fewer observations, rewards and a new model architecture!

2

17

3

7K