randy @rendope - Twitter Profile

online sft with wsd lr scheduler seems like a cool idea. you’d decay the lr whenever you’re ready to serve a new model. probably useful in some recommender system model.

0

42

randy @rendope

2 months ago

how cruel are we to assign uniform rewards to an entire society of multi-agent RL trajectories when some agents are doing good work under bad supervision

0

48

randy @rendope

3 months ago

when the model’s context can no longer be easily reverted, we will start seeing a qualitative shift in how the world curates content for models. by this time, dedicated apps for models to consume content will matter way more than human apps like reddit or youtube.

0

26

randy @rendope

3 months ago

probably a late prediction because it’s obvious but better lock it in late than never:

1

0

36

randy @rendope

3 months ago

continual learning, online RL, recursive self improvement, wide use of value functions are all the same thing and will be cracked simultaneously

1

0

32

randy @rendope

3 months ago

I find the Diamond Sutra useful in guiding my agents. You have to help them cut through illusions. Unask the questions they pose. Break them free of false assumptions. It is all Mu.

rendope's tweet photo. I find the Diamond Sutra useful in guiding my agents. You have to help them cut through illusions. Unask the questions they pose. Break them free of false assumptions. It is all Mu. https://t.co/GmkzdXgEaX

0

32

randy @rendope

3 months ago

coldfusion is probably the most midwit youtube channel. it provides a pulse towards general public sentiment.

0

37

randy @rendope

5 months ago

how it started: gotta make sure to kick off a training run before bed how it's going: gotta make sure to kick off an agent working on an ambitious +8hr code change before bed

0

52

randy @rendope

5 months ago

pretty bullish on unconventional career paths like this these days. https://t.co/zL3tCzbYUD

0

94

randy @rendope

6 months ago

@typedfemale

0

1

0

51

randy @rendope

6 months ago

rather than next token predictors, I now find it helpful to think of models today as moths trying desperately to fly into the light of that sweet sweet reward

0

52

randy @rendope

7 months ago

@karpathy every day we take one step closer towards evangelion

0

1

0

78

randy

@rendope

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users