Regina @reginazs - Twitter Profile

ReginaZs retweeted

about 1 month ago

1/ New from @ScaleAILabs: Rubrics (a.k.a. checklists) have become the default reward interface for RL on open-ended tasks without final verifiable answers. But most rubric RL still relies on static aggregation: fixed human weights over criteria, summed into one scalar reward. We show that this conflates what should matter in the final answer with what can actually teach the current policy. https://t.co/H5wTQ27ulb

utkarsh4430's tweet photo. 1/ New from @ScaleAILabs: Rubrics (a.k.a. checklists) have become the default reward interface for RL on open-ended tasks without final verifiable answers.

But most rubric RL still relies on static aggregation: fixed human weights over criteria, summed into one scalar reward.

We show that this conflates what should matter in the final answer with what can actually teach the current policy.

https://t.co/H5wTQ27ulb

2

74

21

53

9K

ReginaZs retweeted

bautis @bautizita

over 1 year ago

bautizita's tweet photo. https://t.co/yAHlD7Fm41

728

56K

6K

1K

2M

ReginaZs retweeted

ramiro @_odetosink

over 1 year ago

no puedo creer q un hijo de puta un día se levantó y dijo voy a crear la escena más graciosa de la historia

59

76K

9K

6K

2M

ReginaZs retweeted

✬ a @discokisser

over 1 year ago

experiencing one direction in real time actually does make me better than you

94

53K

10K

969

824K

Who to follow

Diego Piña

@diego_0897

Periodista Deportivo. Colaboré en @juanfutbol y en los Juegos Olímpicos en @marcaclaro. Actualmente escribo para @adevaldes y soy community leader de @CMLLNFT

Eugenio Tamés

@eugeniotames

✍🏼🎙️ Periodista deportivo // @somos_FOX // @ApuntesRabona // @EditorialPuskas // M.A. @USCAnnenberg

ReginaZs retweeted

over 1 year ago

when you're financially stable in a walkable city 3 drinks deep with the love of your life...that is the moment life gets better

687

296K

30K

22K

14M

ReginaZs retweeted

nik @dfnclesslou

over 1 year ago

I don’t think I really recognized how much of that teenage girl living inside of me still held on to one direction as a safety and a comfort until now .

53

65K

11K

2K

867K

ReginaZs retweeted