Dataset curation for language models has long relied on brittle, hand-crafted rules. It's time for a more principled, automated approach.
Enter DataRater: a meta-learning framework that learns to value data based on downstream training efficiency. Great summary by Luisa below ๐
Excited to share our new paper, "DataRater: Meta-Learned Dataset Curation"!
We explore a fundamental question: How can we *automatically* learn which data is most valuable for training foundation models?
Paper: https://t.co/N2ozU2RXWb to appear @NeurIPSConf
Thread ๐
Excited to announce that our work on โDiscovering state-of-the-art RL algorithmsโ is finally published in @Nature! In this work, we meta-learned RL algorithms at scale.
Paper: https://t.co/3V4TmPTWm4
Blog: https://t.co/G65ReK2iMs
See thread ๐