How much do you think it costs to pre-train BERT Base on C4 to the point where it reaches an average score of 83.4 when fine-tuned on the GLUE tasks? You know what's coming soon from @MosaicML...
Just bumping this one more time: @MosaicML is hiring research interns this summer. If you want to study the science behind training large models and bring those capabilities within reach for everyone, apply now! https://t.co/WkUack6YdM
Access to diverse partners is crucial when training robust cooperators or evaluating ad-hoc coordination. In our top 25% #iclr2023 paper, we tackle the challenge of generating diverse cooperative policies and expose the issue of "sabotages" affecting simpler methods.
A ๐งต!