We’ll release our first trained model with Stability AI soon. If you want to start tinkering with RLHF now, we’re also helping develop TRLX: https://t.co/dzzR1pOv6e — the open source library for reinforcement learning with transformers.
Learn from the best people training large models 🧑💻
Join @Dahoas1 from @carperai@StabilityAI present trlX, one of the first open source RLHF implementations capable of fine-tuning large language models at scale.
👉 Register here: https://t.co/MHkN2cssbg
CarperAI is doing great work lowering the barrier for RLHF training (i.e. training ChatGPT-like models).
The latest release of their trlX library includes this great example, showing how to train RLHF models at scale with an open-source dataset!