📣 The OSCAR Project and @DFKI are happy to announce the release of Colossal OSCAR 1.0 📚, which is now available on the @huggingface Hub 🤗 at https://t.co/EbMtcrFtrt
Colossal OSCAR 1.0 was put together by @pjox13 as part of the @OpenGPTX collaboration.
Announcing mOSCAR, multilingual interleaved text-image corpus as part of @oscarnlp project.
Paper: https://t.co/1hhnyYCyI3
Dataset: https://t.co/hiEUJ1Q3iJ
Doc: https://t.co/KsnT5wVee2
1/6
Announcing mOSCAR, multilingual interleaved text-image corpus as part of @oscarnlp project.
Paper: https://t.co/1hhnyYCyI3
Dataset: https://t.co/hiEUJ1Q3iJ
Doc: https://t.co/KsnT5wVee2
1/6
👀 We're working on many new features for you, currently we're focusing on improving language identification, so if you want to help or contribute, please join our community 💬 on Discord: https://t.co/toLKAPje4E
📣 The OSCAR Project and @DFKI are happy to announce the release of Colossal OSCAR 1.0 📚, which is now available on the @huggingface Hub 🤗 at https://t.co/EbMtcrFtrt
Colossal OSCAR 1.0 was put together by @pjox13 as part of the @OpenGPTX collaboration.
Everybody is talking about @OpenAI - we should talk more about cool projects like @silo_AI, @oscarnlp (for multilingual #LLMs), @bazril's work on high-performance language tech, or @OpenWebSearchEU. #Metaforum has been a great opportunity to learn more.
👀 We're working on many new features for you, currently we're focusing on improving language identification, so if you want to help or contribute, please join our community 💬 on Discord: https://t.co/toLKAPje4E