Introducing Meta Omnilingual Automatic Speech Recognition (ASR), a suite of models providing ASR capabilities for over 1,600 languages, including 500 low-coverage languages never before served by any ASR system.
While most ASR systems focus on a limited set of languages that are well-represented on the internet, this release marks a major step toward building a truly universal transcription system.
🔗 Learn more: https://t.co/kwCgH9h2vA
Spoke to the doctor, and they said the symptoms expressed here are consistent with data deficiency syndrome.
Treatment is a high dose of real-world training data 🔱
Internet data fueled LLMs, but the next leap in AI will happen in the real world.
▸ Robots that clean
▸ Vehicles that see
▸ Agents that speak
Poseidon is a full-stack data layer for the supply and demand of AI training data.
Introducing Subnets in the Poseidon Litepaper ↓