"The future of AI isn't measured in size. It's measured in relevance."
- Stop Making Models Bigger, Make Them Behave — Kobie Crawford, Snorkel
https://t.co/jA6XlFscpg via @YouTube@KobieWon
🚀 Exciting news from Navdyut AI Labs! We are thrilled to announce Navdyut-Asm-32k, an optimized, native Assamese tokenizer.
By preserving complex morphological boundaries natively, this model establishes a mathematically lean tokenization gateway specifically tailored for sovereign Assamese AI infrastructure.
Performance Highlights:
It is 46.8% more efficient than Google Gemma.
It processes Assamese text using 3.6x fewer tokens than Meta's Llama 3.
A massive thank you to @ai4bharat and our incredible regional supporters for helping us collect and clean the Assamese dataset, with our foundational data being sourced from the ai4bharat/IndicCorpV2 repository!
⏳ Stay tuned: The tokenizer codebase and Hugging Face links are coming soon!
@Navdyut_AI Special thanks to @himantabiswa@CMOfficeAssam for showing incredible support and implementation of navdyut ai at APWD (Buildings) department. Very excited for future collaborations and partnership towards making Assam one step closer to AI hub of India
We at Navdyut ai strongly believe India’s ai infrastructure can’t win with only indic translation model here is a small contribution from our end in order to make ai more accessible .
Follow us for more , we will share the impact of what Navdyut caused very soon
@OfficialINDIAai@himantabiswa@PMOIndia@PiyushGoyal
(I guess this is what honourable Piyush Goyal sir wanted from Indian startups)
Whats the difference between Founders and lunatics.
Spoiler alert: ‘There isn’t any’
Nothing’s stopping launch. Stay Tuned. We are launching BIG! @Navdyut_AI
@BhavikaKapoor5 Congress IT cell. COVERT Account. With AI generated women picture. With AI generated text. No Real content. Only secularism ka [you know what 😜] rona 😂🤣