Argmax now runs on Google Tensor TPU, the first-ever SDK to harness this edge inference accelerator!
Tensor TPU enabled us to deploy billion-scale transformers reliably on Pixel phones without impacting battery life or resource contention with traditional workloads.
Q: How do I add support for Tensor TPUs in my app?
A: The next release of Argmax Pro SDK Kotlin will generate TPU-optimized Parakeet models.
Q: What are the code changes I need to make?
A: Installation and Google Play integration remain the same as before. No code changes required. Video tutorial for Installation can be found here: https://t.co/nHQuhomYht
Q: How do I sign up?
A: Go to https://t.co/JKRKosoBN4 and start your 14-day trial!
Argmax now runs on Google Tensor TPU, the first-ever SDK to harness this edge inference accelerator!
Tensor TPU enabled us to deploy billion-scale transformers reliably on Pixel phones without impacting battery life or resource contention with traditional workloads.
Argmax OSS Swift hits 1.0!
- Swift 6 compatibility
- Now dependency-free to streamline Enterprise procurement, minimize conflicts and security risks
- Renames the package to argmax-oss-swift, consolidating WhisperKit, SpeakerKit and TTSKit
- Battle-tested in production on millions of devices for over 26 months
Argmax OSS (formerly WhisperKit) just crossed 10M monthly on @huggingface!
- First ever Apple Silicon-only model to cross 10M
- Usage grew 10x in ~100 days
- Free, MIT Open-source and pure Swift
We are thrilled that WhisperKit reached 1 million monthly on @huggingface!
- First ever Apple Silicon-only model to reach 1M
- Usage grew 10x in 2025
- Free, MIT open-source and pure-Swift
Google just published a blog post on the real-world commercial adoption of their new on-device inference runtime, LiteRT!
Heidi Health and Argmax are highlighted as the prime example of running medical transcription on Android devices, improving reliability, speed, and privacy while maintaining parity in accuracy with the cloud alternative.
Argmax Pro SDK is first-in-market to run frontier speech models on Android devices while leveraging the previously untapped accelerators such as Edge TPUs (eTPU) on Google Pixel phones.
WhisperKit
WhisperKit is now 26 months old, and it recently crossed 6,000,000 monthly downloads: https://t.co/olc6RM3j2C
It remains the most reliable implementation of Whisper on Apple Silicon, running Large v3 Turbo in real-time on iOS used by developers and Enterprises in high-stakes industries such as healthcare!
Throwback to our original launch: https://t.co/cqRqycriRH
WhisperKit is now Argmax OSS!
As part of our continued commitment to open-source, we are releasing part of Argmax Pro SDK, extending WhisperKit beyond speech-to-text.
Argmax OSS now includes:
- SpeakerKit: Add speaker info to your transcripts with the fastest implementation of Pyannote.
- WhisperKit: One of the most popular frameworks to deploy Whisper with 6+ million monthly downloads.
- TTSKit: Run Qwen3-TTS with real-time generation and playback for voice agents and content readers.
SpeakerKit
We launched SpeakerKit last year as part of Argmax Pro SDK. We published a research paper to demonstrate that SpeakerKit is the fastest Pyannote implementation, with verified accuracy parity across 13 datasets: https://t.co/RfTIVHsLsm
SpeakerKit Pro has reached 500,000+ monthly on Hugging Face, hardened in production at scale for the past 13 months. Now that Argmax Pro SDK 2 brings real-time speaker diarization with Sortformer, we are open-sourcing the previous generation of SpeakerKit Pro with Pyannote! Check out the docs to get started: https://t.co/XYVmTjxDLd
This is what context does to your speech-to-text system!
Our new paper studies the impact of contextual information on the accuracy of leading open-source and proprietary systems.
Introducing Real-time Transcription with Nvidia Parakeet on Android!
Argmax Pro now supports Android with our brand-new Kotlin-first SDK, bringing Argmax's top-tier accuracy and real-time performance from Apple to Android.
Enjoying seamless NPU and GPU acceleration by @GoogleAI LiteRT across several major hardware vendors.
Links to blog and test app are in the replies.
Introducing Real-time Transcription with Nvidia Parakeet on Android!
Argmax Pro now supports Android with our brand-new Kotlin-first SDK, bringing Argmax's top-tier accuracy and real-time performance from Apple to Android.
Enjoying seamless NPU and GPU acceleration by @GoogleAI LiteRT across several major hardware vendors.
Links to blog and test app are in the replies.
Our blog post explains the technical breakthroughs and how Argmax became the first SDK to build with LiteRT in collaboration with @GoogleAI https://t.co/JG6xKDdVMA