We launched Katana ML https://t.co/6ClyBuAp9m in 2018 and now it is time to update the website, to explain where we are now and what we do with #MachineLearning, #MLOps, and #opensource
๐๐๐
We have a new website - https://t.co/gBCV3mzU7r
It explains what we do with ML in a simple and straightforward way. It is featuring our open source product Skipper, we are using it to run #MLOps.
#MachineLearning#MLOps
@Prince_Canuma@GoogleDeepMind Tested mlx-community/Ministral-3-14B-Instruct-2512-8bit against mlx-community/gemma-4-12B-it-8bit for structured output, Ministral wins by large margin
Meet Gemma 4 12B!
A unified, encoder-free multimodal model designed to bring high-performance intelligence directly to your laptop, and released under an Apache 2.0 license.
Bridging the gap between edge efficiency and advanced reasoning. Here is whatโs new with Gemma 4 12B: ๐
Today we're shipping our biggest MLX-VLM release yet: v0.6.0
...and we are raising ๐ธ
This one's about turning your Apple devices into real local agent machines. From your desk to your pocket.
What's new:
โก Speculative decoding everywhere โ Gemma 4 EAGLE3 + DFlash, Qwen MTP, DeepSeek V4 MTP. Faster tokens, less waiting.
๐ค Agent-ready server โ native Anthropic /v1/messages API, stateful /v1/responses, tool calls, Codex context budgets. Plug Claude Code & Codex straight into local models.
๐๏ธ New models galore โ DeepSeek V4, ZAYA1-VL, MiniCPM-V 4.6, LFM2 MoE, Step-3.7 Flash, Laguna + more.
๐จ Image gen & editing โ FLUX.2 (base + klein), PrismML Bonsai.
๐ Audio in โ Qwen3 Omni, Gemma 4 audio, base64 chat audio.
๐งฎ TurboQuant KV cache โ RHT-correct fast paths for leaner memory.
๐ฆ Modular server, better metrics, cleaner streaming.
Run real agents on the hardware already in your hands.
Github: https://t.co/1T06ur6LU5
Building Agentic AI Pipelines for Document Analysis
Two steps. Fully local.
1๏ธโฃ Sparrow Parse extracts structured data from bonds table โ Ministral 3B 14B
2๏ธโฃ Sparrow Instructor analyzes portfolio risk โ Gemma 4 31B
Orchestrated with Prefect. No data leaves your machine.
YouTube: https://t.co/oIvvjoZ4mB
GitHub: https://t.co/JZFeXQGI85
Sparrow: https://t.co/Ln6FQaTukN
Building new Sparrow UI with Claude Code is going well. Implemented file upload component, added backend code with Next.js
Migrating to shadcn from Sparrow Gradio UI:
https://t.co/nvmBzwGFyD
Coding with Claude Code is like building drag&drop forms with Visual Code. Love this feeling. I always was lazy to type all this Python or JavaScript text lines
Building new UI for Sparrow with shadcn. HTML UI mockups are designed with Claude Design.
This will come as replacement for current Gradio based UI: https://t.co/ynvdzqCX2V