your spotify cache is bigger than our largest AI model.
Bonsai: 1-bit weights. 1.7B to 8B params. 14x compression vs bf16. 8x faster on edge. 256 MB to 1.2GB. Based on Qwen 3.
we just came out of stealth. intelligence belongs at the edge and we're going to put it there.
Apache 2.0.
we compressed intelligence. more coming. @PrismML
Today, we are emerging from stealth and launching PrismML, an AI lab with Caltech origins that is centered on building the most concentrated form of intelligence.
At PrismML, we believe that the next major leaps in AI will be driven by order-of-magnitude improvements in intelligence density, not just sheer parameter count.
Our first proof point is the 1-bit Bonsai 8B, a 1-bit weight model that fits into 1.15 GBs of memory and delivers over 10x the intelligence density of its full-precision counterparts. It is 14x smaller, 8x faster, and 5x more energy efficient on edge hardware while remaining competitive with other models in its parameter-class.
We are open-sourcing the model under Apache 2.0 license, along with Bonsai 4B and 1.7B models.
When advanced models become small, fast, and efficient enough to run locally, the design space for AI changes immediately. We believe in a future of on-device agents, real-time robotics, offline intelligence and entirely new products that were previously impossible.
We are excited to share our vision with you and keep working in the future to push the frontier of intelligence to the edge.
Building momentum at Marin! Upgrading from Dense -> 129B parameter MoEs -> architecture improvements -> optimizer improvements gives our pretraining recipe an estimated 6x cumulative learning speedup, accounting for MFU. Includes community contributions. https://t.co/5dPB9uBiSp
@varunneal@CevherLIONS the best way isnt clear yet. i usually just say well grads/momentum and hvps are going into a system that should learn to whiten both. then we just precond the momentum.
@wen_kaiyue Weโve clearly over biased ourselves towards Adam in the wild and in the modded nano GPT speed run weโve over biased ourselves towards muon.
You can now train 120B+ parameter models locally on a laptop! ๐ฅ
We collabed with NVIDIA and Microsoft to bring LLM training on the 128GB unified memory RTX Spark laptop!
@norxornor I bias more towards sgd at the start and then allow for more dynamic at the end. If I know I'm going to have a lot of noise I will also adjust eps.
Bug fix! Bonsai Image generations on local MacBook MLX will be even better quality.
Turns out how you pad text matters ๐ try it out! https://t.co/vJFTG18oNf