The future of compute & AI is heterogeneous.
That future is being built by Callosum.
Today we launch with new breakthroughs made possible by heterogeneity, new scaling principles & a roadmap reimagining compute & AI infrastructure:
https://t.co/Yp1Qwt45tM
@CallosumAI is hiring in London!
We're building the infrastructure for heterogeneous compute - making many models, on many chips, behave as a single coherent, co-evolving system.
Five MTS roles open across the AI infra stack. Link below!
Great news that @UKSovereignAI has invested in Ineffable Intelligence, led by David Silver, joining @CallosumAI in the portfolio.
These are the technologies that will define the next decade (+more) of progress.
Congratulations to David and the team!
Our first investment?
💥 @CallosumAI - @DanAkarca & @achterbrain 💥
Proud to be backing Danyal, Jascha, and the team as they build one of the defining layers of next-gen AI systems. 🔥
We are at a unique moment in time for AI & compute: New accelerators / chips, HPC hardware, and new algorithms have each made strides, but we are not yet orchestrating them as a heterogeneous stack. That is what @CallosumAI is built to do, and today we are sharing our vision 🧵
Everything here is early evidence for a deeper thesis: as the problems we need to solve grow in difficulty, the systems that solve them must grow in diversity.
Heterogeneous systems - diverse models on diverse hardware, co-evolved end-to-end - unlock scaling territory that homogeneous systems cannot access.
In our next post, we formalise this into a theory - a new scaling principle.
The configuration space is vast and we have only just begun to explore it.
Welcome, Heterogeneous Intelligence.
https://t.co/t0KiP6q3eJ
Today we launched @CallosumAI.
We are building the infrastructure where heterogeneous chips & intelligence co-evolve to solve the world's hardest problems.
Today we present our first results.
Across four large problem spaces, we break SOTA and deliver orders-of-magnitude improvements in capabilities, cost and speed: 12× cheaper deep context. New web SOTA with open-source, 3x cheaper and faster. 2.4× cache speedups. 1,767× faster tool calling. This is the worst our infrastructure will ever be.
We do it by co-evolving heterogeneous chips and multi-agent intelligence - workflows aware of their hardware, models aware of their task graph, kernels aware of their output constraints. An Intelligent System.
https://t.co/t0KiP6q3eJ
None of these results came from a bigger model.
12× cheaper deep context. New web SOTA with open-source, 3x cheaper and faster. 2.4× cache speedups. 1,767× faster tool calling.
All from heterogeneity - mixed models, mixed chips, mixed scales - co-evolved end-to-end.