I hate when I hear language model researchers complaining that they can't do any interesting research anymore because they don't have enough compute. Literally stfu. This attitude wrongly paints the world as infinitely less complex and fractal than it really is. Beautiful ideas do exist at scales <10B parameters. You can't pretrain the world's best model, who cares? Go and explore the unbounded space of ideas in the range of compute you have. The world is a microcosm and you can zoom in infinitely at any part of the scale to find interesting and useful things
b10’s inference infrastructure powers the most sophisticated app companies serving specialized models at scale. there are dozens of these companies today and there will be thousands in the future. incredible momentum + world-class talent density.
and, it’s still so early…
they're really so good. @saltyph and i spend about 10 mins of every 1:1 raving about them and imagining all the magical experiences we are going to build together.
Today we’re welcoming the @parsedlabs team to Baseten!
With their RL and post-training expertise, Baseten is enabling companies to own their intelligence by unifying training and inference.
Read more about it from the Parsed founders ⤵️
https://t.co/RfT8d0dUgc