AI compute and inference are increasingly $$$. How can we change the unit economics of AI to improve accessibility? It's been fun working with @prlnet to release the first model endpoint that simultaneously generates tokens **and** a digital asset that can subsidize inference! 🪙 Check it out, links below 🚀
Loved working on this project, super proud of it! Thank you @AwesomeBao, @bialjail, and @wgilpin0 for being such a great team that I learned so much from.
Excited to share our work at ICML!
Our ICML spotlight paper discovers universal redundancies in time series foundation models: the middle layers of many models can be removed without sacrificing performance 1/
We trained LoRA adapters of different ranks to understand training dynamics, finding that adapters for GSM8k live in a surprisingly vast, low-rank solution space.
This hints that some model skills are easy to learn, and training is more forgiving than we think. @hasith_v 1/6 🧵
We post-trained MedGemma to be SoTA in visual medicine ddx, outperforming Opus 4.6, Gemini 3.1 and GPT-5.4 while running at ~1/30th the cost. @getnolla Part 1 - improving visual reasoning 🧵1/6
@jxmnop This is cool, I've always been slightly uncomfortable with treating *everything* unlabeled as a negative.
I wonder if using LLMs to produce rankings (even a somewhat noisy one) would be better than a binary classification. Perhaps we can weight according to rank in softmax?
How do time series foundation models forecast unseen dynamical systems? In new experiments, we find that small transformers learn to approximate transfer operators in-context. (1/N)
https://t.co/6YuLr8QuJD
Great to see high quality software dev in comp bio. It still amazes me how much of computational biology is based on single-thread processing of large .txt files with minimal application-specific-optimization.
@karpathy I actually hacked nanogpt sometime ago to become a diffusion llm. Results were pretty decent on shakespeare with character-level tokenization.
Honestly was just surprised it even learned to spell words and pick up on basic grammar.
Link in reply
@materzynska@AIatMeta Very interested in diffusion models and social AI. Would love to talk with you. You can see more about me on my blog: https://t.co/BUGkLLEquN
@a16z@LiamFedus@LiamFedus what are yalls methods to verify what the LLMs are discovering? How do you make sure it’s ‘understanding’ current physics correctly?
I have lots of thoughts on this as a physics student doing AI research if you want to chat
@khoomeik@periodiclabs@LiamFedus Very excited to see where periodic will go next! Extremely bullish on trying to get tangible alpha from AI models in natural sciences--it really plays to my background of first doing physics research and then doing AI research
@CFGeek To be fair, I also think it will be hard to get it to work, and it might not even. But the negative result plus the rl env will leave us things to learn from.
Cause I’m pretty confident that LLMs will be using internal reasoning techniques only a few years down the line.