Imagine every pixel on your screen, streamed live directly from a model. No HTML, no layout engine, no code. Just exactly what you want to see.
@eddiejiao_obj, @drewocarr and I built a prototype to see how this could actually work, and set out to make it real. We're calling it Flipbook. (1/5)
I can't stress enough how useful this trick has been for me in all these years
It reduces GPU memory by N equal the number of losses, at literally no cost (same speed, exactly same results down to the last decimal digit)
For example ... [1/2]
A thread of thoughts on radiance fields, from my keynote at 3DV:
Radiance fields have had 3 distinct generations. First was NeRF: just posenc and a tiny MLP. This was slow to train but worked really well, and it was unusually compressed --- The NeRF was smaller than the images.
Introducing the File Search Tool in the Gemini API, our hosted RAG solution with free storage and free query time embeddings πΎ
We are super excited about this new approach and think it will dramatically simplify the path to context aware AI systems, more details in π§΅