some news: i quit MSFT last week after 3.5 years with the inflection/microsoft AI crew -- lots of memorable times. i'm taking some time to reconnect w/ old colleagues and friends and its been great. if you are reading this and want to catch up, DM me!!
Excited to announce that we’ve raised $1.3B to build one of the largest clusters in the world and turbocharge the creation of Pi, your personal AI.
https://t.co/p5AfRXGPan
It’s a big week! We’ve raised $1.3 billion and are building the world’s largest AI cluster (22k H100s).
We’re grateful for our investors and new funding that will help us accelerate our mission to make personal AI available to every person in the world. https://t.co/l2MPlhgqVl
We have amazing results to announce! Inflection-1 is our new best-in-class LLM powering Pi, outperforming GPT-3.5, Llama and PALM-540B on major benchmarks commonly used for comparing LLMs. https://t.co/Dhv9eZQsa6
I've ported @rewonfc's very deep VAE (https://t.co/0lQp9BQgcI) from PyTorch to JAX/Flax! Hope other JAX users find this SOTA VAE useful as a forkable baseline... https://t.co/rJno4lcLSO.
My post on SotA image generative models was released 🥳
Featured 7 notable recent papers with emphasis on:
- VD-VAE
- VAE + discriminator (e.g. VQGAN, DC-VAE)
- Diffusion models (e.g. DDPMv2)
Plus some notes on scaling (e.g. DALL-E) and evaluation.
https://t.co/uOV6jVwdmA
It is easy to write a program
but it is difficult to create a machine
that will read those lines.
(Was looking through my journal, found this gpt-3 generation conditioned on haikus)