Catch @DarrylBarnhart speak @NVIDIAGTC today about our exciting work @LatentSpaceAI on scaling LLM with retrieval! 📚
There’ll be an open QA in 30 mins: https://t.co/C42yjpjEi1
Very cool to see Retrieval-Augmented Generation models being scaled up by @LatentSpaceAI and @awscloud. Excited to see what happens when RAG is pushed to GPT3 scale
We @LatentSpaceAI have been making exciting progress on improved neural scaling laws, accelerated by a great collaboration with Amazon!
This post is the first of many on the nuts and bolts of how we’re doing this: https://t.co/FnkWifFgOE
Did a fun panel for #AWSInnovate conference where we talk about our triumphs and missteps of starting an ML startup🤭
It's live today, check it out (need registration tho -- hey, I don't make the rules!): https://t.co/PxT7RgsdNn
TL;DR of the discussion:
💥how well a model scales is better than chasing sota
🧑🎨 combining language + vision is good for generating images (and text!)
👾 the code don’t lie — check other resources first before going deep on a research paper
We 💕 @wandb!
The multi-user reports let’s us have a collaborative research process which is really a game-changer, even more so now that we @LatentSpaceAI are all remote!
Listen to this discussion with Sarah Jane Hong @latentcodes where she talks about the research process, scaling laws, and future of large scale models on the latest @ai_untitled podcast!
Our second episode of Generally Intelligent features Latent Space co-founder @latentcodes! https://t.co/uVJeBHkPSB
We cover why using natural language prompts to render a scene is much harder than you’d expect, Sarah's own learnings and mistakes as a researcher, & much more!
The Generative Age: AI will do for content production what the internet did for distribution.
What happens when the cost to produce a movie like The Avengers goes from $350M to 30 cents?
https://t.co/NksEF9IAjQ
If you're curious for more details on the above, check out this piece I wrote on how we're collaborating remotely @LatentSpaceAI!
Being able to log ~all the things~ with @wandb and share interactive artifacts with our team has been important for our research process.
Laplacian pyramid-like visualization of activations (as described in MSG-GAN by @AnimeshKarnewar) has been useful to observe during training in order to qualitatively identify new behaviors in our models (especially subtle bugs 🐛)
Here's a snippet of how we're collaborating remotely on debugging generative models @LatentSpaceAI ! @wandb has been an invaluable tool for understanding and testing new approaches.
See how ML teams, like @LatentSpaceAI, working on cutting edge deep learning research use W&B Reports to collaborate remotely, debug models iteratively and to showcase results as mini research papers.
#machinelearning#deeplearning#100daysofmlcode
https://t.co/OkAbp5MEwz
Hey optimization folks!
Catch @DarrylBarnhart and @yaroslavvb in between #NeurIPS2019 workshops today and tomorrow, esp if you’re interested in second order methods! 🌟