We're excited to share Stable-Layers!
We train Qwen-Image-Layered further with RL for improved layerization,
using only feedback from a VLM — no paired supervision required!
Paper: https://t.co/WktmmXGNLh
Project Page: https://t.co/WnEV76afQp
We initially cared about local LLMs, but KV caches appear in more than text.
So we also investigated OCTOPUS for autoregressive video and audio transformers.
Joined work with @VikramVoleti, Simon Donné, @esx2ve
The first lab to open-source a solid pixel-space video diffusion model will be cited a bazillion times by everyone working on inverse problems esp. related to 3D/4D. With JiT/Simple(r) Diffusion the tech is mostly there, someone with more GPUs than me should make it happen (plz)
We’re hiring 3 researchers for the Stability AI 3D team — and with our new EA partnership, this is an absolutely massive opportunity.
If you’re passionate about 3D, graphics, AI models, or VLMs consider applying below:
We need something like full of interactive visualization and hover documentation, like distill pub like visualization, for everything. Math, physics, chemistry, etc etc
LLMs can totally make epic interactive diagrams and visualizations at massive scale, we need entire textbooks like this.
Introducing our new work, "Foley Control: Aligning a Frozen Latent Text-to-Audio Model to Video"
It's a control model for Stable Audio to generate aligned audio from an input video.
Project Page: https://t.co/68yckLzNhR
Paper: https://t.co/MK3HFl10CY
🧵 @StabilityAI
Working on 3D, video, or image generation? We’re hiring at @StabilityAI — building next-gen creative AI tools with EA to reimagine world-building.
If you’ve been affected by recent layoffs (Meta/FAIR or elsewhere), DM me.
Around #ICCV2025? Let’s connect.
New paper & surprising result.
LLMs transmit traits to other models via hidden signals in data.
Datasets consisting only of 3-digit numbers can transmit a love for owls, or evil tendencies. 🧵
this guy's channel is so small with only a couple K views here and there
if you're interested in GPU programming and still a beginner, he's worth a look
(Simon Oz on yt)
Congratulations to @gintszilbalodis and the entire Flow film crew for the Academy Award win!
Flow is the manifestation of Blender’s mission, where a small independent team is able to create a story that moves audiences worldwide.
Thank you for the shout out! 🧡 #b3d
@tinotibaldo Short story: back in the day, as a self taught game-dev, I've tried to code Uniball from scratch. It's basically 2D rocket league. It took me a few months to nail the game look and feel. And two years to fail miserably at the multiplayer physics. The rabbithole goes insanely deep
The post below (learnable lambda has to do with skip connections) reminded me of some ResNet and Transformer architecture-related lore that I thought would be fun to write up!
Blogpost: https://t.co/FkYCiTCUyL