Shimon Vainer @esx2ve - Twitter Profile

esx2ve retweeted

13 days ago

We're excited to share Stable-Layers! We train Qwen-Image-Layered further with RL for improved layerization, using only feedback from a VLM — no paired supervision required! Paper: https://t.co/WktmmXGNLh Project Page: https://t.co/WnEV76afQp

CiaraRowles1's tweet photo. We're excited to share Stable-Layers!

We train Qwen-Image-Layered further with RL for improved layerization,
using only feedback from a VLM — no paired supervision required!

Paper: https://t.co/WktmmXGNLh
Project Page: https://t.co/WnEV76afQp https://t.co/3959SZljbr

12

273

51

224

18K

esx2ve retweeted

Mark Boss @markb_boss

26 days ago

We initially cared about local LLMs, but KV caches appear in more than text. So we also investigated OCTOPUS for autoregressive video and audio transformers. Joined work with @VikramVoleti, Simon Donné, @esx2ve

0

5

3

1

272

Shimon Vainer @esx2ve

27 days ago

@Eran_Efrat היי ערן, היכן אפשר לתרום?

1

2

0

1

1K

esx2ve retweeted

Mark Boss @markb_boss

3 months ago

Current hobby: vibecoding tiny tools that eliminate annoying tasks. I open-sourced them in case they’re useful to others.

1

11

1

2

643

Who to follow

Mark Boss

@markb_boss

I’m the Co-Head of 3D & Image at Stability AI with research interests in the intersection of machine learning and computer graphics

Mauro Comi

@mauro_ai

SR at Google, prev. SR at Google DeepMind || PhD student in Machine Learning and Computer Vision

Peter Hedman

@PeterHedman3

Researcher at Meta, working on radiance fields and view-synthesis. Pixels in, pixels out!

esx2ve retweeted

Kyle Sargent

@KyleSargentAI

6 months ago

The first lab to open-source a solid pixel-space video diffusion model will be cited a bazillion times by everyone working on inverse problems esp. related to 3D/4D. With JiT/Simple(r) Diffusion the tech is mostly there, someone with more GPUs than me should make it happen (plz)

14

170

7

87

21K

Shimon Vainer @esx2ve

7 months ago

We are hiring! ☺️

Mark Boss @markb_boss

7 months ago

We’re hiring 3 researchers for the Stability AI 3D team — and with our new EA partnership, this is an absolutely massive opportunity. If you’re passionate about 3D, graphics, AI models, or VLMs consider applying below:

1

7

4

3

1K

0

159

esx2ve retweeted

Simo Ryu

@cloneofsimo

8 months ago

We need something like full of interactive visualization and hover documentation, like distill pub like visualization, for everything. Math, physics, chemistry, etc etc LLMs can totally make epic interactive diagrams and visualizations at massive scale, we need entire textbooks like this.

1

48

2

12

5K

esx2ve retweeted

CiaraRowles @CiaraRowles1

8 months ago

Introducing our new work, "Foley Control: Aligning a Frozen Latent Text-to-Audio Model to Video" It's a control model for Stable Audio to generate aligned audio from an input video. Project Page: https://t.co/68yckLzNhR Paper: https://t.co/MK3HFl10CY 🧵 @StabilityAI

1

15

1

5

727

Shimon Vainer @esx2ve

8 months ago

@mariyaivasileva Heya - we're hiring for some really cool opportunities at #StabilityAI. DM me if sounds relevant

0

1

0

62

Shimon Vainer @esx2ve

8 months ago

Read more about the EA/SAI announcement here: https://t.co/NQTqT7bxBY

0

135

Shimon Vainer @esx2ve

8 months ago

Working on 3D, video, or image generation? We’re hiring at @StabilityAI — building next-gen creative AI tools with EA to reimagine world-building. If you’ve been affected by recent layoffs (Meta/FAIR or elsewhere), DM me. Around #ICCV2025? Let’s connect.

1

0

346

Shimon Vainer @esx2ve

11 months ago

@torchcompiled It's from https://t.co/oLaPgu7l5r. Also check out concurrent work https://t.co/EvITghn4U1

1

6

0

2

1K

esx2ve retweeted

Owain Evans

@OwainEvans_UK

11 months ago

New paper & surprising result. LLMs transmit traits to other models via hidden signals in data. Datasets consisting only of 3-digit numbers can transmit a love for owls, or evil tendencies. 🧵

OwainEvans_UK's tweet photo. New paper & surprising result.
LLMs transmit traits to other models via hidden signals in data.
Datasets consisting only of 3-digit numbers can transmit a love for owls, or evil tendencies. 🧵 https://t.co/ewIxfzXOe3

280

8K

1K

5K

2M

esx2ve retweeted

ludwig

@ludwigABAP

about 1 year ago

this guy's channel is so small with only a couple K views here and there if you're interested in GPU programming and still a beginner, he's worth a look (Simon Oz on yt)

ludwigABAP's tweet photo. this guy's channel is so small with only a couple K views here and there

if you're interested in GPU programming and still a beginner, he's worth a look

(Simon Oz on yt) https://t.co/yr8lmDCuIC

26

2K

137

2K

86K

esx2ve retweeted

Blender 🔶

@Blender

over 1 year ago

Congratulations to @gintszilbalodis and the entire Flow film crew for the Academy Award win! Flow is the manifestation of Blender’s mission, where a small independent team is able to create a story that moves audiences worldwide. Thank you for the shout out! 🧡 #b3d

Blender's tweet photo. Congratulations to @gintszilbalodis and the entire Flow film crew for the Academy Award win!

Flow is the manifestation of Blender’s mission, where a small independent team is able to create a story that moves audiences worldwide.

Thank you for the shout out! 🧡 #b3d https://t.co/cz6HogrODa

142

34K

5K

1K

665K

Shimon Vainer @esx2ve

over 1 year ago

@jon_barron Try generating a simple checkerboard - it's surprisingly impossible. Many questions...

0

2

0

258

Shimon Vainer @esx2ve

over 1 year ago

@tinotibaldo https://t.co/3UERalAMsO is a great place to start losing your sanity

0

1

28

Shimon Vainer @esx2ve

over 1 year ago

@tinotibaldo Short story: back in the day, as a self taught game-dev, I've tried to code Uniball from scratch. It's basically 2D rocket league. It took me a few months to nail the game look and feel. And two years to fail miserably at the multiplayer physics. The rabbithole goes insanely deep

1

0

49

esx2ve retweeted

Lucas Beyer (bl16)

@giffmana

over 1 year ago

The post below (learnable lambda has to do with skip connections) reminded me of some ResNet and Transformer architecture-related lore that I thought would be fun to write up! Blogpost: https://t.co/FkYCiTCUyL

giffmana's tweet photo. The post below (learnable lambda has to do with skip connections) reminded me of some ResNet and Transformer architecture-related lore that I thought would be fun to write up!

Blogpost: https://t.co/FkYCiTCUyL https://t.co/spBJuT2RwI

14

368

36

335

64K

Shimon Vainer

@esx2ve

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users