Yulia Rubanova @YuliaRubanova - Twitter Profile

11 days ago

One year later with Omni and this test can pass. I saw it getting pretty close, so I tweaked the prompt: > A video of a man counting to 10 on his fingers, show the number in the corner. A new number every 1s, no dialogue other than the numbers he says. He uses two hands for numbers bigger than 5. - the model does 1 to 5 consistently well - struggles more when two hands are used, usually on 7 and 8 - if you ask it to count faster, errors increase - it keeps a good cadence

15

172

11

40

40K

YuliaRubanova retweeted

Thomas Kipf

@tkipf

8 days ago

Genuinely amazed by how many generalist visual capabilities one can squeeze out of this model

1

61

5

11

8K

Yulia Rubanova @YuliaRubanova

13 days ago

A few days post-I/O, people are starting to see what Gemini Omni is capable of. Being a part of this project from the early days, it’s been amazing to watch native multimodality open up a new space of possibilities for video generation. Give it a try.

Google DeepMind @GoogleDeepMind

18 days ago

We’re dropping Gemini Omni: our first step towards a model that can create anything from anything - starting with video. It combines Gemini’s intelligence with our generative media systems - representing a leap forward in world understanding, multimodality, and editing 🧵

413

8K

1K

2K

1M

0

5

0

727

YuliaRubanova retweeted

Miko

@Mho_23

17 days ago

not sure why nobody is talking about this but Google Omni is insane at video editing Original Video (left) vs Omni Edited Video (right) everyone is comparing it to Seedance and missing the point completely. Seedance is for generating videos from scratch. Google Omni is for editing videos that already exist. which are two completely different use cases this is like when Nano Banana 1 first came out and nobody realized how big it was going to be. this is the first AI that can actually properly edit videos.. i've generated a few hundred videos with this model and it can do literally any type of edit you can think of. changing voices, swapping characters, removing watermarks, adding captions, transitions, pop ups, whatever. if you can describe the edit you want it can do it this completely crushes every other model on the market when it comes to video editing. nothing else even comes close right now and this is just the flash model. imagine what the pro version is going to be able to do when it drops in a couple months this should have way more hype than it's getting..

40

413

35

349

28K

Who to follow

Ricky T. Q. Chen

@RickyTQChen

Research Scientist. Meta. I build simplified abstractions of the world through the lens of dynamics and flows.

Arnaud Doucet

@ArnaudDoucet1

Senior Staff Research Scientist @GoogleDeepMind. Previously @UniofOxford.

Emiel Hoogeboom

@emiel_hoogeboom

MTS at Anthropic. Previously Google Deepmind, PhD with @wellingmax at University of Amsterdam, Qualcomm Internship.

YuliaRubanova retweeted

MBZ @babaeizadeh

18 days ago

The likeness preservation and natural realism coming out of Gemini #Omni Flash is absolutely unreal 🤯 Huge shoutout to the insanely talented team for pushing these boundaries. The team absolutely cooked. 👨‍🍳🔥 If Flash is this good... imagine what Pro is about to unleash. 👀✨

2

63

13

3

3K

YuliaRubanova retweeted

Shlomi Fruchter

@shlomifruchter

18 days ago

We sat down with @OfficialLoganK @nbrichtova @doomie @gbarthmaron to talk about Gemini Omni Flash. It was pretty wild.

7

203

20

45

41K

YuliaRubanova retweeted

Jay Whang @jaywhang_

18 days ago

Super excited to see Gemini Omni finally out in the world! Having been part of this project since its inception, I've seen how its native multimodal capabilities can redefine what's possible. We're truly entering the "Nano Banana era" for video generation. Give it a try!

4

61

7

6

4K

Yulia Rubanova @YuliaRubanova

about 2 months ago

Excited to head to ICLR 2026 in Rio de Janeiro 🇧🇷! Happy to chat about video generation, controllability and world models!

1

36

1

9

2K

Yulia Rubanova @YuliaRubanova

about 2 months ago

We released Veo 3.1 Ingredients more than 6 months ago, and it is still ranking on top!

Design Arena

@Designarena

about 2 months ago

BREAKING: Veo 3.1 Fast and Veo 3.1 by @GoogleDeepMind are in 1st and 2nd place on Multi-Image to Video Arena These models can successfully reference multiple input images to create a video that users love At an average generation time of 48 seconds, they are also the two fastest video generation models Huge congrats to the @GoogleDeepMind team for this achievement!

Designarena's tweet photo. BREAKING: Veo 3.1 Fast and Veo 3.1 by @GoogleDeepMind are in 1st and 2nd place on Multi-Image to Video Arena

These models can successfully reference multiple input images to create a video that users love

At an average generation time of 48 seconds, they are also the two fastest video generation models

Huge congrats to the @GoogleDeepMind team for this achievement!

6

215

21

29

41K

0

7

0

635

Yulia Rubanova @YuliaRubanova

3 months ago

@jakecsnell @Cambridge_Uni @Cambridge_Eng Big congrats, Jake! Welcome to the UK 🇬🇧

0

1

0

114

Yulia Rubanova @YuliaRubanova

5 months ago

New version Ingredients to Video is out! Now with a portrait mode, better storytelling and consistency -- now available directly on Youtube Shorts. It is the most rewarding experience to be part of this team and put this into the hands of real users.

Google DeepMind @GoogleDeepMind

5 months ago

We’re updating Veo 3.1 Ingredients to Video to help create more expressive and dynamic clips, produce better visual consistency and more. 📽️ Here’s what’s new 🧵

49

880

123

156

439K

0

8

0

436

Yulia Rubanova @YuliaRubanova

6 months ago

Veo is officially a world simulator for robotics! 🤖🎥 We used action-conditioned Veo to evaluate robotics policies entirely in a generated video. This is a game-changer for scalable, safe robotics testing. Loved working with the team on this!

Anirudha Majumdar

@Majumdar_Ani

6 months ago

Generalist robots need a generalist evaluator. But how do you test safety without breaking things? 💥 🌎 Introducing our new work from @GoogleDeepMind: Evaluating Gemini Robotics Policies in a Veo World Simulator https://t.co/ZjvpYXFddZ 🧵👇

27

580

90

297

237K

0

15

2

0

1K

YuliaRubanova retweeted

Thomas Kipf

@tkipf

6 months ago

So excited to finally talk about this work! Veo is a surprisingly strong world simulator. We fine-tuned Veo on action-conditioned, multi-view robotics data. Key result: running a policy in the world model is strongly correlated with real-world results. A few important take-aways: 1) Veo Robotics models real-world physics and robot interactions 2) The base model's world knowledge is retained after fine-tuning and can model OOD scenarios not seen in the robotics data 3) The world model can be used to score task success or failure for a given policy 4) This proves useful for predictive red teaming: simulate dangerous or rare scenarios that would be difficult or irresponsible to execute on the real robot, and judge its performance I couldn't be more excited about where generalist video models are headed.

15

225

29

97

44K

YuliaRubanova retweeted

Aäron van den Oord

@avdnoord

7 months ago

Our second Nano Banana in three months 🚢 🚢. Super proud of the team!! Looking forward to seeing what you'll create with it. Any feedback welcome!

1

72

10

0

11K

Yulia Rubanova @YuliaRubanova

7 months ago

Try out Veo Ingredients -- now on Gemini App!

Google Gemini

@GeminiApp

7 months ago

We’re back with another update to Veo 3.1: Rolling out now on mobile and desktop, you can upload multiple reference images alongside your video prompts, to create entirely new worlds and more nuanced videos that are true to your vision.

191

3K

366

802

10M

0

4

1

0

811

Yulia Rubanova @YuliaRubanova

7 months ago

@xf1280 @techreview Congratulations, Fei! 🎉 Well deserved

0

66

Yulia Rubanova @YuliaRubanova

8 months ago

Excited to share a huge update Veo Ingredients! Now your characters can speak🎤 and sing🎶 Seriously, this opens up a whole new level of storytelling. Try it out on Flow: https://t.co/rxBBBimHhf #VeoIngredients #veo3_1

Google DeepMind @GoogleDeepMind

8 months ago

🖼️ Ingredients to video Give multiple reference images with different people and objects, and watch how Veo integrates these into a fully-formed scene - complete with sound.

2

133

10

15

28K

0

4

0

499

Yulia Rubanova @YuliaRubanova

8 months ago

Veo 3.1 got a major quality upgrade 👇 Try it out!

Arena.ai

@arena

8 months ago

🚨🎬 Big news from Video Arena! @GoogleDeepMind’s latest Veo 3.1 now ranks #1 in both Text-to-Video and Image-to-Video leaderboards. 🏆 This is a +30-point leap from Veo 3.0 → 3.1, making it the first model to break 1400 in Video Arena history! Huge congrats to the @GoogleDeepMind team for pushing the frontier of video generation forward! More details in the thread 🧵

arena's tweet photo. 🚨🎬 Big news from Video Arena!

@GoogleDeepMind’s latest Veo 3.1 now ranks #1 in both Text-to-Video and Image-to-Video leaderboards. 🏆

This is a +30-point leap from Veo 3.0 → 3.1, making it the first model to break 1400 in Video Arena history!

Huge congrats to the @GoogleDeepMind team for pushing the frontier of video generation forward!

More details in the thread 🧵

28

554

72

90

462K

0

8

0

570

Yulia Rubanova @YuliaRubanova

8 months ago

What an incredible night! Thank you @corl_conf for shaking things up and inviting both a K-pop band and a Korean traditional music band to the CoRL Banquet. It made the evening truly special.

Sourav Garg @sourav_garg_

8 months ago

K-Pop performance @corl_conf banquet by @geeniusofficial #CoRL2025

0

30

3

8

6K

0

11

1

0

2K

Yulia Rubanova @YuliaRubanova

8 months ago

Veo 3 has powerful visual reasoning capabilities out of the box 👾 It can solve puzzles, understand optical illusions, reason about gravity, lighting, color mixing and more! Veo just keeps on giving 🍏 Check out a detailed investigation by our Deepmind colleagues below 👇

Paul Vicol @PaulVicol

8 months ago

🔥Veo 3 has emergent zero-shot learning and reasoning capabilities! This multitalented model can do a huge range of interesting tasks. It understands physical properties, can manipulate objects, and can even reason. Check out more examples in this thread!

4

169

24

70

31K

1

8

0

1

824

Yulia Rubanova

@YuliaRubanova

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users