Daniel Garibi @danielgaribi - Twitter Profile

Pinned Tweet

12 months ago

Thrilled to share that our paper TokenVerse received a Best Paper Award at #SIGGRAPH2025! 🎉

about 1 year ago

Excited to share that "TokenVerse: Versatile Multi-concept Personalization in Token Modulation Space" got accepted to SIGGRAPH 2025! It tackles disentangling complex visual concepts from as little as a single image and re-composing concepts across multiple images into a coherent result. https://t.co/MMr8Ssv9cx #SIGGRAPH2025

2

102

32

39

15K

4

20

2

0

2K

DanielGaribi retweeted

Etai Sella @etai_sella

about 1 month ago

Even today, with powerful image editing models, making fine-grained structural changes to 3D shapes remains a major challenge. In our new #SIGGRAPH2026 paper, Prox-E, we use primitive-based abstraction to leverage VLMs for precise, reasoning-based 3D editing! 👇

4

79

21

31

13K

DanielGaribi retweeted

Gal Metzer @galmetz

about 1 month ago

Excited to share our work accepted to #SIGGRAPH2026 ! Video generation models struggle with something few talk about: their transformations don't evolve smoothly. You get long boring stretches... then a sudden semantic jump where everything "catches up" at once.

7

72

27

12

6K

DanielGaribi retweeted

Shelly Golan @Shelly_Golan1

about 2 months ago

1/7 When rewards conflict, what should RL post-training of diffusion models optimize? In visual generation, objectives are often in tension: Prompt adherence can conflict with source preservation. Photorealism can conflict with stylization. In our new paper, ParetoSlider, we introduce a multi-objective RL framework that trains a single diffusion model for continuous control over competing reward objectives 🧵

4

84

28

37

11K

Who to follow

Brian Gordon

@Brian_Gordon13

Research Intern @ Google | https://t.co/YF6cq9yyny @ Tel-Aviv University

Mark Boss

@markb_boss

I’m the Co-Head of 3D & Image at Stability AI with research interests in the intersection of machine learning and computer graphics

omer tov

@omer_tov

Generative AI @ Google DeepMind

DanielGaribi retweeted

Etai Sella @etai_sella

about 2 months ago

Do you like image editing? Don't like prompt engineering? Want to see what a giraffe-duck hybrid looks like? If you answered yes at least once, you may like our new #SIGGRAPH2026 paper: LooseRoPE, which presents a new, prompt-free way to edit images using simple visual cues 👇

9

175

36

108

15K

DanielGaribi retweeted

Inbar Gat @Gatinbar

about 2 months ago

3D editing has long relied on workarounds: per-asset optimization, 2D view propagation, or hacking frozen priors. The bitter lesson is the same one image editing already learned. Train a native model, end-to-end. Introducing ShapeUP, accepted to SIGGRAPH 2026 💫

5

245

39

247

19K

DanielGaribi retweeted

Alon Wolf | Researcher @AlonWolfy

2 months ago

[1/5] Is Text Enough for Control? 🐇 Text-driven video editing lets you describe *what* to change. But what about *how much*? We introduce Adaptive-Origin Guidance (AdaOr). A joint work with @DecartAI and @TelAvivUni 🧪 accepted to #SIGGRAPH2026.

6

66

24

16

11K

DanielGaribi retweeted

Daniel Cohen-Or @DanielCohenOr1

2 months ago

Many styles are not just textures or colors — they reshape geometry. Abstraction in Style (AiS) introduces an abstraction stage before stylization, enabling extreme abstraction-driven styles previously out of reach. https://t.co/DnPpdcodGc

DanielCohenOr1's tweet photo. Many styles are not just textures or colors — they reshape geometry.

Abstraction in Style (AiS) introduces an abstraction stage before stylization, enabling extreme abstraction-driven styles previously out of reach.

https://t.co/DnPpdcodGc https://t.co/5CApisAkBc

3

134

25

57

6K

DanielGaribi retweeted

Omer Dahary

@OmerDahary

2 months ago

Modern T2I DiTs are incredibly powerful, but have a serious diversity problem. We introduce a surprisingly simple and efficient inference-time fix (+2s for Flux-dev, +1s for SD3.5-Turbo). Excited to share our SIGGRAPH 2026 (conditional) paper: “On-the-fly Repulsion in the Contextual Space for Rich Diversity in Diffusion Transformers”

OmerDahary's tweet photo. Modern T2I DiTs are incredibly powerful, but have a serious diversity problem.
We introduce a surprisingly simple and efficient inference-time fix
(+2s for Flux-dev, +1s for SD3.5-Turbo).

Excited to share our SIGGRAPH 2026 (conditional) paper:
“On-the-fly Repulsion in the Contextual Space for Rich Diversity in Diffusion Transformers”

5

95

22

61

7K

DanielGaribi retweeted

Dana Cohen Bar @DanaCohenBar

3 months ago

DLSS 5 is all over the timeline, and for good reason. In my internship at @AIatMeta we had the same idea: use a video model as a learned second-stage renderer on top of game engines. In our paper RealMaster, we make synthetic video look real while preserving scene fidelity 👇

3

114

31

30

8K

DanielGaribi retweeted

Roni Paiss @Roni_Paiss

3 months ago

Check out DynaEdit - our new training free video editing method from @GoogleDeepMind . It allows enough flexibility for the edit to induce dynamic changes without deviating from the original video. Some cool examples in the video and on our website https://t.co/nBlS1gT7L5

1

23

6

4

2K

DanielGaribi retweeted

Daniel Cohen-Or @DanielCohenOr1

3 months ago

Cycle consistency is a powerful and cool idea. Here we use it to learn image decomposition: decompose an image into components, recombine them, and enforce consistency both ways. A cool training principle with surprisingly strong results.

DanielCohenOr1's tweet photo. Cycle consistency is a powerful and cool idea.

Here we use it to learn image decomposition: decompose an image into components, recombine them, and enforce consistency both ways.

A cool training principle with surprisingly strong results. https://t.co/wP6rKGy5vO

0

132

26

67

12K

DanielGaribi retweeted

Hila Chefer

@hila_chefer

3 months ago

New research from @bfl_ml 🥳 Meet Self-Flow: our self-supervised framework for image, audio, video & world models 🤖 https://t.co/AshY8IkSEe Do generative models really need DINO to learn strong representations? We propose teaching them directly via a joint framework instead 🧵

hila_chefer's tweet photo. New research from @bfl_ml 🥳
Meet Self-Flow: our self-supervised framework for image, audio, video & world models 🤖
https://t.co/AshY8IkSEe

Do generative models really need DINO to learn strong representations? We propose teaching them directly via a joint framework instead 🧵 https://t.co/wofHy9mmGT

11

282

61

109

67K

DanielGaribi retweeted

Saar Huberman

@HubermanSaar

4 months ago

SemanticMoments - Semantic motion similarity How do you find videos with similar motion? It’s harder than it sounds. Models like VideoMAE and V-JEPA encode motion, but their embeddings are dominated by appearance. So how do we build a compact embedding for motion similarity? Joint work with @kfir99 @OPatashnik @BenaimSagie @MokadyRon

7

181

29

134

27K

DanielGaribi retweeted

maxwell jones @maxwell54650346

5 months ago

Imagine you see a cool visual effect, and want to apply that visual effect to your own video - all while preserving motion and context? You can even edit the strength of the effect as well as the input video preservation! More coming soon... 😶‍🌫️😶‍🌫️ #VideoEditing #AI

0

15

5

1

1K

DanielGaribi retweeted

shahar sarfaty @shaharsarfaty

6 months ago

The GenAI LoRA ecosystem is a dense jungle. 🌿 Introducing CARLoS 🕵️‍♂️ - a system that retrieves LoRAs by how they alter diffusion behavior, and links these metrics to key concepts in copyright law. ⚖️ 🔗 https://t.co/eKBjI8Y1GZ 📄 https://t.co/VEhqFOm0hF 🧵[1/6]

shaharsarfaty's tweet photo. The GenAI LoRA ecosystem is a dense jungle. 🌿
Introducing CARLoS 🕵️‍♂️ - a system that retrieves LoRAs by how they alter diffusion behavior, and links these metrics to key concepts in copyright law. ⚖️
🔗 https://t.co/eKBjI8Y1GZ
📄 https://t.co/VEhqFOm0hF
🧵[1/6] https://t.co/EznopXUbaM

5

25

13

2

903

DanielGaribi retweeted

Sagi Polaczek 🦜

@PolaczekSagi

6 months ago

[1/4] Sync about it… 💭✨ Editing a portrait video yet keeping it fully synced with the original across the entire sequence. Read more about Sync-LoRA: https://t.co/4sAnR8FcZ1 🚀

1

43

20

5

3K

DanielGaribi retweeted

Delip Rao e/σ

@deliprao

6 months ago

Hey @iclr_conf, reverting scores is unnecessary punishment for the majority of the authors who had nothing to do with this incident and had successful rebuttals. Instead of detecting collusions on your end (you have a ton of metadata) why is this everyone’s burden to bear?

deliprao's tweet photo. Hey @iclr_conf, reverting scores is unnecessary punishment for the majority of the authors who had nothing to do with this incident and had successful rebuttals. Instead of detecting collusions on your end (you have a ton of metadata) why is this everyone’s burden to bear? https://t.co/HHGLMXGq1h

8

215

29

10

39K

DanielGaribi retweeted

Yusuf Dalva

@yusuf_dalva

7 months ago

Introducing Canvas-to-Image (C2I): A new paradigm where you define all controls within a single RGB canvas. 🎨 We simplify complex generation into one intuitive interface. Place specific Identities, Poses, and Boxes to control exactly who appears, how they pose, and where they stand. C2I interprets your design and generates your vision faithfully. You are the designer.

15

288

44

266

30K

DanielGaribi retweeted

Ron Mokady

@MokadyRon

7 months ago

Generating an image from 1,000 words. Very excited to release Fibo 😃, the first ever open-source model trained exclusively on long, structured captions. Fibo sets a new standard for controllability and disentanglement in image generation [1/6] 🧵

MokadyRon's tweet photo. Generating an image from 1,000 words.

Very excited to release Fibo 😃, the first ever open-source model trained exclusively on long, structured captions.

Fibo sets a new standard for controllability and disentanglement in image generation

[1/6] 🧵

26

518

63

326

75K

DanielGaribi retweeted

Shai Yehezkel @YehezkelShai

7 months ago

Visual Diffusion Models are Geometric Solvers We cast geometry as images: a plain diffusion model denoises into valid solutions. It is simple, general and effective. Shown on Inscribed Square, Steiner Tree, and Maximum Area Polygonization - all classic hard problems.

YehezkelShai's tweet photo. Visual Diffusion Models are Geometric Solvers

We cast geometry as images: a plain diffusion model denoises into valid solutions. It is simple, general and effective.
Shown on Inscribed Square, Steiner Tree, and Maximum Area Polygonization - all classic hard problems. https://t.co/KzXxbk57Se

3

75

16

23

11K

Daniel Garibi

@DanielGaribi

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users