Chris Ge @ChrisGe05 - Twitter Profile

8 days ago

@_virgil19 @rohitgandikota @bfl_ml The text tokens are actually discarded at the end of each denoising step, and they receive no supervision loss during training time; the only supervision is on the image tokens. We show causality through the attention knockout and I2I-to-I2I patching experiments.

1

0

338

Chris Ge @ChrisGe05

10 days ago

For more qualitative and quantitative results, check out our paper and project page. Project: https://t.co/8z0uPQBZo3 Code: https://t.co/oCRwcHuK5x Paper: https://t.co/Ao41099M4f This work was done in collaboration with @rohitgandikota, Antonio Torralba, and @TamarRottShaham

0

10

0

7

833

Chris Ge @ChrisGe05

10 days ago

FLUX.2's @bfl_ml text tokens aren't just holding your prompt. During image editing, they absorb reference image content, and some of that absorbed content, like color and style, causally drives the output appearance. New paper 🧵👇

7

201

35

145

25K

Chris Ge @ChrisGe05

10 days ago

Our findings suggest an efficiency opportunity: for some image editing tasks, once the text tokens have absorbed the reference content, the reference image no longer needs to participate in the rest of the computation.

ChrisGe05's tweet photo. Our findings suggest an efficiency opportunity: for some image editing tasks, once the text tokens have absorbed the reference content, the reference image no longer needs to participate in the rest of the computation. https://t.co/upaElsBSHQ

1

7

0

4

960

Chris Ge @ChrisGe05

about 1 month ago

Come check out our work on better utilizing agentic coding benchmark evaluation data, presented at #ICLR2026 Agents in the Wild Workshop!

Daria Kryvosheieva @DKryvosheieva

about 1 month ago

Today’s coding agent evals = single-number benchmark accuracies. But this obscures important details: which tasks in a benchmark are harder, and why? We study agent performance at the task level, and predict how new agents perform on new tasks. 📃To appear at ICLR 2026 AIWILD!

DKryvosheieva's tweet photo. Today’s coding agent evals = single-number benchmark accuracies.

But this obscures important details: which tasks in a benchmark are harder, and why?

We study agent performance at the task level, and predict how new agents perform on new tasks.

📃To appear at ICLR 2026 AIWILD! https://t.co/N2QIAVXkOg

2

21

3

6

4K

0

6

0

285

ChrisGe05 retweeted

Fulcrum

@fulcrum_inc

2 months ago

🚨 We're open-sourcing Druids, a library for coordinating and deploying coding agents across machines. Our beta users have used Druids to work on open math problems, conduct ML "autoresearch," and make software faster.

3

225

31

227

26K

Chris Ge

@ChrisGe05

Last Seen Users on Sotwe

Trends for you

Most Popular Users