Keming Wu @Keming_Charles - Twitter Profile

Pinned Tweet

8 months ago

Why do open-source image editing models lag behind closed-source giants like GPT-Image-1, Seedream, & Google-Nano-Banana? 🤔 It’s mainly due to the quality of the training reward signal. We’re bridging the gap. Meet EditReward! 🏆

Keming_Charles's tweet photo. Why do open-source image editing models lag behind closed-source giants like GPT-Image-1, Seedream, & Google-Nano-Banana? 🤔

It’s mainly due to the quality of the training reward signal.

We’re bridging the gap. Meet EditReward! 🏆 https://t.co/gl7qe0UBFv

4

148

15

74

44K

Keming_Charles retweeted

Zuhao Yang @mwxely464

11 days ago

What if the very pretrained prior that lets an RL agent explore tools also destroys the format that made it tool-native? We name this the Tool Prior Paradox — and tame it with PARA-GRPO. 🚀 Introducing ParaVT: parallel video tool use × agentic RL.

mwxely464's tweet photo. What if the very pretrained prior that lets an RL agent explore tools also destroys the format that made it tool-native?

We name this the Tool Prior Paradox — and tame it with PARA-GRPO.

🚀 Introducing ParaVT: parallel video tool use × agentic RL. https://t.co/kvpQiU36n2

7

15

10

5

663

Keming Wu @Keming_Charles

25 days ago

🙏 If you find this useful: ⭐ Star the repo → https://t.co/Mgc3urfLGg 👍 Upvote on HF Daily Paper → https://t.co/1Hn3mrdYDS 🔁 Retweet to help us reach researchers working on world models, video gen & reward modeling

0

1

0

40

Keming Wu @Keming_Charles

25 days ago

🌍 Can today's video generators REASON about how the world should evolve — or do they just render it beautifully? Introducing WorldReasonBench: a human-aligned stress test that re-frames video generation as future world-state prediction. 🌐 https://t.co/c0HBr58WRu

Keming_Charles's tweet photo. 🌍 Can today's video generators REASON about how the world should evolve — or do they just render it beautifully?
Introducing WorldReasonBench: a human-aligned stress test that re-frames video generation as future world-state prediction.
🌐 https://t.co/c0HBr58WRu https://t.co/6kgoGmPWdY

1

17

4

6

1K

Keming Wu @Keming_Charles

25 days ago

Qualitative example — when SOTA still fails 🎬 Visually plausible ≠ world-aware. Even top models get classic prompts wrong: • "Pencil in water" — refraction direction inverted Browse all qualitative cases on the project page 👇

1

0

42

Keming_Charles retweeted

Computer Vision and Pattern Recognition Papers @CSVisionPapers

about 1 month ago

Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling Keming Wu, Zuhao Yang, Kaichen Zhang, Shizun Wang, Haowei Zhu, Sicong Leng, Zhongyu Yang, Qijie Wang, … https://t.co/3PaATxbfiA [𝚌𝚜.𝙲𝚅] 💬Project: https://t.co/e06OsM7VfL

CSVisionPapers's tweet photo. Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling

Keming Wu, Zuhao Yang, Kaichen Zhang, Shizun Wang, Haowei Zhu, Sicong Leng, Zhongyu Yang, Qijie Wang, …
https://t.co/3PaATxbfiA [𝚌𝚜.𝙲𝚅]
💬Project: https://t.co/e06OsM7VfL https://t.co/UZPUODjtl0

0

1

0

156

Keming_Charles retweeted

Kaichen Zhang @KaichenZhang358

about 1 month ago

Excited to share a fun project I recently collaborated on: a roadmap for thinking about where visual generation is heading next. The key question is no longer just “can it make beautiful images?”, but whether it can handle memory, interaction, and eventually world modeling.

KaichenZhang358's tweet photo. Excited to share a fun project I recently collaborated on: a roadmap for thinking about where visual generation is heading next. The key question is no longer just “can it make beautiful images?”, but whether it can handle memory, interaction, and eventually world modeling. https://t.co/PmH9Co5yGn

1

14

7

2

2K

Keming Wu @Keming_Charles

about 1 month ago

Takeaway: The future is not just higher-fidelity images. It is controllable, interactive, verifiable, and world-aware generation. arXiv: https://t.co/F40gaRS939 HF Daily Paper: https://t.co/jnC75XxbGf GitHub: https://t.co/G50LYzKrLJ WebPage: https://t.co/cB510BFxfp

0

4

0

1

126

Keming Wu @Keming_Charles

about 1 month ago

Excited to share our new roadmap: Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling What does it mean for image generation models to become truly intelligent? HF Daily Paper: https://t.co/jnC75XxbGf WebPage: https://t.co/cB510BFxfp

Keming_Charles's tweet photo. Excited to share our new roadmap:

Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling

What does it mean for image generation models to become truly intelligent?

HF Daily Paper: https://t.co/jnC75XxbGf
WebPage: https://t.co/cB510BFxfp https://t.co/40iduyTFvl

1

64

15

38

10K

Keming Wu @Keming_Charles

about 1 month ago

Benchmarks often reward visual quality. But real progress also needs spatial reasoning, topology, symbolic structure, and code/math-grounded correctness. We stress-test physical and causal reasoning: These examples probe the boundary between image synthesis and world modeling.

Keming_Charles's tweet photo. Benchmarks often reward visual quality.

But real progress also needs spatial reasoning, topology, symbolic structure, and code/math-grounded correctness.

We stress-test physical and causal reasoning:
These examples probe the boundary between image synthesis and world modeling. https://t.co/MbdhrzNs2e

1

3

0

155

Keming Wu @Keming_Charles

about 2 months ago

An insightful and excellent piece of work.

Lianghui Zhu @lianghui_zhu

about 2 months ago

For a decade, we've made models wider and deeper—but we've barely changed how layers *talk* to each other. Since ResNet's `x + F(x)` in 2015, the depth residual has been the only highway for inter-layer communication. It's time to upgrade the staircase. 🧵

lianghui_zhu's tweet photo. For a decade, we've made models wider and deeper—but we've barely changed how layers *talk* to each other.

Since ResNet's `x + F(x)` in 2015, the depth residual has been the only highway for inter-layer communication.

It's time to upgrade the staircase. 🧵 https://t.co/KIvzN4w9dT

18

2K

240

2K

188K

0

4

0

187

Keming_Charles retweeted

Jianyang Gao

@gaoj0017

2 months ago

The TurboQuant paper (ICLR 2026) contains serious issues in how it describes RaBitQ, including incorrect technical claims and misleading theory/experiment comparisons. We flagged these issues to the authors before submission. They acknowledged them, but chose not to fix them. The paper was later accepted and widely promoted by Google, reaching tens of millions of views. We’re speaking up now because once a misleading narrative spreads, it becomes much harder to correct. We’ve written a public comment on openreview (https://t.co/nDVjmNhATM). We would greatly appreciate your attention and help in sharing it.

98

6K

963

2K

1M

Keming Wu

@Keming_Charles

Last Seen Users on Sotwe

Trends for you

Most Popular Users