Xingchen Wan @wanxingchen_ - Twitter Profile

Han Zhou @ICLR 2026

about 2 months ago

✈️Arrived in Rio🇧🇷 for #ICLR2026 this week! Thrilled to present our work: 1. Visual Planning: Let's Think Only with Images @ Oral Session 3D Vision language models II Fri 24 Apr 10:30; Poster P4-3304 2. Multi-Agent Design Sat 25 Apr 10:30, P4-4713 Looking forward to a chat!

hanzhou032's tweet photo. ✈️Arrived in Rio🇧🇷 for #ICLR2026 this week!

Thrilled to present our work:

1. Visual Planning: Let's Think Only with Images
@ Oral Session 3D Vision language models II
Fri 24 Apr 10:30; Poster P4-3304

2. Multi-Agent Design
Sat 25 Apr 10:30, P4-4713

Looking forward to a chat! https://t.co/7Jb0mt7TRI

0

5

1

0

331

wanxingchen_ retweeted

Louis Gleeson

@aigleeson

8 months ago

🚨 Google just dropped the most advanced self-improving video AI ever built. It’s called VISTA, and it literally rewrites its own prompts to make every new generation better than the last. No retraining. No fine-tuning. Just pure test-time self-reflection. Here’s how it works: → Turns your idea into a full scene-by-scene storyboard → Generates multiple video candidates → Runs a tournament to find the best one → Then critiques itself visually, audibly, contextually before trying again Each loop = sharper visuals, tighter storytelling, more aligned motion. The results? 60% win rate vs Veo 3 and 66.4% human preference. This isn’t “text-to-video.” This is video that learns from itself.

aigleeson's tweet photo. 🚨 Google just dropped the most advanced self-improving video AI ever built.

It’s called VISTA, and it literally rewrites its own prompts to make every new generation better than the last.

No retraining. No fine-tuning. Just pure test-time self-reflection.

Here’s how it works:

→ Turns your idea into a full scene-by-scene storyboard
→ Generates multiple video candidates
→ Runs a tournament to find the best one
→ Then critiques itself visually, audibly, contextually before trying again

Each loop = sharper visuals, tighter storytelling, more aligned motion.

The results? 60% win rate vs Veo 3 and 66.4% human preference.

This isn’t “text-to-video.”

This is video that learns from itself.

66

813

134

485

84K

wanxingchen_ retweeted

AK

@_akhaliq

8 months ago

Google presents VISTA A Test-Time Self-Improving Video Generation Agent

4

208

26

97

28K

wanxingchen_ retweeted

Han Zhou @ICLR 2026

@hanzhou032

about 1 year ago

Automating Multi-Agent Design: 🧩Multi-agent systems aren’t just about throwing more LLM agents together. 🛠️They require mastering the subtle art of prompting and agent orchestration. Introducing MASS🚀- Our new agent optimization framework for better prompts and topologies!

hanzhou032's tweet photo. Automating Multi-Agent Design:

🧩Multi-agent systems aren’t just about throwing more LLM agents together.

🛠️They require mastering the subtle art of prompting and agent orchestration.

Introducing MASS🚀- Our new agent optimization framework for better prompts and topologies!

14

723

160

1K

82K

Who to follow

AutoML Conference

@automl_conf

Official account for International Conference on Automated Machine Learning. #AutoML26 Ljubljana, Slovenia

Jean-François Ton

@jeanfrancois287

ByteDance Seed @ByteDance_Seed | Senior Research Scientist working on LLMs | prev. @oxcsml @UniofOxford, @amazon, @apple, @bloomberg All opinions are my own

Aryan Deshwal

@deshwal_aryan

Assistant Professor in Computer Science @UMNews. AI to Accelerate Scientific Discovery and Engineering Design

wanxingchen_ retweeted

Yi Xu

@_yixu

about 1 year ago

🚀Let’s Think Only with Images. No language and No verbal thought.🤔 Let’s think through a sequence of images💭, like how humans picture steps in their minds🎨. We propose Visual Planning, a novel reasoning paradigm that enables models to reason purely through images.

_yixu's tweet photo. 🚀Let’s Think Only with Images.

No language and No verbal thought.🤔

Let’s think through a sequence of images💭, like how humans picture steps in their minds🎨.

We propose Visual Planning, a novel reasoning paradigm that enables models to reason purely through images.

15

1K

220

1K

230K

wanxingchen_ retweeted

elvis

@omarsar0

over 1 year ago

Astute RAG Proposes a novel RAG approach to deal with the imperfect retrieval augmentation and knowledge conflicts of LLMs. Astute RAG adaptively elicits essential information from LLMs' internal knowledge. Then it iteratively consolidates internal and external knowledge with source-awareness. Astute RAG is designed to better combine internal and external information through an interactive consolidation mechanism (i.e., identifying consistent passages, detecting conflicting information in them, and filtering out irrelevant information). (Prompts for this step provided in the paper) The explicit consolidation step addresses knowledge conflicts which is probably one of the most challenging parts of building reliable RAG systems. It really does help to know how to leverage the internal and external information of RAG systems.

omarsar0's tweet photo. Astute RAG

Proposes a novel RAG approach to deal with the imperfect retrieval augmentation and knowledge conflicts of LLMs.

Astute RAG adaptively elicits essential information from LLMs' internal knowledge. Then it iteratively consolidates internal and external knowledge with source-awareness.

Astute RAG is designed to better combine internal and external information through an interactive consolidation mechanism (i.e., identifying consistent passages, detecting conflicting information in them, and filtering out irrelevant information). (Prompts for this step provided in the paper)

The explicit consolidation step addresses knowledge conflicts which is probably one of the most challenging parts of building reliable RAG systems. It really does help to know how to leverage the internal and external information of RAG systems.

6

325

63

256

27K

Xingchen Wan @wanxingchen_

almost 2 years ago

Curious about getting AI to think more like us? Discover how to tune your prompts for better alignment with human judgment! 🧠💡 https://t.co/izY5syk338

Han Zhou @ICLR 2026

@hanzhou032

almost 2 years ago

Which output is better? [A] or [B]? LLM🤖: B❌ [B] or [A]? LLM🤖: A✅ Thrilled to share our preprint in addressing preference biases in LLM judgments!🧑‍⚖️We introduce ZEPO, a 0-shot prompt optimizer that enhances your LLM evaluators via fairness⚖️ 📰Paper: https://t.co/ZkMvJnFFMC

hanzhou032's tweet photo. Which output is better?
[A] or [B]? LLM🤖: B❌
[B] or [A]? LLM🤖: A✅

Thrilled to share our preprint in addressing preference biases in LLM judgments!🧑‍⚖️We introduce ZEPO, a 0-shot prompt optimizer that enhances your LLM evaluators via fairness⚖️

📰Paper: https://t.co/ZkMvJnFFMC https://t.co/qtz1ckZJSa

3

98

23

61

12K

0

3

1

0

454

Xingchen Wan @wanxingchen_

about 2 years ago

@xiye_nlp @UAlberta @PrincetonPLI Congrats!

0

133

wanxingchen_ retweeted

Masaki Adachi @masaki_adachi

about 2 years ago

For batch active learning, how can the algorithm determine the batch size? Use computational uncertainty via kernel quadrature! Check out our new AISTATS paper with @maosbot, @JorgensenMart, @wanxingchen_, @nguyentienvu!🎉 #ProbabilisticNumerics paper: https://t.co/ZtR0RHwT1G

0

26

6

5

3K

Xingchen Wan @wanxingchen_

over 2 years ago

@SamuelAlbanie @Cambridge_Eng @Oxford_VGG Congrats!

0

84

wanxingchen_ retweeted

Han Zhou @ICLR 2026

@hanzhou032

over 2 years ago

#ICLR2024 Proud that 🚀Batch Calibration🚀 is accepted at @iclr_conf! 🔥Batch Calibration addresses the prompt sensitivity of LLMs and enhances LLMs' robustness to ICL orders, prompt templates, and ✔️❌choice of verbalizers even in emojis👍👎 📙Paper: https://t.co/UIhPGLsuLX

2

60

13

27

6K

wanxingchen_ retweeted

Google AI

@GoogleAI

over 2 years ago

Visit the #EMNLP2023 Google booth today at 3:30 PM to learn about Universal Self-Adaptive Prompting, an automatic method tailored for zero-shot learning (while compatible with few-shot) that uses a small amount of unlabelled data & an inference-only LLM.

12

199

48

41

58K

Xingchen Wan @wanxingchen_

over 2 years ago

@linylinx Why not apply with an endorsement from RAEng which gives you ILR in 3 years instead of 5? https://t.co/YIeA0FCr88 AI/ML research is listed under RAEng, and industrial researchers are permitted: https://t.co/o7ypoaVCSH

0

68

Xingchen Wan @wanxingchen_

over 2 years ago

Work done with @ruoxi_cc Hootan Nakhost @hanjundai @eisenjulian @sercanarik @tomaspfister. Thanks to all my amazing co-authors 🙌

0

3

1

0

498

Xingchen Wan @wanxingchen_

over 2 years ago

Introducing COSP and USP: they select good demos from unlabeled samples & LLMs’ self-generated outputs as in-context examples to achieve stronger zero-shot performance 💪 Paper: https://t.co/YwqZGpKuJh (COSP: ACL Findings) + https://t.co/wJ3Xuoxhns (USP: EMNLP)

Google AI

@GoogleAI

over 2 years ago

Introducing a new approach for adaptive prompting of #LLMs that train with unlabeled samples + pseudo-demonstrations generated by the model itself to close the gap between few-shot and 0-shot performance on reasoning, NLU and language generation tasks. → https://t.co/ZtSEZOxNCc

25

480

116

137

125K

2

8

2

1

1K

wanxingchen_ retweeted

Han Zhou @ICLR 2026

@hanzhou032

over 2 years ago

🤔Curious about an efficient way to automatically find better prompts for your LLM? Thrilled to Introduce “ClaPS” which underscores the critical role of search space in prompt search accepted to #EMNLP Findings🎉. 📎Code: https://t.co/hW474orfRN 📷Paper: https://t.co/NshpmkRmwd

hanzhou032's tweet photo. 🤔Curious about an efficient way to automatically find better prompts for your LLM? Thrilled to Introduce “ClaPS” which underscores the critical role of search space in prompt search accepted to #EMNLP Findings🎉.

📎Code: https://t.co/hW474orfRN
📷Paper: https://t.co/NshpmkRmwd https://t.co/RzOZz23nnN

2

17

4

2K

Xingchen Wan @wanxingchen_

over 2 years ago

Check out Batch Calibration! Blog: https://t.co/E51FN6hDGx Paper: https://t.co/ITeJBpRCuy

Google AI

@GoogleAI

over 2 years ago

LLMs’ sensitivity to design decisions, such as template choice, can degrade performance & prevent their robust application. Introducing Batch Calibration, a simple method that mitigates this effect & improves on existing methods w/ negligible add’l cost → https://t.co/Y7VbhanA3w

22

607

120

215

158K

0

6

0

592

Xingchen Wan @wanxingchen_

almost 3 years ago

5/5 We evaluate COSP on three different LLMs in the zero-shot setting over 6 tasks to show vast improvement. Paper: https://t.co/YwqZGpKuJh. Many thanks to my fantastic coauthors: @ruoxi_cc, @sercanarik, @hanjundai and @tomaspfister.🤝

1

6

0

352

Xingchen Wan @wanxingchen_

almost 3 years ago

1/5 Wondering how to do better than “Let’s think step by step” in zero-shot? Enter COSP, a prompting technique that massively improves reasoning over 0-shot-CoT & matches few-shot in 3 LLMs – a work done during my internship at @Google & accepted to #ACL2023NLP Findings🤩

wanxingchen_'s tweet photo. 1/5 Wondering how to do better than “Let’s think step by step” in zero-shot? Enter COSP, a prompting technique that massively improves reasoning over 0-shot-CoT & matches few-shot in 3 LLMs – a work done during my internship at @Google & accepted to #ACL2023NLP Findings🤩 https://t.co/u4Z77SiKWb

1

18

6

3

3K

Xingchen Wan @wanxingchen_

almost 3 years ago

4/5 We thus use entropy (and a repetition penalty) to craft a scoring function. We run zero-shot-CoT, score the outputs and determine which outputs should serve as the LLM’s in-context demos – all done using unlabelled data & LLM’s outputs & no more laborious handcrafting 🎯

wanxingchen_'s tweet photo. 4/5 We thus use entropy (and a repetition penalty) to craft a scoring function. We run zero-shot-CoT, score the outputs and determine which outputs should serve as the LLM’s in-context demos – all done using unlabelled data & LLM’s outputs & no more laborious handcrafting 🎯 https://t.co/Mi7ZmqOAAb

1

4

0

1

487

Xingchen Wan

@wanxingchen_

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users