Zhikun Xu @JerrryKun - Twitter Profile

Treating reasoning and acting as two tools for one job folds many debates (long context vs RAG, think vs act, etc.) into a single allocation question. And framing the internalization–externalization boundary as the next design question feels exactly right. Really inspiring read!

Hongru Wang @HongruWang007

6 days ago

Two students take the same exam. Both score 100 — one solved it himself, the other Googled every answer. A semester later, the gap is huge. That's the problem with today's AI agents. I write a detailed blog to share my recent thoughts on this, mainly based on Theory of Agents. I promise this is definitely worth 30 minutes of your time. Blog: https://t.co/VCFC7RnbU6 Project: https://t.co/WFLEYhOaCl

1

48

13

41

10K

2

4

1

583

Zhikun Xu

@JerrryKun

5 days ago

That last part resonates with what motivated our ongoing project, ReuseRL: if a capability only internalizes when it is compressible, the real question becomes what gets internalized. We go looking for the atoms: the small set of reusable skills a model can absorb and build on.

1

2

0

98

Zhikun Xu

@JerrryKun

8 days ago

@Swarooprm7 We may need to define the "machine creativity/novelty" first. What about searching for some counterexamples in formal math? Concepts are created to simplify things, facilitate abstraction, and improve human understanding (efficiency, etc.). Why do AI models need novel concepts?

0

1

0

215

Zhikun Xu

@JerrryKun

17 days ago

@wzenus @YejinChoinka @jiajunwu_cs @ManlingLi_ @LINJIEFUN @chi_gui_1 @DeimosGN @qineng_wang @James_KKW @shiqi_chen17 @zhengyuan_yang Well-deserved👍

0

1

0

59

Zhikun Xu

@JerrryKun

about 2 months ago

✨Check out the paper to learn more! This paper was done by our intern @Zijun0916 last summer. A big thank you to my labmate @XiaoYe1170354 and our advisor @BenZhou96 for the support and guidance!

0

198

Zhikun Xu

@JerrryKun

about 2 months ago

LLMs memorize massive amounts of text, but can they actually apply this knowledge conceptually? 🤔 Our #ICLR 26' paper from the ARC Lab probes this in math reasoning! "CORE: Concept-Oriented Reinforcement for Bridging the Definition-Application Gap..." 🔗 https://t.co/GscQjqjTMd

JerrryKun's tweet photo. LLMs memorize massive amounts of text, but can they actually apply this knowledge conceptually? 🤔

Our #ICLR 26' paper from the ARC Lab probes this in math reasoning! "CORE: Concept-Oriented Reinforcement for Bridging the Definition-Application Gap..." 🔗 https://t.co/GscQjqjTMd https://t.co/M4ZyFGKRpp

1

9

2

0

593

Zhikun Xu

@JerrryKun

about 2 months ago

📈 The results: Consistent gains over vanilla baselines, including up to +9.3% on in-domain Textbook problems and +9.6% on out-of-domain TheoremQA. We also did ablation experiments to show the results are consistent with different models and across different benchmarks.

1

0

239

Zhikun Xu

@JerrryKun

4 months ago

@HBX_hbx @QuYuxiao Besides QuestA, there are also many other related works using a similar idea from last year: BREAD(https://t.co/O7wOGJnYPD), Scaf-GRPO(https://t.co/oug6ytphJ3), and CORE(https://t.co/1ygNSBsQC1). "Guided prefix" could be partial oracle solutions, problem-related concepts, etc.

0

1

0

99

Zhikun Xu

@JerrryKun

5 months ago

@denny_zhou Fake agi: Gemini🤖 Real agi: daughter🧒

0

2

0

314

Zhikun Xu

@JerrryKun

6 months ago

Highly resonate about this. Conceptual reasoning should be very important for LLM reasoning.

Denny Zhou

@denny_zhou

6 months ago

To the questions of “why not both?”: my dream is for LLMs to make conceptual discoveries, like Galois with group theory or Einstein with general relativity. I don’t believe breakthroughs like these would come from A* search or its more advanced version MCTS.

5

185

5

21

27K

0

777

Zhikun Xu

@JerrryKun

6 months ago

@ShashwatGoel7 👀 https://t.co/0ZaHOGxxEx

1

0

518

Zhikun Xu

@JerrryKun

6 months ago

First time at #NeurIPS2025! I’ll be in San Diego from Dec 1–6 and would love to make new friends, grab some tea🍵, and discuss LLM reasoning (math & cognition-inspired), RL, and more! Feel free to DM!

0

6

0

640

Zhikun Xu

@JerrryKun

6 months ago

@zjasper @hyperbolic_labs https://t.co/H5u5ZB51HL 👀how about testing it with some counterexamples?

0

2

0

1

212

Zhikun Xu

@JerrryKun

7 months ago

@taiwei_shi NeurIPS+1! Hope to talk with u🙋

0

1

0

159

Zhikun Xu

@JerrryKun

Last Seen Users on Sotwe

Trends for you

Most Popular Users