Ching-An Cheng @chinganc_rl - Twitter Profile

Pinned Tweet

4 months ago

LLM has been struggling to solve search and optimization at scale when feedback is stochastic. We propose a simple solution, POLCA, using text embedding with “provable” guarantee. Excited to see the first theoretically correct work of LLM optimization. Kudos to @XuanfeiRen

chinganc_rl's tweet photo. LLM has been struggling to solve search and optimization at scale when feedback is stochastic. We propose a simple solution, POLCA, using text embedding with “provable” guarantee. Excited to see the first theoretically correct work of LLM optimization. Kudos to @XuanfeiRen https://t.co/vA5QC4Nomx

Xuanfei Ren @XuanfeiRen

4 months ago

🚀 How can we make LLM-based optimization stable and scalable when the feedback signal is stochastic? Introducing POLCA: a framework for robust, scalable stochastic generative optimization. Paper: https://t.co/xgdjISRxtE Code: https://t.co/9TRuyvxVcf 🧵👇 1/

XuanfeiRen's tweet photo. 🚀 How can we make LLM-based optimization stable and scalable when the feedback signal is stochastic?

Introducing POLCA: a framework for robust, scalable stochastic generative optimization.

Paper: https://t.co/xgdjISRxtE

Code: https://t.co/9TRuyvxVcf
🧵👇 1/ https://t.co/2Yxq8NrVq8

4

28

9

17

17K

2

51

17

34

10K

Ching-An Cheng @chinganc_rl

3 months ago

Looking for Google research student researcher (PhD student) to work on LLM and agent related learning. Preferred background: RL/game theory, agentic system, LLM training. Candidate will work closely with me and @allenainie Email me if you are interested. 😀

12

279

27

182

52K

chinganc_rl retweeted

Nan Jiang @nanjiang_cs

3 months ago

I have served as AC for NeurIPS every year since 2020. Just declined (with messages adapted from @xuanalogue). At least the organizers owe the community an explanation why they are the only major ML venue adopting such a policy.

nanjiang_cs's tweet photo. I have served as AC for NeurIPS every year since 2020. Just declined (with messages adapted from @xuanalogue). At least the organizers owe the community an explanation why they are the only major ML venue adopting such a policy. https://t.co/jCB5mJzA3F

12

635

26

39

91K

chinganc_rl retweeted

Dan Roy

@roydanroy

10 months ago

Too close to home? Junior researcher: I’m publishing papers at NeurIPS, my students are happy, but my chair says I’m “not impactful enough.” I don’t know what that means. Senior researcher: What did you tell them you accomplished last year? Junior: 3 top-tier papers, a new theoretical result on regret bounds, and an invited talk. Senior: And what did they hear? Junior: That I published 3 papers? Senior: They heard “I added to the publication count, but didn’t bring in grants or visibility for the department.” Junior: But regret bounds are impactful! Senior: To who? Junior: To… theorists? Senior: Your chair spends 20 minutes a month justifying your position to the dean. Can they use regret bounds to argue for funding? Junior: …probably not. Senior: What external metrics did your work move? Junior: One collaboration, one best paper award, and some citations. We don’t really track grant impact. Senior: There’s the problem. Half your contributions are invisible by design. Junior: But theory is necessary. The field would break without it. Senior: I believe you. The dean doesn’t care. Junior: That seems unfair. Senior: It is unfair. It’s also how academia works. Chairs get grilled on grants, rankings, and prestige, not the long-run stability of ML theory. Junior: So what should I do? Senior: Reframe. “Secured $500K in funding to explore foundational algorithms” sounds better than “proved a tighter regret bound.” Junior: But I don’t have that funding. Senior: Then you’re fighting academic reality without weapons. Junior: I don’t have time to write grants and still publish. Senior: Most junior faculty don’t. That’s the trap — you get judged on impact but don’t get impact resources. Junior: So what do I do? Senior: Acknowledge the game is rigged, then play it anyway. Junior: Meaning? Senior: Build collaborations that attract funding. Tie your theory to hot applied areas. Translate your results into language deans understand. Junior: That feels political. Senior: Everything above a certain level is political. The choice isn’t political vs pure. It’s visible vs irrelevant. Junior: What if my chair still doesn’t care? Senior: Then you’ve learned your chair doesn’t know how to evaluate theory. That’s a different problem — one you solve by finding a better environment. Junior: This is harder than just proving good theorems. Senior: Proving good theorems is table stakes. Surviving academia while proving good theorems — that’s the actual job.

21

1K

72

718

161K

Who to follow

David Held

@davheld

Associate Professor at Carnegie Mellon University | he/him

Shuran Song

@SongShuran

Assistant Professor @Stanford University working on #Robotics #AI #ComputerVision

Brandon Amos

@brandondamos

🧙 RL @Reflection_AI past: @MetaAi @GoogleDeepmind @SCSatCMU @Cornell_Tech

Ching-An Cheng @chinganc_rl

11 months ago

Happening now at #RLC2025. Join us if you’re interested in program, agents and RL.

Shao-Hua Sun @ ICML 🇰🇷 @shaohua0116

11 months ago

Kicking off #RLC2025 with our Workshop on Programmatic Reinforcement Learning! This workshop explores how programmatic representations can improve interpretability, generalization, efficiency, and safety in RL.

shaohua0116's tweet photo. Kicking off #RLC2025 with our Workshop on Programmatic Reinforcement Learning! This workshop explores how programmatic representations can improve interpretability, generalization, efficiency, and safety in RL. https://t.co/KF5QH49TsR

2

53

9

12

12K

0

21

3

2K

chinganc_rl retweeted

Brando Miranda

@BrandoHablando

12 months ago

🔄 We were nominated for Oral+top 1 in the MATH-AI workshp at #ICML! 🚨Why? ≈46 % of GitHub commits are AI-generated—but can we verify them correct? 📢 VeriBench challenges agents; turn Python into Lean code! 🧵1/14 📃 Paper: https://t.co/QPCxg5lKM4

BrandoHablando's tweet photo. 🔄 We were nominated for Oral+top 1 in the MATH-AI workshp at #ICML!

🚨Why? ≈46 % of GitHub commits are AI-generated—but can we verify them correct?
📢 VeriBench challenges agents; turn Python into Lean code!
🧵1/14
📃 Paper: https://t.co/QPCxg5lKM4 https://t.co/UDxwdOBoEj

1

39

14

4K

Ching-An Cheng @chinganc_rl

12 months ago

We are organizing a workshop tomorrow at #icml25. Come join us and checkout the latest on programmatic representation and agent learning

Shao-Hua Sun @ ICML 🇰🇷 @shaohua0116

12 months ago

Our #ICML2025 Programmatic Representations for Agent Learning workshop will take place tomorrow, July 18th, at the West Meeting Room 301-305, exploring how programmatic representations can make agent learning more interpretable, generalizable, efficient, and safe! Come join us!

shaohua0116's tweet photo. Our #ICML2025 Programmatic Representations for Agent Learning workshop will take place tomorrow, July 18th, at the West Meeting Room 301-305, exploring how programmatic representations can make agent learning more interpretable, generalizable, efficient, and safe! Come join us! https://t.co/hjsJ7U4yFU

1

67

15

25

36K

0

25

5

2

2K

chinganc_rl retweeted

Shao-Hua Sun @ ICML 🇰🇷 @shaohua0116

12 months ago

Our #ICML2025 Programmatic Representations for Agent Learning workshop will take place tomorrow, July 18th, at the West Meeting Room 301-305, exploring how programmatic representations can make agent learning more interpretable, generalizable, efficient, and safe! Come join us!

1

67

15

25

36K

Ching-An Cheng @chinganc_rl

12 months ago

Starting my #ICML2025. Will be here until Saturday. Looking forward to meeting everyone 😀

0

19

0

950

chinganc_rl retweeted

Allen Nie ✈️ ICML 2026 🇰🇷

@allenainie

12 months ago

Provably Learning from Language Feedback TLDR: RL theory can help us do better inference-time exploration with feedback. Work done with @wanqiao_xu, @ruijie_zheng12, @chinganc_rl, @adityamodi94, @adith387 📰 https://t.co/Zi3EwmX98R 📍EXAIT Best Paper/Oral Sat 8:45-9:30 am

allenainie's tweet photo. Provably Learning from Language Feedback

TLDR: RL theory can help us do better inference-time exploration with feedback.

Work done with @wanqiao_xu, @ruijie_zheng12, @chinganc_rl, @adityamodi94, @adith387

📰 https://t.co/Zi3EwmX98R
📍EXAIT Best Paper/Oral Sat 8:45-9:30 am https://t.co/4t81B3OrCy

1

22

8

13

4K

Ching-An Cheng @chinganc_rl

about 1 year ago

Super excited about this work done by our former intern @wanqiao_xu . We show Learning from Language Feedback (LLF) with LLM can be formally studied with provable no-regret learning algorithms. This result builds a foundation toward new theories for LLM learning and optimization.

Allen Nie ✈️ ICML 2026 🇰🇷

@allenainie

about 1 year ago

Decision-making with LLM can be studied with RL! Can an agent solve a task with text feedback (OS terminal, compiler, a person) efficiently? How can we understand the difficulty? We propose a new notion of learning complexity to study learning with language feedback only. 🧵👇

allenainie's tweet photo. Decision-making with LLM can be studied with RL! Can an agent solve a task with text feedback (OS terminal, compiler, a person) efficiently? How can we understand the difficulty? We propose a new notion of learning complexity to study learning with language feedback only. 🧵👇

4

101

23

87

18K

1

19

1

6

2K

chinganc_rl retweeted

Allen Nie ✈️ ICML 2026 🇰🇷

@allenainie

about 1 year ago

Decision-making with LLM can be studied with RL! Can an agent solve a task with text feedback (OS terminal, compiler, a person) efficiently? How can we understand the difficulty? We propose a new notion of learning complexity to study learning with language feedback only. 🧵👇

4

101

23

87

18K

Ching-An Cheng @chinganc_rl

about 1 year ago

Check out this new optimization framework (https://t.co/sN6E1jWl4n) by #DataRobot that can automatically search for "Pareto-optimal" solutions for agentic workflows. It's built on our LLM generative optimization framework #Trace. Excited to see more applications of #Trace! 😎

0

7

0

671

chinganc_rl retweeted

Shao-Hua Sun @ ICML 🇰🇷 @shaohua0116

about 1 year ago

Our ICML & RLC workshops welcome contributions using programmatic representations as policies, reward functions, skill libraries, task generators, environment models, etc., to improve interpretability, generalization, efficiency, & safety in agent learning & RL! Please retweet 🙏

shaohua0116's tweet photo. Our ICML & RLC workshops welcome contributions using programmatic representations as policies, reward functions, skill libraries, task generators, environment models, etc., to improve interpretability, generalization, efficiency, & safety in agent learning & RL! Please retweet 🙏 https://t.co/6XiLsnz8KA

4

53

10

14

24K

Ching-An Cheng @chinganc_rl

about 1 year ago

Organizers: Shao-Hua Sun @shaohua0116, Levi Lelis @levilelis, Xinyun Chen @xinyun_chen_, Shreyas Kapur @shreyaskapur, Jiayuan Mao @maojiayuan, Ching-An Cheng @chinganc_rl, Anqi Li @AnqiLi24, Kuang-Huei Lee @kuanghueilee, and Leslie Kaelbling

0

1

0

245

Ching-An Cheng @chinganc_rl

about 1 year ago

We're organizing workshops on Programmatic Representation for Agent Learning at the upcoming #ICML2025 and #RLC2025. We welcome contributions using programs as policies, reward functions, skill libraries, task generators, environment models, etc., and more! See you soon!😀

chinganc_rl's tweet photo. We're organizing workshops on Programmatic Representation for Agent Learning at the upcoming #ICML2025 and #RLC2025. We welcome contributions using programs as policies, reward functions, skill libraries, task generators, environment models, etc., and more! See you soon!😀 https://t.co/IDeIhXdOfy

1

7

2

0

747

Ching-An Cheng @chinganc_rl

about 1 year ago

Workshop on Programmatic Reinforcement Learning (RLC 2025) - Web page: https://t.co/a8h4YObjUE - Submission Deadline: May 30, 2025, AoE - Author Notification: June 15, 2025, AoE - Workshop Date: August 5, 2025 @ Edmonton, Canada

1

0

149

Ching-An Cheng @chinganc_rl

about 1 year ago

Started my new job at #Google Research recently. Super excited about what can be done here. 😎

22

288

3

13

26K

chinganc_rl retweeted

RL_Conference @RL_Conference

about 1 year ago

The RLC accepted workshops list is out (link in next tweet)! Programmatic RL Causal RL RL and videogames Inductive biases and RL and returning from last year: RL beyond rewards, finding the frame, and RL in practice!

1

103

16

25

18K

Ching-An Cheng

@chinganc_rl

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users