Qingfeng Lan @qingfeng_lan - Twitter Profile

Qingfeng Lan @qingfeng_lan

7 days ago

For long horizon tasks in LLM, PPO and GRPO are no longer suitable. Scaling off-policy algorithms will the next bet.

0

3

0

1

167

Qingfeng Lan @qingfeng_lan

3 months ago

@WenhuChen @ArminPCM Check out our loss of plasticity paper to find out: https://t.co/URynMy9Vc9

0

2

0

4

509

Qingfeng Lan @qingfeng_lan

3 months ago

Using AI doesn't magically turn NP problems into P problems.

0

1

0

106

Qingfeng Lan @qingfeng_lan

3 months ago

There are three wailing walls for intelligent systems: the exploration-exploitation dilemma, credit assignment, and continual learning.

qingfeng_lan's tweet photo. There are three wailing walls for intelligent systems: the exploration-exploitation dilemma, credit assignment, and continual learning. https://t.co/0dKQEpADY0

1

2

1

0

175

Who to follow

Alan Chan

@_achan96_

Research Fellow @GovAIOrg | AI policy | PhD from @Mila_quebec | 🇨🇦

Rupali Bhati

@BhatiRupali

PhD @Northeastern | RL | MARL | Ex @CHAI_Berkeley, @MATSprogram, @Mila_Quebec

John D. Martin

@jdmartin86

Fellow @ Openmind Research Institute. Adjunct Professor @UAlberta. Thinking about AI and RL.

Qingfeng Lan @qingfeng_lan

4 months ago

Qwen is nothing without its people. ❤️🥃

Junyang Lin

@JustinLin610

4 months ago

me stepping down. bye my beloved qwen.

2K

13K

717

1K

7M

1

4

0

520

Qingfeng Lan @qingfeng_lan

4 months ago

# Made a small contribution Putting in some extra hours on Chinese New Year's Eve – finally ready to enjoy the holidays!

Qwen

@Alibaba_Qwen

4 months ago

🚀 Qwen3.5-397B-A17B is here: The first open-weight model in the Qwen3.5 series. 🖼️Native multimodal. Trained for real-world agents. ✨Powered by hybrid linear attention + sparse MoE and large-scale RL environment scaling. ⚡8.6x–19.0x decoding throughput vs Qwen3-Max 🌍201 languages & dialects 📜Apache2.0 licensed 🔗Dive in: GitHub: https://t.co/NzNdS9joAT Chat: https://t.co/bg4tAU0Rhw API：https://t.co/YiiyKTnHoU Qwen Code: https://t.co/qqwj5nAger Hugging Face: https://t.co/wFMdX5p5um ModelScope: https://t.co/9NGXcId57a blog: https://t.co/AW8UQStXaL

Alibaba_Qwen's tweet photo. 🚀 Qwen3.5-397B-A17B is here: The first open-weight model in the Qwen3.5 series.

🖼️Native multimodal. Trained for real-world agents.
✨Powered by hybrid linear attention + sparse MoE and large-scale RL environment scaling.
⚡8.6x–19.0x decoding throughput vs Qwen3-Max
🌍201 languages & dialects
📜Apache2.0 licensed

🔗Dive in:
GitHub: https://t.co/NzNdS9joAT
Chat: https://t.co/bg4tAU0Rhw
API：https://t.co/YiiyKTnHoU
Qwen Code: https://t.co/qqwj5nAger
Hugging Face: https://t.co/wFMdX5p5um
ModelScope: https://t.co/9NGXcId57a
blog: https://t.co/AW8UQStXaL

271

5K

861

1K

1M

0

13

0

675

qingfeng_lan retweeted

Qwen

@Alibaba_Qwen

5 months ago

🚀 Introducing Qwen3-Max-Thinking, our most capable reasoning model yet. Trained with massive scale and advanced RL, it delivers strong performance across reasoning, knowledge, tool use, and agent capabilities. ✨ Key innovations: ✅ Adaptive tool-use: intelligently leverages Search, Memory & Code Interpreter without manual selection ✅ Test-time scaling: multi-round self-reflection beats Gemini 3 Pro on reasoning ✅ From complex math (98.0 on HMMT Feb) to agentic search (49.8 on HLE)—it just thinks better. 🧠 Think deeper. Solve harder. Try the adaptive reasoning experience now: https://t.co/V7RmqMaVNZ Completions API: https://t.co/Eo8DZdw4ac Responses API: https://t.co/ocUfhvT3M8 blog: https://t.co/l7MYH3pgWm

Alibaba_Qwen's tweet photo. 🚀 Introducing Qwen3-Max-Thinking, our most capable reasoning model yet. Trained with massive scale and advanced RL, it delivers strong performance across reasoning, knowledge, tool use, and agent capabilities.
✨ Key innovations:
✅ Adaptive tool-use: intelligently leverages Search, Memory & Code Interpreter without manual selection
✅ Test-time scaling: multi-round self-reflection beats Gemini 3 Pro on reasoning
✅ From complex math (98.0 on HMMT Feb) to agentic search (49.8 on HLE)—it just thinks better.

🧠 Think deeper. Solve harder.
Try the adaptive reasoning experience now: https://t.co/V7RmqMaVNZ

Completions API: https://t.co/Eo8DZdw4ac
Responses API: https://t.co/ocUfhvT3M8

blog: https://t.co/l7MYH3pgWm

197

4K

556

1K

880K

Qingfeng Lan @qingfeng_lan

8 months ago

@tydsh Crazy.

0

1

0

250

qingfeng_lan retweeted

CoLLAs 2026

@CoLLAs_Conf

about 1 year ago

📢 Just 8 weeks until #CoLLAs2025! Modern ML thrives in benchmarks but struggles in the wild. CoLLAs is where we tackle non-stationarity head-on: catastrophic forgetting, distribution shift, continual RL, online adaptation, and lifelong learning for ML. 📍 Aug 11–14 @Penn 🧠 Keynotes, Workshops, tutorials, posters & community 🔗 https://t.co/8Yyt33Ef8h #ContinualLearning #LifelongLearning #AI #MachineLearning

0

11

4

0

1K

Qingfeng Lan @qingfeng_lan

about 1 year ago

🚀RL algorithms are shaping the post-training of LLMs, but how do their objectives connect? In this blog, I explore their relationships and provide a unified perspective through the Policy Gradient Theorem—the backbone of policy gradient methods. Dive in: https://t.co/SQREPoqGH0

1

288

54

235

18K

qingfeng_lan retweeted

Amii @AmiiThinks

over 1 year ago

BREAKING: Amii Chief Scientific Advisor, Richard S. Sutton, has been awarded the A.M. Turing Award, the highest honour in computer science, alongside Andrew Barto! Read the official @TheOfficialACM announcement: https://t.co/JXDhdEsQv7 #TuringAward #AI #ReinforcementLearning

AmiiThinks's tweet photo. BREAKING: Amii Chief Scientific Advisor, Richard S. Sutton, has been awarded the A.M. Turing Award, the highest honour in computer science, alongside Andrew Barto! Read the official @TheOfficialACM announcement: https://t.co/JXDhdEsQv7

#TuringAward #AI #ReinforcementLearning https://t.co/3fpdZmROgt

5

235

50

9

33K

qingfeng_lan retweeted

CoLLAs 2026

@CoLLAs_Conf

over 1 year ago

🚨 Call for Reviewers! 🚨 Want to contribute to advancing lifelong learning research? #CoLLAs2025 is looking for expert reviewers! 📝 Help shape the field by reviewing cutting-edge research in continual and lifelong learning. 🔗 Apply here: https://t.co/nP6oE2OwPl #MachineLearning #LifelongLearning #ContinualLearning #AI #AcademicReview

0

7

5

1

955

qingfeng_lan retweeted

Mohamed Elsayed @mhmd_elsaye

over 1 year ago

Would you believe that deep RL can work without replay buffers, target networks, or batch updates? Our recent work gets deep RL agents to learn from a continuous stream of data one sample at a time without storing any sample. Joint work with @Gautham529 and @rupammahmood.

9

623

105

381

163K

qingfeng_lan retweeted

Gautham Vasan @Gautham529

over 1 year ago

Our NeurIPS paper is now on arXiv: We introduce Action Value Gradient (AVG), a novel incremental deep RL method that learns in real-time, one sample at a time — no batch updates, target networks or a replay buffer! Co-authors @mhmd_elsaye @bellingerc @white_martha @rupammahmood

2

93

21

49

10K

qingfeng_lan retweeted

Marlos C. Machado @MarlosCMachado

over 1 year ago

RLC will be held at the Univ. of Alberta, Edmonton, in 2025. I'm happy to say that we now have the conference's website out: https://t.co/ZjpvWi5jyV We'll continue to update it, and the CFP will be out soon, but the relevant dates are already there. @RL_Conference @UAlberta

0

142

35

27

11K

qingfeng_lan retweeted

CoLLAs 2026

@CoLLAs_Conf

over 1 year ago

📢 Exciting News! The Fourth Conference on Lifelong Learning Agents (CoLLAs 2025) will be held at the University of Pennsylvania (@Penn) in Philadelphia, USA 🇺🇸 🗓️ Important Dates: Abstract Deadline: Feb 21, 2025 Submission Deadline: Feb 26, 2025 Conference Dates: Aug 11 - Aug 14, 2025 We invite submissions that present new theories, methodologies, applications, or insights into algorithms and benchmarks designed for non-i.i.d. and non-stationary settings. Accepted papers will be published in the Proceedings of Machine Learning Research (PMLR). 📚 Full CFP: https://t.co/S8MwjLbOAr #CoLLAs2025 #AI #MachineLearning #ContinualLearning #LifelongLearning #ResearchConference #CallForPapers #NonStationaryLearning

CoLLAs_Conf's tweet photo. 📢 Exciting News! The Fourth Conference on Lifelong Learning Agents (CoLLAs 2025) will be held at the University of Pennsylvania (@Penn) in Philadelphia, USA 🇺🇸

🗓️ Important Dates:

Abstract Deadline: Feb 21, 2025
Submission Deadline: Feb 26, 2025
Conference Dates: Aug 11 - Aug 14, 2025

We invite submissions that present new theories, methodologies, applications, or insights into algorithms and benchmarks designed for non-i.i.d. and non-stationary settings. Accepted papers will be published in the Proceedings of Machine Learning Research (PMLR). 📚

Full CFP: https://t.co/S8MwjLbOAr

#CoLLAs2025 #AI #MachineLearning #ContinualLearning #LifelongLearning #ResearchConference #CallForPapers #NonStationaryLearning

1

71

25

26

57K

qingfeng_lan retweeted

Richard Sutton

@RichardSSutton

almost 2 years ago

A year later and our work on Loss of Plasticity is finally published, in Nature no less! The Nature version is totally rewritten and has many new results: https://t.co/QImypXpqQl Congratulations to the authors: @s_dohare @JFernandoHG @LanceLan3 @rahman_parash @rupammahmood

5

194

25

57

17K

qingfeng_lan retweeted

Amii @AmiiThinks

almost 2 years ago

It’s a deep learning problem that was ‘hidden in plain sight:’ A new Nature paper by Amii researchers explores why continual learning models can all of a sudden stop working, and what to do about it: https://t.co/WJ4Rgb8qoe #AI #ContinualLearning #MachineLearning

1

41

11

4

5K

qingfeng_lan retweeted

Marlos C. Machado @MarlosCMachado

almost 2 years ago

I couldn't be prouder of my colleagues at the @UAlberta! The work led by @s_dohare, in collaboration w/ J. F. H.-Garcia, @LanceLan3, @rahman_parash, @rupammahmood, & @RichardSSutton on continual learning and loss of plasticity is now published at @Nature! https://t.co/PaO8GPsVHq

7

134

23

34

9K

Qingfeng Lan @qingfeng_lan

almost 2 years ago

@anaik96 @mhmd_elsaye @RL_Conference @RichardSSutton nice catch!

0

2

0

341

Qingfeng Lan

@qingfeng_lan

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users