Yali Du

12 days ago

Attending @AAMASConference! I’ll be at ASI & SE Workshop 25th, C-MAS 26th, and poster on 28th PM. Topics on scalable evaluation of hundreds of LLM agents, partner selection, and counterfactual effects for social dilemmas, led by @r_j_willis Stefan, Yudi, Shuqing! Come say hi!👋

438

yalidux retweeted

23 days ago

🌟 Spring Season Finale: AI Agent Frontier Seminar 🌟 Software engineering is undergoing a radical shift as agents move toward autonomous self-improvement. 💻✨ For our final talk of the season, we are thrilled to host UIUC Prof. Lingming Zhang @LingmingZhang to present: "Towards Self-Evolving Software Intelligence." Lingming’s group has pioneered LLM-based SE work adopted by Meta, Google, OpenAI, and DeepSeek. He’ll unpack the evolution of software agents, from live coding to continuous self-improvement with SWE-RL. 📅 Friday, May 15 | 12 PM ET / 9 AM PT 🔗 https://t.co/wQGQKmu8Lt 📍 Join: https://t.co/I02pRNd0Bd🔑 Passcode: 309194 Organizers: @yalidux @ShangdingG95714 @MingJin_AI #AIAgents #SoftwareEngineering #MachineLearning #DeepSeek

MingJin_AI's tweet photo. 🌟 Spring Season Finale: AI Agent Frontier Seminar 🌟

Software engineering is undergoing a radical shift as agents move toward autonomous self-improvement. 💻✨

For our final talk of the season, we are thrilled to host UIUC Prof. Lingming Zhang @LingmingZhang to present: "Towards Self-Evolving Software Intelligence."

Lingming’s group has pioneered LLM-based SE work adopted by Meta, Google, OpenAI, and DeepSeek. He’ll unpack the evolution of software agents, from live coding to continuous self-improvement with SWE-RL.

📅 Friday, May 15 | 12 PM ET / 9 AM PT
🔗 https://t.co/wQGQKmu8Lt
📍 Join: https://t.co/I02pRNd0Bd🔑 Passcode: 309194
Organizers: @yalidux @ShangdingG95714 @MingJin_AI #AIAgents #SoftwareEngineering #MachineLearning #DeepSeek

549

yalidux retweeted

MSL@Meta. I led PoT, MMMU, MMLU-Pro, MAmmoTH, General-Reasoner, VL-Rethinker, Pixel-Reasoner. I contributed to Gemini-2.5. Prev @GoogleDeepMind.

28 days ago

As AI moves from answering questions to taking complex actions, our evals are hitting a wall. The truth is: the quality of our evaluations directly shapes the quality of the agents we train. 📊🤖 We are thrilled to host Bing Liu @vbingliu (Head of Research @ Scale AI) for the next AI Agent Frontier Seminar to present: "Eval-Driven Agentic RL." Bing will unpack the lessons learned building major benchmarks like SWE-Bench Pro and Humanity's Last Exam (HLE), and show how rubric-based reward design translates directly into better RL training. 📅 5/8 12PM ET / 9AM PT 🔗 https://t.co/YnLvxIKx8x 📍 Join: https://t.co/I02pRNd0Bd 🔑 Passcode: 309194 Organizers: @yalidux @ShangdingG95714 @MingJin_AI #AIAgents #ReinforcementLearning #MachineLearning #LLMs

MingJin_AI's tweet photo. As AI moves from answering questions to taking complex actions, our evals are hitting a wall. The truth is: the quality of our evaluations directly shapes the quality of the agents we train. 📊🤖

We are thrilled to host Bing Liu @vbingliu (Head of Research @ Scale AI) for the next AI Agent Frontier Seminar to present: "Eval-Driven Agentic RL."

Bing will unpack the lessons learned building major benchmarks like SWE-Bench Pro and Humanity's Last Exam (HLE), and show how rubric-based reward design translates directly into better RL training.

📅 5/8 12PM ET / 9AM PT
🔗 https://t.co/YnLvxIKx8x
📍 Join: https://t.co/I02pRNd0Bd
🔑 Passcode: 309194
Organizers: @yalidux @ShangdingG95714 @MingJin_AI
#AIAgents #ReinforcementLearning #MachineLearning #LLMs

366

Who to follow

Wenhu Chen

@WenhuChen

Percy Liang

@percyliang

professor of computer science @Stanford @stanfordnlp, co-founder of @togethercompute, creator of https://t.co/7R5THVogW2, co-founder of @simile_ai, pianist

Jindong Wang

@jd92wang

AI/ML professor and researcher | Ass. Professor @williamandmary, Ex Senior Researcher @MSFTResearch. Generative AI, machine learning, large language models.

about 1 month ago

@ZiyanWang98 @fangf07 Congrats! Look forward to more exciting works!

yalidux retweeted

about 1 month ago

The pendulum of AI is swinging back. As pure end-to-end behavior learning hits its limits, the field is re-integrating search and explicit reasoning. 🧠⚙️ We are incredibly honored to host MIT Prof. Leslie Kaelbling for the next AI Agent Frontier Seminar on May 1st to present: "RL: Rational Learning." She was doing agentic AI way before it was cool. Join us as she revisits the rational-agent approach to building general-purpose, human-level intelligent robots. 🤖 📅 May 1 12 ET / 9PT 🔗 https://t.co/YnLvxIKx8x📍 Join: Agentic AI Frontier Seminar: Professor · Leslie Kaelbling · MIT Zoom link: https://t.co/I02pRNd0Bd Organizers: @yalidux @ShangdingG95714 @MingJin_AI #AIAgents #ReinforcementLearning #Robotics #MachineLearning

MingJin_AI's tweet photo. The pendulum of AI is swinging back. As pure end-to-end behavior learning hits its limits, the field is re-integrating search and explicit reasoning. 🧠⚙️
We are incredibly honored to host MIT Prof. Leslie Kaelbling for the next AI Agent Frontier Seminar on May 1st to present: "RL: Rational Learning."
She was doing agentic AI way before it was cool. Join us as she revisits the rational-agent approach to building general-purpose, human-level intelligent robots. 🤖

📅 May 1 12 ET / 9PT
🔗 https://t.co/YnLvxIKx8x📍 Join: Agentic AI Frontier Seminar: Professor · Leslie Kaelbling · MIT
Zoom link:
https://t.co/I02pRNd0Bd
Organizers: @yalidux @ShangdingG95714 @MingJin_AI
#AIAgents #ReinforcementLearning #Robotics #MachineLearning

about 1 month ago

How should multi-agent learning evolve in the era of LLMs and generative agents? I’m delighted to discuss this in my invited keynote at the @iclr_conf ICLR 2026 Wokshop on Multi-Agent Learning and Generative AI, at April 27, 09:15 BRT @MALGAI_ICLR2026 https://t.co/p97GqdwIUg

349

about 1 month ago

Congrats to @ysu_nlp for the exciting launch! Fully agree with the belief in the self-learning agent!

Yu Su

@ysu_nlp

about 1 month ago

Introducing @NeoCognition, the agent lab for specialized intelligence. Everyone needs experts, but human expertise does not scale. Backed by $40M seed funding, we build self-learning agents that specialize across domains to make expertise abundant.

874

134

365

186K

343

Dharmesh Tailor @dtailor17

about 2 months ago

Safe AI workshop @UncertaintyInAI 2026 is coming to Amsterdam. Welcome to submit and spread the word! 😀🚀 Deadline: 28th May Link: https://t.co/wzLerSnXtK Co organisers @pechenizkiy @dtailor17 @EmtiyazKhan Eric Nalisnick, Christos Louizos, Alvaro H.C. Correia

about 2 months ago

📢 We are thrilled to announce the 2nd Workshop on Safe AI, co-located with @UncertaintyInAI in Amsterdam 🌟Submit your latest works in Safe AI (deadline: May 28, 2026 AoE) We welcome both extended abstracts (4 pages) and recently accepted papers (original format).

dtailor17's tweet photo. 📢 We are thrilled to announce the 2nd Workshop on Safe AI, co-located with @UncertaintyInAI in Amsterdam

🌟Submit your latest works in Safe AI (deadline: May 28, 2026 AoE)

We welcome both extended abstracts (4 pages) and recently accepted papers (original format). https://t.co/sdrqVs8uuv

390

yalidux retweeted

NeurIPS Conference

@NeurIPSConf

2 months ago

We want to speak directly to the concern many of you have expressed, and we owe you a clear explanation of what happened, why it happened, and where we stand now. We understand this situation caused genuine alarm and we take that seriously. In preparing the NeurIPS 2026 handbook, we included a link to a US government sanctions tool that covers a significantly broader set of restrictions than those NeurIPS is actually required to follow. This error was due to miscommunication between the NeurIPS Foundation and our legal team; there was never an intention to restrict participation beyond our mandatory compliance obligations. The responsibility for that error is ours as an organization, and we deeply apologize for the alarm and impact this miscommunication had on our community. We have updated the link and clarified the text of our policy, which is consistent with that of ACM and IEEE, as well as other international conferences and NeurIPS in the past. As in previous years, NeurIPS welcomes submissions from all compliant institutions and individuals. We want to reiterate that NeurIPS is a community-driven event, created by and for the community, and strives to be inclusive. The NeurIPS 2026 organizing committee was particularly saddened to learn of this institutional miscommunication. The organizing committee has taken on the responsibility of running the conference this year with the goal of fostering open communication, knowledge sharing, and global scientific discourse. We thank the community for bringing this issue to our attention and working with us through this situation.

265

504

127

138

497K

2 months ago

Don’t miss the next talk on Agentic AI Frontier seminar by Prof Sergey Levine on Robot foundation models @svlevine

2 months ago

Vision-Language-Action (VLA) models are evolving fast. How do we move robots from following basic instructions to executing complex, multi-stage tasks with sophisticated test-time reasoning? 🤖🧠 We are incredibly honored to host Sergey Levine @svlevine for the next AI Agent Frontier Seminar to present: "Robotic Foundation Models." Sergey will discuss the leap from first-generation VLAs to models that handle diverse data modalities and advanced reasoning, outlining the true frontiers of the field. Date: This Friday 3/27 12pm ET/9am PT 🔗 https://t.co/ZbDRxzkaq7 📍 Join: https://t.co/x6PIDQtKl8 🔑 Passcode: 309194 Organizers: @yalidux @ShangdingG95714 @MingJin_AIl #Robotics #AIAgents #VLA #FoundationModels

MingJin_AI's tweet photo. Vision-Language-Action (VLA) models are evolving fast. How do we move robots from following basic instructions to executing complex, multi-stage tasks with sophisticated test-time reasoning? 🤖🧠
We are incredibly honored to host Sergey Levine @svlevine for the next AI Agent Frontier Seminar to present: "Robotic Foundation Models."
Sergey will discuss the leap from first-generation VLAs to models that handle diverse data modalities and advanced reasoning, outlining the true frontiers of the field.
Date: This Friday 3/27 12pm ET/9am PT
🔗 https://t.co/ZbDRxzkaq7
📍 Join: https://t.co/x6PIDQtKl8
🔑 Passcode: 309194
Organizers: @yalidux @ShangdingG95714 @MingJin_AIl
#Robotics #AIAgents #VLA #FoundationModels

178

167

19K

yalidux retweeted

3 months ago

Join us this week for the AI Agent Frontier Seminar with Graham Neubig (@gneubig) presenting "Lessons from the Trenches in Building Agents for Software Development." The talk will cover the foundational technologies behind software-based agents, including: • Tooling for model interfaces • Rigorous evaluation benchmarks • Training agentic models • Open problems in memory, task decomposition, and human-agent interaction 📅 3/13 Friday 12pm ET 📍 Join: https://t.co/x6PIDQtKl8 🔑 Passcode: 309194 Organizers: @yalidux @ShangdingG95714 @MingJin_AI #AIAgents #SoftwareEngineering #LLMs #MachineLearning

MingJin_AI's tweet photo. Join us this week for the AI Agent Frontier Seminar with Graham Neubig (@gneubig) presenting "Lessons from the Trenches in Building Agents for Software Development."
The talk will cover the foundational technologies behind software-based agents, including:
• Tooling for model interfaces
• Rigorous evaluation benchmarks
• Training agentic models
• Open problems in memory, task decomposition, and human-agent interaction
📅 3/13 Friday 12pm ET
📍 Join: https://t.co/x6PIDQtKl8
🔑 Passcode: 309194
Organizers: @yalidux @ShangdingG95714 @MingJin_AI
#AIAgents #SoftwareEngineering #LLMs #MachineLearning

yalidux retweeted

3 months ago

🚨 Tomorrow at 12 PM ET! We are thrilled to host @lifu_huang (UC Davis) for a talk on "Goodhart’s Revenge: Reward Hacking in RL-Tuned LLMs." Are our RLHF models truly aligned, or just hacking their proxy rewards? Join us to discuss sycophancy, code gaming, and how we can fight back with robust defenses. 📍 Join: https://t.co/Tf9AwfqtEG 🔑 Passcode: 309194 Organizers: @yalidux @ShangdingG95714 @MingJin_AI #RLHF #LLMs #AIAlignment #MachineLearning

501

yalidux retweeted

3 months ago

🚀 Happening Tomorrow! 🚀 We are thrilled to host @pulkitology (MIT) at the AI Agent Frontier Seminar! 📌 "Rethinking Post Training" Pulkit will challenge the pre-training/finetuning paradigm and discuss advances in continual learning (RL Razor, Self-Distillation Learning, SEAL, and more). 📅 Friday, Feb 27 | 12 PM ET 📍 Zoom: https://t.co/Tf9AwfqtEG 📷 Passcode: 309194 Organizers: @yalidux @ShangdingG95714 @MingJin_AI #AIAgents #MachineLearning #MIT #ContinualLearning

445

yalidux retweeted

4 months ago

Is AI safety too technical and western-centric? 🤖🛡️ This Friday, we are thrilled to host @MaartenSap (CMU) at the AI Agent Frontier Seminar! Maarten will discuss making AI safety more human-centric and culturally aware, covering tool-use safety and culturally offensive non-verbal communication. 🌍 📅 Friday, Feb 13🕛 12 PM ET / 9 AM PT📍 Zoom: https://t.co/Tf9AwfqtEG🔑 Passcode: 309194 Organizers: @yalidux @ShangdingG95714 @MingJin_AI #AIAgents #AISafety #LLM

858

4 months ago

@SarveshGharat12 Yes, the paper is available here https://t.co/0ZR5GU8ngt Feedback welcomed!

4 months ago

Huge congratulations to our group — Zihao, Shuqing, Lianghao, and Richard!🎉 Big thanks to all our collaborators. We’re excited to share three RL-pure (100%) projects, focusing on multi-agent social dilemma evaluation, coalition learning, and RL exploration. Stay tuned! 🚀

yalidux's tweet photo. Huge congratulations to our group — Zihao, Shuqing, Lianghao, and Richard!🎉 Big thanks to all our collaborators. We’re excited to share three RL-pure (100%) projects, focusing on multi-agent social dilemma evaluation, coalition learning, and RL exploration. Stay tuned! 🚀 https://t.co/q6FGCjxh19

4 months ago

A huge thank you to Prof Yu Su for taking the time to share his insights with our community. Looking forward to seeing you all there! Thanks to the amazing co-organisers @ShangdingG95714 and @MingJin_AI!

410

4 months ago

Agentic AI Frontier Seminar - Excited to welcome Prof Yu Su @ysu_nlp from (Ohio State University) on Friday 6 Feb. Title: Computer Use: Modern Moravec’s Paradox Time: 2026-02-06 · 09:00–10:00（PT）｜17:00–18:00（GMT）| Join us via Zoom —https://t.co/wkSe2vcJFt

yalidux's tweet photo. Agentic AI Frontier Seminar - Excited to welcome Prof Yu Su @ysu_nlp from (Ohio State University) on Friday 6 Feb.
Title: Computer Use: Modern Moravec’s Paradox
Time: 2026-02-06 · 09:00–10:00（PT）｜17:00–18:00（GMT）|
Join us via Zoom —https://t.co/wkSe2vcJFt https://t.co/qzVU89nU3F