My RL team at @amazon NYC is looking for summer 2026 PhD interns! We apply RL to Amazon's supply chain and do publishable, open-ended research in meta learning, multi-agent RL, constrained RL, exploration, and LLMs+RL. Interested? Email your CV to [email protected] by Nov 30th!
The deadline for submitting your extended abstracts is fast approaching (Aug 21)! Having a poster will guarantee attendance and you don't want to miss out on what promises to be a really fun day with keynotes from @JohnCLangford@criticalneuro@ben_eysenbach and Ludovic Righetti!
🚨New workshop alert 🚨
Calling all RL researchers in the New York area 🧠🗽
Present your work at the first-ever New York Reinforcement Learning Workshop (NYRL), co-organized by Amazon, Columbia Business School & NYU Tandon School of Engineering.
https://t.co/v7MropcLcz
🚨 My RL team at Amazon is looking for PhD interns for this summer! We apply RL to the Amazon supply chain, and do open-ended research in topics such as exploration, multi-agent RL, OPE, and use RL with LLMs for problems such as theorem proving. DM if interested!
Excited to share our NeurIPS paper: "Sample-Efficient Agnostic Boosting" (link: https://t.co/fgXQhipeuQ)
This work advances agnostic boosting by proposing a more sample-efficient algorithm without compromising computational complexity. Here's a quick breakdown! (1/6)
LLM self-improvement has critical implications in synthetic data, post-training and test-time inference. To understand LLMs' true capability of self-improvement, we perform large-scale experiments with multiple families of LLMs, tasks and mechanisms. Here is what we found: (1/9)