Can the Environment Speak for Itself? TΒ²-GRPO: A Turn-Trajectory Group Relative Policy Optimization for Caregiver Agents
Yutong Song, Jiang Wu, Pengfei Zhang, Wenjun Huang, Honghui Xu, Nikil Dutt, Amir M. Rahmani
https://t.co/LR164ZPe0P [ππ.π°πΈ]
Hybrid E-Assessment in Higher Education: Semi-Automated Grading of Paper-Based Written Examinations
Hartwig Grabowski, Michael Canz
https://t.co/G6X584AiwL [ππ.π°πΈ ππ.π²π ππ.π²π]
A Resilience-as-a-Service assessment framework for coordinated disruption response in interdependent urban transit systems
Sara Jaber, S. M. Hassan Mahdavi, Neila Bhouri, Mostafa Ameli
https://t.co/qyn52j6sBS [ππ.π°πΈ]
Inference-Time Conformal Reasoning with Valid Factuality Control for Large Language Models
Ting Wang, Yuanjie Shi, Yan Yan, Huan Zhang
https://t.co/4qd9d0T0A8 [ππ.π°πΈ]
π¬Accepted at ICML 2026
STAR: Rethinking MoE Routing as Structure-Aware Subspace Learning
Sumin Park, Noseong Park
https://t.co/JcUfN6YT4Z [ππ.π°πΈ ππ.π»πΆ]
π¬Accepted at ICML 2026
Q-Delta: Beyond Key-Value Associative State Evolution
Sumin Park, Seojin Kim, Noseong Park
https://t.co/0iHJToLWXM [ππ.π°πΈ ππ.π»πΆ]
π¬Accepted at ICML 2026
RAILS: Verification-Native Clearing For Agentic Commerce
Adrian de Valois-Franklin, Alex Bogdan
https://t.co/XALE4CQ2a0 [ππ.π°πΈ ππ.π²π ππ.πΌπ°]
Artificial Intelligence for Mathematical Reasoning: An Integrated Survey of Language Models, Neuro-symbolic Systems, and Verified Discovery
Syed Rifat Raiyan, Mohsinul Kabir, Hasan Mahmud, Md Kamrul Hasan
https://t.co/sjxUxWROR6 [ππ.π°πΈ ππ.π²π» ππ.π²π ππ.π»πΆ]
How AI Agents Reshape Knowledge Work: Autonomy, Efficiency, and Scope
Jeremy Yang, Kate Zyskowski, Noah Yonack, Jerry Ma
https://t.co/CdXNhZS4Q9 [ππ.π°πΈ ππππ.πΆπ½]
Act As a Real Researcher: A Suite of Benchmarks Evaluating Frontier LLMs and Agentic Harnesses in Research Lifecycle
Jiayu Wang, Weijiang Lv, Bowen Fu, Jing Fu, Jiayi Song, Lingyu Zhang, Lanxuan Xue, Luodi Chen, Zepeng Xin, Kaiyu Li, β¦
https://t.co/2qTYxXnSF3 [ππ.π°πΈ]