Excited to introduce 🌠Orion: Towards Lab Automation with Computer-Using Agents.
Give it control of your lab computer💻, and it can use software, analyze any experiment images, browse databases on Chrome exactly like you, and work for hours to analyze your experiments.
🌎:https://t.co/5EAe8vEetl
��:https://t.co/D08hYrkJuG
We will have a guest talk from Cai Zhou. He is a second-year PhD in MIT EECS. "Continuous modeling in diffusion language models: HDLM and CCDD
". All are welcome to join via the following link.
https://t.co/ZlLDO5pKRH
Yifan Hou from ETH will be giving a talk titled "How Far Are We from Visual Language Understanding? Pitfalls and Progress in Vision-Language Models" at Tuesday Aug 26 2pm HKT. Link to talk: https://t.co/9TIt2Yvj2X
Jinjie Ni @NiJinjie from NUS will be giving a talk titled "Diffusion Language Models are Super Data Learners" at Friday Aug 22 11am HKT. link to talk: https://t.co/WxTUSok1in
Xinyu Yang from CMU will be giving a talk titled "Multiverse: Your Language Models Secretly
Decide How to Parallelize and Merge Generation" at Friday July 25 11am HKT (Thursday July 24 8pm PDT). Link to talk: https://t.co/Cdn9TGqWQ2
# 🚨 4B open-recipe model beats Claude-4-Opus
🔓 100% open data, recipe, model weights and code.
Introducing Polaris✨--a post-training recipe for scaling RL on advanced reasoning models.
🥳 Check out how we boost open-recipe reasoning models to incredible performance levels (65 → 79 on AIME25) through RL training on open-source data and academic-level resources.
📑Notion: https://t.co/k5ITJFzCe1
📗Blog post: https://t.co/Leth9PWSod
🤗Model & data: https://t.co/SVdfIwYTrU
💻Code: https://t.co/txg0qcywWi
Hongru Wang from CUHK will be giving a talk titled "Theory of agent: from definition to objective" at ⏰Wednesday 6.11 3pm HKT (Thursday 6.11 11am PDT). Link to talk: https://t.co/MobT0Rtsgl
🔥 Meet PromptCoT-Mamba
The first reasoning model with constant-memory inference to beat Transformers on competition-level math & code
⚡ Efficient decoding: no attention, no KV cache
⚡ +16.0% / +7.1% / +16.6% vs. s1.1-7B on AIME 24 / 25 / LiveCodeBench
🚀 Up to 3.66× faster
🎉Introducing our latest work: "ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows"
🤗 Huggingface: https://t.co/56CJS6Qzg9
🏠Homepage: https://t.co/jU6mHlFIoU
TLDR: We introduce ScienceBoard, featuring (1) a dynamic OS env with real scientific software (CLI + GUI), and (2) a human-validated benchmark spanning domains like biochem, astronomy, GIS, ATP, and more.
🧵[1/5]
Guanqi Jiang from UCSD will be giving a talk titled "Robots Pre-Train Robots: Manipulation-Centric Robotic Representation from Large-Scale Robot Datasets" at ⏰Friday 5.16 11am HKT (Thursday 5.15 8pm PDT). Link to talk: https://t.co/Wjw0A2E7HA
We are kicking off a series of seminars at @hkunlp2020. @siyan_zhao will be giving a talk titled "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning" at ⏰Friday 5.9 11am HKT (Thursday 5.8 8pm PDT). Link to talk: https://t.co/i9FsWYRNbZ