🔥 Zero-shot generalization is the dream: adapt instantly, no fine-tuning. It's why LLMs blew up—but it's not just a language modeling thing. It’s happening in RL too.
🚨 @maxsbob21's new paper dives deep into zero-shot RL under shifting dynamics—and why current methods break.
My dearest friend Sasha @how_uhh, congrats on successfully defending your AI PhD! Everyone please address him as Dr. Nikulin in the comments from now onwards!
ps: who’d have thought during our freshman year as Sociology majors [pic taken ~10 yrs ago] that I'd posting this tweet?
👀 Action fine-tuning often blinds VLA models: they lose the visual–language (VL) priors that made them smart. We show how to keep those priors intact with a tiny alignment loss. 🤖
↓
I’m asking for help. I was meant to start my PhD with @_rockt and @robertarail at UCL, but my UK background check was refused. My appeal seems unlikely to succeed, so I’m urgently searching for any PhD or research positions in academia or industry. Any help is appreciated.
Had a blast together with @how_uhh at @LeRobotHF hackathon this weekend. Built phone-based teleoperation for my SO-100 arm using pose estimation. Here’s a quick BTS of the final demo with teleop working (+ a small victory dance 🎉)
Can complex reasoning emerge directly from learned representations? In our new work, we study representations that capture both perceptual and temporal structure, enabling agents to reason without explicit planning. https://t.co/gGdnAUixcv
Enabling useful teleoperation for complex dexterous robotic arm tasks in decent working spaces proved challenging with the open source SO100 design ($100 arms). Reach, degrees of freedom, and strength were insufficient. The next price point ($1000) felt unnecessary. We’ve been customizing low-cost arms ($300) for useful dexterous manipulation. Despite some challenges the teleop results are now really promising. Maybe $200-300 is the sweet spot. Curious about others’ experiences.
As boring as it sounds, I’m slowly realizing that 90% of success is doing the obvious thing for a painfully long amount of time without convincing yourself you’re smarter than you are.
Had a blast together with @how_uhh at @LeRobotHF hackathon this weekend. Built phone-based teleoperation for my SO-100 arm using pose estimation. Here’s a quick BTS of the final demo with teleop working (+ a small victory dance 🎉)
I know 2 min is forever for social media, but check out this video. Benjamin feeds himself a whole meal with FEAST---something he is not otherwise able to do. Very proud to be part of this project led by the legendary @rkjenamani