Kyle Stachowicz @KyleStachowicz - Twitter Profile

Kyle Stachowicz @KyleStachowicz

24 days ago

Cool to see other labs getting use out of FAST!

DailyPapers

@HuggingPapers

25 days ago

AllenAI just released the MolmoAct2 FAST action tokenizer on Hugging Face Turns continuous robot actions into discrete tokens for training vision-language-action models. Fully open and trained on millions of trajectories across five embodiments.

HuggingPapers's tweet photo. AllenAI just released the MolmoAct2 FAST action tokenizer on Hugging Face

Turns continuous robot actions into discrete tokens for training vision-language-action models.

Fully open and trained on millions of trajectories across five embodiments. https://t.co/ufBnYT510R

1

69

14

37

12K

0

5

0

845

Kyle Stachowicz @KyleStachowicz

about 1 month ago

@GPTJustin If you have observations/tasks that look very different to the pretraining mix (you do), you'll probably need to train the whole model rather than just the action expert. Very little representation learning happens inside the action expert itself.

2

0

16

Kyle Stachowicz @KyleStachowicz

about 2 months ago

I love this @Ultraroboticsco OP1 design, super pragmatic choices. Tbh, the robot-on-a-robot setup maps better to humans' dynamic range and usable workspace size better than a lot of full humanoids. Robots will have many shapes.

Jon Miller Schwartz

@JonMSchwartz

about 2 months ago

OP1 has all the moves. It was designed to be safe, productive, and easy to deploy. Many companies we've spoken with are concerned about legged and wheeled robots being tipped over; OP1's not getting knocked down. Many battery-powered robots need downtime to recharge or swap batteries; OP1 plugs directly into a standard power outlet and runs continuously, never needing to recharge. Many robots require difficult integrations, such as being bolted down to the floor; OP1 is on locking wheels and can be moved around easily when you need to move it. At Ultra, we're on a mission to make the world's most useful and deployable robot. OP1 is a big step (or in our case, extension) in that direction.

11

340

42

127

55K

0

10

0

1

1K

Kyle Stachowicz @KyleStachowicz

about 2 months ago

@fchollet linus torvalds mailing list style holywar ragebait is so 2010s and i will absolutely engage, i love jax

0

3

0

1

1K

Who to follow

Kevin Black

@kvablack

phd @berkeley_ai, research @physical_int

Priya Sundaresan

@priyasun_

CS PhD student @Stanford, prev. Intrinsic, @Amazon Robotics, @UCBerkeley | learning from humans & teaching robots

Dhruv Shah @ CVPR

@shahdhruv_

professor @Princeton | researcher @GoogleDeepMind

Kyle Stachowicz @KyleStachowicz

about 2 months ago

@bznotes integers not yet discovered internally

0

2

0

77

Kyle Stachowicz @KyleStachowicz

about 2 months ago

@HaoruXue It's not really a "rivalry" between VLA vs. WAM (vs. from-scratch) - when you have enough data, the thing that matters the most is how efficiently you learn from it, not what your model used to be before it was a policy. Most likely the best recipe will use parts of both.

2

17

0

2

535

Kyle Stachowicz @KyleStachowicz

about 2 months ago

Once you have an architecture that is able to soak up lots of data, the two things that determine downstream capabilities are (1) the data, and (2) your scaling constant. Lots of the tricks that would make for good academic papers just stop mattering as much at large data scales.

0

5

0

884

Kyle Stachowicz @KyleStachowicz

about 2 months ago

Really great thread from @lucy_x_shi about the origins of π0.7. We went into the project originally thinking that a hierarchical "world model"-style policy would be a great way to make model better at generalization, but as we've scaled our data that gap largely disappeared.

Lucy Shi @lucy_x_shi

about 2 months ago

1/ We just released π0.7 — a steerable generalist robot model with emergent capabilities. I want to share a bit of the backstory, because π0.7 taught me something surprising about where robot learning is heading. A thread on bittersweet lessons 🧵

31

851

103

378

85K

2

42

3

20

10K

Kyle Stachowicz @KyleStachowicz

about 2 months ago

For details on how we trained π0.7 (and some more videos of robots doing cool things - we're especially excited about the cross-embodiment transfer results), take a look at our tech report and blog post https://t.co/7egbOXjfut

0

2

0

1

255

Kyle Stachowicz @KyleStachowicz

about 2 months ago

Excited to share the latest model we've been training: π0.7: a highly steerable model that can be prompted to do almost any task, out of the box! This robot has never seen this air fryer - in fact, it's never seen *any* air fryer - but with some prompting it can use it perfectly!

3

83

6

15

17K

Kyle Stachowicz @KyleStachowicz

about 2 months ago

It's not only a capable generalist policy, but it can also perform highly dexterous tasks right after pretraining! Here's some videos of the model cutting and peeling vegetables

1

2

0

345

Kyle Stachowicz @KyleStachowicz

2 months ago

@julianboolean_ @ylecun Not sure what you mean - autoregressive is just a parameterization; it's independent of the training method (teacher-forced vs. on-policy)?

0

64

Kyle Stachowicz @KyleStachowicz

2 months ago

@julianboolean_ @ylecun Training on-policy allows you to recover from a mistake token (while training off-policy does not, necessarily)

1

0

82

Kyle Stachowicz @KyleStachowicz

2 months ago

@ylecun @julianboolean_ The exponential argument only applies if mistakes are unrecoverable, no?

0

63

Kyle Stachowicz @KyleStachowicz

2 months ago

@ylecun @julianboolean_ My pet peeve is posts using this slide to dunk on @ylecun. The (1-e)^n bound is about training LLMs off-policy; RL fixes it (for reasoning) by training on-policy Kinda lame for a Turing award winner to be snarkposting instead of taking 15s to write a serious explanation though

3

8

0

990

Kyle Stachowicz @KyleStachowicz

3 months ago

@KyleVedder @vriishin @saurabhtwq @KyleMorgenstein robokyle group chat when

1

2

0

82

Kyle Stachowicz @KyleStachowicz

3 months ago

@gf_256 tl2x pipeline

0

3

0

744

Kyle Stachowicz @KyleStachowicz

3 months ago

Partial observability means that a robot policy - even with infinitely many demonstrations - will still be worse than the demonstrator. With MEM, we built a recipe to close this gap. Fantastic work led by @KarlPertsch @marceltornev @DannyDriess.

Physical Intelligence

@physical_int

3 months ago

We’ve developed a memory system for our models that provides both short-term visual memory and long-term semantic memory. Our approach allows us to train robots to perform long and complex tasks, like cleaning up a kitchen or preparing a grilled cheese sandwich from scratch 👇

48

2K

265

1K

449K

2

46

0

7

3K

Kyle Stachowicz

@KyleStachowicz

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users