Lucas @quantbagel - Twitter Profile

Pinned Tweet

3 months ago

Robot action models shouldn't need 256 vision tokens per frame. Pi0.5 spends 400M parameters on SigLIP just to see. We replaced it with a 4.4M encoder that outputs 5 tokens — and action quality barely changes. 91x smaller. 51x fewer tokens. 7.3x faster inference.

quantbagel's tweet photo. Robot action models shouldn't need 256 vision tokens per frame.

Pi0.5 spends 400M parameters on SigLIP just to see. We replaced it with a 4.4M encoder that
outputs 5 tokens — and action quality barely changes.

91x smaller. 51x fewer tokens. 7.3x faster inference. https://t.co/GL8S6Qw3ow

22

368

31

272

23K

quantbagel retweeted

Arnaud Denis-Remillard

@dr8_unix

about 18 hours ago

@sean_pixel @interlatent @allen_ai We got the same model fully optimized on our inference engine. Running at less than 95ms full e2e with MolmoAct2-Think. Running anywhere in the US.

0

18

5

3

1K

Lucas

@quantbagel

4 days ago

On a china trip rn, feeling like the fear of the sz robots is somewhat overstated

1

4

0

169

Lucas

@quantbagel

4 days ago

We just labelled 1 million hours of visual data with a VLM at 40% the cost of other open source. If any data factory shops are interested we can get started fast

0

8

0

2

322

quantbagel retweeted

David Liu

@davidliuxyz

8 days ago

why hasn't anyone made these autonomous robots yet

85

280

7

21

19K

quantbagel retweeted

hardmaru

@hardmaru

8 days ago

For over a decade, we’ve accepted that end-to-end backprop is the only way to train deep networks. But holding the entire network in memory all at once is why AI training is hitting a resource wall. We found a new way to break the network into blocks and train them independently. The trick? Treating the network’s forward pass like a diffusion model denoising a signal. This reinterpretation slashes the memory needed to train deep models. In our #ICLR2026 paper (https://t.co/PK5h0mqQSo), we matched end-to-end performance across ViTs, DiTs, and LLMs. We did this while training just one isolated block at a time.

152

6K

646

4K

733K

quantbagel retweeted

PsudoMike 🇨🇦

@PsudoMike

13 days ago

Wealthsimple just launched a USD chequing account that works on both sides of the border. One account, two currencies, no conversion racket. Canadian banks had years to build this. They built fees instead. Now Wealthsimple has it. Good luck.

170

8K

424

2K

588K

Lucas

@quantbagel

12 days ago

Immigration is all you need

0

3

0

283

Lucas

@quantbagel

12 days ago

If you aren’t fighting for control over the lightcone ngmi

0

5

0

290

Lucas

@quantbagel

12 days ago

Landed in Canada and I haven’t had peace from drake in 12 hours

1

3

0

321

Lucas

@quantbagel

12 days ago

Coolest thing I have ever been a part of Literally Neil Armstrong for the Noetic Dyson sphere

Prophetic

@PropheticAI

12 days ago

Meet @quantbagel. We technologically altered his perception and recall of dreams. He was a participant in our placebo/sham controlled study. User interview #3

11

136

3

74

21K

2

49

2

15

10K

quantbagel retweeted

Jason Gao

@jasonzgao

13 days ago

becoming a scholar was undoubtedly the best thing that happened to me in college if you've ever had the desire to venture into startups, you'd be doing yourself a disservice by not applying :)

0

13

2

7K

quantbagel retweeted

Han Guo

@HanGuo97

13 days ago

LLM training is built on fast MatMuls. But many surrounding ops still run as memory-bound kernels. CODA reparameterizes them to hide in the matmul’s shadow, fused into its epilogue before results leave the chip. Bonus: LLMs can write fast CODA kernels too (approaching SoLs).

HanGuo97's tweet photo. LLM training is built on fast MatMuls. But many surrounding ops still run as memory-bound kernels.

CODA reparameterizes them to hide in the matmul’s shadow, fused into its epilogue before results leave the chip.

Bonus: LLMs can write fast CODA kernels too (approaching SoLs). https://t.co/cOTeMUr4py

15

678

103

531

196K

Lucas

@quantbagel

13 days ago

One good girl is worth thousands of Anthropic secondaries

0

21

0

2

1K

quantbagel retweeted

Simon Eskildsen

@Sirupsen

14 days ago

turbopuffer crossed $100M run-rate in March. 19mo after $1M. Profitable & <$1M raised. Cursor・Anthropic・Notion・Cognition・Harvey・Bridgewater・Ramp・Linear・Legora・Superhuman・Atlassian・Granola We’d be nowhere without them. We work like hell to exceed their expectations.

253

3K

149

1K

1M

quantbagel retweeted

Daniel

@danielgothits

14 days ago

these types of properties are gonna look insanely underpriced in a few years when everything is all about drone delivery, EVTOLs, autonomous airport pickup and dropoff, starlink, robotic/AI farm equipment and civil unrest keeps rising in cities

77

2K

62

903

899K

Lucas

@quantbagel

14 days ago

if any upcoming spaceX ballers wanna invest in what oai was originally supposed to be (robots), i’m on some pretty wild sht right now

0

13

1

0

763

quantbagel retweeted

Lucas

@quantbagel

14 days ago

Reflex is actively hiring world-class engineers/kernelDevs for robot inference, even if you have zero prior experience in physical AI. Smart humans figure it out fast. Please send ~3 bullet points demonstrating evidence of exceptional ability

7

63

4

37

13K

Lucas

@quantbagel

14 days ago

Reflex is actively hiring world-class engineers/kernelDevs for robot inference, even if you have zero prior experience in physical AI. Smart humans figure it out fast. Please send ~3 bullet points demonstrating evidence of exceptional ability

7

63

4

37

13K

Lucas

@quantbagel

Last Seen Users on Sotwe

Trends for you

Most Popular Users