David Mack

8 days ago

@ctorobotics @JonMSchwartz Both are key to task throughput. We’re doing interesting work to both speed up the robot but then also make it more accurate

0

16

8 days ago

@moss_here @JonMSchwartz Real Time chunking, and lots of experimenting with training

0

1

0

19

co-founder @ndea and @zapier @arcprize

8 days ago

@gracjan_goral @JonMSchwartz We generate 50 actions at 50 Hz but then execute them at 125 Hz

0

1

0

1

21

Who to follow

Mike Knoop

@mikeknoop

deepset, makers of Haystack

@deepset_ai

Creators of the @Haystack_AI open source AI Orchestration framework and Enterprise Platform, helping organizations build reliable production AI.

Haya Odeh

@HayaOdeh

I'm here to listen - cofounder and designer @replit (YC W18) where any idea can become big. Build it and test it on https://t.co/25TttCM0Vc

8 days ago

A thing i made

Jon Miller Schwartz

@JonMSchwartz

8 days ago

Everything here is 100% autonomous, 1x video playback speed. GO ULTRA mode boosts robot speed to 250%

22

229

16

63

20K

0

1

0

70

25 days ago

Nice articulation of what we've been experiencing first hand

avi

@avizurlo

25 days ago

https://t.co/8aO7KXyKpe

13

341

40

639

160K

0

2

0

140

about 1 month ago

@R2rule1 @chetan_ It possibly learned some hand-dominance from humans. Or, the stochastic flow matching happened to choose this modality. If the former, and this is something that has a production impact (unlikely), we could balance this out in the data mix and you'd get ambidextrous

0

20

about 1 month ago

When your training data has some off-task data in it..... you robot becomes more human

Chetan

@chetan_

about 1 month ago

emergent autonomous capability: fist bumps

6

97

4

19

11K

1

4

0

345

about 2 months ago

@benjamin_bolte Most of our large mature model markets have many open and closed source players (E.g. LLMs - OpenAI, Anthropic, Qwen, Lllama etc etc)

0

3

0

997

DavidHHMack retweeted

about 2 months ago

A closer look at Operator. It reaches across the entire work cell, up to 10 feet high and all the way down to the floor. Operator handles bags, mailers, and boxes with the dexterity to adapt to an endless variety of items on the fly. [1/4]

7

104

12

27

12K

about 2 months ago

🥰thanks for the @Ultraroboticsco shout-out

Y Combinator

@ycombinator

about 2 months ago

Physical Intelligence (@physical_int) is building a foundation model that can control any robot to do any task — what the team describes as the GPT moment for robotics. The company's cross-embodiment approach trains across many different robot platforms, and recent results show tasks being performed zero-shot that last year required hundreds of hours of data collection. In this episode of the @LightconePod , co-founder Quan Vuong (@QuanVng) sat down with @garrytan, @snowmaker, @sdianahu, and @harjtaggar to talk about why robotics is finally ready for its scaling moment, how PI runs its models in the cloud rather than on-device, and the playbook for what Quan sees as a Cambrian explosion of vertical robotics companies. 00:00 — Robotics just got cheaper 00:41 — The GPT moment for robotics 02:24 — Why robots didn’t work before 05:30 — The breakthrough that changed everything 09:12 — The data problem 13:33 — Robots learning without data 15:05 — Robots folding laundry (for real) 22:18 — From engineering problem → ops problem 29:12 — The startup playbook 38:46 — Thousands of robotics startups are coming

26

427

58

325

115K

0

80

about 2 months ago

We didn't design this robot in a lab, we designed it in real warehouses, doing real work

about 2 months ago

Introducing Operator, our newest industrial AI robot built to work, not demo. Operator handles your warehouse's most repetitive tasks: packing, sorting, and kitting. Up to 24 hours a day, with flexibility and consistency that allows businesses to scale quickly. This is what we've been building. ↓ [1/4]

20

380

52

179

87K

0

3

0

126

DavidHHMack retweeted

Andrew Jefferson

@EastlondonDev

about 2 months ago

Presenting Meridian: a line to connect deterministic compute and language model AI. From Neural Turing Machines and Differentiable Transformers to The Neural Computer, there’s a rich history of trying to combine traditional deterministic computation with the wildly different architecture of Artificial Intelligence. I’ve spent the last 4 weeks creating a single neural network that has the combined capabilities of a 4B param language model and a deterministic computation engine based on Web Assembly. It allows the AI deterministic integer computations up to 2^32, control flow (while loops and if statements) and a basic filesystem - all implemented as part of the transformer neural network, no external tool calls. With this architecture adding fewer than 1 million parameters to an existing 4B param language model I can take it from <20% accuracy on arithmetic with 4-digit numbers to 100% accuracy on 4 digit numbers and 99% accuracy on arithmetic up to 2^32 without adversely affecting the language model’s performance on non-mathematical tasks. The combined model can precisely execute a range of algorithms including checking number for primeness, finding the GCD of two integers and sorting arrays.

EastlondonDev's tweet photo. Presenting Meridian: a line to connect deterministic compute and language model AI.

From Neural Turing Machines and Differentiable Transformers to The Neural Computer, there’s a rich history of trying to combine traditional deterministic computation with the wildly different architecture of Artificial Intelligence.

I’ve spent the last 4 weeks creating a single neural network that has the combined capabilities of a 4B param language model and a deterministic computation engine based on Web Assembly. It allows the AI deterministic integer computations up to 2^32, control flow (while loops and if statements) and a basic filesystem - all implemented as part of the transformer neural network, no external tool calls.

With this architecture adding fewer than 1 million parameters to an existing 4B param language model I can take it from <20% accuracy on arithmetic with 4-digit numbers to 100% accuracy on 4 digit numbers and 99% accuracy on arithmetic up to 2^32 without adversely affecting the language model’s performance on non-mathematical tasks.

The combined model can precisely execute a range of algorithms including checking number for primeness, finding the GCD of two integers and sorting arrays.

16

203

43

150

44K

about 2 months ago

@icanvardar My agentic startup is very unique thank you

0

13

2 months ago

we has new thing to share!!!

2 months ago

Tune in and turn it up next week. We sure will be.

0

18

3

5

2K

0

1

0

44

DavidHHMack retweeted

Jon Miller Schwartz

@JonMSchwartz

2 months ago

Reliability and durability are one of the biggest hurdles to hyperscaling ai robots. This isn’t spoken about enough given its criticality

3

16

2

3

1K

2 months ago

@_joe_harris_ We definitely generate huge amounts of data and have very early stage infrastructure, but that doesn’t stop us training on all of the data we want to

1

0

1

299

2 months ago

@siddarthv66 Why?

0

345

DavidHHMack retweeted

Andrew Jefferson

@EastlondonDev

2 months ago

I’ve been working on combining a language model and a basic computer (based on web assembly) into a single AI model. One outcome of that is if the model generates programs or compute instructions, I don’t have to do round trips to the CPU, start a process, run the program/calculation and serialize and tokenize it before feeding the answer into the AI to get its response. I can do it in a continuous loop on the GPU. That’s already a pretty interesting performance characteristic for a certain kind of tool use BUT I realised I can do something even more interesting. When the machine outputs a compute instruction token, “multiply the two numbers at the top of the stack” for example (that’s a single token). I don’t need to wait for the compute to happen and print a response and enter it in to the input before I can generate the response token. I can just start the network generating the next token immediately after the “multiply” token was generated. Since the stack machine and the language model are part of the same neural network, running on a single GPU, I can start the model generating the next token right away and the language model first layers will run in parallel with the multiply computation (or whatever instruction it is). No waiting at all for it to compute. The output of the compute subnetwork goes into the mid layers of the language model allowing it to steer the next token generation even before it’s been emitted from the gpu and converted to human readable output. Concretely in my setup the neural wasm implementation runs in parallel with the first 10 layers of the language model and it’s working pretty well.

4

30

5

16

2K

3 months ago

So pumped to ship these guys!