csgm @csgbwk - Twitter Profile

@DuinoDu Well the LoRA you see on the viz is a semi failed run. It goes to the cupcake but fails in the grasping most times. I'm working on a better way of visualizing it but it's not easy...

0

3

csgm

@csgbwk

about 18 hours ago

Using the same reprojection system: a live preview of the motion planning output of a VLA in 3D. pi0.5's action expert plans 50 steps in advance and replans before the end of the trajectory. This is what the planned end effector trajectory looks like.

csgm

@csgbwk

3 days ago

From one Aruco marker I got the relative positions of my wrist and global cameras and of my robot's kinematics chain. The Aruco is flat on the table so I can project the wrist camera's intrinsics on the table plane, and get an estimate of the wrist cam from global pixels only.

10

237

22

165

34K

7

104

8

75

11K

csgm

@csgbwk

about 2 hours ago

@John_lussier_ For an individual probably a cleaned up log of all your conversations with agents, but not sure this is worth it for one person. For a company idk man it depends, internal documentation, agent chat logs of the best employees in the company, sky is the limit!

1

0

25

csgm

@csgbwk

about 3 hours ago

I'm pretty sure a GLM5.2 LoRA'd on an internal dataset is by far the best model in the world right now and would not cost that much compared to opus/fable/5.5 pro API costs over time. If you're in a company pitch the idea!

ARC Prize

@arcprize

about 10 hours ago

GLM-5.2 from @Zai_org on ARC-AGI (Verified) - ARC-AGI-2: 22.8%, $0.25 - ARC-AGI-1: 77.0%, $0.19 Performance is comparable with GPT-5.4 & 5.5 (Low Reasoning Effort)

arcprize's tweet photo. GLM-5.2 from @Zai_org on ARC-AGI (Verified)

- ARC-AGI-2: 22.8%, $0.25
- ARC-AGI-1: 77.0%, $0.19

Performance is comparable with GPT-5.4 & 5.5 (Low Reasoning Effort) https://t.co/beYeeTpQJR

37

848

56

138

232K

2

12

1

6

3K

csgm

@csgbwk

about 3 hours ago

Qwen: "We did not distill ChatGPT." Pangram internal representations:

Elyas Masrour

@elyasbuilds

about 12 hours ago

Did you know? Pangram learns the difference between Claude, ChatGPT, and Gemini in its internal representations, even without being trained on it! This signal is increasingly recoverable throughout the network, reaching 91% accuracy on a simple linear probe!

elyasbuilds's tweet photo. Did you know?

Pangram learns the difference between Claude, ChatGPT, and Gemini in its internal representations, even without being trained on it!

This signal is increasingly recoverable throughout the network, reaching 91% accuracy on a simple linear probe! https://t.co/ucR4XjEvB6

68

1K

98

452

112K

0

60

csgbwk retweeted

Elyas Masrour

@elyasbuilds

about 12 hours ago

Did you know? Pangram learns the difference between Claude, ChatGPT, and Gemini in its internal representations, even without being trained on it! This signal is increasingly recoverable throughout the network, reaching 91% accuracy on a simple linear probe!

68

1K

98

452

112K

csgm

@csgbwk

about 3 hours ago

@pbshgthm My friend is using SAM3D for hands only with pretty good results! Surprisingly fast if you spend a bit of time doing basic inference optimizing (he got it running at 20Hz on a 3080 iirc)

1

2

0

1

18

csgm

@csgbwk

about 4 hours ago

@VMises76153 Pi0.5 outputs joint angles, not sure if you can finetune it on cartesian positions easily. I get the end effector pos just by computing the position of the end of the FK chain

0

16

csgm

@csgbwk

about 13 hours ago

@IterIntellectus No hands makes it useless for anything except dancing

1

0

71

csgm

@csgbwk

about 14 hours ago

(all my homies love @wandb )

0

1

0

94

csgm

@csgbwk

about 14 hours ago

Night Night! Hope you grow up to have more than 50% success rate!

1

5

1

652

csgm

@csgbwk

about 14 hours ago

@yacineMTB Escaping French hellish summer by going to Singapore (literal equator)

0

209

csgm

@csgbwk

about 14 hours ago

@DuinoDu Sadly it's bad at showing what happens at the actual failure points (contacts) but it's pretty cool!

1

0

80

csgm

@csgbwk

about 16 hours ago

(quote from wife)

0

1

0

54

csgm

@csgbwk

about 16 hours ago

Alternative tweet: Robot arm with an orange tongue furiously licks a plastic cupcake

csgm

@csgbwk

about 18 hours ago

Using the same reprojection system: a live preview of the motion planning output of a VLA in 3D. pi0.5's action expert plans 50 steps in advance and replans before the end of the trajectory. This is what the planned end effector trajectory looks like.

7

104

8

75

11K

3

15

3

6

2K

csgbwk retweeted

gabriel

@gabriel1

1 day ago

STOP HOLDING BACK WHEN PROMPTING you can literally one shot whatever feature in one prompt just yap for longer. aim to describe every thing you can possibly imagine in ONE prompt and obviously use voice. i often talk for 15minutes straight

213

5K

156

1K

383K

csgm

@csgbwk

about 18 hours ago

Arm: AgileX Nero Model: pi0.5 LoRA (openpi default setup, 100 demos of pick and place one of 3 colored cupcakes in the plate) Prompt: "Place the pink cupcake in the plate"

0

1

0

190

csgbwk retweeted

kache

@yacineMTB

1 day ago

Man, these language models suck at programming. I asked it to reverse engineer this entire mystery wifi blob with two physical devices as test harnesses and after three days and a few billion tokens it only figured out how to work around all of the upper layer wifi stack

26

404

3

43

24K

csgm

@csgbwk

Last Seen Users on Sotwe

Trends for you

Most Popular Users