Musi @musingit - Twitter Profile

about 20 hours ago

I've been circling a similar problem (what does durable learning look like with LLMs in the loop) - whether capability actually transfers to the learner. Your /teach skill and the Fable release prompted me to revisit this: I kept your mission/workspace skeleton, replaced learning records with a probe-graded capability ledger scheduled by FSRS, and added a log of every time the learner reaches for the answer before attempting. Here's what I ended up building: https://t.co/OVllYKuvAM - would appreciate your feedback!

0

1

0

1

114

Musi

@musingit

1 day ago

It's 7 markdown files and one Python script, portable to any agent that reads skills. I'm now using it to learn about post-trade settlement - would love your feedback if you give it a go. https://t.co/pG1Mwq7QlM

0

8

Musi

@musingit

1 day ago

I wrote about durable learning with a model in the loop: once a model sits inside, output stops being a signal of understanding, and what actually matters is whether the capability transfers to the learner. I wanted to test @claudeai Fable, so I tried turning that into a skill.

1

0

32

Musi

@musingit

1 day ago

The really important part is agency telemetry. The skill always answers when you ask, but it logs when you reached for the answer before attempting so it's framed like a training log rather than a conscience. Over weeks your independence compounds next to your capability.

1

0

12

Who to follow

SoulDaddio

@SoulDaddio

Day trader by grind. Creator by soul TG: https://t.co/Z9cuQMQPa1

Dan Gabriel Olteanu

@danolteanu_ro

mă chinuie talentul antreprenorial de vreo 30 si ceva de ani…#numălas

7 days ago

Hey @claudeai: feature suggestion - user profiles with on/off memory per profile. Currently memory is account-wide, so either everything gets pooled together, or there's no memory across your chats. Projects and Incognito get close, but do not actually close the gap.

0

23

Musi

@musingit

23 days ago

Live at https://t.co/EkT20pO9AZ. Excited to test this over the next few weeks as I get ready for returning to sport!

0

55

Musi

@musingit

23 days ago

I'm 10 months post-ACL surgery - so the final stage of recovery (thankfully!), working on pivoting for tennis and padel. This needs un-cued training, because the hard part of those sports is reacting to things you didn't plan for. Cued, predictable drills do not help with this.

2

1

0

90

Musi

@musingit

23 days ago

Most of the work was getting the spec right - what counts as truly un-cued, where the random draws need to happen, what the active screen should look like from 3 meters away. Built it in 10 mins with @claudeai

1

0

56

Musi

@musingit

about 2 months ago

Published the spec for Prism's HTTP API and JSON log format in a SPEC.md at the project root, so anyone can build an SDK in any language against the same UI server. Python is today's official SDK; the TypeScript port is next on the list. Interested to hear feedback on the spec, especially from anyone wanting to take a swing at a community SDK before I get to TS. https://t.co/9gJTeuAwZ0

0

1

0

157

Musi

@musingit

about 2 months ago

I audited the 9 running versions of my @polymarket weather trading bot. A 285:1 longshot that resolved yes hit every version, skewing the PnL by 75%. A trade that every version catches cannot rank them - I found that the comparison was more valuable in areas of disagreement.

0

1

0

175

Musi

@musingit

about 2 months ago

@agispas Why only designers? 😁

0

3

0

90

Musi

@musingit

about 2 months ago

@aaronjmars This cannot be real

0

1

0

2K

Musi

@musingit

about 2 months ago

The harder problem is measuring transfer itself, not just retrieval. Agents could change this - a model that can probe what a learner actually understands, not just what they produce, might get closer to measuring transfer than anything we have now.

0

68

Musi

@musingit

about 2 months ago

I've been thinking about what durable learning looks like when a model can answer almost anything. Not what it means to learn faster - what it means to actually know something when the answer is always a query away.

1

0

117

Musi

@musingit

about 2 months ago

With @BrainbankSpace I've tried to design for this: AI generates & explains, the spaced repetition loop forces internalisation. The capability should live in the user, not in the tool. This handles the retrieval side, but it does not solve or measure the transfer of capability.

1

0

94

Musi

@musingit

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users