Lukas Böhm

Verified account

@lukas_undefined

Building at the boundary between possible and undefined

Joined November 2022

120 Following

27 Followers

56 Posts

@lukas_undefined

4 days ago

7 days, even at 12 hr/day, is much easier than it was a few years ago. Not only do I *want* to do that now, I actually *can*. Working with agents feels much easier cognitively. It’s a series of short mental sprints with waiting/rest periods in between instead of constant focus for hours. Using voice input helps a lot with that too.

0

1

0

0

279

@lukas_undefined

4 days ago

What happened there @thsottiaux ? Partial Codex usage reset overnight? First time I'm seeing this graph go down for a change

lukas_undefined's tweet photo. What happened there @thsottiaux ? Partial Codex usage reset overnight? First time I'm seeing this graph go down for a change https://t.co/l5A0UD2rnh

0

0

0

0

61

@lukas_undefined

6 days ago

@theo Watching this train wreck happen in real time while Codex keeps happily whirring in the background

0

1

0

0

201

@lukas_undefined

6 days ago

@mattpocockuk Commit Often, Yell “Goblins”. Clearly.

1

4

0

0

866

Who to follow

@lukas_undefined

8 days ago

More AX ergonomics: Agents must create `.spec.ts.md` sidecars for each test. They’re even closer to pure intent, easier for me to skim, and much less noisy for agents than the full `.spec.ts`. Those also make the auto-feedback loop very easy to discover: code change -> affected tests -> rerun those tests The actual `.spec.ts` can then be generated from the `.spec.ts.md` via a skill pretty reliably.

lukas_undefined's tweet photo. More AX ergonomics: Agents must create `.spec.ts.md` sidecars for each test.

They’re even closer to pure intent, easier for me to skim, and much less noisy for agents than the full `.spec.ts`.

Those also make the auto-feedback loop very easy to discover:
code change -> affected tests -> rerun those tests

The actual `.spec.ts` can then be generated from the `.spec.ts.md` via a skill pretty reliably.

0

0

0

0

65

@lukas_undefined

8 days ago

Taking this to the extreme: one seam, and it’s the UI. I’ve moved almost entirely to full E2E tests: user-perspective only, against a real pre-filled test DB that gets restored between each test. No mocks. Yes, they’re heavy and slow. But the verbosity doubles as specs, and agents have a much easier time picking up intent vs implementation.

1

5

0

1

915

@lukas_undefined

9 days ago

@thsottiaux So what you're saying is now is the time for /fast maxxing?

0

0

0

0

843

@lukas_undefined

10 days ago

I’d love a real benchmark for the “dumb zone” @mattpocockuk talks about. Not needle/haystack retrieval. More like: give the model real tasks while progressively filling context with semi-relevant prior conversation. Where does problem-solving quality actually start dropping, and how steep is the curve?

0

0

0

0

23

@lukas_undefined

10 days ago

@Dimillian In my experience, auto-running linters, IDE indexing, and tests are the biggest resource hogs. The agent’s actual code modifications shouldn’t be enough to saturate the SSD or CPU on their own, right?

0

0

0

0

852

lukas_undefined retweeted

11 days ago

@Polymarket Guys... you will not believe this...

Everlier's tweet photo. @Polymarket Guys... you will not believe this... https://t.co/Ixg7rHnsZx

35

4K

107

53

90K

@lukas_undefined

11 days ago

@Han_Akamatsu Feels like *that* Jaguar rebrand team got fired, pivoted to a design studio, and immediately landed Ferrari as a client

2

2

0

0

262

@lukas_undefined

11 days ago

@levelsio Impressive for +7 years!

0

0

0

0

81

@lukas_undefined

11 days ago

@benjamincowen Actually, that would be a very interesting tool: Redraw the price history and see how the MAs / BMSB would respond. What if we had an Oct 2025 blow-off top to $170k and then continued from there?

1

2

0

0

123

@lukas_undefined

11 days ago

@benjamincowen It’s already been very different: we never had a blow-off top. All of your MAs would look dramatically different now if we had. Given how atypical the "top" was, it feels like a pretty shaky basis for assuming the bottom will match prior cycles.

2

13

0

0

1K

@lukas_undefined

12 days ago

Being new on Twitter is wild. Days of talking to an empty room with zero reaction, then suddenly hundreds of people are actively agreeing with you. And this tweet? Probably straight back to the empty room. https://t.co/dnptFBEmaO

@lukas_undefined

13 days ago

@thsottiaux /overnight Give the goblins a task before bed. Wake up to the finished work and a little “what happened while you slept” report.

11

365

1

5

8K

0

1

0

0

41

@lukas_undefined

12 days ago

@YAHHPv2 @thsottiaux https://t.co/Vyn72JIOgt

@lukas_undefined

13 days ago

@TrentKelly2472 @thsottiaux The main point here is to use the existing batch API, aka they can process it whenever more capacity is available off-peak and in turn you get lower usage. /goal does not do this. /overnight /goal combination would be sweet though

1

26

0

0

539

0

0

0

0

90

@lukas_undefined

13 days ago

@TrentKelly2472 @thsottiaux The main point here is to use the existing batch API, aka they can process it whenever more capacity is available off-peak and in turn you get lower usage. /goal does not do this. /overnight /goal combination would be sweet though

1

26

0

0

539

@lukas_undefined

13 days ago

@Everlier At least you're no longer "absolutely right" about everything. Time to fork https://t.co/horagNxM0z for "load-bearing" I guess

0

1

0

0

82

@lukas_undefined

13 days ago

Is there already a good tool for composing AGENTS.md / CLAUDE.md from multiple sources? Thinking: shared team rules + personal prefs + repo-specific rules + tool-specific overrides, then generate/sync clean AGENTS.md, CLAUDE.md etc. Any recommendations?

0

1

0

0

45

Last Seen Users on Sotwe

Trends for you

Most Popular Users