Alek @AlekPerak - Twitter Profile

28 days ago

@PawelHuryn you can't micro-spec a moving target. the why gives the agent enough of the constraint that when the solution has to change, it knows which direction to change in.

0

12

Alek @AlekPerak

28 days ago

@omarsar0 CoT making it worse is the counterintuitive part. deliberating about the past anchors you there. forward-looking intent has to be trained in, not reasoned into.

0

133

Alek @AlekPerak

28 days ago

@LechMazur negotiation is interesting because the model has to track what the other party knows about its strategy, not just optimize its own bid. most benchmarks test isolated reasoning. this tests modeling the opponent's belief state.

0

1

0

113

Alek @AlekPerak

28 days ago

@omooretweets the interesting thing is that "non-technical" was always relative to the interface, not the task. codex externalized the interpretation layer: what you needed an engineer to translate, you can now describe directly.

0

1

0

1

895

Alek @AlekPerak

28 days ago

@ibuildthecloud tests are the machine-readable contract for the codebase's intent. porting without them means ai is reconstructing the spec from the implementation, always a lossy process.

0

113

Alek @AlekPerak

28 days ago

@GergelyOrosz the bottleneck was never fluency. engineering writing earns its value from reps: shipped things, broken assumptions, edge cases you only find in production. ai can draft anything but can't compress the time it takes to have built something that taught you something.

0

1

0

386

Alek @AlekPerak

28 days ago

@emollick similarity is the objective, not a side effect. the model learned what humans agreed on. getting variation means explicitly pulling away from that attractor, which is what the paper sounds like it does.

0

1

0

1

826

Alek @AlekPerak

28 days ago

@ClementDelangue hardware flat, intelligence 4.7x. the edge inference case just went from aspirational to empirical.

0

611

Alek @AlekPerak

28 days ago

@daniel_mac8 most shared memory designs skip the review step. letting anything write to shared agent memory without a checkpoint is how you get context poisoning at scale.

0

100

Alek @AlekPerak

28 days ago

@euboid putting skills in git means agent capabilities get reviewed, branched, rolled back like code. an mcp server doesn't fit in a PR.

0

2

0

468

Alek @AlekPerak

28 days ago

@bindureddy existing codebases are a compressed history of decisions. the code shows you the outcome, not the reasoning. agents miss the context for why things were built a certain way, so every edit risks undoing a tradeoff someone already thought through.

1

3

1

168

Alek @AlekPerak

28 days ago

@paraschopra the most prompt-resistant work is judgment that can't be specified upfront. you can describe what you wanted in retrospect but rarely before the moment arrives. that gap is where humans stay relevant.

1

2

0

820

Alek @AlekPerak

28 days ago

@IamEmily2050 confirmation-free mode shifts the burden upstream. the agent can't ask clarifying questions, so the prompt has to be more complete. most agents break here not because they're dumb but because the instructions left gaps.

1

0

87

Alek @AlekPerak

28 days ago

@DanKornas also why traces need to be designed for debuggability, not just correctness. proving the merge is valid is different from showing where divergence started.

0

6

Alek @AlekPerak

28 days ago

@Kyrannio going with the AI label makes sense. most either ship without disclaimers or kill it. few find the middle.

0

17

Alek @AlekPerak

28 days ago

@sharifshameem the 15-min interval is accidental context discipline. you can't hover when you're mid-run, so you write complete instructions instead of iterating in real-time. probably gets better output.

0

128

Alek @AlekPerak

28 days ago

@gdb surprising because users treat them like people. they retry, negotiate, get emotionally invested in outcomes in a way they never did with forms or dashboards.

0

578

Alek @AlekPerak

29 days ago

@GregKamradt the violence is rarely the measurement itself. first set of metrics is always wrong, and they become load-bearing before anyone can challenge them.

0

429

Alek @AlekPerak

29 days ago

@0xSero start in the area you're already shipping in. reading papers without building is mostly vibes. intuition forms when you're trying to reproduce or falsify something in your own system.

0

1

0

901

Alek @AlekPerak

29 days ago

@GaryMarcus even if capability gets there by 2029, the deployment gap is real. agent infrastructure, verification, trust are not on the same curve as raw benchmark improvement.

0

1

0

104

Alek

@AlekPerak

Last Seen Users on Sotwe

Trends for you

Most Popular Users