I am sorry but this implementation is not dynamic workflows.
The dynamic, emergent nature of later phases (where phase N's shape is determined by phase N-1's output) is a defining characteristic of Claude Code's workflow model, and it's one of the places where pi-dynamic-workflows currently takes a simpler, more static approach.
/ralph loop => A script maintains the loop and force the inner agent to solve a single task per iteration.
/autoresearch => The vanilla autoresearch requires the agent maintain the loop and optimize a slice of code per iteration. Some models are struggling of maintaining the loop after 5~10 iterations.
/goal => lightweight /ralph + /autoresearch
Pi Agent is easy to implemente all of them through its extensions system.
@badlogicgames
looked into /goal in both cofex and claude codr and all i'm seeing are inferior versions of autoresearch. what am i missing?
available from the @ShopifyDevs folks as a pi extension.
https://t.co/ml6PfaS6UM
@badlogicgames@ShopifyDevs basically yes, /goal an inferior version of auto research. but it solves how to force the agent iterates in the loop by sending continuous prompt.
which is more lightweight for user than writing the loop script. some models can't maintain the loop by itself.
ds-v4-flash is struggling with invoking a customized `send_message` tool. I defined 10 eval examples and splitted them into `train` and `test` and ask the agent optimize tool description, guidelines to make it work in `train`.
After 10 iterations, the model still can't pass the train cases... so i ask the agent to include the tool_name into optmization scope ....
Does the agent overfit on specific harness too much? or i missed some keypoints?
Claude Code 2.1.139 added /goal
You set a completion condition and Claude keeps working across turns until it's met
Works in interactive, -p, and Remote Control 👏
/goal + autoresearch
The agent has been working continuously for 24 hours and keeps climbing the leaderboard. The goal is set to reach 5,000 points on the leaderboard.
I don’t know how long this will take. let's watch.
BTW, I've restricted its network access.