Telexed

#0412

#0412Agents & tools Hacker News · LLMlast week

`Statewright`: visual state machines to harden coding agents

50radar

StatewrightAgent control layer — enforces state-machine steps

Reliability is improved by shrinking the agent's action space, not inflating prompts. Statewright enforces phase-specific tools and transitions, making smaller models more usable and frontier models less wasteful; worth watching, but missing hard benchmark numbers.

Each state limits tools, iteration count, and valid transitions, so planning, editing, and testing are separated by protocol rather than prompt discipline.
The reported sweet spot starts around 13B+ models: below that they can follow the machine but still lose enough context to make bad edits.
A Rust runtime handles guards and transitions without another LLM in the loop, while MCP plugins expose the workflow to tools like Claude Code.
The practical bet is cost control and fewer failure spirals: giving the model 5 tools instead of dozens reduces token waste and off-policy behavior.

Source: github.com/statewright/statewrightRead original →