Problem/Motivation
From alex_ua at #3583109-2: Explore (Claude) hooks for deterministic checks such as coding standards:
In my opinion what would be great is a fully tested and community maintained review agent that worked in any of the big players as well as on the self hosted, international, &/or smaller LLMs.
Agreed. :)
Possible implementation, also from that comment:
From my own experience I've seen the most improvements in output to leveraging skills and workflow improvements:
* planning as a prerequisite to execution, full test passes a gate for success
* agent "swarms" each with a unique role and jtbd (agents are really just other chats with specific skills,
Cli tools, & mcps loaded)
* criticism, testing, and review of both plans and execution
* clear output documents that show what was done and allow others to easily replicate the outcome
* force it to work from lists like todos and make it fill in forms to prove it succeeded.
Comments