episode traces
full execution logstool calls, outcomes
analysis agent
reads traces, spotsfailure patterns
harness patch
prompt edit, new toolor retry logic change
eval & deploy
re-run tasks, confirmmeasurable improvement