- Run history: GET /agent-evals/suites/:id/runs with limit param - Regression comparison: GET /agent-evals/suites/:id/regression - Detects 5%+ score drop between consecutive runs - Returns latest vs previous comparison + trend data - Release gate check: GET /agent-evals/suites/:id/gate - Checks if latest release-gate run passed threshold - Agent compliance report: GET /agent-evals/agents/:agentId/report - Aggregates pass rate, avg score, suite counts, recent runs - Eval scheduling: POST /agent-evals/suites/:id/schedule - Wires eval suite to job runner with cron expression - New repo functions: listRunsBySuite, listRunsByAgent - 1,324 tests passing (8 new) |
||
|---|---|---|
| .. | ||
| repository.test.ts | ||
| repository.ts | ||
| routes.test.ts | ||
| routes.ts | ||
| types.ts | ||