Commit Graph

2 Commits

Author SHA1 Message Date
saravanakumardb1
da9ca9dc1a feat(extraction-service): add Ollama local model eval config and compare script
- Add evals/promptfoo.ollama.yaml: same 19 cases hitting Ollama OpenAI-compat
  API directly (no extraction-service needed); all assertions use inline
  JSON.parse(output) to handle raw string response from Ollama
- Add evals/compare-evals.sh: runs Gemini + Ollama evals back-to-back and
  prints side-by-side pass-rate comparison table
- Supports OLLAMA_MODEL env var (default: llama3.1:8b)
2026-02-19 12:19:24 -08:00
saravanakumardb1
acd4c3542b feat(extraction-service): scaffold promptfoo eval suite with 19 test cases
- Add evals/promptfoo.yaml: HTTP provider hitting extraction-service API
  covering all 5 built-in tasks (transcript, triage, memory-insight,
  reflection-enrichment, bug-report-extraction)
- Add evals/fixtures/golden.json: machine-readable golden input/output fixtures
- Add evals/run-evals.sh: shell runner with health checks, auth token
  handling, task filtering, and CI mode
- Add evals/README.md: usage docs, prerequisites, cost estimates, CI integration
2026-02-19 12:19:16 -08:00