learning_ai_common_plat/__LOCAL_LLMs
saravanakumardb1 2d9475bd15 feat(local-llm): Sprint 2 — streaming pull progress, token metrics, fixes (B2/F1,F6,B7,B8)
New features:
- B2/F1: Streaming model pull with real-time progress bar. New
  /api/ollama/pull/route.ts pipes NDJSON from Ollama stream:true.
  UI shows status, completed/total bytes, and percentage during download.
- F6: Token/s metrics after prompt generation. Parses eval_count and
  eval_duration from the final NDJSON chunk. Displays tok/s, total
  tokens, and duration in the prompt modal footer.

Bug fixes:
- B7: Parse vm_stat page size from output instead of hardcoding 16384.
  Reads 'page size of N bytes' from the first line for portability.
- B8: Whisper model discovery now scans multiple directories:
  WHISPER_MODELS_DIR env var, ~/whisper-models, /opt/homebrew/share/
  whisper-cpp/models/, ~/.cache/whisper/. Returns the first dir with
  .bin files found.
2026-02-19 15:16:33 -08:00
..
dashboard feat(local-llm): Sprint 2 — streaming pull progress, token metrics, fixes (B2/F1,F6,B7,B8) 2026-02-19 15:16:33 -08:00
docs docs(local-llm): mark Sprint 1 items complete in DASHBOARD_REVIEW.md 2026-02-19 15:13:43 -08:00
LOCAL_LLMs_setup_mac_m4_48gb.md docs(local-llm): update original setup doc to redirect to docs/ structure 2026-02-19 13:01:35 -08:00