Bug fixes:
- Fix Google Fonts build error (corporate proxy blocks fonts.gstatic.com)
by removing Geist font imports and switching to system font stack
- Fix system API 7.6s latency by caching static info (chip, GPU, brew)
with timeouts on shell commands — now responds in ~50ms
New features:
- Streaming prompt responses via NDJSON proxy (/api/ollama/stream)
with typing cursor animation and auto-scroll
- Model pull UI: input field + button to download new models
- Model delete with two-step confirmation dialog
- VRAM usage and expiry time display for loaded models
- Toast notifications (success/error/info) with slide-in animation
- Copy response button in prompt modal
- Escape key closes modals, backdrop click dismisses
- Pull/delete/show actions added to Ollama API route