learning_ai_common_plat

Author	SHA1	Message	Date
saravanakumardb1	cd6e561f1b	docs(local-llm): consolidate dashboard docs into dashboard/docs/ - Created DASHBOARD_PRD.md — full updated PRD with current 19-file architecture, all 10 API routes, UI layout, data flow, localStorage keys, security model, and v1-v3 changelog. - Created DASHBOARD_ROADMAP.md — phased implementation plan for N1-N15 improvements across 4 phases: pre-load intelligence, rich metadata, model intelligence badges, runtime metrics. Includes acceptance criteria and implementation details per item. - Updated DASHBOARD_REVIEW.md — refreshed file inventory to 19 files (~2,930 lines), fixed broken Tier B markdown table, added cross-links. - Replaced __LOCAL_LLMs/docs/05-mission-control-dashboard.md with redirect pointer to new dashboard/docs/ location. Dashboard docs are now co-located at __LOCAL_LLMs/dashboard/docs/: - DASHBOARD_PRD.md (product requirements) - DASHBOARD_REVIEW.md (audit + 39 completed items + N1-N15 proposals) - DASHBOARD_ROADMAP.md (phased implementation plan)	2026-02-19 22:54:18 -08:00
saravanakumardb1	1552006feb	fix(local-llm): proxy extraction health check through API route Move extraction service health check from direct browser fetch (http://localhost:4005/health) to server-side /api/extraction/health proxy. Eliminates ERR_CONNECTION_REFUSED console errors when the extraction service is not running locally.	2026-02-19 15:53:02 -08:00
saravanakumardb1	8bdd5ee1c8	feat(local-llm): Sprint 7 — all remaining features (F5,F7,F8,F12,F13,F15,CQ5,S3) Features: - F5: Model comparison side-by-side — after a prompt response, click any other model to compare. Responses display in two-column grid. - F7: System resource sparklines — memory usage ring buffer (30 points) with SVG sparkline component in the memory stats card. - F8: Ollama logs viewer — collapsible terminal-style panel below main grid. Fetches from /api/ollama/logs route. Color-coded by level. - F12: Whisper transcription test — file upload button in Whisper panel. Uploads audio to /api/whisper/transcribe, displays text + latency. - F13: Responsive mobile layout — p-3/sm:p-6 padding, gap-3/sm:gap-4, hidden sm:inline for header text, responsive comparison grid. - F15: Extraction service panel — health check to localhost:4005 on each refresh. Status card in right column with endpoint + service. Code quality: - CQ5: Skeleton shimmer loading UI — 4 skeleton cards shown while initial data loads. Uses CSS shimmer animation from globals.css. Security: - S3: Documented CORS/auth assumption in code comment — dashboard is local-only, no auth needed for dev tool. New files: - components/Sparkline.tsx — reusable SVG sparkline component - api/ollama/chat/route.ts — streaming chat endpoint (from Sprint 6) - api/ollama/logs/route.ts — Ollama log file reader - api/whisper/transcribe/route.ts — Whisper STT test endpoint	2026-02-19 15:44:20 -08:00
saravanakumardb1	ed93a6f0af	feat(local-llm): Sprint 6 — major feature batch (CQ2,CQ5,CQ6,P5,F4,F10,F14,F16) Code quality: - CQ2: Add CSS utility classes (text-primary/secondary/tertiary, bg-, btn-, input-base) to globals.css — reduces inline style repetition - CQ5: Add skeleton shimmer animation CSS for loading states - CQ6: Replace manual model name validation with Zod schema (PostBodySchema) in Ollama API route Performance: - P5: Eagerly warm static cache on module load — system_profiler no longer blocks first dashboard request Features: - F4: Chat mode with multi-turn conversation via new /api/ollama/chat streaming route. Chat bubble layout, system prompt input, message history. Toggle between prompt/chat modes in modal. - F10: Dark/light theme toggle with CSS var overrides in :root.light. Sun/Moon button in header, persisted in localStorage. - F14: Model tags (coding, chat, fast, vision, reasoning) with colored toggle badges in expanded model details. Persisted in localStorage. - F16: Auto-load preferred model — star toggle in expanded details. When Ollama is online but no models loaded, auto-loads the starred model. Persisted in localStorage.	2026-02-19 15:38:06 -08:00
saravanakumardb1	b1fda3a1a5	perf(local-llm): Sprint 5 — request dedup + cache TTLs (P1, P2, P3) Performance fixes: - P1: Add fetchingRef guard to fetchAll() — prevents duplicate requests from rapid Refresh button clicks or overlapping interval ticks - P2: Add 5-minute TTL to staticCache (chip, GPU, brew packages) — previously cached indefinitely per server process, now refreshes after brew upgrades without requiring a restart - P3: Add 60-second TTL cache for Ollama models disk usage (du command) — previously traversed ~/.ollama/models on every 15s refresh cycle, now reuses cached value for 60s	2026-02-19 15:28:07 -08:00
saravanakumardb1	9c2f5f3396	feat(local-llm): Sprint 4 — UX enhancements (F2, F3, F9, F11) New features: - F2: Model search/filter — search input above models list (shown when 4+ models installed). Filters by name, family, and quantization level. Press / to focus the search input. - F3: Prompt history — saves last 20 prompts to localStorage with model name and timestamp. History dropdown in prompt modal with one-click re-run. Toggle via clock icon in textarea. - F9: Modelfile viewer — expanded model details now fetch and display the Modelfile via the show action. Collapsible <details> element with syntax-highlighted pre block. - F11: Keyboard shortcuts panel — press ? to toggle. Shows all shortcuts: ? (help), R (refresh), / (search), Esc (close/cancel), Cmd+Enter (send). Shortcuts only fire when not in an input field.	2026-02-19 15:25:43 -08:00
saravanakumardb1	75a3cd0826	refactor(local-llm): Sprint 3 — component extraction, error boundary, security (CQ1,CQ3,CQ4,S1,S2) Component extraction (CQ1): - lib/types.ts: All interfaces (OllamaData, SystemData, Toast, etc.) - lib/format.ts: formatBytes, formatUptime utilities - lib/ollama-config.ts: Shared OLLAMA_URL constant - components/StatusDot.tsx: Status indicator component - components/ProgressBar.tsx: Progress bar component - page.tsx: Now imports from extracted modules, reduced from 1180 to 1077 lines (interfaces + utilities + sub-components removed) Error boundary (CQ4): - error.tsx: Next.js App Router error boundary with styled error UI, stack trace preview, and 'Try again' button Shared config (CQ3): - All 3 Ollama API routes now import OLLAMA_URL from lib/ollama-config.ts instead of duplicating the env var fallback Security (S1): - Add MODEL_NAME_RE regex validation on POST /api/ollama — rejects invalid model names before passing to Ollama API Security (S2): - Replace exec() with execFile() for brew package version check — prevents shell injection if targets list ever becomes dynamic	2026-02-19 15:21:22 -08:00
saravanakumardb1	2d9475bd15	feat(local-llm): Sprint 2 — streaming pull progress, token metrics, fixes (B2/F1,F6,B7,B8) New features: - B2/F1: Streaming model pull with real-time progress bar. New /api/ollama/pull/route.ts pipes NDJSON from Ollama stream:true. UI shows status, completed/total bytes, and percentage during download. - F6: Token/s metrics after prompt generation. Parses eval_count and eval_duration from the final NDJSON chunk. Displays tok/s, total tokens, and duration in the prompt modal footer. Bug fixes: - B7: Parse vm_stat page size from output instead of hardcoding 16384. Reads 'page size of N bytes' from the first line for portability. - B8: Whisper model discovery now scans multiple directories: WHISPER_MODELS_DIR env var, ~/whisper-models, /opt/homebrew/share/ whisper-cpp/models/, ~/.cache/whisper/. Returns the first dir with .bin files found.	2026-02-19 15:16:33 -08:00
saravanakumardb1	2da67c2f74	fix(local-llm): Sprint 1 — critical dashboard bug fixes (B1,B3-B6,B9-B11,P4) Bug fixes: - B4: Escape key now respects streaming state — during active stream, Escape aborts the generation instead of closing the modal - B5: Auto-refresh (15s interval) pauses during streaming and pull operations to prevent background churn and UI flicker - B9: Add AbortController to streaming fetch — closing modal or pressing Escape cancels the underlying HTTP request, saving CPU/bandwidth - B1: Header subtitle now dynamically shows chip name and RAM from the system API instead of hardcoded 'Apple M4 Pro · 48 GB' - B11: Escape handler clears promptText and promptResponse on close - B6: Toast IDs use Date.now()+random instead of incrementing ref (prevents collision on HMR remount) - B10: Brew panel distinguishes 'Loading...' (system=null) from 'No tracked packages found' (system loaded, empty array) - B3: Remove dead non-streaming generate action from Ollama API route - P4: Add 5-second AbortController timeout to all fetchOllama() calls to prevent indefinite hangs when Ollama is unresponsive	2026-02-19 15:12:41 -08:00
saravanakumardb1	43f8103c5a	fix(local-llm): show accurate macOS memory (app vs cached vs free) Replace Node.js os.freemem() with vm_stat parsing for macOS. The old approach reported ~47.7 GB / 48 GB 'used' because os.freemem() only counts truly free pages, ignoring ~20 GB of inactive/reclaimable cache. New memory breakdown: - App Memory: active + wired + compressor (actual process usage) - Cached: inactive + purgeable + speculative (reclaimable on demand) - Available: free + cached (what apps can actually use) - Pressure: normal/warning/critical based on app memory ratio Dashboard UI updated to show app memory, cached (reclaimable) label, and pressure-based color coding on progress bars.	2026-02-19 13:22:17 -08:00
saravanakumardb1	970b565026	fix(local-llm): dashboard v2 — streaming prompts, model management, perf fixes Bug fixes: - Fix Google Fonts build error (corporate proxy blocks fonts.gstatic.com) by removing Geist font imports and switching to system font stack - Fix system API 7.6s latency by caching static info (chip, GPU, brew) with timeouts on shell commands — now responds in ~50ms New features: - Streaming prompt responses via NDJSON proxy (/api/ollama/stream) with typing cursor animation and auto-scroll - Model pull UI: input field + button to download new models - Model delete with two-step confirmation dialog - VRAM usage and expiry time display for loaded models - Toast notifications (success/error/info) with slide-in animation - Copy response button in prompt modal - Escape key closes modals, backdrop click dismisses - Pull/delete/show actions added to Ollama API route	2026-02-19 13:03:11 -08:00
saravanakumardb1	2565714c52	feat(local-llm): add Mission Control dashboard v1 Next.js 16 dashboard for monitoring and managing the local LLM stack. Runs on port 3100 with dark theme using ByteLyst design tokens. API routes: - GET/POST /api/ollama — model list, running status, load/unload/generate - GET /api/whisper — binary discovery, GGML model inventory - GET /api/system — chip info, RAM/disk usage, brew package versions Dashboard UI: - Top stats row: Ollama status, model count, Whisper status, RAM usage - Ollama models panel with load/unload actions, LOADED badge, details - System panel with progress bars for RAM and disk - Whisper.cpp panel with binary list and model inventory - Brew packages panel with version tracking - Basic prompt modal with Cmd+Enter shortcut - Auto-refresh every 15 seconds Also excludes __LOCAL_LLMs/ from root ESLint config (dashboard has its own config and uses browser globals not available in Node.js context). Tech: Next.js 16, React 19, TailwindCSS v4, Lucide icons, TypeScript	2026-02-19 13:02:48 -08:00

12 Commits