# Mission Control Dashboard — Bug & Improvement Review

> Systematic code review of `__LOCAL_LLMs/dashboard/` (6 source files, 1,395 lines)
> Last updated: Feb 19, 2026

---

## File Inventory

| File                                 | Lines | Purpose                                                              |
| ------------------------------------ | ----- | -------------------------------------------------------------------- |
| `src/app/page.tsx`                   | 1,079 | Main dashboard UI (single component)                                 |
| `src/app/globals.css`                | 91    | Design tokens, animations, base styles                               |
| `src/app/layout.tsx`                 | 20    | Root layout (metadata, dark mode)                                    |
| `src/app/api/ollama/route.ts`        | 117   | Ollama REST proxy (list, load, unload, pull, delete, show, generate) |
| `src/app/api/ollama/stream/route.ts` | 38    | Ollama streaming generate proxy (NDJSON)                             |
| `src/app/api/whisper/route.ts`       | 66    | Whisper binary + GGML model discovery                                |
| `src/app/api/system/route.ts`        | 162   | System info (chip, memory via vm_stat, disk, brew)                   |

**Stack:** Next.js 16, React 19, TailwindCSS v4, Lucide icons, TypeScript

---

## 1. Bugs

- [x] **B1. Hardcoded machine specs in header** — `page.tsx:317`
      Subtitle reads `Apple M4 Pro · 48 GB · {system?.platform}` — should use `system?.chip` and `formatBytes(system?.memory.total)` dynamically so it works on any machine.

- [x] **B2. Pull model blocks UI — no progress feedback** — `api/ollama/route.ts:84-92`
      `handlePull` calls Ollama with `stream: false`. Large models (20+ GB) block for 30+ minutes. The Next.js API route will likely timeout. Must use `stream: true` and pipe progress events to the client. _(Combined with F1.)_

- [x] **B3. Dead code: non-streaming `generate` action** — `api/ollama/route.ts:69-82`
      The `action === 'generate'` handler is unused — UI only uses `/api/ollama/stream`. Remove or keep as fallback with a comment.

- [x] **B4. Escape key closes modal during active streaming** — `page.tsx:188-197`
      Global `keydown` handler calls `setPromptModel(null)` unconditionally. Backdrop click correctly checks `!promptLoading`. Escape should also respect `promptLoading` to prevent discarding an in-flight response.

- [x] **B5. Auto-refresh (15s) fires during streaming/pull** — `page.tsx:182-185`
      `setInterval(fetchAll, 15000)` runs unconditionally. During streaming this causes background churn and potential UI flicker. Should pause while `promptLoading` or `pullLoading` is true.

- [x] **B6. Toast ID collision on HMR remount** — `page.tsx:156-159`
      `toastId.current` resets to 0 on component remount during dev. Use `Date.now()` or `crypto.randomUUID()` for robust uniqueness.

- [x] **B7. vm_stat page size hardcoded** — `api/system/route.ts:103`
      Hardcoded `16384`. Should parse from vm_stat's first line: `"(page size of NNNNN bytes)"` for portability.

- [x] **B8. Whisper models dir not configurable** — `api/whisper/route.ts:24`
      Hardcoded to `~/whisper-models`. Should scan multiple known paths (`/opt/homebrew/share/whisper-cpp/models/`, `~/whisper-models`, `~/.cache/whisper/`) or accept `WHISPER_MODELS_DIR` env var.

- [x] **B9. No AbortController for streaming fetch** — `page.tsx:250-289`
      Closing the prompt modal doesn't cancel the underlying fetch. The `reader.read()` loop continues in the background wasting CPU/bandwidth until the model finishes generating.

- [x] **B10. Brew shows "Loading..." when array is empty** — `page.tsx:936-940`
      When `system.brewPackages` is `[]` (all uninstalled), displays "Loading..." instead of "No packages found". Needs to distinguish "still fetching" vs "fetched but empty".

- [x] **B11. Prompt text not cleared on close without send** — `page.tsx:951-957`
      Backdrop click clears `promptText`, but Escape handler (B4 fix) should also clear it. Otherwise stale text persists when re-opening.

---

## 2. Code Quality

- [x] **CQ1. Monolithic 1,079-line single component** — `page.tsx`
      All interfaces, utilities, sub-components, and 900+ lines of JSX in one file. Extract to:
  - `components/` — StatusDot, ProgressBar, ToastContainer, PromptModal, OllamaModelsPanel, SystemPanel, WhisperPanel, BrewPanel
  - `lib/types.ts` — interfaces (OllamaModel, SystemData, etc.)
  - `lib/format.ts` — formatBytes, formatUptime
  - `lib/hooks.ts` — useAutoRefresh, useToasts, useOllamaActions

- [x] **CQ2. Pervasive inline styles instead of CSS/Tailwind classes** — `page.tsx` (100+ occurrences)
      Every `style={{ color: 'var(--text-tertiary)' }}` should be a utility class. Options: custom Tailwind theme mapping, or CSS utility classes in `globals.css` (e.g., `.text-muted`).

- [x] **CQ3. OLLAMA_URL duplicated** — `api/ollama/route.ts:3` + `api/ollama/stream/route.ts:3`
      Same `process.env.OLLAMA_URL || 'http://localhost:11434'` in two files. Extract to `lib/ollama-config.ts`.

- [x] **CQ4. No React Error Boundary** — `page.tsx`
      Unexpected API response shape crashes the entire dashboard. Add an `error.tsx` (Next.js App Router convention) for graceful recovery.

- [x] **CQ5. No loading skeleton / shimmer UI**
      Initial load shows "..." placeholders. Skeleton cards would be more polished.

- [x] **CQ6. No TypeScript strict null checks in API responses**
      API route handlers catch errors but return loosely typed JSON. Add Zod validation on the Ollama/system responses to prevent runtime surprises.

---

## 3. Features

- [x] **F1. Streaming pull with progress bar** _(fixes B2)_
      Use Ollama `stream: true` for `/api/pull`. Create `/api/ollama/pull/route.ts` that pipes NDJSON progress. UI shows progress bar with `completed/total` bytes, speed, and ETA.

- [x] **F2. Model search/filter**
      Search input above models list. Filter by name, family, quantization. Useful when 10+ models are installed.

- [x] **F3. Prompt history (localStorage)**
      Store last 20 prompts with model name + timestamp. Dropdown in prompt modal to re-run previous prompts.

- [x] **F4. Chat mode (multi-turn conversation)**
      Use Ollama `/api/chat` instead of `/api/generate`. Chat bubble layout with message history. System prompt input field.

- [x] **F5. Model comparison (side-by-side)**
      Send same prompt to 2 models simultaneously. Display responses side-by-side with latency/quality comparison.

- [x] **F6. Token/s metrics after generation**
      Parse `eval_count` and `eval_duration` from the final NDJSON chunk. Display tokens/second, total tokens, and latency in the response footer.

- [x] **F7. System resource sparklines (time-series)**
      Ring buffer of memory/CPU snapshots (localStorage). Render mini sparkline charts in the System panel. Spot trends over time.

- [x] **F8. Ollama server logs viewer**
      Read `~/.ollama/logs/` and display in a collapsible terminal-style panel. Filter by level. Auto-scroll.

- [x] **F9. Modelfile / template viewer**
      The `show` action already fetches Modelfile, template, and system prompt. Display in a collapsible code block in expanded model details.

- [x] **F10. Dark/light theme toggle**
      Add `:root.light` CSS variable overrides. Theme toggle with localStorage persistence. Current architecture supports this natively.

- [x] **F11. Keyboard shortcuts panel (`?` key)**
      Show all shortcuts in a modal: ⌘+Enter (send), Esc (close), R (refresh), / (search models), ? (help).

- [x] **F12. Whisper transcription test**
      Upload/record a short audio clip, transcribe locally via whisper-cli, display result with latency. Tests the full local STT pipeline.

- [x] **F13. Responsive mobile layout**
      Better breakpoints for the 4-column stats row and 3-column main grid. Collapsible sidebar on mobile.

- [x] **F14. Model tags/labels (localStorage)**
      User-defined tags (coding, fast, vision) with colored badges. Persisted in localStorage.

- [x] **F15. Extraction service integration panel**
      Show extraction-service (port 4005) health status. Run test extractions against loaded Ollama models. Bridges dashboard to LysnrAI pipeline.

- [x] **F16. Auto-load preferred model**
      Mark a model as "auto-load" (stored in localStorage). When Ollama is online but no models loaded, auto-load the preferred model.

---

## 4. Performance & Reliability

- [x] **P1. No request deduplication on Refresh** — `page.tsx:164-176`
      Rapid clicks on Refresh fire duplicate `fetchAll()` calls. Add a `fetchingRef` guard or disable the button during fetch (partially done for `actionLoading` but not for `fetchAll`).

- [x] **P2. Static cache never expires** — `api/system/route.ts:81-90`
      `staticCache` (chip, GPU, brew) lives forever in the server process. Brew package upgrades won't reflect. Add 5-minute TTL.

- [x] **P3. `du -sk ~/.ollama/models` on every refresh** — `api/system/route.ts:41`
      Traverses entire models directory every 15 seconds. Cache with 60-second TTL.

- [x] **P4. No fetch timeout on Ollama calls** — `api/ollama/route.ts:5-12`
      `fetchOllama` has no `AbortSignal` or timeout. If Ollama hangs, the dashboard hangs. Add 5-second timeout.

- [x] **P5. `system_profiler` slow on first load** — `api/system/route.ts:52-53`
      Takes ~2-3 seconds. Cached after first call, but first dashboard load waits. Consider eager background fetch on server start or return placeholder.

---

## 5. Security & Hardening

- [x] **S1. No input validation on model names** — `api/ollama/route.ts:50-51`
      `model` from request body passed directly to Ollama. Add regex validation: `^[a-zA-Z0-9._:/-]{1,256}$`.

- [x] **S2. Shell command interpolation pattern** — `api/system/route.ts:67`
      `execAsync(\`brew list --versions ${pkg}\`)`— safe today (hardcoded targets) but fragile. Use`execFile('brew', ['list', '--versions', pkg])` for safety.

- [x] **S3. No CORS or auth** _(acceptable for local-only, documented)_
      Any local process can call API routes. Fine for dev tool; document the assumption.

---

## 6. Implementation Tracker

### Sprint 1 — Critical Bug Fixes _(est. 1–2 hrs)_

| #   | ID        | Task                                      | Effort | Commit    |
| --- | --------- | ----------------------------------------- | ------ | --------- |
| 1   | - [x] B4  | Guard Escape key during streaming         | 5 min  | `2da67c2` |
| 2   | - [x] B5  | Pause auto-refresh during prompt/pull     | 10 min | `2da67c2` |
| 3   | - [x] B9  | Add AbortController to streaming fetch    | 15 min | `2da67c2` |
| 4   | - [x] B1  | Dynamic chip/RAM in header                | 5 min  | `2da67c2` |
| 5   | - [x] B11 | Clear prompt text on Escape close         | 5 min  | `2da67c2` |
| 6   | - [x] P4  | Add timeout to Ollama fetch calls         | 10 min | `2da67c2` |
| 7   | - [x] B3  | Remove dead generate action (or document) | 5 min  | `2da67c2` |
| 8   | - [x] B6  | Use Date.now() for toast IDs              | 2 min  | `2da67c2` |
| 9   | - [x] B10 | Fix brew "Loading..." vs "empty" state    | 5 min  | `2da67c2` |

### Sprint 2 — Pull Progress + Metrics _(est. 2–3 hrs)_

| #   | ID          | Task                                | Effort | Commit    |
| --- | ----------- | ----------------------------------- | ------ | --------- |
| 10  | - [x] B2+F1 | Streaming pull with progress bar    | 60 min | `2d9475b` |
| 11  | - [x] F6    | Display tokens/s after generation   | 30 min | `2d9475b` |
| 12  | - [x] B7    | Parse vm_stat page size dynamically | 10 min | `2d9475b` |
| 13  | - [x] B8    | Multi-path whisper model discovery  | 15 min | `2d9475b` |

### Sprint 3 — Component Refactor _(est. 2–3 hrs)_

| #   | ID        | Task                                    | Effort | Commit    |
| --- | --------- | --------------------------------------- | ------ | --------- |
| 14  | - [x] CQ1 | Extract components into separate files  | 90 min | `75a3cd0` |
| 15  | - [x] CQ4 | Add error.tsx Error Boundary            | 15 min | `75a3cd0` |
| 16  | - [x] CQ3 | Shared ollama-config.ts                 | 10 min | `75a3cd0` |
| 17  | - [x] CQ2 | Consolidate inline styles → CSS classes | 45 min | `ed93a6f` |
| 18  | - [x] S1  | Add model name input validation         | 10 min | `75a3cd0` |
| 19  | - [x] S2  | Replace exec → execFile for brew        | 10 min | `75a3cd0` |

### Sprint 4 — UX Enhancements _(est. 3–4 hrs)_

| #   | ID        | Task                                 | Effort | Commit    |
| --- | --------- | ------------------------------------ | ------ | --------- |
| 20  | - [x] F3  | Prompt history (localStorage)        | 45 min | `9c2f5f3` |
| 21  | - [x] F9  | Modelfile viewer in expanded details | 30 min | `9c2f5f3` |
| 22  | - [x] F4  | Chat mode (multi-turn via /api/chat) | 90 min | `ed93a6f` |
| 23  | - [x] F2  | Model search/filter                  | 30 min | `9c2f5f3` |
| 24  | - [x] F11 | Keyboard shortcuts panel             | 20 min | `9c2f5f3` |

### Sprint 5 — Integration & Polish _(est. 2–3 hrs)_

| #   | ID          | Task                       | Effort | Commit    |
| --- | ----------- | -------------------------- | ------ | --------- |
| 25  | - [x] F15   | Extraction service panel   | 60 min | `8bdd5ee` |
| 26  | - [x] F12   | Whisper transcription test | 45 min | `8bdd5ee` |
| 27  | - [x] F7    | System resource sparklines | 45 min | `8bdd5ee` |
| 28  | - [x] CQ5   | Loading skeleton UI        | 20 min | `8bdd5ee` |
| 29  | - [x] P1-P3 | Request dedup + cache TTLs | 30 min | `b1fda3a` |
| 30  | - [x] F16   | Auto-load preferred model  | 20 min | `ed93a6f` |

### Deferred (nice-to-have)

| ID        | Task                            | Notes     |
| --------- | ------------------------------- | --------- |
| - [x] F5  | Model comparison (side-by-side) | `8bdd5ee` |
| - [x] F10 | Dark/light theme toggle         | `ed93a6f` |
| - [x] F13 | Responsive mobile layout        | `8bdd5ee` |
| - [x] F14 | Model tags/labels               | `ed93a6f` |
| - [x] CQ6 | Zod validation on API responses | `ed93a6f` |
| - [x] F8  | Ollama server logs viewer       | `8bdd5ee` |
| - [x] S3  | CORS / auth (documented)        | `8bdd5ee` |

---

## 7. Commit Log

_Commits will be added here as work progresses._

| #   | Date   | Commit    | Sprint   | Items Completed                      |
| --- | ------ | --------- | -------- | ------------------------------------ |
| 1   | Feb 19 | `2da67c2` | Sprint 1 | B1, B3, B4, B5, B6, B9, B10, B11, P4 |
| 2   | Feb 19 | `2d9475b` | Sprint 2 | B2, B7, B8, F1, F6                   |
| 3   | Feb 19 | `75a3cd0` | Sprint 3 | CQ1, CQ3, CQ4, S1, S2                |
| 4   | Feb 19 | `9c2f5f3` | Sprint 4 | F2, F3, F9, F11                      |
| 5   | Feb 19 | `b1fda3a` | Sprint 5 | P1, P2, P3                           |
| 6   | Feb 19 | `ed93a6f` | Sprint 6 | CQ2, CQ6, P5, F4, F10, F14, F16      |
| 7   | Feb 19 | `8bdd5ee` | Sprint 7 | F5, F7, F8, F12, F13, F15, CQ5, S3   |

---

> **39 items total:** 11 bugs, 6 code quality, 16 features, 5 performance, 3 security
> **All 39 items completed** across 7 sprints (9 code commits + doc updates)
> **Actual total effort:** ~8 hours across 7 sprints

---

## 8. Next Wave — Model Intelligence & Pre-Load Metrics

> Proposed improvements focused on helping users make informed decisions **before** loading a model.

### Tier A — Pre-Load Decision Metrics _(est. 45 min)_

| ID  | Feature                        | Description                                                                                                                                     |
| --- | ------------------------------ | ----------------------------------------------------------------------------------------------------------------------------------------------- |
| N1  | **Estimated RAM per model**    | Approximate from disk size: Q4_K_M ≈ 1.2×disk in RAM. Show on every model card (e.g., `~22 GB RAM`), not just running models.                   |
| N2  | **"Will it fit?" indicator**   | Compare estimated RAM vs `system.memory.free + cached`. Color-code: 🟢 Fits, 🟡 Tight (80–100%), 🔴 Won't fit. Show on Load button or as badge. |
| N3  | **Aggregate loaded model RAM** | Sum VRAM of all running models. Display at top of models panel: "3 models loaded · 28.5 GB VRAM".                                               |

### Tier B — Rich Model Metadata _(est. 60 min)_

| ID  | Feature                 | Description                                                                                                                                |
| --- | ----------------------- | ------------------------------------------------------------------------------------------------------------------------------------------ | ---------------- | ---------------- | ------------------------------------------ |
| N4  | **RAM budget bar**      | Horizontal stacked bar: `[OS+Apps                                                                                                          | Model A (loaded) | Model B (loaded) | Free]`. Instant visual of memory headroom. |
| N5  | **Context window size** | Fetch `context_length` from Ollama `/api/show` → `model_info`. Display on card (e.g., `128k ctx`). Critical for knowing max prompt length. |

### Tier C — Model Intelligence Badges _(est. 45 min)_

| ID  | Feature                     | Description                                                                                                                       |
| --- | --------------------------- | --------------------------------------------------------------------------------------------------------------------------------- |
| N6  | **`<think>` warning badge** | If model is DeepSeek R1 family, show ⚠️ badge: "Emits `<think>` traces — strip before JSON.parse". Prevents silent JSON failures. |
| N7  | **Vision model indicator**  | If model is multimodal (llava, qwen2.5vl), show 👁 badge. These need image input — text-only prompts are suboptimal.              |
| N8  | **Architecture badge**      | Show model arch (llama, qwen2, phi3, deepseek2) as subtle pill on the card. Currently buried in expanded details.                 |
| N9  | **Sort/order models**       | Dropdown to sort by: name, size, parameters, running status, last modified. Currently uses Ollama's default order.                |
| N10 | **Ollama version display**  | Call `/api/version`. Show in Ollama status card. Useful for debugging model compatibility.                                        |

### Tier D — Runtime Metrics & UX _(est. 30 min)_

| ID  | Feature                           | Description                                                                                                                                    |
| --- | --------------------------------- | ---------------------------------------------------------------------------------------------------------------------------------------------- |
| N11 | **Last known tok/s per model**    | Persist `StreamMetrics.tokensPerSec` in localStorage keyed by model. Show on card (e.g., `~45 tok/s`). Compare speeds without re-benchmarking. |
| N12 | **Auto-unload countdown**         | Replace static `Expires: 3:45 PM` with live countdown: `Unloads in 4m 32s`. More actionable.                                                   |
| N13 | **Session stats per model**       | Track prompts sent + tokens generated per model in session. Show in expanded details.                                                          |
| N14 | **Delete confirmation + reclaim** | Show "Delete qwen2.5-coder:32b? Reclaim 18.5 GB disk." before deleting. Currently no confirmation.                                             |
| N15 | **Simultaneous load suggestions** | Based on available RAM, suggest which models can be co-loaded. E.g., "Can co-load llama3.1:8b + qwen2.5-coder:32b (28 GB, 20 GB free)".        |

### Implementation Plan

| Sprint | Items                   | Focus                    | Effort  |
| ------ | ----------------------- | ------------------------ | ------- |
| 8      | N1, N2, N3              | Pre-load RAM estimates   | ~45 min |
| 9      | N4, N5                  | RAM bar + context window | ~60 min |
| 10     | N6, N7, N8, N9, N10     | Badges + sort + version  | ~45 min |
| 11     | N11, N12, N13, N14, N15 | Runtime metrics + UX     | ~30 min |