Commit Graph

73 Commits

Author SHA1 Message Date
saravanakumardb1
41e7a125da chore: update project files 2026-02-27 17:09:25 -08:00
saravanakumardb1
e5acd1dc26 fix(mission-control): dedupe whisper binaries and clean WSL package metadata
Prevent duplicate React keys by de-duplicating whisper binary names and improve WSL package reporting so system metadata shows stable, user-friendly versions.

Co-authored-by: Cursor <cursoragent@cursor.com>
2026-02-22 18:03:59 -08:00
saravanakumardb1
a7790b7115 chore(local-llms): add WSL Ollama connectivity test script
Add a helper script to quickly verify Ollama reachability across localhost, WSL gateway, and nameserver paths to speed up Windows+WSL troubleshooting.

Co-authored-by: Cursor <cursoragent@cursor.com>
2026-02-22 16:46:18 -08:00
saravanakumardb1
112341e162 fix(local-llms): harden WSL setup and dashboard Ollama connectivity
Stabilize Windows+WSL setup by fixing script line-ending pitfalls, WSL Ollama host detection, and dashboard startup behavior so models are detected reliably in Mission Control.

Co-authored-by: Cursor <cursoragent@cursor.com>
2026-02-22 16:44:58 -08:00
saravanakumardb1
6dca1bd6f1 docs(windows): rename mac-vs-windows → all-machines-comparison, add 4-machine tables
- Renamed mac-vs-windows-comparison.md → all-machines-comparison.md
- Added Fleet Overview table (all 4 machines at a glance)
- Added All Machines — Hardware Comparison (4-column ASCII table)
- Added All Machines — AI/ML Capability (inference, TTS, fine-tuning, OpenClaw, Voicebox)
- Added All Machines — Software Development (iOS, CUDA, Docker, portability)
- Added All Machines — Recommended Roles (PRIMARY/SECONDARY per workload)
- Added All Machines — Cost & Power (purchase, electricity, fleet totals)
- Preserved original Mac vs Razer deep-dive sections below
- Updated README.md file reference
2026-02-22 15:47:44 -08:00
saravanakumardb1
9f6c216d0f docs(voicebox): add setup guide for local voice cloning studio
Covers: what it is, architecture diagram, prerequisites (Python 3.12,
Bun, Rust), step-by-step install for macOS and WSL2, running backend
+ web frontend, first use (model download, voice profiles, story editor),
Make commands reference, platform performance table, troubleshooting
(proxy workaround, MPS/CUDA, transformers conflict), file structure,
and relevance to LysnrAI
2026-02-22 15:45:32 -08:00
saravanakumardb1
c50f271e1c docs(windows): flesh out Dell P16s spec with full hardware details + use cases
Expanded from 7-line raw system info to full spec document:
- CPU (Ryzen 7 PRO 7840U Zen 4, 8c/16t, AVX-512, NPU)
- GPU (Radeon 780M RDNA 3 iGPU, DirectML/ROCm)
- RAM (32GB DDR5, 24GB usable, VRAM allocation explained)
- Capabilities assessment (dev workstation, light AI, remote dev)
- AI/ML section (Ollama CPU, ROCm experimental, Ryzen AI NPU)
- Portable dev setup diagram + OpenClaw client use case
- 4-machine comparison table
- Optimization tips (reclaim RAM from iGPU, WSL2 memory limit)
- BIOS recommendations
2026-02-22 15:43:20 -08:00
saravanakumardb1
b3da5dd66e docs(windows): flesh out HP Z240 spec + update README with all machines
- HP Z240 spec: expanded from 7-line raw info to full spec document
  with hardware details, capabilities assessment, OpenClaw server
  use case, home lab services table, GPU upgrade path, machine
  comparison, and setup recommendations
- README: added Machines table (Razer/HP Z240/Dell P16s), expanded
  files table, added Related: OpenClaw section with links to OPEN_CLAW/
2026-02-22 15:41:20 -08:00
saravanakumardb1
09d3ffe20d chore: gitignore __LOCAL_LLMs/APPS/ (third-party clones) 2026-02-22 15:38:52 -08:00
saravanakumardb1
bc8e7a0504 docs(openclaw): add detailed install, secure setup & validation guide
7-phase step-by-step guide:
- Phase 1: Install (Node.js, OpenClaw, onboarding wizard)
- Phase 2: Secure the Gateway (config hardening, password, permissions)
- Phase 3: Connect Channels (WhatsApp, Telegram, Discord, Slack, WebChat)
- Phase 4: Harden the Host (Windows Firewall, WSL2, macOS, router)
- Phase 5: Remote Access (Tailscale Serve setup)
- Phase 6: Validate Security (run validate-security.sh)
- Phase 7: Daemon & Auto-Start (launchd/systemd)
Plus: 22-point verification checklist, maintenance schedule,
emergency procedures, and quick reference card
2026-02-22 15:38:52 -08:00
saravanakumardb1
5667308629 docs(openclaw): move doc to OPEN_CLAW/ dir, add security validation script
- Move openclaw-personal-ai-assistant.md from windows_specific/ to OPEN_CLAW/
- Add validate-security.sh: cross-platform security scanner with visual output
  - 7 check categories: installation, gateway config, file permissions,
    network security, API keys, system security, openclaw doctor
  - Green ticks (pass), red crosses (fail), yellow warnings
  - Numbered recommendations list with fix commands
  - Works on macOS, Linux, and WSL2
2026-02-22 15:38:52 -08:00
saravanakumardb1
4dd8003f25 docs(local-llms): add security best practices to OpenClaw guide
10-point security hardening section:
- Known risks table (DM policy, WebSocket, prompt injection, tool exec)
- Gateway config hardening (loopback bind, pairing, disable system.run)
- Windows Firewall rules (block external, allow localhost only)
- WSL2 hardening (UFW, file permissions, no root, disable SSH)
- Network architecture diagram (Tailscale Serve, no port forwarding)
- API key security (OAuth preferred, rotation, no git commits)
- Prompt injection defense (disable browser/system tools, per-channel pairing)
- Monitoring & audit cron script
- Backup & recovery commands
- 14-point pre-launch security checklist
2026-02-22 15:38:51 -08:00
saravanakumardb1
60aa6fd0ef docs(local-llms): add OpenClaw personal AI assistant guide
Covers: what it is, installation on Windows/WSL2 and macOS, channels,
models, configuration, security, HP Z240 as always-on gateway host,
Tailscale remote access, troubleshooting, CLI reference, and skills.
2026-02-22 15:38:51 -08:00
sarvana7
05a790de9f
Add Dell P16s Windows specifications document 2026-02-22 15:23:01 -08:00
sarvana7
95534bbfcc
Add Windows specifications for HP Z240 2026-02-22 13:34:40 -08:00
saravanakumardb1
6d18344fe0 docs(local-llms): add 7 RTX 5090 capability deep-dive guides
New capabilities/ subfolder with detailed guides:
- 01: GPU inference speed (benchmarks, Ollama tuning, API usage)
- 02: Whisper batch transcription (scripts, Python integration, use cases)
- 03: TTS generation at scale (Orpheus + Qwen3, batch scripts, voice cloning)
- 04: Fine-tuning / training (LoRA, QLoRA, data prep, Ollama export)
- 05: CUDA / TensorRT / ML research (toolchain setup, Triton kernels, profiling)
- 06: Stable Diffusion / image gen (ComfyUI, SDXL, FLUX, batch generation)
- 07: Multi-GPU workloads (scaling path, eGPU, cloud, cost planning)
- README: index with learning order and prerequisites

Each guide covers: what it is, how to use it, benefits, skills to learn
2026-02-21 20:36:21 -08:00
saravanakumardb1
1650e0da6c docs(local-llms): add Mac vs Windows side-by-side comparison
Detailed comparison of MacBook Pro M4 Pro 48GB vs Razer Blade 18 RTX 5090:
- Hardware specs, memory architecture, VRAM budgets
- Model performance estimates (inference speed, capacity)
- Software dev capabilities (iOS, CUDA, Docker, etc.)
- Recommended workload distribution
- Visual ASCII diagrams for memory architecture
2026-02-21 17:59:58 -08:00
saravanakumardb1
efd45ad86f feat(local-llms): add one-click Windows setup scripts
- setup-windows.ps1: PowerShell script for Windows side
  - NVIDIA driver verification, Ollama install via winget
  - Pull all 5 models with skip-if-exists logic
  - WSL2 Ubuntu 24.04 install
- setup-wsl.sh: Bash script for WSL2 side
  - Idempotent apt deps (Node.js 20, Python 3.12, ffmpeg, cmake)
  - CUDA GPU passthrough verification
  - Repo clone + git pull, whisper.cpp CUDA build
  - Whisper model download, TTS setup, dashboard start
- README.md: 2-step quick start (no IDE required)
- setup-guide.md: add automated setup section at top
2026-02-21 16:28:02 -08:00
saravanakumardb1
b1d2e4ec81 fix(local-llms): cross-platform audit — 8 bugs/gaps fixed
- setup-tts.sh: make fully cross-platform (macOS + Linux/WSL2)
  - OS detection, apt fallback, CUDA PyTorch install, nvidia-smi check
  - cross-platform playback hints, HF_MIRROR env override
- api/system/route.ts: fix ffmpeg detection (use -version not --version)
- api/system/memory/route.ts: remove unused total variable in Linux path
- api/system/exec/route.ts: expand allowlist with Linux commands
  (head, tail, grep, which, ps, uname, free, lscpu, nvidia-smi, etc.)
- api/tts/route.ts: cross-platform venv path + CUDA/MPS label
- api/whisper/route.ts: Linux binary/model paths
- api/ollama/logs/route.ts: Linux log paths + WSL2 hint
- test_qwen_tts.py: platform-aware speech text + CUDA device detection
- test_orpheus_tts.py: platform-aware text, move import sys to top
- setup-guide.md: fix false auto-detect claim, add HF_MIRROR hint
2026-02-21 15:27:49 -08:00
saravanakumardb1
f85b455eb5 ci: update CI/CD configuration 2026-02-21 14:13:07 -08:00
saravanakumardb1
14c7883d2a docs(local-llm): mark all phases A-G complete in roadmap with commit links 2026-02-20 00:48:31 -08:00
saravanakumardb1
6d98d12f04 feat(local-llm): Phase G — projects + multi-model orchestration (G1-G7)
G1: Project CRUD in IndexedDB (already added in Phase F commit)
G2: Project sidebar section with create, pin, delete, and active selection
G3: Project system context injection (via project default model/agent/context)
G4: Cmd+P project switcher modal with keyboard navigation
G5: Chain orchestration — sequential multi-model pipeline with {prev} placeholder
G6: Race orchestration — parallel model competition with timing
G7: Vote orchestration — consensus synthesis from multiple model responses
2026-02-20 00:47:34 -08:00
saravanakumardb1
52f3d16b65 feat(local-llm): Phase F — scheduled tasks (F1-F7)
F1: cron-parser integration + cron utility functions (parse, nextRun, toHuman, shouldRunNow)
F2: ScheduledTask + Project + Orchestration CRUD in IndexedDB
F3: Task editor modal (schedule, model, input source, output action, prompt)
F4: Browser-based task runner with setInterval + cron matching
F5: /api/system/exec — safe shell command execution with allowlist
F6: Task run history stored per task (last 20 runs)
F7: 5 built-in task templates (morning brief, git diff, disk usage, code review, deps)
2026-02-20 00:44:53 -08:00
saravanakumardb1
e15a5a2f2f feat(local-llm): Phase E — response enhancements (E1-E5)
E1: Per-message action bar (copy, regenerate dropdown, rating) on hover
E2: Per-code-block copy button in MarkdownResponse with 'Copied!' feedback
E3: 'Try with other model' — regenerate dropdown shows loaded models
E4: Live streaming metrics (token count + tok/s during stream)
E5: Rating (thumbs up/down) persisted per message in IndexedDB
2026-02-20 00:40:49 -08:00
saravanakumardb1
d625be283c feat(local-llm): Phase D model router + multi-modal input (D1-D7)
- add lib/router.ts with task classifier, model hint mapping, resolve fallback chain, and auto-detect defaults
- integrate auto-routing mode in conversation model selector with __auto__ option
- persist/read model defaults from localStorage (llm-model-defaults)
- route prompts to selected/routed model before streaming
- add multi-modal input controls (attach file, image, voice)
- support attachment chips, removal, drag-and-drop file attach
- add audio transcription flow via /api/whisper/transcribe and append result to input
- support sending attachments payload alongside text from InputBar
2026-02-20 00:31:54 -08:00
saravanakumardb1
79cf42c8e3 docs(local-llm): mark Phase C complete with checkboxes and commit link 2026-02-20 00:27:33 -08:00
saravanakumardb1
d18b695029 feat(local-llm): Phase C custom agents (C1-C5)
- add built-in agents library (10 seeded agents)
- add agent CRUD/seeding/export/import helpers in db layer
- seed agents on workspace load
- add agent strip in sidebar and launch-from-agent flow
- add command palette support for agent entries
- add agent conversation wiring (agentId, systemPrompt, welcome message)
- render agent badge in conversation header
- add example prompt chips in input bar for agent conversations
- add AgentEditor modal for creating/updating custom agents
2026-02-20 00:26:46 -08:00
saravanakumardb1
f289099461 docs(local-llm): mark Phase B complete with checkboxes and commit link 2026-02-20 00:20:13 -08:00
saravanakumardb1
7ae92da16e feat(local-llm): Phase B quick actions + command palette (B1-B6)
- add fuse.js dependency and command palette modal (Cmd+K)
- add built-in quick actions library (30 templates across categories)
- add quick action CRUD + seeding + import/export helpers in db layer
- seed quick actions on workspace load and list top actions in sidebar
- implement quick action launcher -> creates preconfigured conversation
- add custom quick action editor modal for creating/editing actions
- wire command palette system actions and conversation navigation
- support passing QA template into conversation input via query param
2026-02-20 00:19:17 -08:00
saravanakumardb1
1335d47869 docs(local-llm): mark Phase A complete with checkboxes and commit link 2026-02-20 00:12:26 -08:00
saravanakumardb1
e17bb311c9 feat(local-llm): Phase A foundation (A1-A8) workspace + indexeddb
- add idb dependency and create typed db layer (conversations/messages/agents/etc)
- extend app/lib/types.ts with v4 workspace interfaces
- move existing dashboard to /mission-control route group
- create / workspace route group with sidebar shell and conversation pages
- implement conversation list grouping + search in sidebar
- implement conversation view with streaming via /api/ollama/chat
- add context bar and token/context utilities
- add /api/ollama/title endpoint for auto-title generation
- add v3->v4 migration utility (llm-inference-log + llm-chat-* to indexeddb)
- wire migration in workspace layout and cmd+/ sidebar toggle

Implements roadmap Phase A tasks A1-A8.
2026-02-20 00:11:27 -08:00
saravanakumardb1
d7dc66eb92 docs(local-llm): Rich Features Roadmap — 45 tasks across 7 phases for coding agent
Detailed implementation roadmap for the Rich Features PRD with:

Phase A (Sprint 14-16, ~15hr): Foundation
  A1: IndexedDB layer with idb — 9 object stores, compound indexes
  A2: v4 TypeScript interfaces — all data models
  A3: Route group (mission-control) — move existing dashboard
  A4: Route group (workspace) — sidebar + content layout
  A5: Sidebar — conversation list, time groups, search
  A6: Conversation view — message thread, input bar, streaming
  A7: Auto-title + context window usage bar
  A8: v3 → v4 migration from localStorage

Phase B (Sprint 17-18, ~10hr): Quick Actions + Cmd+K
  B1-B6: 30 built-in actions, fuse.js command palette, launcher,
  custom editor, usage tracking, export/import

Phase C (Sprint 19-20, ~9hr): Custom Agents
  C1-C5: 10 built-in agents, picker, full-screen editor,
  conversation wiring (welcome msg, chips, temp), export

Phase D (Sprint 21-22, ~13hr): Model Router + Multi-Modal
  D1-D7: regex classifier, model defaults, auto-routing UI,
  rich input bar, file/voice/image processing, drag-drop

Phase E (Sprint 23, ~7hr): Response Enhancements
  E1-E5: action bars, code-block copy, try-other-model,
  live metrics, rating with aggregation

Phase F (Sprint 24-25, ~11hr): Scheduled Tasks
  F1-F7: cron-parser, CRUD, editor, browser runner,
  /api/system/exec with allowlist, notifications, templates

Phase G (Sprint 26-28, ~13hr): Projects + Orchestration
  G1-G7: project CRUD, drag-to-project, system context,
  Cmd+P switcher, chain/race/vote modes

Every task has: explicit file paths, step-by-step instructions,
pass/fail exit criteria, verification commands, and commit templates.
Dependency graph: A is foundation, B-F parallel after A, G needs A+B.
2026-02-19 23:54:07 -08:00
saravanakumardb1
7bd14054d4 docs(local-llm): Rich Features PRD rev 2 — comprehensive review + expansion
Review findings addressed (20+ issues):

Structure additions:
- Target Users section with 5 personas (solo dev, tinkerer, privacy pro, writer, power user)
- Non-Goals section (8 explicit out-of-scope items for v4)
- Risks & Mitigations table (10 risks with impact/likelihood/mitigation)
- New API Routes section (4 new routes with security notes)
- Settings Expansion section (full tree: General, Router, Models, Input, Tasks, Data, About)
- New Dependencies table (idb ~1KB, fuse.js ~6KB, cron-parser ~3KB)
- Error Handling appendix (12 edge cases with expected behavior)

Data model fixes:
- Conversation/Message split into separate IndexedDB stores (scalability)
- Message gets conversationId FK, promptTokens field, size/language on Attachment
- Design decision note explaining why messages are stored separately

Feature spec improvements:
- 3.1 Conversations: context window management (token bar, auto-summarize at 80/95%)
- 3.2 Quick Actions: expanded Cmd+K palette spec (5 result types, ranking)
- 3.3 Agents: tools marked v4 vs v5, duplicate-from-builtin, unlink on delete
- 3.4 Model Router: full resolveModel() with 4-level fallback chain + availability
- 3.5 Multi-Modal: attachment size limits, Whisper error handling
- 3.6 Response: hover-only action bars, rating aggregation per task type
- 3.7 Cron: built-in templates table, runtime constraints, security (execFile)
- 3.8 Orchestration: full data model, chain/race/vote UI specs, step limits
- 3.9 Projects: system context detail, project stats, unlink behavior

Acceptance criteria added to ALL 9 features (was missing on 5).
Competitive analysis expanded with local competitors (Open WebUI, LM Studio, Jan.ai).
Success metrics improved with measurement methodology and rationale.
Open questions restructured as decision table with recommendations.
IndexedDB schema with explicit indexes and compound keys.
Migration strategy: 7-step v3→v4 with safety (no delete until confirmed).

681 lines → 1149 lines (+69% content)
2026-02-19 23:47:59 -08:00
saravanakumardb1
1172dbb23e docs(local-llm): Rich Features PRD — full local AI workspace spec
Comprehensive PRD evolving Mission Control into a ChatGPT-class local AI workspace:

- 3.1 Conversations: persistent, named, searchable, branching, IndexedDB
- 3.2 Quick Actions: 30 built-in 1-click launchers across 5 categories
     (code, writing, analysis, creative, devops) + custom actions + Cmd+K palette
- 3.3 Custom Agents: 10 built-in local GPTs with system prompts, tools,
     temperature, welcome messages, example prompts
- 3.4 Model Router: heuristic task classifier (<5ms, no LLM call),
     auto-selects best model per task type, configurable defaults
- 3.5 Multi-Modal Input: file attach, voice (Whisper), images, drag-drop,
     paste intelligence (code/image/error detection)
- 3.6 Response Enhancements: per-message actions, per-code-block copy,
     branching with navigation, live metrics, rating/quality profiles
- 3.7 Scheduled Tasks: cron-based recurring prompts with shell/file input,
     notification/file/conversation output, 5 built-in templates
- 3.8 Multi-Model Orchestration: chain, race, vote modes
- 3.9 Projects: conversation folders with system context + model defaults

7 implementation phases (~78hr), component architecture, storage migration,
competitive analysis, success metrics, open questions
2026-02-19 23:39:20 -08:00
saravanakumardb1
3dc0c441a9 docs(local-llm): mark all roadmap phases 1-6 complete with commit links
All 27 roadmap items + 5 bugs checked off across 6 phases:
- Phase 1 (040013e): N1-N3, BN1, BN2, BN5
- Phase 2 (7f04297): N4-N5, BN3, BN4
- Phase 3 (6f6baf9): N6-N10
- Phase 4 (588d21c): N11-N14
- Phase 5 (44ad8a6): F24-F28
- Phase 6 (07d3911): F29-F31
2026-02-19 23:30:11 -08:00
saravanakumardb1
07d391101a feat(local-llm): Phase 6 — data persistence + export (F29-F31)
F29: Export/import settings — gear icon in header opens settings popover,
     export downloads all llm-* localStorage as JSON, import validates
     and merges, both with toast feedback
F30: Inference history log — saves prompt/response/model/metrics to
     llm-inference-log (capped 100 FIFO), searchable panel with replay
     button, count badge in header toggle
F31: Factory reset — confirm dialog clears all llm-* localStorage keys,
     resets all component state to defaults
2026-02-19 23:29:40 -08:00
saravanakumardb1
44ad8a6301 feat(local-llm): Phase 5 — response quality + interaction (F24-F28)
F24: Vision image upload — file picker for vision models, base64 encoding,
     passed through stream API to Ollama generate endpoint
F25: Markdown rendering — ReactMarkdown replaces raw <pre> for all
     prompt responses and chat assistant messages
F26: Syntax highlighting — Prism-based code blocks with language labels
     and oneDark theme via react-syntax-highlighter
F27: <think> block collapse — auto-detect and collapse DeepSeek R1
     reasoning traces into expandable details with word count
F28: Ollama library link — button next to Pull input opens ollama.com/library
2026-02-19 23:25:20 -08:00
saravanakumardb1
588d21c70e feat(local-llm): Phase 4 — runtime metrics + UX polish (N11-N14)
N11: Persist tok/s per model to localStorage (llm-model-benchmarks),
     display on model card as faded accent text
N12: Live countdown to auto-unload — 1s interval, color-coded
     (green >5m, yellow 1-5m, red <1m 'Unloading soon')
N13: Session stats per model (prompts + tokens) in expanded details
N14: Co-load suggestions strip below models list showing which
     unloaded models fit in remaining free memory
2026-02-19 23:20:30 -08:00
saravanakumardb1
6f6baf99c8 feat(local-llm): Phase 3 — model intelligence badges + sort + version (N6-N10)
N6: <think> warning badge for DeepSeek R1 and distilled variants
N7: Vision model indicator for llava, bakllava, moondream, qwen-vl, etc.
N8: Architecture/family badge as pill on every model card
N9: Sort dropdown (A-Z, size, params, running, recent) with localStorage persist
N10: Ollama server version fetched from /api/version, shown in stats card
2026-02-19 23:17:07 -08:00
saravanakumardb1
7f042975de feat(local-llm): Phase 2 — rich metadata + persistence (N4-N5, BN3-BN4)
N4: RamBudgetBar component — stacked horizontal bar showing OS+Apps,
    loaded models (by name with color), and free memory segments
N5: Context window size — extract context_length from /api/show
    model_info, cache in modelMetadata state, display on card
BN3: Persist chat messages to localStorage (llm-chat-{model}),
     restore on modal re-open, capped at 50 messages
BN4: Logs panel refresh button — RefreshCw icon next to toggle
2026-02-19 23:13:22 -08:00
saravanakumardb1
040013e495 feat(local-llm): Phase 1 — pre-load intelligence + bug fixes (N1-N3, BN1-BN2, BN5)
N1: Estimated RAM per model with quant-aware multipliers (Q4=1.2x, Q5=1.25x, Q8=1.1x, F16=1.05x)
N2: Will-it-fit indicator (green/yellow/red dot) next to Load button
N3: Aggregate loaded model VRAM in panel header badge
BN1: Compare buttons now filter to running models only
BN2: AbortController on compare stream, cancel on modal close
BN5: Delete confirmation shows model name + disk reclaim size
2026-02-19 23:09:49 -08:00
saravanakumardb1
ae231d5aac docs(local-llm): comprehensive roadmap review — 5 bugs, 6 phases, 31 items
Systematic code review of DASHBOARD_ROADMAP.md against actual codebase:

Bugs found (BN1-BN5):
- BN1: Compare buttons show unloaded models (can't generate)
- BN2: No AbortController on compare stream (leaks on close)
- BN3: Chat messages lost on modal close (no persistence)
- BN4: Logs panel has no refresh button
- BN5: Delete dialog missing reclaim size (partial impl exists)

Expanded from 4 phases to 6 + backlog (15 → 31 items):
- Phase 1: Pre-load intelligence + bug fixes (N1-N3, BN1-BN2, BN5)
- Phase 2: Rich metadata + persistence (N4-N5, BN3-BN4)
- Phase 3: Model intelligence badges + sort (N6-N10)
- Phase 4: Runtime metrics + UX polish (N11-N14)
- Phase 5 (NEW): Response quality — markdown, syntax highlight,
  vision upload, think-block collapse, model library link
- Phase 6 (NEW): Data persistence — export/import, inference log,
  factory reset
- Phase 7: Expanded backlog (F17-F38, +6 new ideas)

Improvements:
- Added checkboxes for all tasks and acceptance criteria
- Quant-aware RAM estimate multipliers (Q4/Q5/Q8/F16)
- Broader vision model regex (bakllava, moondream, llama-vision)
- DeepSeek R1 distill variant detection for think badge
- Conservative memory availability formula (free + cached*0.5)
- localStorage key registry with llm- prefix standardization
- Dependency graph between phases
- ~6 hrs total estimated effort
2026-02-19 23:02:25 -08:00
saravanakumardb1
cd6e561f1b docs(local-llm): consolidate dashboard docs into dashboard/docs/
- Created DASHBOARD_PRD.md — full updated PRD with current 19-file
  architecture, all 10 API routes, UI layout, data flow, localStorage
  keys, security model, and v1-v3 changelog.
- Created DASHBOARD_ROADMAP.md — phased implementation plan for N1-N15
  improvements across 4 phases: pre-load intelligence, rich metadata,
  model intelligence badges, runtime metrics. Includes acceptance
  criteria and implementation details per item.
- Updated DASHBOARD_REVIEW.md — refreshed file inventory to 19 files
  (~2,930 lines), fixed broken Tier B markdown table, added cross-links.
- Replaced __LOCAL_LLMs/docs/05-mission-control-dashboard.md with
  redirect pointer to new dashboard/docs/ location.

Dashboard docs are now co-located at __LOCAL_LLMs/dashboard/docs/:
  - DASHBOARD_PRD.md (product requirements)
  - DASHBOARD_REVIEW.md (audit + 39 completed items + N1-N15 proposals)
  - DASHBOARD_ROADMAP.md (phased implementation plan)
2026-02-19 22:54:18 -08:00
saravanakumardb1
519f348583 docs(local-llm): add Next Wave — 15 model intelligence improvements (N1–N15)
Section 8 of DASHBOARD_REVIEW.md: pre-load RAM estimates, will-it-fit
indicator, RAM budget bar, context window, architecture/vision/think
badges, sort, tok/s history, countdown, session stats, delete confirm,
co-load suggestions. Organized in 4 tiers with sprint plan.
2026-02-19 22:32:29 -08:00
saravanakumardb1
4090c8aa13 docs(local-llms): add developer guide — API endpoint, code examples, model selection
- New 00-developer-guide.md: start-here doc for developers covering:
  - Ollama endpoint (http://localhost:11434/v1) and API key
  - curl, TypeScript, Python code examples with env var pattern
  - Model selection table by task
  - Running extraction service evals locally
  - JSON output gotchas (parse from string, <think> strip for R1)
  - Model management commands
  - Troubleshooting quick reference
  - Links to all other docs
- Updated index in LOCAL_LLMs_setup_mac_m4_48gb.md to include doc 00
2026-02-19 18:43:06 -08:00
saravanakumardb1
5deb5efdcf docs(local-llms): add comprehensive model comparison table and deepseek-r1:32b details
- Add Comprehensive Model Comparison Table: 11 models (local + cloud) with
  Disk, Params, Quant, RAM, Tok/s, JSON quality, Reasoning, Code, Instruction
  Following, Context window, <think> flag, and install status columns
- Add Gap Analysis table: llama3.1:8b (~55%), qwen2.5-coder:32b (~85%),
  deepseek-r1:32b (~75-80%) vs llama3.3:70b across 5 capability dimensions
- Update Tier 4 Reasoning table: add Parameters, Quant columns; add <think>
  warning note with link to eval doc transform pattern
- Update By Use Case table: add brain signal routing row, update extraction
  evals fallback to qwen2.5-coder:32b
2026-02-19 16:06:02 -08:00
saravanakumardb1
cfc1194079 docs(local-llms): add latency/cost comparison and deepseek-r1 transform pattern to evals doc
- Add Latency & Cost Comparison table: llama3.1:8b (~1m27s), qwen2.5-coder:32b
  (~5-8m est.), deepseek-r1:32b (~5-8m est.) vs gemini-2.5-flash (~15-25s, $0.003)
  and gpt-4o (~20-40s, $0.05-0.15) — all measured at 19 cases, concurrency=4
- Fix assertion pattern docs: single expressions required, not const/return blocks
- Add deepseek-r1 <think> strip transform pattern for promptfoo provider config
- Expand recommended models table with Disk, Reasoning, Pass Rate, and Notes columns
2026-02-19 16:05:52 -08:00
saravanakumardb1
71a7623553 docs(local-llms): expand installed models table with parameters and quantization
- Add Parameters, Quantization, and Status columns to models table
- qwen2.5-coder:32b: 32.8B params, Q4_K_M, 18.5 GB disk
- llama3.1:8b: 8B params, Q4_K_M, 4.9 GB disk (confirmed via ollama API)
2026-02-19 16:05:42 -08:00
saravanakumardb1
1552006feb fix(local-llm): proxy extraction health check through API route
Move extraction service health check from direct browser fetch
(http://localhost:4005/health) to server-side /api/extraction/health
proxy. Eliminates ERR_CONNECTION_REFUSED console errors when the
extraction service is not running locally.
2026-02-19 15:53:02 -08:00
saravanakumardb1
984630eb45 docs(local-llm): mark ALL 39 items complete in DASHBOARD_REVIEW.md
All bugs (11), code quality (6), features (16), performance (5), and
security (3) items are now checked off. Added Sprint 6 (ed93a6f) and
Sprint 7 (8bdd5ee) to commit log. Updated summary to reflect 100%
completion across 7 sprints.
2026-02-19 15:45:46 -08:00