saravanakumardb1
3561deee52
docs(local-llm): add multimodal stack, model recommendations, and troubleshooting
...
- docs/04-multimodal-local-stack.md: vision models (llava, qwen2.5vl, moondream2),
audio pipeline architecture, video understanding status, Kimi alternatives,
complete local AI stack diagram
- docs/07-model-recommendations.md: 6-tier model guide (coding, fast, general,
reasoning, vision, embeddings), recommended 10-model stack for M4 Pro 48GB,
use-case quick reference, hardware scaling guide
- docs/08-troubleshooting.md: corporate Forcepoint proxy workarounds, MLX warning,
JSON parse errors, slow inference, whisper-cli vs whisper-cpp naming, audio
format conversion, proxy-corrupted downloads detection
2026-02-19 13:01:22 -08:00
saravanakumardb1
80f794dee7
docs(local-llm): add Ollama setup, extraction evals, and env vars reference
...
- docs/02-ollama-setup-and-models.md: installation, server config, memory management,
idle timeout, manual load/unload, OpenAI-compatible API, native API reference,
performance tuning flags (flash attention, KV cache)
- docs/06-extraction-service-evals.md: promptfoo eval suite against Ollama, 19 cases
across 5 tasks, assertion patterns for JSON string output, Python sidecar config
- docs/09-environment-variables.md: comprehensive var reference for Ollama server,
evals, Python sidecar, dashboard, whisper CLI flags, proxy/network settings
2026-02-19 13:01:05 -08:00
saravanakumardb1
464ffb92ec
docs(local-llm): add docs index, hardware specs, and whisper-cpp setup
...
- docs/README.md: documentation index with quick start, file structure, status table
- docs/01-hardware-and-prerequisites.md: M4 Pro 48GB specs, toolchain inventory,
disk budget, network environment (Forcepoint proxy details)
- docs/03-whisper-cpp-setup.md: whisper-cpp installation, GGML model guide,
ffmpeg audio conversion, CLI usage, real-time streaming, LysnrAI integration
2026-02-19 13:00:48 -08:00
saravanakumardb1
dd23f6cf96
docs: add local LLM setup guide for Apple Silicon Mac (48GB)
...
- Add __LOCAL_LLMs/LOCAL_LLMs_setup_mac_m4_48gb.md: comprehensive reference
for running Ollama on the dev Mac covering installation (v0.16.2 via brew),
corp proxy handling (AT&T Forcepoint), OpenAI-compat API usage examples
(curl/Node/Python), extraction-service eval integration, Python sidecar
wiring, model recommendations by use case, troubleshooting, and env var
reference
- Models documented: llama3.1:8b (4.9GB, default evals), qwen2.5-coder:32b
(19GB, code gen / Swift / TS)
2026-02-19 12:19:44 -08:00