learning_ai_common_plat/__LOCAL_LLMs/windows_specific
saravanakumardb1 6d18344fe0 docs(local-llms): add 7 RTX 5090 capability deep-dive guides
New capabilities/ subfolder with detailed guides:
- 01: GPU inference speed (benchmarks, Ollama tuning, API usage)
- 02: Whisper batch transcription (scripts, Python integration, use cases)
- 03: TTS generation at scale (Orpheus + Qwen3, batch scripts, voice cloning)
- 04: Fine-tuning / training (LoRA, QLoRA, data prep, Ollama export)
- 05: CUDA / TensorRT / ML research (toolchain setup, Triton kernels, profiling)
- 06: Stable Diffusion / image gen (ComfyUI, SDXL, FLUX, batch generation)
- 07: Multi-GPU workloads (scaling path, eGPU, cloud, cost planning)
- README: index with learning order and prerequisites

Each guide covers: what it is, how to use it, benefits, skills to learn
2026-02-21 20:36:21 -08:00
..
capabilities docs(local-llms): add 7 RTX 5090 capability deep-dive guides 2026-02-21 20:36:21 -08:00
mac-vs-windows-comparison.md docs(local-llms): add Mac vs Windows side-by-side comparison 2026-02-21 17:59:58 -08:00
razer-blade-18-spec.md ci: update CI/CD configuration 2026-02-21 14:13:07 -08:00
README.md feat(local-llms): add one-click Windows setup scripts 2026-02-21 16:28:02 -08:00
setup-guide.md feat(local-llms): add one-click Windows setup scripts 2026-02-21 16:28:02 -08:00
setup-windows.ps1 feat(local-llms): add one-click Windows setup scripts 2026-02-21 16:28:02 -08:00
setup-wsl.sh feat(local-llms): add one-click Windows setup scripts 2026-02-21 16:28:02 -08:00

Windows Setup — Local LLM Stack

Two scripts. Zero IDE required. Fresh machine to running dashboard in ~30 minutes.

Quick Start

Step 1: Windows Side (PowerShell as Admin)

# Allow script execution for this session
Set-ExecutionPolicy -Scope Process Bypass

# Run the Windows setup (installs Ollama, pulls models, installs WSL2)
.\setup-windows.ps1

What it does: Verifies NVIDIA drivers, installs Ollama, pulls 5 models (~52 GB), installs WSL2 Ubuntu 24.04.

After WSL2 install you may need to reboot. Ubuntu will ask for a username/password on first launch.

Step 2: WSL2 Side (Ubuntu terminal)

# One-liner — downloads and runs the WSL2 setup script
curl -fsSL https://raw.githubusercontent.com/saravanakumardb1/learning_ai_common_plat/main/__LOCAL_LLMs/windows_specific/setup-wsl.sh | bash

What it does: Installs Node.js, Python, ffmpeg, cmake → builds Whisper.cpp with CUDA → sets up TTS → starts the dashboard.

Step 3: Open Browser

http://localhost:3000

Dashboard should show all green. Done.


What Gets Installed

Component Where Size
NVIDIA drivers Windows pre-installed
Ollama Windows (native) ~200 MB
5 LLM models Windows (%USERPROFILE%\.ollama\) ~52 GB
WSL2 Ubuntu 24.04 Windows ~1 GB
Node.js 20 LTS WSL2 ~50 MB
Python 3.12 + venv WSL2 ~200 MB
whisper.cpp (CUDA) WSL2 /usr/local/bin/ ~50 MB
Whisper model WSL2 ~/whisper-models/ ~1.5 GB
SNAC decoder WSL2 (repo models/) ~76 MB
Qwen3-TTS 0.6B WSL2 (repo models/) ~1.7 GB
PyTorch CUDA WSL2 (.venv-qwen-tts/) ~2.5 GB
Dashboard deps WSL2 (dashboard/node_modules/) ~200 MB

Total: ~60 GB (mostly Ollama models)


Files in This Directory

File Purpose
README.md This file — quick start guide
setup-windows.ps1 PowerShell script — Windows-side setup
setup-wsl.sh Bash script — WSL2-side setup
setup-guide.md Detailed manual guide with troubleshooting
razer-blade-18-spec.md Full hardware specs for the Razer Blade 18

After Setup

# Daily usage — start everything
cd ~/code/mygh/learning_ai_common_plat/__LOCAL_LLMs
bash start-dashboard.sh

# Check status
bash start-dashboard.sh status

# Stop
bash start-dashboard.sh stop

# Test TTS
.venv-qwen-tts/bin/python test_orpheus_tts.py
.venv-qwen-tts/bin/python test_qwen_tts.py

Troubleshooting

See setup-guide.md for common issues:

  • Ollama not accessible from WSL2
  • CUDA not visible in WSL2
  • Slow filesystem performance