# 09 — Environment Variables Reference

> All configuration variables for Ollama, Whisper, dashboard, and evals.

---

## Ollama Server

| Variable                   | Default                  | Purpose                                                |
| -------------------------- | ------------------------ | ------------------------------------------------------ |
| `OLLAMA_HOST`              | `http://127.0.0.1:11434` | Server bind address                                    |
| `OLLAMA_MODELS`            | `~/.ollama/models`       | Model storage path                                     |
| `OLLAMA_KEEP_ALIVE`        | `5m`                     | How long to keep model loaded after last request       |
| `OLLAMA_FLASH_ATTENTION`   | `false`                  | Enable flash attention (faster, less RAM)              |
| `OLLAMA_KV_CACHE_TYPE`     | _(none)_                 | KV cache quantization (`q8_0` = smaller RAM footprint) |
| `OLLAMA_NUM_PARALLEL`      | `1`                      | Number of concurrent requests                          |
| `OLLAMA_MAX_LOADED_MODELS` | `1`                      | Max models loaded in RAM simultaneously                |
| `OLLAMA_GPU_OVERHEAD`      | _(auto)_                 | Reserved GPU memory (bytes)                            |
| `OLLAMA_ORIGINS`           | `*`                      | Allowed CORS origins                                   |
| `OLLAMA_DEBUG`             | `false`                  | Enable debug logging                                   |
| `HTTP_PROXY`               | _(system)_               | HTTP proxy for model downloads                         |
| `HTTPS_PROXY`              | _(system)_               | HTTPS proxy for model downloads                        |
| `NO_PROXY`                 | _(none)_                 | Hosts to bypass proxy                                  |

### Performance Tuning Combo

```bash
OLLAMA_FLASH_ATTENTION=1 \
OLLAMA_KV_CACHE_TYPE=q8_0 \
OLLAMA_NUM_PARALLEL=2 \
OLLAMA_KEEP_ALIVE=10m \
ollama serve
```

---

## Extraction Service Evals (promptfoo)

| Variable                | Default                     | Purpose                                 |
| ----------------------- | --------------------------- | --------------------------------------- |
| `OLLAMA_MODEL`          | `llama3.1:8b`               | Model used by `pnpm eval:ollama`        |
| `OLLAMA_BASE_URL`       | `http://localhost:11434/v1` | OpenAI-compat endpoint for promptfoo    |
| `EXTRACTION_EVAL_TOKEN` | _(none)_                    | Auth token for extraction-service evals |

### Usage

```bash
# Run evals with a different model
OLLAMA_MODEL=qwen2.5:7b pnpm eval:ollama

# Compare Gemini vs Ollama
EXTRACTION_EVAL_TOKEN=your-token pnpm eval:compare
```

---

## Python Sidecar (LangExtract)

| Variable               | Default          | Purpose                                       |
| ---------------------- | ---------------- | --------------------------------------------- |
| `LANGEXTRACT_PROVIDER` | `gemini`         | Switch to `openai_compat` for Ollama          |
| `LANGEXTRACT_BASE_URL` | _(Gemini)_       | Set to `http://localhost:11434/v1` for Ollama |
| `LANGEXTRACT_API_KEY`  | _(Gemini key)_   | Set to `ollama` for local                     |
| `LANGEXTRACT_MODEL`    | _(Gemini model)_ | Set to `llama3.1:8b` or preferred model       |

### Switch to Ollama

```bash
export LANGEXTRACT_PROVIDER=openai_compat
export LANGEXTRACT_BASE_URL=http://localhost:11434/v1
export LANGEXTRACT_API_KEY=ollama
export LANGEXTRACT_MODEL=llama3.1:8b
```

---

## Mission Control Dashboard

| Variable     | Default                  | Purpose                                |
| ------------ | ------------------------ | -------------------------------------- |
| `OLLAMA_URL` | `http://localhost:11434` | Ollama server URL (used by API routes) |
| `PORT`       | `3100`                   | Dashboard dev server port              |

### Start with Custom Ollama URL

```bash
OLLAMA_URL=http://192.168.1.100:11434 npm run dev -- -p 3100
```

---

## Whisper.cpp

Whisper.cpp uses CLI flags rather than environment variables:

| Flag              | Purpose                       | Example                                            |
| ----------------- | ----------------------------- | -------------------------------------------------- |
| `--model`         | Path to GGML model file       | `--model ~/whisper-models/ggml-large-v3-turbo.bin` |
| `--language`      | Input language                | `--language en`                                    |
| `--file`          | Audio file path               | `--file recording.wav`                             |
| `--output-json`   | Output in JSON format         | `--output-json`                                    |
| `--output-srt`    | Output as SRT subtitles       | `--output-srt`                                     |
| `--output-vtt`    | Output as VTT subtitles       | `--output-vtt`                                     |
| `--translate`     | Translate to English          | `--translate`                                      |
| `--threads`       | Number of CPU threads         | `--threads 8`                                      |
| `--processors`    | Number of processors          | `--processors 1`                                   |
| `--print-colors`  | Colorize output by confidence | `--print-colors`                                   |
| `--no-timestamps` | Omit timestamps               | `--no-timestamps`                                  |
| `--port`          | Server port (whisper-server)  | `--port 8080`                                      |

---

## Proxy / Network (Corporate)

| Variable                       | Value on This Machine            | Purpose                                           |
| ------------------------------ | -------------------------------- | ------------------------------------------------- |
| `HTTP_PROXY`                   | `http://cso.proxy.att.com:8080/` | Corporate HTTP proxy                              |
| `HTTPS_PROXY`                  | `http://cso.proxy.att.com:8080/` | Corporate HTTPS proxy                             |
| `NODE_TLS_REJECT_UNAUTHORIZED` | `0`                              | Bypass Forcepoint SSL interception for Node.js    |
| `NO_PROXY`                     | _(not set by default)_           | Add `ollama.com,registry.ollama.ai` if pulls fail |

---

## All Paths

| Path                                 | Content                     |
| ------------------------------------ | --------------------------- |
| `~/.ollama/models/`                  | Downloaded Ollama models    |
| `~/whisper-models/`                  | Whisper GGML model files    |
| `/opt/homebrew/bin/ollama`           | Ollama binary               |
| `/opt/homebrew/bin/whisper-cli`      | Whisper CLI binary          |
| `/opt/homebrew/bin/ffmpeg`           | FFmpeg binary               |
| `__LOCAL_LLMs/dashboard/`            | Mission Control Next.js app |
| `__LOCAL_LLMs/docs/`                 | This documentation          |
| `services/extraction-service/evals/` | Promptfoo eval configs      |