bytelyst-devops-tools/dashboard/web
Hermes VM 13a105ba23 feat(vm): Phase 5 closure — GPU/freshness checks, chaos validation, I/O alert
vm-health-check.sh:
- check_gpu(): nvidia-smi probe; "CPU-only" OK on this VM (no GPU)
- check_image_freshness(): flag containers running images >30d old.
  Skips third-party images (gitea, grafana, prom, mcr.microsoft, axllent,
  caddy, traefik, valkey, cadvisor) — they have their own rebuild cadence.
  Currently flags 19 stale product images (~60d old).

chaos-validation.sh:
- Monthly chaos test: kill PID 1 in chronomind-web, wait up to 35 min
  for docker-health-watchdog to detect + restart. Telegram pass/fail.
- Refuses to run if target not healthy. systemd timer fires 1st of month
  at 10:00 UTC (after 08:00 weekly digest).

vm-io-anomaly-check.sh:
- 6h avg sda write rate; transition alerts at WARN (1 GB/hr) /
  CRIT (2.5 GB/hr). De-dupes via /var/log/vm-io-anomaly-state so the
  alert fires once per transition, not every 6h. Current baseline:
  ~1.94 GB/hr (orphan-container state-file writes; see Phase 0.3).
- Reports recovery to OK when rate drops back.

vm/page.tsx: gpu + image_freshness added to CHECK_META so they render
with proper icon/label and slot into CHECK_ORDER.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-05-30 05:26:49 +00:00
..
e2e Complete Hermes ops dashboard and roadmap 2026-05-27 20:53:58 +00:00
public feat(devops): adopt trading web deployment model with docker-compose 2026-05-11 03:24:11 +00:00
src feat(vm): Phase 5 closure — GPU/freshness checks, chaos validation, I/O alert 2026-05-30 05:26:49 +00:00
.env.local.example feat(devops-web): fix responsive layout and add comprehensive dashboard pages 2026-05-11 03:10:31 +00:00
.gitignore feat(devops-web): fix responsive layout and add comprehensive dashboard pages 2026-05-11 03:10:31 +00:00
.pnpmfile.cjs feat(devops-web): add UX foundation - local package resolution and design tokens 2026-05-11 02:01:29 +00:00
Dockerfile feat(devops): restore dashboard build and log polling 2026-05-18 09:00:56 +00:00
next-env.d.ts feat: add Hermes mission control dashboard 2026-05-26 08:27:59 +00:00
next.config.js feat: add Hermes mission control dashboard 2026-05-26 08:27:59 +00:00
package-lock.json feat(dashboard/vm): Phases 1.1, 1.3, 3.1, 3.4 — VM page panels 2026-05-27 21:49:23 +00:00
package.json feat: add Hermes mission control dashboard 2026-05-26 08:27:59 +00:00
playwright.config.ts feat: add Hermes mission control dashboard 2026-05-26 08:27:59 +00:00
postcss.config.js feat(devops-web): fix responsive layout and add comprehensive dashboard pages 2026-05-11 03:10:31 +00:00
tailwind.config.ts feat(devops-web): fix responsive layout and add comprehensive dashboard pages 2026-05-11 03:10:31 +00:00
tsconfig.json feat: add Hermes mission control dashboard 2026-05-26 08:27:59 +00:00
tsconfig.tsbuildinfo fix: systematic bug fixes — code-quality parser, env key, config warnings, auth cleanup, deployment safety 2026-05-27 18:53:20 +00:00
vitest.config.ts feat: add Hermes mission control dashboard 2026-05-26 08:27:59 +00:00