Expanded from 7-line raw system info to full spec document: - CPU (Ryzen 7 PRO 7840U Zen 4, 8c/16t, AVX-512, NPU) - GPU (Radeon 780M RDNA 3 iGPU, DirectML/ROCm) - RAM (32GB DDR5, 24GB usable, VRAM allocation explained) - Capabilities assessment (dev workstation, light AI, remote dev) - AI/ML section (Ollama CPU, ROCm experimental, Ryzen AI NPU) - Portable dev setup diagram + OpenClaw client use case - 4-machine comparison table - Optimization tips (reclaim RAM from iGPU, WSL2 memory limit) - BIOS recommendations
14 KiB
Dell Latitude 16 (P16s) — Specification & Use Case Guide
Hostname:
WIN-6TAKOREL9MS· Form Factor: 16" Business Laptop · Era: 2023 (Zen 4) Primary Role: Portable development workstation — coding, meetings, light AI testing
1. System Overview
The Dell Latitude 16 (P16s) is a business-class laptop featuring AMD's Ryzen 7 PRO 7840U with integrated Radeon 780M graphics and an AI-capable NPU. It's a solid portable workstation for development, with the Radeon 780M being one of the best integrated GPUs available — capable of light AI inference via ROCm/DirectML.
Raw System Info (from Windows)
| Field | Value |
|---|---|
| Device name | WIN-6TAKOREL9MS |
| Processor | AMD Ryzen 7 PRO 7840U w/ Radeon 780M Graphics (3.30 GHz) |
| Installed RAM | 32.0 GB (23.7 GB usable) |
| Device ID | 0A30F38F-E7BC-4251-B6B4-B0318C5519E0 |
| Product ID | 00355-62015-25014-AAOEM |
| System type | 64-bit operating system, x64-based processor |
| Pen and touch | Touch support with 10 touch points |
Note: 23.7 GB usable out of 32 GB — the ~8.3 GB reserved is allocated to the Radeon 780M integrated GPU as shared VRAM. This is normal and configurable in BIOS.
2. Hardware Specifications
CPU
| Attribute | Specification |
|---|---|
| Model | AMD Ryzen 7 PRO 7840U |
| Architecture | Zen 4 (Phoenix) |
| Base Clock | 3.30 GHz |
| Boost Clock | 5.13 GHz |
| Cores / Threads | 8 / 16 |
| TDP | 15–30W (configurable cTDP) |
| Fabrication | TSMC 4nm |
| Instruction Sets | SSE4.2, AVX2, AVX-512 |
| AI Accelerator | AMD Ryzen AI (XDNA NPU, ~10 TOPS) |
| Integrated GPU | AMD Radeon 780M |
Strengths:
- 8 cores / 16 threads — strong multi-threaded performance
- AVX-512 support — useful for some ML workloads
- Ryzen AI NPU — hardware AI acceleration (Windows Copilot, ONNX Runtime)
- Excellent single-thread performance (Zen 4)
GPU (Integrated)
| Attribute | Specification |
|---|---|
| Model | AMD Radeon 780M |
| Architecture | RDNA 3 |
| Compute Units | 12 CUs (768 stream processors) |
| Clock | Up to 2.7 GHz |
| Shared VRAM | ~8 GB (from system RAM, configurable in BIOS) |
| API Support | DirectX 12, Vulkan 1.3, OpenCL 2.0 |
| AI Support | DirectML, ROCm (limited), ONNX Runtime |
Assessment: The Radeon 780M is the strongest integrated GPU in AMD's lineup. It can run small LLMs (3B–7B quantized) via DirectML/ONNX at usable speeds, though much slower than a discrete GPU.
Memory
| Attribute | Specification |
|---|---|
| Installed | 32 GB |
| Usable | 23.7 GB (rest allocated to Radeon 780M) |
| Type | DDR5 (likely 5600 MHz, dual-channel) |
| Max Supported | 32 GB (soldered, not upgradeable on most P16s configs) |
| GPU Allocation | ~8.3 GB shared VRAM (adjustable in BIOS) |
Important: RAM is likely soldered on this model. What you have is what you get — 32 GB is the ceiling.
Display
| Attribute | Specification |
|---|---|
| Size | 16" (16:10 aspect ratio) |
| Touch | Yes — 10-point multi-touch |
| Resolution | Likely 1920x1200 (WUXGA) or 2560x1600 (WQXGA) |
Network
| Attribute | Specification |
|---|---|
| WiFi | WiFi 6E (Intel AX211 or similar) |
| Ethernet | None built-in (USB-C dongle) |
| WWAN | Optional 5G/LTE (some configs) |
Power
| State | Estimated Draw |
|---|---|
| Idle | ~8–12W |
| Light Load (browsing, coding) | ~15–25W |
| Heavy Load (CPU+GPU stress) | ~45–65W |
| Battery | ~54–64 Wh (6–10 hours typical) |
3. Capabilities Assessment
What It CAN Do Well
| Use Case | Performance | Notes |
|---|---|---|
| Software development | Excellent | 8c/16t, fast SSD, great keyboard |
| Docker containers | Good | 24 GB usable RAM, 8c/16t |
| Web browsing / meetings | Excellent | Low power, quiet, good display |
| Code compilation | Good | Zen 4 single-thread is fast |
| WSL2 development | Good | AMD-V virtualization, 24 GB usable |
| Light AI inference (CPU) | Decent | Ollama CPU mode: ~8–15 tok/s on 7B |
| Light AI inference (iGPU) | Usable | DirectML/ONNX on Radeon 780M: ~10–20 tok/s on 3B–7B |
| OpenClaw Gateway | Excellent | CPU-only, lightweight |
| Portable presentations | Excellent | Touchscreen, good display |
| Remote dev (SSH/VS Code) | Excellent | Connect to HP Z240 or Razer for heavy work |
What It CANNOT Do Well
| Use Case | Why Not |
|---|---|
| Large LLM inference (>13B) | Only 24 GB usable RAM, no discrete GPU |
| GPU training / fine-tuning | No discrete GPU, ROCm support limited on iGPU |
| Whisper CUDA transcription | No NVIDIA GPU |
| TTS at scale | No discrete GPU |
| Image generation | Radeon 780M too slow for practical Stable Diffusion |
| Multi-GPU workloads | Single integrated GPU only |
4. AI / ML Capabilities (Detailed)
Ollama (CPU Mode)
# CPU inference works out of the box
ollama run llama3.2:3b # ~15-25 tok/s (fast, small model)
ollama run llama3.1:8b # ~8-15 tok/s (usable)
ollama run qwen2.5:7b # ~8-15 tok/s (usable)
ollama run llama3.1:70b # Won't fit in RAM — use Razer instead
Ollama (Radeon 780M via ROCm)
ROCm support on integrated GPUs is experimental. If it works:
# Set ROCm environment
export HSA_OVERRIDE_GFX_VERSION=11.0.0
export OLLAMA_GPU_OVERRIDE=radeon
ollama run llama3.2:3b # May get ~20-30 tok/s with iGPU assist
Realistic expectation: CPU mode is reliable. ROCm on 780M is hit-or-miss. Don't count on GPU acceleration here.
AMD Ryzen AI (NPU)
The built-in NPU (XDNA, ~10 TOPS) can accelerate:
- Windows Copilot features
- ONNX Runtime models
- Background AI tasks in supported apps
It's not useful for general LLM inference (too limited), but it offloads small AI tasks from the CPU.
5. Recommended Use Cases
Primary: Portable Development Workstation
┌──────────────────────────────────────────────────────────────────┐
│ Dell P16s — Portable Dev Setup │
│ │
│ ┌────────────────────────────────────────────────────────┐ │
│ │ Local Development │ │
│ │ │ │
│ │ • VS Code / Windsurf (TypeScript, Python) │ │
│ │ • Docker Desktop (containers, local services) │ │
│ │ • WSL2 Ubuntu (Linux tooling) │ │
│ │ • Ollama (small models, CPU mode, quick testing) │ │
│ │ • Git (all 3 repos) │ │
│ │ • Browser (dashboards, docs, meetings) │ │
│ │ │ │
│ │ For heavy GPU work → SSH/Remote Desktop to: │ │
│ │ • Razer Blade 18 (RTX 5090) — GPU inference │ │
│ │ • HP Z240 (bl1box) — always-on services │ │
│ └────────────────────────────────────────────────────────┘ │
│ │
│ Battery: 6–10 hours │
│ Weight: ~2 kg (portable) │
│ Noise: Near-silent under light load │
│ │
└──────────────────────────────────────────────────────────────────┘
Secondary: On-the-Go OpenClaw Client
When away from home, use the Dell P16s as an OpenClaw client connecting back to the HP Z240 server via Tailscale:
# Connect to HP Z240 OpenClaw Gateway via Tailscale
# (Gateway runs on bl1box, accessible anywhere)
open https://bl1box.your-tailnet.ts.net:18789
# Or run a local OpenClaw Gateway for offline use
openclaw gateway --verbose
6. Comparison with Other Machines
| Capability | Dell P16s | HP Z240 (bl1box) | Mac M4 Pro 48GB | Razer RTX 5090 |
|---|---|---|---|---|
| Role | Portable dev | Always-on server | Daily driver | ML powerhouse |
| CPU | Ryzen 7 7840U (8c/16t) | i7-7700K (4c/8t) | M4 Pro (14c) | Ultra 9 275HX (24c) |
| RAM | 32 GB DDR5 (24 usable) | 32 GB DDR4 | 48 GB unified | 64 GB DDR5 |
| GPU | Radeon 780M (iGPU) | None (HD 630) | M4 Pro (MPS) | RTX 5090 24GB |
| LLM Inference | CPU ~10 tok/s (7B) | CPU ~5 tok/s (7B) | MPS ~40 tok/s (7B) | CUDA ~80 tok/s (7B) |
| Portable? | Yes — laptop | No — tower | Yes — laptop | Yes — laptop (heavy) |
| Battery | 6–10 hours | N/A (desktop) | 12–18 hours | 2–4 hours |
| Weight | ~2.0 kg | ~11 kg | ~1.6 kg | ~3.1 kg |
| Best For | Coding on the go | 24/7 services | Everything | GPU workloads |
| Cost | ~$1,200 | ~$100 used | ~$2,500 | ~$4,500 |
7. Optimizing the Dell P16s
Reclaim RAM from iGPU
If you don't need gaming/GPU performance, reduce Radeon 780M VRAM allocation:
BIOS → Advanced → UMA Frame Buffer Size
• Default: 8 GB (leaves 24 GB for OS)
• Reduced: 2 GB (leaves 30 GB for OS)
• Minimum: 512 MB (leaves 31.5 GB for OS)
Trade-off: Less VRAM = worse iGPU performance but more RAM for Docker/VMs/Ollama.
Power Profiles
# Windows — switch power profiles
# Battery saver: longest battery life, lower performance
# Balanced: default
# Best performance: maximum CPU/GPU boost (louder fans)
# Check current profile
powercfg /getactivescheme
# List all profiles
powercfg /list
WSL2 Memory Limit
By default WSL2 can consume all available RAM. Set a limit:
# Create/edit %USERPROFILE%\.wslconfig
[wsl2]
memory=16GB
processors=6
swap=4GB
This reserves 8+ GB for Windows while giving WSL2 plenty.
8. Setup Recommendations
For Development Workstation
- Install WSL2 Ubuntu 24.04
- Install Docker Desktop (uses WSL2 backend)
- Install Ollama (native Windows) for quick model testing
- Install Tailscale for secure access to HP Z240 and Razer
- Clone all 3 repos in WSL2
- Use VS Code / Windsurf with Remote-WSL extension
For Travel / Offline
- Pre-pull small Ollama models:
ollama pull llama3.2:3b,ollama pull qwen2.5:7b - Install OpenClaw locally (optional — for offline AI assistant)
- Ensure Tailscale is configured (auto-connects when back on home network)
BIOS Recommendations
| Setting | Value | Why |
|---|---|---|
| UMA Frame Buffer | 2–4 GB | More RAM for development |
| AMD-V | Enabled | Required for WSL2/Hyper-V |
| Secure Boot | Enabled | Keep for corporate compliance |
| TPM | Enabled | Windows 11 requirement |