Method · Browse by topic

The archive, mapped

Every blog post and case study on the site with the principles and playbooks each piece argues for. Filter by any principle or playbook to see just the evidence behind it.

Back to principles Reading paths

PrincipleAll Explicit Control Loops Evidence First, Measure Before You Optimize Warnings Over Errors, Graceful Degradation by Default One Config, Many Surfaces Contract-Driven Integration GitOps as the Boring Substrate

PlaybookAI-Assisted Dev Adoption Loop AI Infrastructure Readiness Audit Healthcare Integration Onboarding Agent Rollout With Guardrails

Matching the filter

7 items

Building a Multi-Model LLM Platform on Consumer GPUsCase studyJan 15, 2026
How I run multiple OpenAI-compatible LLM endpoints on a small K3s cluster with AMD Radeon GPUs, and what I had to do to make it stable.
Evidence First, Measure Before You Optimize GitOps as the Boring Substrate Playbook: AI Infrastructure Readiness Audit
SLOs for Inference: Latency, Errors, SaturationDec 29, 2025 · 6 min
How to define meaningful SLOs for production inference workloads, and what to do when they break.
Explicit Control Loops Evidence First, Measure Before You Optimize Playbook: AI Infrastructure Readiness Audit
Standing Up a GPU-Ready Private AI Platform (Harvester + K3s + Flux + GitLab)Dec 29, 2025 · 6 min
Field notes from building and operating a small private GPU platform with Harvester, K3s, and a GitLab -> Flux delivery loop.
One Config, Many Surfaces GitOps as the Boring Substrate Playbook: AI Infrastructure Readiness Audit
Hybrid/On-Prem GPU: The Boring GitOps PathDec 29, 2025 · 4 min
A practical guide to running GPU workloads on-prem or hybrid, using Kubernetes and GitOps patterns that make operations boring.
GitOps as the Boring Substrate Playbook: AI Infrastructure Readiness Audit
GPU Failure Modes: What Breaks and How to Debug ItDec 29, 2025 · 5 min
Common GPU infrastructure failures in production and how to diagnose them before they become incidents.
Evidence First, Measure Before You Optimize Warnings Over Errors, Graceful Degradation by Default Playbook: AI Infrastructure Readiness Audit
GPU Cost Baseline: What to Measure, What LiesDec 29, 2025 · 4 min
Before you can cut GPU costs, you need to measure them correctly. Here is what to track and what the cloud console will not tell you.
Evidence First, Measure Before You Optimize Playbook: AI Infrastructure Readiness Audit
AI Infra Readiness Audit: What I Check (and What You Get)Dec 29, 2025 · 3 min
A practical checklist for auditing production AI infrastructure: GPU cost baselines, reliability risks, and an executable roadmap.
Evidence First, Measure Before You Optimize Playbook: AI Infrastructure Readiness Audit