Writing
Writing
Case studies and blog posts on architecture, operations, and delivery: what worked, what failed, and what is reusable.
Case Studies
Real implementations with metrics, stack details, and lessons learned.
Blog
Technical notes, implementation updates, and deeper dives.
April 4, 2026·8 min read
Lab
Getting Gemma 4 Running on a Radeon 7900 XTX (with and without TurboQuant)
What it took to get Gemma 4 E4B serving cleanly on Radeon through FlexInfer: a stable TRITON lane on a 7900 XTX, an experimental TurboQuant long-context lane on a second node, and the GPTQ pipeline work still underway.
gemma4amdradeon7900xtx+6 more
Read post
March 9, 2026·8 min read
Professional
Build Your Own Legs Before the Crutches Fail
AI-assisted development is useful leverage, but only if you convert borrowed competence into real judgment before the support becomes a dependency.
ai-assisted-devengineeringagentsdeveloper-workflows
Read post