Contact

Work With Me

If you're trying to ship production inference (or stop GPU costs from spiking), describe what you're building and where it's getting stuck.

Send a message

Include context like your stack, current GPU/runtime setup, timeline, and what "success" looks like. I'll respond with next steps.

Typical response time: 24-48 hours