Skip to main content
Contact

Work With Me

If you're trying to ship production inference (or stop GPU costs from spiking), describe what you're building and where it's getting stuck.

Send a message

Include context like your stack, current GPU/runtime setup, timeline, and what "success" looks like. I'll respond with next steps.

Typical response time: 24-48 hours