Writing

March 25, 2026

How I self-hosted an AI chat on my portfolio (and why I stopped)

I ran my own LLM on a GPU. Cold starts killed the experience. Here's the real tradeoff between quality, speed, and money when you self-host inference.

GovTechData + AI PlatformReference ArchitectureAPI Design